BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 007737
         (591 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score = 1006 bits (2600), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/579 (81%), Positives = 520/579 (89%), Gaps = 7/579 (1%)

Query: 3   FHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
           F     +FPLL L       ++ ST+PPFSCDPSNPST +F FCKT+LPISQR RDLVSR
Sbjct: 8   FFTFFTIFPLLIL-------QITSTEPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLVSR 60

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LTLDEKISQLV+SAP+IPRLGIPAYEWWSEALHGVA VG+GI F G I+ ATSFPQVILT
Sbjct: 61  LTLDEKISQLVSSAPSIPRLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILT 120

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           AASFD+Y WYRIGQ IG EARA+YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPLV
Sbjct: 121 AASFDAYQWYRIGQVIGREARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLV 180

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
           TGKYAVSYVRGVQGD+F GGKLKG LQASACCKHFTAYDLDNWKG  R+ FDARVTMQDL
Sbjct: 181 TGKYAVSYVRGVQGDSFQGGKLKGHLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDL 240

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
           ADTYQPPF+SCV+QG+ASGIMCAYNRVNGIPSCAD NLLS+TAR QW FHGYI SDCDAV
Sbjct: 241 ADTYQPPFQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAV 300

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
           SIIYD +GYAKSPEDAVVDVLKAGMDVNCGS+LQKHTKAAV+QKKLPE+ IDRALHNLFS
Sbjct: 301 SIIYDNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFS 360

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           VRMRLGLFNGNPT QPF  IG D VCS  HQ+LAL+AA++GIVLLKNS  LLPL KSK+V
Sbjct: 361 VRMRLGLFNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTV 420

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
           SLA+IGPNANS +TLLGNYAGP C+++TPLQALQ YV+NT+YY GCDTV CSSASIDKAV
Sbjct: 421 SLAVIGPNANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSSASIDKAV 480

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
           DIAKG D VV++MGLDQTQE+EELDR+DLVLPG+QQELIT VA++AK P++LVLL GGPV
Sbjct: 481 DIAKGVDRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPV 540

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           DI+FAKYD NIGSILWAGYPGEAG +ALAE+IFGDHNPG
Sbjct: 541 DISFAKYDENIGSILWAGYPGEAGGIALAEIIFGDHNPG 579


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  973 bits (2516), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 458/559 (81%), Positives = 503/559 (89%)

Query: 23  RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           RVDSTQPPFSCD SNPST+ FPFC+TTLPISQRARDLVSRLTLDEKISQLVNSAP IPRL
Sbjct: 20  RVDSTQPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRL 79

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           GIP YEWWSEALHGV+  G GI FN  I+GATSFPQVILTAASFD+Y WYRIGQAIG EA
Sbjct: 80  GIPGYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEA 139

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           RALYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPLVTG YA SYV+GVQGD+F GG
Sbjct: 140 RALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFEGG 199

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           K+KG LQASACCKHFTAYDLDNWKG  R+ FDARVTMQDLADTYQPPF+SCV+QGRASGI
Sbjct: 200 KIKGHLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPFKSCVEQGRASGI 259

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MCAYN+VNG+PSCAD NLLSKTAR QWGF GYITSDCDAVSII+D +GYAKSPEDAVVDV
Sbjct: 260 MCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQGYAKSPEDAVVDV 319

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAGMDVNCGS+L KH K AV+QKKL ES+ID+ALHNLFSVRMRLGLFNG P  Q FG I
Sbjct: 320 LKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGLFNGRPEGQLFGNI 379

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           G D VCS  HQ+LAL+AA++GIVLLKNS  LLPL KSK+ SLA+IGPNANS + LLGNYA
Sbjct: 380 GPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGPNANSGQMLLGNYA 439

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP CR +TPLQALQ+Y++ TVY+P CDTV CSSAS+D+AVD+AKGAD+VVLMMGLDQTQE
Sbjct: 440 GPPCRFVTPLQALQSYIKQTVYHPACDTVQCSSASVDRAVDVAKGADNVVLMMGLDQTQE 499

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           +EELDR DL+LPG+QQELI  VA+AAK PV+LVL  GGPVDI+FAK D+NIGSILWAGYP
Sbjct: 500 REELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAKNDKNIGSILWAGYP 559

Query: 563 GEAGAVALAEVIFGDHNPG 581
           GE GA+ALAE++FGDHNPG
Sbjct: 560 GEGGAIALAEIVFGDHNPG 578


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  957 bits (2474), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 456/565 (80%), Positives = 508/565 (89%)

Query: 17  FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
           FT ++ +VDSTQPPFSCD SNPST+TFPFCKTTLPISQRA DLVSRLTL+EKISQLVNSA
Sbjct: 14  FTVIVLQVDSTQPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSA 73

Query: 77  PAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
             IPRLGIP Y+WWSEALHGVA  G GI FNGTI+ ATSFPQVIL+AASFD+  WYRI Q
Sbjct: 74  QPIPRLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQ 133

Query: 137 AIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
           AIG EARALYNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDPL+TGKYAVSYVRG+QG
Sbjct: 134 AIGKEARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQG 193

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
           D+F GG++KG LQASACCKHFTAYDL+NW GT+RY FDA VT QDLADTYQPPF+SCV++
Sbjct: 194 DSFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEE 253

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           GRASGIMCAYNRVNGIP+CAD N LS+TAR QWGF GYI SDCDAVSII+DA+GYAK+PE
Sbjct: 254 GRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGYAKTPE 313

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           DAVV VLKAGMDVNCGS+LQ+HTKAAV QKKL  SEIDRALHNLFSVRMRLGLFNGNPT 
Sbjct: 314 DAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTG 373

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
           Q FG IG D VCS  +Q+LAL AA++GIVLLKNS GLLPL KSK++SLA+IGPNANS +T
Sbjct: 374 QQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQT 433

Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           LLGNYAGP C+ +TPLQALQ+Y+++T+ YPGCD+V CSSASI  AV++AKGADHVVL+MG
Sbjct: 434 LLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSSASIVGAVNVAKGADHVVLIMG 493

Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
           LD TQEKE LDR DLVLPG+QQELI  VA+AAK PV+LVLL GGPVDI+FAK D+NIGSI
Sbjct: 494 LDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGSI 553

Query: 557 LWAGYPGEAGAVALAEVIFGDHNPG 581
           LWAGYPGEAGA+ALAE+IFGDHNPG
Sbjct: 554 LWAGYPGEAGAIALAEIIFGDHNPG 578


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  933 bits (2412), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 443/585 (75%), Positives = 506/585 (86%), Gaps = 8/585 (1%)

Query: 1   MIFHKLSL-VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
           M    L L +  L+ +  TSL   V+STQPP+SCD S+PST+ +PFC+T LPISQR  DL
Sbjct: 1   MRLQNLCLRILILIAIHTTSLHLYVESTQPPYSCDSSDPSTKLYPFCQTKLPISQRVEDL 60

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA---GVGKGIFFNGTIRGATSF 116
           VSRLTLDEK+SQLV++APAIPRLGIPAYEWWSEALHGVA    V +GI FNGTIR ATSF
Sbjct: 61  VSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSF 120

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
           PQVILTAASFD++LWYRIGQ IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQETP
Sbjct: 121 PQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETP 180

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDPLV GKYAVSYVRGVQGD+F GG L  +LQASACCKHFTAYDLD WKG  R+ FDA 
Sbjct: 181 GEDPLVAGKYAVSYVRGVQGDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVFDA- 239

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
              QDLADTYQPPF+SC+++G+ASGIMCAYNRVNG+P+CAD NLLSK AR QWGF+GYIT
Sbjct: 240 ---QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYGYIT 296

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
           SDCDAV+II+D +GYAKSPEDAV DVLKAGMDVNCG +L+ +TK+AVK+KKLPESEIDRA
Sbjct: 297 SDCDAVAIIHDDQGYAKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESEIDRA 356

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           LHNLFS+RMRLGLFNGNPT QP+G I  D VCS  HQ LAL+AAQDGIVLLKN   LLPL
Sbjct: 357 LHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDKLLPL 416

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
            K ++ SLA+IGPNAN++  LLGNY GP C+++TPLQ LQNY++NT Y+PGC  VACSSA
Sbjct: 417 SKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVACSSA 476

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           SI++AV IAKGAD V+L+MGLDQTQEKEE DRVDLVLPG+Q+ELIT VA+AAKKPV+LVL
Sbjct: 477 SINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKPVVLVL 536

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            CGGPVD++FAKYD+NIGSI+WAGYPGEAG  ALA++IFGDHNPG
Sbjct: 537 FCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPG 581


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  912 bits (2357), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/582 (76%), Positives = 504/582 (86%), Gaps = 4/582 (0%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQ-PPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
           M  H L L+     +  T +L  V+STQ PPFSCD SNPST+++ FCKTTLPI  R RDL
Sbjct: 1   MKLHSLLLIN---LIYVTVILVGVESTQSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDL 57

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA  G GI FNGTIR ATSFPQV
Sbjct: 58  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQV 117

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           ILTAASFD +LWYRIG+AIG+EARA+YNAGQ  GMTFWAPNINIFRDPRWGRGQETPGED
Sbjct: 118 ILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGED 177

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PLVTG YAVSYVRGVQGD   G K  G+LQASACCKHFTAYDLD+WKG  R+KFDARVTM
Sbjct: 178 PLVTGSYAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTM 237

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           QDLADTYQPPF  C+++GRASGIMCAYNRVNG+PSCAD NLL+ TAR++W F GYITSDC
Sbjct: 238 QDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDC 297

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
           DAVS+I+D+ G+AK+PEDAVVDVLKAGMDVNCG++L  HTK+AV QKKLPESE+DRAL N
Sbjct: 298 DAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALEN 357

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
           LF+VRMRLGLFNGNP  QP+G IG + VCS  HQ LAL AA+DGIVLLKNS  LLPLPK 
Sbjct: 358 LFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKG 417

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
           K++SLA+IGPNANS KTL+GNYAGP C+ ITPLQALQ+YV++T+Y+PGCD VACSS SI+
Sbjct: 418 KTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIE 477

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           KAV+IA+ AD+VVL+MGLDQTQE+E  DR+DLVLPG+QQ+LI  VA AAKKPV+LVLL G
Sbjct: 478 KAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSG 537

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GPVDI+FAKY  NIGSILWAGYPG AG  A+AE IFGDHNPG
Sbjct: 538 GPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPG 579



 Score =  892 bits (2304), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/556 (76%), Positives = 482/556 (86%), Gaps = 2/556 (0%)

Query: 26   STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
            S+ PPF+CD S+P T+++ FC TTL ISQRA DL+SRLTLDEKISQL++SA +IPRLGIP
Sbjct: 693  SSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIPRLGIP 752

Query: 86   AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            AYEWWSEALHG+     GI FNGTIR ATSFPQVILTAASFD++LWYRIGQAIG+E RA+
Sbjct: 753  AYEWWSEALHGIRDR-HGIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGIETRAM 811

Query: 146  YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
            YNAGQA+GMTFWAPNINIFRDPRWGRGQETPGEDP+V GKYAVSYVRG+QGDTF GGK+ 
Sbjct: 812  YNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFEGGKVD 871

Query: 206  GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
              LQASACCKHFTAYDLDNW    RY FDARVTMQDLADTYQPPF SC+++GRASG+MCA
Sbjct: 872  -VLQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCA 930

Query: 266  YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
            YN VNG+P+CAD NLLSKTAR QWGF GYI SDCDAVS+++D +GYAKSPEDAV  VL A
Sbjct: 931  YNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTA 990

Query: 326  GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
            GMDV CG +LQKH K+AV QKKL ESEIDRAL NLF+VRMRLGLFNGNP   PFG IG D
Sbjct: 991  GMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPD 1050

Query: 386  VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
             VCS  HQ LAL+AA+ GIVLLKNS  LLPL K +++SLA+IGPNAN+  TLLGNYAGP 
Sbjct: 1051 QVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPP 1110

Query: 446  CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
            C+ I+PLQ LQ+YV NT+Y+ GC+ VACSSASI+ AVD+AK AD+VVL+MGLDQTQE+E+
Sbjct: 1111 CKFISPLQGLQSYVNNTMYHAGCNDVACSSASIENAVDVAKQADYVVLVMGLDQTQEREK 1170

Query: 506  LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
             DR+DLVLPG+Q++LIT VA+AAKKPV+LVLLCGGPVDI+FAK   NIGSILWAGYPGEA
Sbjct: 1171 YDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEA 1230

Query: 566  GAVALAEVIFGDHNPG 581
            G  A+AE IFGDHNPG
Sbjct: 1231 GGAAIAETIFGDHNPG 1246


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  911 bits (2355), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/582 (76%), Positives = 504/582 (86%), Gaps = 4/582 (0%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQ-PPFSCDPSNPSTETFPFCKTTLPISQRARDL 59
           M  H L L+     +  T +L  V+STQ PPFSCD SNPST+++ FCKTTLPI  R RDL
Sbjct: 1   MKLHSLLLIN---LIYVTVILVGVESTQSPPFSCDSSNPSTKSYHFCKTTLPIPDRVRDL 57

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA  G GI FNGTIR ATSFPQV
Sbjct: 58  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQV 117

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           ILTAASFD +LWYRIG+AIG+EARA+YNAGQ  GMTFWAPNINIFRDPRWGRGQETPGED
Sbjct: 118 ILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGED 177

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PLVTG YAVSYVRGVQGD   G K  G+LQASACCKHFTAYDLD+WKG  R+KFDARVTM
Sbjct: 178 PLVTGSYAVSYVRGVQGDCLRGLKRCGELQASACCKHFTAYDLDDWKGIDRFKFDARVTM 237

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           QDLADTYQPPF  C+++GRASGIMCAYNRVNG+PSCAD NLL+ TAR++W F GYITSDC
Sbjct: 238 QDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSDC 297

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
           DAVS+I+D+ G+AK+PEDAVVDVLKAGMDVNCG++L  HTK+AV QKKLPESE+DRAL N
Sbjct: 298 DAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRALEN 357

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
           LF+VRMRLGLFNGNP  QP+G IG + VCS  HQ LAL AA+DGIVLLKNS  LLPLPK 
Sbjct: 358 LFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLPKG 417

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
           K++SLA+IGPNANS KTL+GNYAGP C+ ITPLQALQ+YV++T+Y+PGCD VACSS SI+
Sbjct: 418 KTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVACSSPSIE 477

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           KAV+IA+ AD+VVL+MGLDQTQE+E  DR+DLVLPG+QQ+LI  VA AAKKPV+LVLL G
Sbjct: 478 KAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVLLSG 537

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GPVDI+FAKY  NIGSILWAGYPG AG  A+AE IFGDHNPG
Sbjct: 538 GPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPG 579


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  889 bits (2297), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/559 (74%), Positives = 489/559 (87%), Gaps = 1/559 (0%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V+S  PP SCDPSNP+T+ + FC+T LPI +RARDLVSRLT+DEKISQLVN+AP IPRLG
Sbjct: 17  VESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLG 76

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           +PAYEWWSEALHGVA  G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 77  VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 136

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
            +YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 137 GVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRK 196

Query: 204 -LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
            L   LQASACCKHFTAYDLD WKG TRY F+A+V++ DLA+TYQPPF+ C+++GRASGI
Sbjct: 197 TLSNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGI 256

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MCAYNRVNGIPSCAD NLL++TAR QW F GYITSDCDAVSIIYDA+GYAKSPEDAV DV
Sbjct: 257 MCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGYAKSPEDAVADV 316

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL NLFSVR+RLGLFNG+PT  P+G I
Sbjct: 317 LKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNI 376

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             + VCSPAHQ LAL AA++GIVLLKN+  LLP  K    SLA+IGPNA+  KTLLGNYA
Sbjct: 377 SPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVVKTLLGNYA 436

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP C+++TPL AL++YV+N VY+ GCD+VACS+A+ID+AV IAK ADHVVL+MGLDQTQE
Sbjct: 437 GPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAAIDQAVAIAKNADHVVLIMGLDQTQE 496

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           KE+ DRVDL LPG+QQELIT VA AAKKPV+LVL+CGGPVDI+FA  +  IGSI+WAGYP
Sbjct: 497 KEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNKIGSIIWAGYP 556

Query: 563 GEAGAVALAEVIFGDHNPG 581
           GEAG +A++E+IFGDHNPG
Sbjct: 557 GEAGGIAISEIIFGDHNPG 575


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  883 bits (2282), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/559 (74%), Positives = 490/559 (87%), Gaps = 1/559 (0%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V+S  PP SCDPSNP+T+ + FC+T LPISQRARDLVSRL +DEKISQL N+AP IPRLG
Sbjct: 16  VESAPPPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLG 75

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           +PAYEWWSEALHGVA  G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 76  VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 135

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
            +YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 136 GVYNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSFDGRK 195

Query: 204 -LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
            L   LQASACCKHFTAYDLD WKG TRY F+A+V++ DLA+TYQPPF+ C+++GRASGI
Sbjct: 196 TLSIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCIEEGRASGI 255

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MCAYNRVNGIPSCAD NLL++TAR  W F GYITSDCDAVSII+DA+GYAK+PEDAV DV
Sbjct: 256 MCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGYAKTPEDAVADV 315

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL NLFSVR+RLGLFNG+PT  P+G I
Sbjct: 316 LKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGDPTKLPYGNI 375

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             + VCSPAHQ LAL+AA++GIVLLKN+  LLP  K    SLA+IGPNA+ AKTLLGNYA
Sbjct: 376 SPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHVAKTLLGNYA 435

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP C+++TPL AL++YV+N VY+ GCD+VACS+A+ID+AV IA+ ADHVVL+MGLDQTQE
Sbjct: 436 GPPCKTVTPLDALRSYVKNAVYHNGCDSVACSNAAIDQAVAIARNADHVVLIMGLDQTQE 495

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           KE++DRVDL LPG+QQELIT VA AAKKPV+LVL+CGGPVDI+FA  +  IGSI+WAGYP
Sbjct: 496 KEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDKIGSIMWAGYP 555

Query: 563 GEAGAVALAEVIFGDHNPG 581
           GEAG +ALAE+IFGDHNPG
Sbjct: 556 GEAGGIALAEIIFGDHNPG 574


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  880 bits (2273), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/578 (72%), Positives = 484/578 (83%)

Query: 4   HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
           HK+ L+  LL   F SL+    S+QPP++CD SNP T+T PFCKT LPI  RARDLVSRL
Sbjct: 11  HKIKLLTLLLSAAFLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRL 70

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TLDEK+ QLVN+ P IPRLGIPAYEWWSEALHGVA VG GI  NGTI  ATSFPQVILTA
Sbjct: 71  TLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTA 130

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASFD  LWY+IGQAIG EARA+YNAGQA GMTFW PNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 131 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMT 190

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           GKY+V+YVRG+QGD   GGKL  +L+ASACCKHFTAYDLD W G TRY FDA+VTMQD+A
Sbjct: 191 GKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 250

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DTYQPPFESCV++G+ASGIMCAYNRVNG+PSCAD +LL+ TAR+QW F+GYITSDCDAVS
Sbjct: 251 DTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 310

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           II+DA+GYAK PEDAV DVL+AGMDVNCG++L++HTK+AV+ KK+P   IDRAL NLFSV
Sbjct: 311 IIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSV 370

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           RMRLGLF+GNPT  PFG+IG D VCS  HQ LALQAA++GIVLLKNS  LLPL KS + S
Sbjct: 371 RMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHS 430

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IG N N  KTL GNYAG  C+S TP Q L NYV+NTVY+ GC+   C+ A+I +AV 
Sbjct: 431 LAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVK 490

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IAK  D+VVL+MGLDQTQE+E+ DR +L LPG+Q +LI  VA+AAK+PVILV+L GGPVD
Sbjct: 491 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVD 550

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I+ AKY+  IGSILWAGYPG+AG  A+AE+IFGDHNPG
Sbjct: 551 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPG 588


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  878 bits (2269), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/578 (72%), Positives = 483/578 (83%)

Query: 4   HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
           HK+ L+  LL   F SL+    S+QPP++CD SNP T+T PFCKT LPI  RARDLVSRL
Sbjct: 11  HKIKLLTLLLSAAFLSLIVAGSSSQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRL 70

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TLDEK+ QLVN+ P IPRLGIPAYEWWSEALHGVA VG GI  NGTI  ATSFPQVILTA
Sbjct: 71  TLDEKVLQLVNTVPPIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTA 130

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASFD  LWY+IGQAIG EARA+YNAGQA GMTFW PNINIFRDPRWGRGQETPGEDPL+T
Sbjct: 131 ASFDENLWYQIGQAIGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMT 190

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           GKY+V+YVRG+QGD   GGKL  +L+ASACCKHFTAYDLD W G TRY FDA+VTMQD+A
Sbjct: 191 GKYSVAYVRGIQGDAIEGGKLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMA 250

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DTYQPPFESCV++G+ASGIMCAYNRVNG+PSCAD +LL+ TAR+QW F+GYITSDCDAVS
Sbjct: 251 DTYQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVS 310

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           II+DA+GYAK PEDAV DVL+AGMDVNCG++L++HTK+AV+ KK+P   IDRAL NLFSV
Sbjct: 311 IIHDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSV 370

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           RMRLGLF+GNPT  PFG+IG D VCS  HQ LALQAA++GIVLLKNS  LLPL KS + S
Sbjct: 371 RMRLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHS 430

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IG N N  KTL GNYAG  C+S TP Q L NYV+NTVY+ GC+   C+ A+I +AV 
Sbjct: 431 LAVIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANCTEATIYQAVK 490

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IAK  D+VVL+MGLDQTQE+E+ DR +L LPG+Q +LI  VA+AAK PVILV+L GGPVD
Sbjct: 491 IAKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVD 550

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I+ AKY+  IGSILWAGYPG+AG  A+AE+IFGDHNPG
Sbjct: 551 ISSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPG 588


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  874 bits (2258), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/577 (71%), Positives = 482/577 (83%), Gaps = 3/577 (0%)

Query: 7   SLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
           +  F  +   F +L   V +  PPF+CD SNPST ++PFC   LPI+QR +DLVSRLTLD
Sbjct: 6   TFTFVTIISLFLTLTYSVLAQLPPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRLTLD 65

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           EK++QLVNSAP IPRLGIPAYEWWSEALHG+  VG+GIFFNG+I  ATSFPQVILTAASF
Sbjct: 66  EKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATSFPQVILTAASF 125

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           DS+LWYRIGQAIG+EARA+YN GQA+GMTFWAPNINIFRDPRWGRGQET GEDP++T  Y
Sbjct: 126 DSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNY 185

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AVSYVRG+QGD+F GGKL+G LQASACCKHFTAYDLDNWKG  R+ FDARV++QDLADTY
Sbjct: 186 AVSYVRGLQGDSFQGGKLRGHLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTY 245

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
           QPPF SC++QGRASGIMCAYNRVNGIPSCAD NLL+ T R+QW FHGYI SDC AV II+
Sbjct: 246 QPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIH 305

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
           D +GYAKS EDAV DVL AGMD+ CGS+L  H K+AV+QKKLP   IDRALHNLFS+R+R
Sbjct: 306 DEQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIR 365

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
           LG F+GNP   PFG IG + VCS  H  LAL+AA++GIVLLKN+  LLPLPK+ S+SLA+
Sbjct: 366 LGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKT-SISLAV 424

Query: 427 IGPNANSAK-TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD-TVACSSASIDKAVDI 484
           IGPNAN++  TLLGNYAGP C+SIT LQ  Q+YV+N V++PGCD    C+SA IDKAV +
Sbjct: 425 IGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKV 484

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           AK AD+VVL+MGLDQ+ E+EE DRV L LPG+Q ELI  VA+A+K+PVILVLLCGGP+DI
Sbjct: 485 AKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDI 544

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + AK +  IG I+WAGYPGE G +ALA++IFGDHNPG
Sbjct: 545 SSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPG 581


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  854 bits (2207), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/582 (68%), Positives = 481/582 (82%), Gaps = 3/582 (0%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           M  H  +L+  +L     SL++ V STQPPFSCD SNP T++  FC+T LPIS R  DLV
Sbjct: 1   MKLHISTLITTILISL--SLVSIVQSTQPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLV 58

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
           SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE+LHGV   GKGIFFNG+I GATSFPQVI
Sbjct: 59  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQVI 118

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
           LTAA+FD  LWYRIGQ IG+EAR +YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP
Sbjct: 119 LTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 178

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLK-GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           ++TGKYA+ YVRGVQGD+FNGG+LK G LQASACCKHFTAYDLD WK   R+ F+A VT 
Sbjct: 179 IMTGKYAIRYVRGVQGDSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAIVTP 238

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           QD+ADT+QPPF+ C+++ +ASGIMC+YN VNGIPSCA+ NLL+KTAR+QWGFHGYITSDC
Sbjct: 239 QDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDC 298

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
           DAV +++D   Y  +PED+    LKAGMD++CG +L+K+TK+AV +KK+ +  IDRALHN
Sbjct: 299 DAVQVMHDNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHN 358

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
           LFS+RMRLGLFNG+P  Q +G I    VC+P HQ LAL+AA++GIVLLKN+  LLPL K+
Sbjct: 359 LFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKA 418

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
           K+ SLA+IG NAN+A  L GNY GP C+ I  L+AL  Y ++  Y  GC+   C+SA+ID
Sbjct: 419 KTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTSANID 478

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           +AV+IA+ AD+VVL+MGLDQTQE+E+ DR DLVLPG+Q+ LI  VA+AAKKPVILV+L G
Sbjct: 479 QAVNIARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSG 538

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GPVDI+FAKY+  IGSILWAGYPGEAG +ALAE+IFG+HNPG
Sbjct: 539 GPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPG 580


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  834 bits (2154), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/565 (71%), Positives = 471/565 (83%), Gaps = 11/565 (1%)

Query: 24  VDSTQP--PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
           V S++P  PF+CD SNPS+ ++PFC   LPI QR +DL+SRLTLDEK+SQLVN+AP IPR
Sbjct: 17  VQSSKPEAPFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNTAPPIPR 76

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
           LGIPAY+WWSEALHGV+GVG GI F  N TI  ATSFPQVILTAASFDS LWYRIG AIG
Sbjct: 77  LGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYRIGHAIG 136

Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
           +EARA++NAGQA G+TFWAPNINIFRDPRWGRGQET GEDPL+T +YAVS+VRG+QGD+F
Sbjct: 137 IEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRGLQGDSF 196

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
            G      L ASACCKHFTAYDLDNWKG  R+ FDARV++QDLADTYQPPF+SCV+QGRA
Sbjct: 197 KG----AHLLASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQPPFQSCVQQGRA 252

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
           SGIMCAYNRVNG+P+CAD  LL++TAR QW F+GYITSDC AV  I+D + YAKSPED V
Sbjct: 253 SGIMCAYNRVNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHDRQRYAKSPEDVV 312

Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
            DVL+AGMD+ CGS+L  H K+AV QKKL  SEIDRAL NLFS+RMRLGLF+GNPT   F
Sbjct: 313 ADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMRLGLFDGNPTRLSF 372

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAK-TL 437
           G IG++ VCS  HQ LAL+AA++GIVLLKNS  LLPLPK S S+SLA+IGPNANS+  TL
Sbjct: 373 GLIGSNHVCSKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLAVIGPNANSSPLTL 432

Query: 438 LGNYAGPSCRSITPLQALQNYVENTVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMG 496
           LGNYAGP C+ +T LQ  ++YV+N  Y+PGCD    CSSA ID+AV++AK  D+VVL+MG
Sbjct: 433 LGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVEVAKKVDYVVLVMG 492

Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
           LDQ++E+EE DRV L LPG+Q ELI  VAEA+KKPVILVLL GGP+DIT AKY+  IG I
Sbjct: 493 LDQSEEREERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLDITSAKYNHKIGGI 552

Query: 557 LWAGYPGEAGAVALAEVIFGDHNPG 581
           LWAGYPGE G +ALA++IFGDHNPG
Sbjct: 553 LWAGYPGELGGIALAQIIFGDHNPG 577


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  824 bits (2128), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/572 (68%), Positives = 468/572 (81%), Gaps = 4/572 (0%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
            +     +L    +STQPP+SCD S+ ++  +PFC T LPIS+RA+DLVSRLTLDEK++Q
Sbjct: 11  FISFLLLTLHHHAESTQPPYSCDSSS-NSPYYPFCNTRLPISKRAQDLVSRLTLDEKLAQ 69

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           LVN+APAIPRLGIP+Y+WWSEALHGVA  G GI FNGTI+ ATSFPQVILTAASFD  LW
Sbjct: 70  LVNTAPAIPRLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLW 129

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           Y+I + IG EARA+YNAGQA GMTFWAPNIN+FRDPRWGRGQET GEDPL+  KY V+YV
Sbjct: 130 YQISKTIGKEARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYV 189

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+QGD+F GGKL  +LQASACCKHFTAYDLD+WKG  R+ +DARVT QDLADTYQPPF+
Sbjct: 190 RGLQGDSFEGGKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQ 249

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           SC++QGRASGIMCAYNRVNG+P+CA+ NLL+KTAR+QW F GYITSDC AVSII+D +GY
Sbjct: 250 SCIEQGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGY 309

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
           AK+ EDA+ DV +AGMDV CG ++ KH K+AV QKKLP S+IDRAL NLFS+R+RLGL +
Sbjct: 310 AKTAEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLD 369

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           GNPT  PFG IG D VCS     LAL+AA+DGIVLLKN++ LLPLPK+   ++ALIGPNA
Sbjct: 370 GNPTKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTNP-TIALIGPNA 428

Query: 432 N-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDT-VACSSASIDKAVDIAKGAD 489
           N S+K  LGNY G  C  +T LQ  + Y ++TVY+PGCD    C+ A I+ AV++AK  D
Sbjct: 429 NASSKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVD 488

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
           +VVL+MGLDQ+QE+E  DR  L LPG+Q+ELI  VA A+K+PV+LVLLCGGPVDIT AK+
Sbjct: 489 YVVLVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKF 548

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           D  +G ILWAGYPGE G VALA+V+FGDHNPG
Sbjct: 549 DDKVGGILWAGYPGELGGVALAQVVFGDHNPG 580


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  815 bits (2106), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/581 (65%), Positives = 463/581 (79%), Gaps = 3/581 (0%)

Query: 2   IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
           I +K +++F  + L    L  R +ST+PPFSCD SNP+T ++PFC   LPI QR  DLVS
Sbjct: 5   ITNKFTVMFIFVILVL--LFRRTESTKPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVS 62

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           RLT+DEKI QLVN AP IPRLGI AYEWWSE LHG++  GKG  FNGTI+ AT FPQ+IL
Sbjct: 63  RLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIIL 122

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
           TA+SFD  LWYRI QAIG EARA+YNAGQ  G+T WAPNINI RDPRWGRGQETPGEDP+
Sbjct: 123 TASSFDENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPM 182

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLK-GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           + GKY V+YVRG+QGD+F GGKLK G LQ SACCKHF A D+DNW   +RY FDA+V  Q
Sbjct: 183 MVGKYGVAYVRGLQGDSFEGGKLKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQVLKQ 242

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           DLAD+Y+PPF+ CV+QG+AS +MCAYN VNGIP+CA+ +LL+ TAR +WG  GYI SDCD
Sbjct: 243 DLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIVSDCD 302

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
           AV  +Y  + YAK PEDAV   LKAGMDVNCGS L+ +TK+A++++K+ ES+IDRALHNL
Sbjct: 303 AVDKMYSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDRALHNL 362

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
           FSVRMRLGLFNG+P+   +G I A  VCS  H+ LA++AA+ G VLLKNS+ LLPL K K
Sbjct: 363 FSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLPLSKMK 422

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDK 480
           + SLA+IGP AN ++ LLGNY G SC+++T  Q LQ YV NT+Y+PGCD + C+S +ID+
Sbjct: 423 TASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTSPAIDE 482

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           AV+IAK AD+VVL+MGLDQT E+E+ DR +L LPG Q++LIT +AEAA KPVILVL+CGG
Sbjct: 483 AVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVILVLMCGG 542

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           PVD+TFAK +  IG ILW GYPGE GA ALA+++FG+HNPG
Sbjct: 543 PVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPG 583


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  814 bits (2103), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/559 (69%), Positives = 464/559 (83%), Gaps = 5/559 (0%)

Query: 25  DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
           +ST+PP+SCD S+ ++  + FC T LPI++RA+DLVSRLTLDEK++QLVN+APAIPRLGI
Sbjct: 25  ESTRPPYSCDSSS-NSPYYSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGI 83

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           P+Y+WWSEALHGVA  G GI FNGTI+ ATSFPQVILTAASFD  LWY+I + IG EARA
Sbjct: 84  PSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREARA 143

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           +YNAGQA GMTFWAPNIN+FRDPRWGRGQET GEDPL+  KY V+YVRG+QGD+F GGKL
Sbjct: 144 VYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEGGKL 203

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
             +LQASACCKHFTAYDLD WKG  R+ FDARVT QDLADTYQPPF+SC++QGRASGIMC
Sbjct: 204 AERLQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASGIMC 263

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           AYNRVNG+P+CAD NLL+KTAR+QW F GYITSDC AVSII++ +GYAK+ EDA+ DV +
Sbjct: 264 AYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAKTAEDAIADVFR 323

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AGMDV CG ++ KH K+AV QKKLP S+IDRAL NLFS+R+RLGLF+GNPT  PFG IG 
Sbjct: 324 AGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGTIGP 383

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAG 443
           + VCS     LAL+AA+DGIVLLKN++ LLPLPK+   ++ALIGPNAN S+K  LGNY G
Sbjct: 384 NEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTNP-TIALIGPNANASSKVFLGNYYG 442

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDT-VACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
             C  +T LQ  + Y + TVY+PGCD    C+ A I++AV++AK  D+VVL+MGLDQ+QE
Sbjct: 443 RPCNLVTLLQGFEGYAK-TVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQSQE 501

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           +E  DR  L LPG+Q+ELI  VA AAK+PV++VLLCGGPVDIT AK+D  +G ILWAGYP
Sbjct: 502 RESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWAGYP 561

Query: 563 GEAGAVALAEVIFGDHNPG 581
           GE G VALA+V+FGDHNPG
Sbjct: 562 GELGGVALAQVVFGDHNPG 580


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  810 bits (2091), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/582 (65%), Positives = 465/582 (79%), Gaps = 8/582 (1%)

Query: 6   LSLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
           +++ F  L L     L   DS     PP+SCD SNP T++FPFC   L I+QRA+D+VSR
Sbjct: 6   ITITFIFLFLTRYHRLVHADSLATNVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSR 65

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LTLDEKISQLVN+APAIPRLGIP+Y+WW+EALHGV+ VGKGI  NG+I  ATSFPQ+IL 
Sbjct: 66  LTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILI 125

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           AASFD  LWYRI + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV
Sbjct: 126 AASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLV 185

Query: 183 TGKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
             KY VSYVRG+QGD+F GGKL  G+L+ASACCKHFTAYDL+NWKG  RY FDA+VT+QD
Sbjct: 186 NSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYDLENWKGVNRYVFDAKVTLQD 245

Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
           LADTYQP F SCV QGR+SGIMCAYNRVNG+P+CAD NLL+ TAR++W F+GYI SDCDA
Sbjct: 246 LADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDA 305

Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           V  IY+ +GYAK+PED V DVL+AGMDV CG+++ KH K+AV QKK+P S+IDRALHNLF
Sbjct: 306 VRFIYEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLF 365

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
           ++R+RLGLF+GNPT   +G+IG + VCS  +  LAL+AA+ GIVLLKN+  +LPLP+   
Sbjct: 366 TIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPRVN- 424

Query: 422 VSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC-DTVACSSASID 479
            +L +IGPNAN S+  LLGNY G  C+ ++ L+    Y   T Y  GC D V C+SA ID
Sbjct: 425 -TLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCASAEID 483

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           +AV++AK +D+V+L+MGLDQ+QE E LDR  L LPG+QQ+LI  VA+A+KKPVILV+LCG
Sbjct: 484 RAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCG 543

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GPVDITFAK +  IG I+WAGYPGE G  ALA+V+FGD+NPG
Sbjct: 544 GPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPG 585


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  809 bits (2090), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/581 (66%), Positives = 468/581 (80%), Gaps = 8/581 (1%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           + F  L L   + L   DS    PP+SCD +NP T+++ FC   L   QRA+D+VSRLTL
Sbjct: 11  ISFIFLFLTRYNQLVHADSPTLVPPYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRLTL 70

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG--TIRGATSFPQVILTA 123
           DEK++QLVN+APAIPRLGI +Y+WWSEALHGVA  GKGI  NG  TI+ AT FPQVILTA
Sbjct: 71  DEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYGKGIRLNGNVTIKAATIFPQVILTA 130

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASFDS LWYRI + IG EARA+YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV+
Sbjct: 131 ASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPLVS 190

Query: 184 GKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            KYAVS+VRG+QGD+F GGKL + +L+ASACCKHFTAYDLDNWKG  R+ FDA VT+QDL
Sbjct: 191 AKYAVSFVRGLQGDSFEGGKLNEDRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQDL 250

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
           ADTYQPPF SC+ QGR+SGIMCAYNRVNGIP+CAD NLL+ TAR++W F+GYITSDC AV
Sbjct: 251 ADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCSAV 310

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            II+D +GYAK+PEDAV DVL+AGMDV CG +   H+K+AV QKK+P S+IDRALHNLFS
Sbjct: 311 DIIHDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNLFS 370

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           +R+RLGLF+G+PT   +GKIG + VCS  +  +AL+AA+ GIVLLKN+  +LPLPKS   
Sbjct: 371 IRIRLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPKSTD- 429

Query: 423 SLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC-DTVACSSASIDK 480
           S+ +IGPNAN S++ +LGNY G  C  +T LQ  +NY +N +Y+PGC D   C SA ID+
Sbjct: 430 SIVVIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDR 489

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           AV++AK  D+VVL+MGLDQ+QE E  DR DL LPG+QQELI  VA+A+K+PVILVL CGG
Sbjct: 490 AVEVAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFCGG 549

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           PVDI+FAK D  IG ILWAGYPGE G +ALA+V+FGD+NPG
Sbjct: 550 PVDISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPG 590


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  808 bits (2086), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/595 (65%), Positives = 467/595 (78%), Gaps = 23/595 (3%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           + F  L L     L   DS    PP+SCD +NP T++ PFC   L I+QRA+D+VSRLTL
Sbjct: 11  ISFIFLFLTRYHRLVHADSPTHVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTL 70

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
           DEKISQLVN+AP+IPRLGIP+Y+WW EALHGVA  GKGI  NG++ GATSFPQVILTAAS
Sbjct: 71  DEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAAS 130

Query: 126 FDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
           FDS LWY+I + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV  K
Sbjct: 131 FDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSK 190

Query: 186 YAVSYVRGVQGDTFNGGKLKG-KLQASACCKHFTAYDLDNWKGTTRYKFDAR-------- 236
           Y VSYVRG+QGD+F GGKL G +L+ASACCKHFTAYDLDNWKG  R+ FDA+        
Sbjct: 191 YGVSYVRGLQGDSFEGGKLIGDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMA 250

Query: 237 --------VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
                   VT+QDLADTYQPPF SC+ QGR+SGIMCAYNRVNG+P+CAD NLL+KTAR++
Sbjct: 251 YSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTARQK 310

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
           W F+GYITSDC+AV IIYD +GYAK+PEDAV DVL+AGMDV CG +L KH KAAV QKK+
Sbjct: 311 WNFNGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKV 370

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
           P S+IDRALHNLF++R+RLGLF+GNPT   +G+IG + VCS  +  LAL+AA+ GIVLLK
Sbjct: 371 PISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLK 430

Query: 409 NSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG 467
           N+  +LPLP+    +L +IGPNAN S+K +LGNY G  CR +  L+    Y   T Y  G
Sbjct: 431 NTASILPLPRVN--TLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSG 488

Query: 468 C-DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526
           C D   C+SA ID+AV++AK +D+V+L+MGLDQ+QE+E  DR DL LPG+QQELI  VA+
Sbjct: 489 CLDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAK 548

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A+KKPVILVLLCGGPVDITFAK +  IG I+WAGYPGE G  ALA+V+FGD+NPG
Sbjct: 549 ASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPG 603


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  801 bits (2068), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/556 (67%), Positives = 452/556 (81%), Gaps = 2/556 (0%)

Query: 26  STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           +++PP++C    P     PFC T LPI +R  DLVSRLT+ EKISQL + +PAIPRLG+P
Sbjct: 35  ASEPPYTCGAGAPPN--IPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVP 92

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           AY+WWSEALHGVA  G+GI  +G +R ATSFPQVILTAASF+ +LWYRIGQ IG+EARA+
Sbjct: 93  AYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAV 152

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP +TGKYA  +VRGVQG    G    
Sbjct: 153 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGVAGPVNS 212

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             L+ASACCKHFTAYDL+NWKG TRY +DA+VT QDL DTY PPF+SCV+ G ASGIMC+
Sbjct: 213 TDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDGHASGIMCS 272

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YNRVNG+P+CAD NLLSKTAR+ WGF+GYITSDCDAVSII+DA+GYAK+ EDAV DVLKA
Sbjct: 273 YNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGYAKTSEDAVADVLKA 332

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           GMDVNCG ++QK+  +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P    +G IG D
Sbjct: 333 GMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRRNRYGNIGPD 392

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VC+  HQ LAL+AAQDGIVLLKN  G LPL KS   SLA+IG NAN+A +LLGNY GP 
Sbjct: 393 QVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATSLLGNYFGPP 452

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C ++TPLQ LQ YV++T +  GC++ AC+  +I +AV  A  AD VVL MGLDQ QE+EE
Sbjct: 453 CVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQNQEREE 512

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           +DR+DL LPG+QQ LI  VA AAKKPVILVLLCGGPVD++FAK +  IG+ILWAGYPGEA
Sbjct: 513 VDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGEA 572

Query: 566 GAVALAEVIFGDHNPG 581
           G +A+A+V+FG+HNPG
Sbjct: 573 GGIAIAQVLFGEHNPG 588


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  794 bits (2051), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/553 (67%), Positives = 445/553 (80%), Gaps = 4/553 (0%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC    PS+  F  C   LPI QRA DLVSRLTL+EKISQL + +PA+ RLG+PAY+
Sbjct: 28  PPFSC--GAPSSAAF--CNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYK 83

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+  G+GI  +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EARA+YN 
Sbjct: 84  WWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNN 143

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP VTGKYA  +VRGVQG    G      L
Sbjct: 144 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDL 203

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           +ASACCKHFTAYDL+NWKG TRY FDA+VT QDLADTY PPF SCV+ G ASGIMC+YNR
Sbjct: 204 EASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNR 263

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+P+CAD NLLSKTAR  W F+GYITSDCDAVSII+D +GYAK+ EDAV DVLKAGMD
Sbjct: 264 VNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMD 323

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           VNCGS++Q+H  +A++Q K+ E +I+RALHNLF+VRMRLGLFNGNP    +G IG D VC
Sbjct: 324 VNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVC 383

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +  HQ LAL+AAQ G+VLLKN    LPL KS+  S+A+IG NAN A  LLGNY GP C S
Sbjct: 384 TQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCIS 443

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           +TPLQ LQ YV++T +  GC++ AC+ +SI +A  +A   D+VVL MGLDQ QE+EE+DR
Sbjct: 444 VTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDR 503

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           ++L LPG Q+ LI  VA AAKKPVILVLLCGGPVD+TFAKY+  IG+ILWAGYPGEAG +
Sbjct: 504 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 563

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG+HNPG
Sbjct: 564 AIAQVLFGEHNPG 576


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  794 bits (2051), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/553 (67%), Positives = 445/553 (80%), Gaps = 4/553 (0%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC    PS+  F  C   LPI QRA DLVSRLTL+EKISQL + +PA+ RLG+PAY+
Sbjct: 34  PPFSC--GAPSSAAF--CNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYK 89

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+  G+GI  +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EARA+YN 
Sbjct: 90  WWSEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNN 149

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP VTGKYA  +VRGVQG    G      L
Sbjct: 150 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQGYALAGAINSTDL 209

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           +ASACCKHFTAYDL+NWKG TRY FDA+VT QDLADTY PPF SCV+ G ASGIMC+YNR
Sbjct: 210 EASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNR 269

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+P+CAD NLLSKTAR  W F+GYITSDCDAVSII+D +GYAK+ EDAV DVLKAGMD
Sbjct: 270 VNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGYAKTAEDAVADVLKAGMD 329

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           VNCGS++Q+H  +A++Q K+ E +I+RALHNLF+VRMRLGLFNGNP    +G IG D VC
Sbjct: 330 VNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVC 389

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +  HQ LAL+AAQ G+VLLKN    LPL KS+  S+A+IG NAN A  LLGNY GP C S
Sbjct: 390 TQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCIS 449

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           +TPLQ LQ YV++T +  GC++ AC+ +SI +A  +A   D+VVL MGLDQ QE+EE+DR
Sbjct: 450 VTPLQVLQGYVKDTRFLAGCNSAACNVSSIGEAAQLASSVDYVVLFMGLDQDQEREEVDR 509

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           ++L LPG Q+ LI  VA AAKKPVILVLLCGGPVD+TFAKY+  IG+ILWAGYPGEAG +
Sbjct: 510 LELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGI 569

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG+HNPG
Sbjct: 570 AIAQVLFGEHNPG 582


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  788 bits (2036), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/556 (66%), Positives = 445/556 (80%), Gaps = 4/556 (0%)

Query: 26  STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           +  PPFSC    PS+  F  C   LPI QRA DLVS+LTL+EKISQL + +PA+ RLG+P
Sbjct: 29  AADPPFSC--GAPSSAAF--CDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVP 84

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           AY+WWSEALHGVA  G+G+  +G +R ATSFPQVILTAASF+ +LWYRIGQ IG EAR +
Sbjct: 85  AYKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGV 144

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP +TGKYA  +VRGVQG   +G    
Sbjct: 145 YNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGMSGAINS 204

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             L+ASACCKHFTAYDL+NWKG TR+ FDA+VT QDLADTY PPF+SCV+ G ASGIMC+
Sbjct: 205 SDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCS 264

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YNRVNG+P+CAD NLLSKTAR  W F+GYITSDCDAV+II+D +GYAK+PEDAV DVLKA
Sbjct: 265 YNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKA 324

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           GMDVNCG ++Q H  +A +Q K+   +IDRAL NLF++RMRLGLF+GNP    +G IGAD
Sbjct: 325 GMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGAD 384

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VCS  HQ LALQAA+DGIVLLKN    LPL KSK  SLA+IGPN N+A  LLGNY GP 
Sbjct: 385 QVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPP 444

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C S+TPLQALQ YV++  +  GC+   C+ ++I +AV  A  AD+VVL MGLDQ QE+EE
Sbjct: 445 CISVTPLQALQGYVKDARFVQGCNAAVCNVSNIGEAVHAAGSADYVVLFMGLDQNQEREE 504

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           +DR++L LPG Q+ L+  VA+AAKKPVILVLLCGGPVD+TFAK +  IG+I+WAGYPG+A
Sbjct: 505 VDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQA 564

Query: 566 GAVALAEVIFGDHNPG 581
           G +A+A+V+FGDHNPG
Sbjct: 565 GGIAIAQVLFGDHNPG 580


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  786 bits (2031), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/582 (64%), Positives = 455/582 (78%), Gaps = 7/582 (1%)

Query: 5   KLSLVFPLLCLCFTSLL---TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
           +LSLV     L     L     V +++PP++C    P     PFC   LPI +R  DLVS
Sbjct: 6   RLSLVHAAAALVVLLQLHGGAVVVASEPPYTCGAGAPPN--IPFCDAGLPIDRRVDDLVS 63

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           R+T+ EKISQL + +PAIPRLG+PAY+WWSEALHG++  G+GI  +G +R ATSFPQVIL
Sbjct: 64  RMTVAEKISQLGDQSPAIPRLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVIL 123

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
           TAASF+ +LWYRIGQ IG+EARA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP 
Sbjct: 124 TAASFNPHLWYRIGQVIGVEARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPT 183

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
           +TGKYA  +VRGVQG    G      L+ASACCKHFTAYDL+NWKG TRY FDA+VT QD
Sbjct: 184 MTGKYAAVFVRGVQGYGLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQD 243

Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
           LADTY PPF+SCV+ G ASGIMC+YNRVNG+P+CAD NLLS TAR+ WGF+GYITSDCDA
Sbjct: 244 LADTYNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDA 303

Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           V+II+DA+GYAK+ EDAV DVLKAGMDVNCGS++Q H  +A++Q K+ E +I+RALHNLF
Sbjct: 304 VAIIHDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLF 363

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKS 419
           +VRMRLGLFNG+P    +G IG D VC+  HQ LAL+AAQDGIVLLKN    G LPL K 
Sbjct: 364 AVRMRLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKP 423

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
              SLA+IG NAN A  L GNY GP C ++TPLQ LQ YV++T +  GC++ AC+  +I 
Sbjct: 424 NVASLAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIP 483

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           +AV  A  AD VVL MGLDQ QE+EE+DR+DL LPG+QQ LI  VA AAKKPVILVLLCG
Sbjct: 484 EAVQAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCG 543

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GPVD++FAK +  IG+ILWAGYPGEAG +A+A+V+FG+HNPG
Sbjct: 544 GPVDVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPG 585


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  769 bits (1985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/585 (63%), Positives = 450/585 (76%), Gaps = 16/585 (2%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           +IFH   +V PL CL   + +    +T PPFSC     S  ++PFC   LPI QRA DL 
Sbjct: 6   IIFH---VVLPL-CLVLQATM----ATDPPFSCG----SPSSYPFCDRKLPIGQRAADLA 53

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV---GKGIFFN-GTIRGATSF 116
           SRLT++EK+S L + +P +PRLG+PAY+WWSEALHGVA       G+ F+ G +R ATSF
Sbjct: 54  SRLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSF 113

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETP 176
           PQV++TAASF+ +LWYRIGQ IG EAR +YN+GQA G+TFWAPNIN+FRDPRWGRGQETP
Sbjct: 114 PQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETP 173

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDP +TGKYA  +VRGVQG   +G      L+ASACCKHFTAYDL+NW G TR+ F+A+
Sbjct: 174 GEDPTMTGKYAAVFVRGVQGYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAK 233

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
           V+ QDLADTY PPF SCV+ G ASGIMC+YNRVNG+P+CAD NLLSKTAR  W F+GYIT
Sbjct: 234 VSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYIT 293

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
           SDCDAV+II+D +GYAK PEDAV DVLKAGMDVNCG ++QKH  +A  Q K+ E +IDRA
Sbjct: 294 SDCDAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDRA 353

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           L NLF++RMRLGLF+GNP    +G IGAD VC   HQ LAL+AAQDGIVLLKN  G LPL
Sbjct: 354 LQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLPL 413

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
           PK K  SLA+IG NAN A+ L GNY GP C S++PLQALQ YV  T +  GC+   C+ +
Sbjct: 414 PKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNVS 473

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
            I  A   A  A++VVL MGLDQ QE+E+LDR++L LPG Q+ L+  VA+AAKKPV+LVL
Sbjct: 474 DIAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVLVL 533

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           LCGGPVD+TFAK +  IG+I+WAGYPG+AG +A+A+V+FG+HNPG
Sbjct: 534 LCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPG 578


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/557 (58%), Positives = 422/557 (75%), Gaps = 5/557 (0%)

Query: 26  STQPPFSCDPSN-PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
           +  PPFSC PS+  +T+ + FC  TLP++QRA DLV+RLT  EK++QL + A  +PRLG+
Sbjct: 33  AADPPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGV 92

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           PAY+WW+EALHG+A  GKG+ FNG +R ATSFPQV LTAA+FD  LW RIGQAIG EARA
Sbjct: 93  PAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARA 152

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           LYN GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    +Y V++V+G+QG++      
Sbjct: 153 LYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNS----TS 208

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              LQ SACCKH TAYDL++W G  RY FDARVT QDL DTY PPF SCV  G+AS +MC
Sbjct: 209 SSLLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDGKASCVMC 268

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           AY  +NG+P+CA+  LL+ T R  WG  GY+ SDCDAV+I+ DA+ YA +PEDAV   LK
Sbjct: 269 AYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVALALK 328

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AG+D++CG+++Q+H  AA++Q K+ E ++D+AL NLF++RMRLG F+G+P    +G + A
Sbjct: 329 AGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRANIYGGLNA 388

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             +C+P H+ LAL+AAQDGIVLLKN  G+LPL ++   S A+IGPNAN+   L+GNY GP
Sbjct: 389 AHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGLLIGNYFGP 448

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            C S+TPL+ +Q YV++  +  GC + AC  A  D+A  +A  +D+V+L MGL Q QE E
Sbjct: 449 PCESVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAGSSDYVLLFMGLSQQQESE 508

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DR  L+LPG+QQ LIT VA+AAK+PVILVLL GGPVD+TFAK +  IG+ILWAGYPG+
Sbjct: 509 GRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQ 568

Query: 565 AGAVALAEVIFGDHNPG 581
           AG +A+A V+FGDHNPG
Sbjct: 569 AGGLAIARVLFGDHNPG 585


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/555 (59%), Positives = 417/555 (75%), Gaps = 8/555 (1%)

Query: 29  PPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           PPFSC P S  ++E   FC  TL  +QRA DLVSRLT  EKI+QL + A  +PRLG+P Y
Sbjct: 26  PPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGY 85

Query: 88  EWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           +WW+EALHG+A  GKG+ F+  G +R ATSFPQV+LTAA+FD  LW+RIGQAIG EARAL
Sbjct: 86  KWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARAL 145

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           +N GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YAV++VRG+QG++ +     
Sbjct: 146 FNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS---- 201

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             LQ SACCKH TAYDL++W G  RY F ARVT QDL DT+ PPF SCV +G+AS IMCA
Sbjct: 202 -LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGKASCIMCA 260

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           Y  +NG+P+CA+ +LL+ T R  WG  GY+ SDCDAV+I+ DA+ YA +PEDAV   LKA
Sbjct: 261 YTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKA 320

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G+D++CGS++Q+H  AA++Q KL E +ID+AL NLF+VRMRLG F+G+P    +G + A 
Sbjct: 321 GLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKNMYGALSAA 380

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            +C+P H+ LAL+AAQDGIVLLKN  G+LPL +S   S A+IGPN+N    L+ NY GP 
Sbjct: 381 DICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMALIANYFGPP 440

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C S TPLQ LQ+YV N  +  GC + AC  A  D+AV ++   D+V L MGL Q QE E 
Sbjct: 441 CESTTPLQGLQSYVNNVRFLAGCSSAACDVAVTDQAVVLSGSEDYVFLFMGLSQQQESEG 500

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR  L+LPG QQ LIT VA+A+K+PVILVLL GGPVDITFA+ +  IG+ILWAGYPG+A
Sbjct: 501 KDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQA 560

Query: 566 GAVALAEVIFGDHNP 580
           G +A+A+V+FGDHNP
Sbjct: 561 GGLAIAKVLFGDHNP 575


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  698 bits (1802), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/553 (58%), Positives = 419/553 (75%), Gaps = 8/553 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC P + +T+ + FC   LP+ +RA DLVSRLTL EK+SQL + A A+PRLG+PAY+
Sbjct: 34  PPFSCGPGS-ATQGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYK 92

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSE LHG++  G G+ F+G +R  TSFPQV+LTAASFD  +WYRIGQAIG EARALYN 
Sbjct: 93  WWSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNL 152

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    KYAV++V+G+QG +         L
Sbjct: 153 GQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSAT------TL 206

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           Q SACCKH TAYDL++W G  RY F+A+VT+QDLADT+ PPF+SCV++G+A+ +MCAY  
Sbjct: 207 QTSACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNPPFKSCVEEGKATCVMCAYTN 266

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           +NG+P+CA  +L++KT +  WG +GY++SDCDAV+++ DA+ Y  +PED V   LKAG+D
Sbjct: 267 INGVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDAQRYRATPEDTVAVALKAGLD 326

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADVV 387
           +NCG++ Q H  +A++Q K+ E ++D AL NLF+VRMRLG F+G+P T   +G +GA  V
Sbjct: 327 LNCGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLGHFDGDPRTSALYGSLGAADV 386

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           CSPAH+ LAL+AAQ GIVLLKN  G+LPL  S   S A IG NAN    L GNY GP C 
Sbjct: 387 CSPAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAIGHNANDPAALNGNYFGPPCE 446

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           + TPLQ LQ YV+N  +  GCD+ AC  A+  +AV +A  +D+V+L MGL Q +E+E +D
Sbjct: 447 TTTPLQGLQGYVKNVKFLAGCDSAACGFAATGQAVTLASSSDYVILFMGLSQKEEQEGID 506

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  L+LPG+QQ LIT VA A+K+PVILVLL GG VDITFAK +  IG+ILWAGYPG+AG 
Sbjct: 507 RTSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGG 566

Query: 568 VALAEVIFGDHNP 580
           +A+A V+FGDHNP
Sbjct: 567 LAIARVLFGDHNP 579


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/556 (58%), Positives = 418/556 (75%), Gaps = 13/556 (2%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P FSC P  P    + FC   LP  QRA DLV++LTL+EK+SQL + AP +PR G+P Y 
Sbjct: 12  PAFSCGP--PQQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYN 69

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSE LHGV+  G G+ FNG +RG T+FPQV+LT ASFD  +WYRIGQAIG EARA++N 
Sbjct: 70  WWSEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNL 129

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    KYAV++VRG+QG +         L
Sbjct: 130 GQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTT------TL 183

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           Q SACCKH TAYDLD+W    RY F+A+VT QDL +T+ PPF+SCV +G+A+ +MCAY  
Sbjct: 184 QTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKATCVMCAYTS 243

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNGIP+CAD  LL+KT + +WG +GYI+SDCDAV+++Y    Y+ +PEDAV   +KAG+D
Sbjct: 244 VNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR-YSGTPEDAVAAAIKAGLD 302

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVV 387
           +NCG+F Q H  AA++Q+K+ E ++D+AL NLF++RMRLG F+G+P   P +G++GA  V
Sbjct: 303 MNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPLYGRLGAQDV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           CSPAH+ LAL+AAQ+GIVLLKN    LPL  P + S S A+IGPNAN    LLGNY GP 
Sbjct: 363 CSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGALLGNYFGPP 422

Query: 446 CRSITPLQALQN-YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
           C + TPLQALQ  Y +N  + PGCD+ AC+ A   +A  +A  +D+ +L MGL Q QE+E
Sbjct: 423 CETTTPLQALQKFYSKNVRFVPGCDSAACNVADTYQASGLAATSDYTILFMGLSQKQEQE 482

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            LDR  L+LPG+Q+ LIT VA AAK+P+ILVLL GGPVDITFAK++  IG+ILWAGYPG+
Sbjct: 483 GLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIGAILWAGYPGQ 542

Query: 565 AGAVALAEVIFGDHNP 580
           AG +A+A+V+FG+HNP
Sbjct: 543 AGGLAIAKVLFGEHNP 558


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  689 bits (1778), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/553 (59%), Positives = 419/553 (75%), Gaps = 6/553 (1%)

Query: 29  PPFSCDPSN-PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           PPFSC PS+  ++E   FC  TL  +QRA DLVSRLT  EKI+QL + AP +PRLG+P Y
Sbjct: 29  PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WW+EALHG+A  GKG+ F+  +R ATSFPQV+LTAA+FD  LW RIGQAIG EARAL+N
Sbjct: 89  KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YAV++VRG+QG++ +       
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-----L 203

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           LQ SACCKH TAYDL++W G  RY F ARVT QDL DT+ PPF SCV + +AS +MCAY 
Sbjct: 204 LQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYT 263

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            +NG+P+CA+ +LL+ T R  WG  GY+ SDCDAV+I+ DA+ YA +PEDAV   LKAG+
Sbjct: 264 AINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLKAGL 323

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D++CGS++Q+H  AA++Q KL E +ID+AL NL++VRMRLG F+G+P    +G +GA  +
Sbjct: 324 DIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRKNMYGVLGAADI 383

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C+P H+ LAL+AAQDGIVLLKN  G+LPL +S   S A+IGPNAN    L+ NY GP C 
Sbjct: 384 CTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMALIANYFGPPCE 443

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           S TPL+ LQ+YV +  +  GC++ AC  A+ D+AV +A   D+V L MGL Q QE E  D
Sbjct: 444 STTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAGSEDYVFLFMGLSQKQESEGKD 503

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  L+LPG QQ LIT VA+A+K+PVILVLL GGPVDITFA+ +  IG+ILWAGYPG+AG 
Sbjct: 504 RTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWAGYPGQAGG 563

Query: 568 VALAEVIFGDHNP 580
           +A+A+V+FGDHNP
Sbjct: 564 LAIAKVLFGDHNP 576


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  686 bits (1770), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/575 (58%), Positives = 420/575 (73%), Gaps = 11/575 (1%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
            +F  L L    L +  +ST P F C P  P+   +PFC T+LPIS RA+ LVS LTL E
Sbjct: 7   FLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KI QL + A AIPRL IPAYEWWSE+LHG+A  G G+ FNGT+  ATSFPQV+LTAASF+
Sbjct: 65  KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             LW+ IG AI +EARA+YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYA 183

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
           V +VRG QGD+   G     L  SACCKH TAYDL+ W   +RY FDA V+ QDL DTYQ
Sbjct: 184 VEFVRGFQGDSDGDG-----LMLSACCKHLTAYDLEKWGNFSRYSFDAVVSNQDLEDTYQ 238

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
           PPF SCV+QG+AS +MC+YNRVNG+P+CA ++L  K A+ +WGF GYITSDCDAV+ +Y+
Sbjct: 239 PPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK-AKTEWGFKGYITSDCDAVATVYE 297

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
            + YA SPEDAV DVLKAG D+NCGS++ +HT++A+ Q K+ E +IDRAL NLFSV+MRL
Sbjct: 298 YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDRALFNLFSVQMRL 357

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
           GLF+G+P    +G +G   VC+  H+ LAL+AA+ GIVLLKN    LPL KS+  SLA+I
Sbjct: 358 GLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLPLDKSRISSLAII 417

Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAK 486
           GP A+    L G Y G  C+  + ++ L+ YVE T +  GC  V C S    D+AV IA+
Sbjct: 418 GPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLSDTGFDEAVSIAR 476

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            AD VV++ GLD +QE E+ DRV L+LPG+Q  LI+ VA A +KP++LVL  GGP+D++F
Sbjct: 477 KADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVLVLTGGGPLDVSF 536

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A+ D  I SILW GYPGEAGA ALAE+IFGD NPG
Sbjct: 537 AEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPG 571


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  686 bits (1769), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/552 (58%), Positives = 418/552 (75%), Gaps = 9/552 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC     ++ ++ FC   LP++QRA DLVSRLT  EK++QL + A  +PRLG+P Y+
Sbjct: 25  PPFSC---GQASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYK 81

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WW+EALHG+A  GKG+ F+G +R ATSFPQV LTAA+FD  LW+RIGQAIG EARALYN 
Sbjct: 82  WWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNL 141

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    +YAV++VRG+QG++ +       L
Sbjct: 142 GQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTS------LL 195

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           QASACCKH TAYDL++W G  RY FDA+VT QDL DT+ PPF SCV  G+AS +MCAY  
Sbjct: 196 QASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDGKASCVMCAYTG 255

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           +NG+P+CA+ +LL+KT R  WG  GY  SDCDAV+I+ DA+ YA+SPEDAV   LKAG+D
Sbjct: 256 INGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRYAQSPEDAVALALKAGLD 315

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG+++Q+H  AA++Q K+ E +ID+AL NLF++RMRLG F+G+P    +G +GA  +C
Sbjct: 316 IDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRTNMYGGLGAADIC 375

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +  H+ LAL AAQDGIVLLKN  G+LPL ++   S A+IGPNAN+   L+ NY GP C S
Sbjct: 376 TAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGALIANYFGPPCES 435

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
            TPL+ +Q YV++  +  GC + AC  A+ D+A  +A  +D+V L MGL Q QE E  DR
Sbjct: 436 TTPLKGIQGYVKDARFLAGCSSTACDVATTDQAAALASTSDYVFLFMGLGQRQESEGRDR 495

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L+LPG+QQ LIT VA+AA++PVILVLL GGPVD+TFA+ +  IG+ILWAGYPG+AG +
Sbjct: 496 TSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGAILWAGYPGQAGGL 555

Query: 569 ALAEVIFGDHNP 580
           A+A V+FGDHNP
Sbjct: 556 AIARVLFGDHNP 567


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  683 bits (1763), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/554 (58%), Positives = 416/554 (75%), Gaps = 8/554 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           +PP+SC P +PS+  + FC   LP ++RA DLVSRLT  EK++QL + A  +PRLG+P Y
Sbjct: 24  EPPYSCGPRSPSS-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPY 82

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WWSE LHG++  G G+ FNG +   TSFPQV+LTAA+FD  LW+RIGQAIG EARALYN
Sbjct: 83  KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    KYAV++V+G+QG T       G 
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGST------PGT 196

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           LQ SACCKH TAYDL+ W G  RY F+A+VT QDLADT+ PPF+SCV   +AS +MCAY 
Sbjct: 197 LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYT 256

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            +NG+P+CA  +LLSKT R QWG  GY++SDCDAV+++ DA+ YA +PED V   +KAG+
Sbjct: 257 DINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGL 316

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADV 386
           D+NCG++ Q H  AA++Q K+ ES++DRAL NLF+VRMRLG F+G+P +   +G +GA  
Sbjct: 317 DLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAAD 376

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+ AH+ LAL+AAQDGIVLLKN  G LPL ++   S A+IGPNAN    L GNY GP C
Sbjct: 377 VCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPC 436

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
            + TPLQ +Q Y+ +  +  GCD+ AC  A+  +A  +A  +D V++ MGL Q QEKE L
Sbjct: 437 ETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGL 496

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+QQ LIT VA AA++PVILVLL GGPVD+TFAK +  IG+ILWAGYPG+AG
Sbjct: 497 DRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 556

Query: 567 AVALAEVIFGDHNP 580
            +A+A+V+FGDHNP
Sbjct: 557 GLAIAKVLFGDHNP 570


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  683 bits (1763), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/482 (66%), Positives = 391/482 (81%), Gaps = 5/482 (1%)

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           GI  NG+I  ATSFPQVILTAASFD  LWY+I + IG EAR +YNAGQA GM FWAPNIN
Sbjct: 2   GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL-KGKLQASACCKHFTAYD 221
           IFRDPRWGRGQET GEDPLV  KY VSYVRG+QGD+F GGKL  G+L+ASACCKHFTAYD
Sbjct: 62  IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSFEGGKLIGGRLKASACCKHFTAYD 121

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+NWKG  RY FDA+VT+QDLADTYQP F SCV QGR+SGIMCAYNRVNG+P+CAD NLL
Sbjct: 122 LENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLL 181

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
           + TAR++W F+GYI SDCDAV  IY+ +GYAK+PED V DVL+AGMD+ CG+++ KH K+
Sbjct: 182 TNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVADVLRAGMDLECGNYMTKHAKS 241

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AV QKK+P S+IDRALHNLF++R+RLGLF+GNPT   +G+IG + VCS  +  LAL+AA+
Sbjct: 242 AVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAAR 301

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYVE 460
            GIVLLKN+  +LPLP+    +L +IGPNAN S+  LLGNY GP C++++ L+    Y  
Sbjct: 302 SGIVLLKNTASILPLPRVN--TLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYTYAS 359

Query: 461 NTVYYPGC-DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            T Y+ GC D   C+SA ID+AV++AK +D+V+L+MGLDQ+QE E LDR  L LPG+QQ+
Sbjct: 360 QTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQK 419

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI  VA+A+KKPVILVLLCGGPVDITFAK +  IG I+WAGYPGE G  ALA+V+FGD+N
Sbjct: 420 LINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYN 479

Query: 580 PG 581
           PG
Sbjct: 480 PG 481


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  683 bits (1762), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/556 (59%), Positives = 419/556 (75%), Gaps = 3/556 (0%)

Query: 26  STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           ++ P FSC PS+ S   +PFC  +LP ++RA DLVSRLT+ EK+SQL + A  +PRLG+P
Sbjct: 25  ASDPMFSCGPSSAS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 83

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y+WWSE LHG+A  G G+ FNGT+   TSFPQV+LT ASFD  LW+RIGQAIG EARAL
Sbjct: 84  PYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARAL 143

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  KYAV++VRG+QG     G   
Sbjct: 144 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSN-PAGAAA 202

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             LQASACCKH TAYDL++W G  RY FDARVT+QDLADT+ PPF+SCV  G+AS +MCA
Sbjct: 203 APLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDGKASCVMCA 262

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           Y  +NG+P+CA  +LL+KT R  WG  GY++SDCDAV+I+ DA+ Y  +PED V   LKA
Sbjct: 263 YTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRYEPTPEDTVAVALKA 322

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGA 384
           G+D+NCG++ Q+H  AA++Q K+ E ++D+AL NLF+VRMRLG F+G+P     +G++GA
Sbjct: 323 GLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGRLGA 382

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+  H+ LAL+AAQDGIVLLKN  G+LPL +S   S A+IG NAN    L GNY GP
Sbjct: 383 ADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPLVLSGNYFGP 442

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
           +C + TPL+ LQ+YV N  +  GC + AC  A+  +A  +A  A++V L MGL Q QEKE
Sbjct: 443 ACETTTPLEGLQSYVRNVRFLAGCSSAACGYAATGQAAALASSAEYVFLFMGLSQDQEKE 502

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            LDR  L+LPG+QQ L+T VA AAK+PV+LVLL GGPVDITFA+ +  IG+ILWAGYPG+
Sbjct: 503 GLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIGAILWAGYPGQ 562

Query: 565 AGAVALAEVIFGDHNP 580
           AG +A+A V+FGDHNP
Sbjct: 563 AGGLAIARVLFGDHNP 578


>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/591 (57%), Positives = 423/591 (71%), Gaps = 22/591 (3%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
            +F  L L    L +  +ST P F C P  P+   +PFC T+LPIS RA+ LVS LTL E
Sbjct: 7   FLFICLFLQVLPLFSISESTHPQFPCMP--PTNSDYPFCNTSLPISTRAQSLVSLLTLSE 64

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KI QL + A AIPRL IPAYEWWSE+LHG+A  G G+ FNGT+  ATSFPQV+LTAASF+
Sbjct: 65  KIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTAASFN 124

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             LW+ IG AI +EARA+YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   YA
Sbjct: 125 RSLWFSIGSAIAVEARAMYNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYA 183

Query: 188 VSYVRGVQGDTFNGGK-----------LKGK-----LQASACCKHFTAYDLDNWKGTTRY 231
           V +VRG QG  + GG            L+G      L  SACCKH TAYDL+ W   +RY
Sbjct: 184 VEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLEKWGNFSRY 243

Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
            FDA V+ QDL DTYQPPF SCV+QG+AS +MC+YNRVNG+P+CA ++L  K A+ +WGF
Sbjct: 244 SFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK-AKTEWGF 302

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
            GYITSDCDAV+ +Y+ + YA SPEDAV DVLKAG D+NCGS++ +HT++A+ Q K+ E 
Sbjct: 303 KGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEE 362

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
           +IDRAL NLFSV+MRLGLF+G+P    +G +G   VC+  H+ LAL+AA+ GIVLLKN  
Sbjct: 363 DIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDK 422

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTV 471
             LPL KS+  SLA+IGP A+    L G Y G  C+  + ++ L+ YVE T +  GC  V
Sbjct: 423 KFLPLDKSRISSLAIIGPQADQP-FLGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDV 481

Query: 472 AC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
            C S    D+AV IA+ AD VV++ GLD +QE E+ DRV L+LPG+Q  LI+ VA A +K
Sbjct: 482 PCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQK 541

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           P++LVL  GGP+D++FA+ D  I SILW GYPGEAGA ALAE+IFGD NPG
Sbjct: 542 PLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPG 592


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  679 bits (1751), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/563 (57%), Positives = 414/563 (73%), Gaps = 11/563 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           V   +PPFSC  +         FC  TLP  QRA DLV+RLT  EK++QL + A  +PRL
Sbjct: 30  VAGGEPPFSCGGAAAGGGQGYAFCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRL 89

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNG---TIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
           G+PAY+WWSEALHG+A  G+G+ F+      R ATSFPQV+LTAA+FD  LW+RIGQAIG
Sbjct: 90  GVPAYKWWSEALHGLATSGRGLHFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIG 149

Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
            EARALYN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP +  KYAV++V+G+QG++ 
Sbjct: 150 TEARALYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNS- 208

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
                   LQ SACCKH TAYDL++W G  RY F+A+VT QDL DTY PPF SCV   +A
Sbjct: 209 -----SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKA 263

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
           + IMCAY  +NG+P+CA+ +LL+KT R  WG  GYI SDCDAV+I+ DA+ Y ++PEDAV
Sbjct: 264 TCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAV 323

Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP- 378
              LKAG+D+NCG+++Q+H  AA++Q KL E +ID+AL NLF++RMRLG F+G+P     
Sbjct: 324 AVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSV 383

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +G +GA  +C+P H+ LAL+AA DGIVLLKN  G+LPL ++   S A+IGPNAN    L+
Sbjct: 384 YGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALI 443

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           GNY GP C S TPL  +  Y++N  +  GC++ AC  A+ D+A  +A  +D+V L MGL 
Sbjct: 444 GNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAAVASSSDYVFLFMGLS 503

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
           Q QE E  DR  L+LPG QQ LIT VA+AAK+PVILVLL GGPVD+TFA+ +  IG+ILW
Sbjct: 504 QKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILW 563

Query: 559 AGYPGEAGAVALAEVIFGDHNPG 581
           AGYPG+AG +A+A V+FGDHNPG
Sbjct: 564 AGYPGQAGGLAIARVLFGDHNPG 586


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/554 (58%), Positives = 414/554 (74%), Gaps = 8/554 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           +PP+SC P +PS   + FC   LP ++RA DLVSRLT  EK++QL + A  + RLG+P Y
Sbjct: 24  EPPYSCGPRSPSL-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPY 82

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WWSE LHG++  G G+ FNG +   TSFPQV+LTAA+FD  LW+RIGQAIG EARALYN
Sbjct: 83  KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NI+RDPRWGRGQETPGEDP    KYAV++V+G+QG T       G 
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGST------PGT 196

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           LQ SACCKH TAYDL+ W G  RY F+A+VT QDLADT+ PPF+SCV   +AS +MCAY 
Sbjct: 197 LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKASCVMCAYT 256

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            +NG+P+CA  +LLSKT R QWG  GY++SDCDAV+++ DA+ YA +PED V   +KAG+
Sbjct: 257 DINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRYAPTPEDTVAVAIKAGL 316

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP-TMQPFGKIGADV 386
           D+NCG++ Q H  AA++Q K+ ES++DRAL NLF+VRMRLG F+G+P +   +G +GA  
Sbjct: 317 DLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAAYGHLGAAD 376

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+ AH+ LAL+AAQ+GIVLLKN  G LPL ++   S A+IGPNAN    L GNY GP C
Sbjct: 377 VCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALNGNYFGPPC 436

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
            + TPLQ +Q Y+ +  +  GCD+ AC  A+  +A  +A  +D V++ MGL Q QEKE L
Sbjct: 437 ETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALASSSDQVIMFMGLSQDQEKEGL 496

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+QQ LIT VA AA++PVILVLL GGPVD+TFAK +  IG+ILWAGYPG+AG
Sbjct: 497 DRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAILWAGYPGQAG 556

Query: 567 AVALAEVIFGDHNP 580
            +A+A+V+FGDHNP
Sbjct: 557 GLAIAKVLFGDHNP 570


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  676 bits (1743), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/569 (56%), Positives = 413/569 (72%), Gaps = 2/569 (0%)

Query: 13  LCLCFTSLLTRVD-STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           + L   +LL RV  +  PPFSC PS+PS +  PFC   LP SQRA DLVSR+T  EK SQ
Sbjct: 30  VLLLVPALLMRVAVAGAPPFSCGPSSPS-KGLPFCNMKLPASQRAADLVSRMTPAEKASQ 88

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           L + A  +PRLG+P+Y+WW+EALHGVA  GKGI  N  +R ATSFPQV+ TAASF+  LW
Sbjct: 89  LGDIANGVPRLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLW 148

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           +RIGQA G EARA YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +Y  ++V
Sbjct: 149 FRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFV 208

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+QG + N   +   LQ SACCKH TAYDL++WKG +RY F A VT+QDLADT+ PPF 
Sbjct: 209 RGLQGSSSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFR 268

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           SCV  G+AS +MCAY  VNG+PSCA+ +LL+KT R  WG  GY+ +DCDAV+I+ +++ Y
Sbjct: 269 SCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIMRNSQFY 328

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
             + ED V   LKAG+D++CG ++Q++  AA+++ KL + ++D+A+ NL + RMRLG F+
Sbjct: 329 RPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFD 388

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           G+P    +G +GA  +C+  H+ LAL+AA DGIVLLKNS G+LPL +    S A+IG NA
Sbjct: 389 GDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNA 448

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
           N    LLGNY GP C   TPLQ +Q YV+N  +  GC+  AC+ A+  +A  +A  +D V
Sbjct: 449 NDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNVAATPQATALASSSDAV 508

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           +L MGL Q QE E  DR  L+LPG QQ LI  VA AAK+PVILVLL GGPVDITFA+ + 
Sbjct: 509 ILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANP 568

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            IG+ILWAGYPG+AG +A+A+V+FG+ NP
Sbjct: 569 KIGAILWAGYPGQAGGLAIAKVLFGEKNP 597


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  669 bits (1727), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/573 (56%), Positives = 415/573 (72%), Gaps = 6/573 (1%)

Query: 10  FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           F  L L   +L++ +  +Q  F+CD S P+T  + FC  +L    RA+DLVSRL+L EK+
Sbjct: 6   FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
            QLVN A  +PRLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP  ILTAASF++ 
Sbjct: 66  QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           LW ++G+ +  EARA++N G A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV  KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YV+G+Q D  + GK + +L+ S+CCKH+TAYDLDNWKG  R+ FDA+VT QDL DTYQ P
Sbjct: 185 YVKGLQ-DVHDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTP 242

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F+SCV++G  S +MC+YNRVNGIP+CAD NLL    R QW   GYI SDCD++ + ++  
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDI 302

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            Y K+ EDAV   LKAG+++NCG FL K+T+ AVK KKL  S++D AL   + V MRLG 
Sbjct: 303 HYTKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGF 362

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
           F+G+P   PFG +G   VCS  HQ+LAL+AA+ GIVLL+N  G LPLPK+    LA+IGP
Sbjct: 363 FDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGP 421

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
           NAN+ K ++ NYAG  C+  +P+Q LQ YV E  VY PGC  V C   + I  AV     
Sbjct: 422 NANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSE 481

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           AD  VL++GLDQT E E LDRV+L LPG Q++L+  VA AAKK V+LV++  GP+DI+FA
Sbjct: 482 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 541

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           K    I ++LW GYPGEAG  A+A+VIFGD+NP
Sbjct: 542 KNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNP 574


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  668 bits (1723), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/573 (56%), Positives = 415/573 (72%), Gaps = 6/573 (1%)

Query: 10  FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           F  L L   +L++ +  +Q  F+CD S P+T  + FC  +L    RA+DLVSRL+L EK+
Sbjct: 6   FVRLSLLIIALVSSLCESQKNFACDISAPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
            QLVN A  +PRLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP  ILTAASF++ 
Sbjct: 66  QQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAASFNTS 125

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           LW ++G+ +  EARA++N G A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV  KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSKYAVN 184

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YV+G+Q D  + GK + +L+ S+CCKH+TAYDLDNWKG  R+ FDA+VT QDL DTYQ P
Sbjct: 185 YVKGLQ-DVHDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQTP 242

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F+SCV++G  S +MC+YNRVNGIP+CAD NLL    R QW   GYI SDCD++ + ++  
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFNDI 302

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            Y K+ EDAV   LKAG+++NCG FL K+T+ AVK KKL  S++D AL   + V MRLG 
Sbjct: 303 HYTKTREDAVALALKAGLNMNCGDFLGKYTENAVKLKKLNGSDVDEALIYNYIVLMRLGF 362

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
           F+G+P   PFG +G   VCS  HQ+LAL+AA+ GIVLL+N  G LPLPK+    LA+IGP
Sbjct: 363 FDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGP 421

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
           NAN+ K ++ NYAG  C+  +P+Q LQ YV E  VY PGC  V C   + I  AV     
Sbjct: 422 NANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIVYEPGCKDVKCGDQTLISAAVKAVSE 481

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           AD  VL++GLDQT E E LDRV+L LPG Q++L+  VA AAKK V+LV++  GP+DI+FA
Sbjct: 482 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 541

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           K    I ++LW GYPGEAG  A+A+VIFGD+NP
Sbjct: 542 KNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNP 574


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/556 (58%), Positives = 410/556 (73%), Gaps = 16/556 (2%)

Query: 26  STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           ++ P FSC PS+PS   +PFC  +LP ++RA DLVSRLT+ EK+SQL + A  +PRLG+P
Sbjct: 14  ASDPLFSCGPSSPS-RAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 72

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y+WWSE LHG+A  G G+ FNGT+ G TSFPQV+LT ASFD  LW+RIGQAIG EARAL
Sbjct: 73  PYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLWFRIGQAIGREARAL 132

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  KYAV++VRG+QG +       
Sbjct: 133 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSSAA--GAA 190

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             LQASACCKH TAYDL++W G  RY FDARVT QDLADT+ PPF+SCV  G+A+ +MCA
Sbjct: 191 APLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCA 250

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           Y  +NG+P+CA  +LL+KT R  WG  GY++SDCDAV+I++DA+ Y  +PED V   LK 
Sbjct: 251 YTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVALK- 309

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGA 384
                      +H  AA++Q K+ E ++D+AL NLF+VRMRLG F+G+P     +G +GA
Sbjct: 310 -----------EHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGA 358

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+  H+ LAL+AAQDGIVLLKN  G+LPL +S   S A+IG NAN A  L GNY GP
Sbjct: 359 ADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGP 418

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
           +C + TPLQ +Q+YV N  +  GC + AC  A+  +A  +A  +++V L MGL Q QEKE
Sbjct: 419 ACETTTPLQGVQSYVSNVRFLAGCSSAACGYAATGQAAALASSSEYVFLFMGLSQDQEKE 478

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            LDR  L+LPG+QQ LIT VA AAK+PVILVLL GGPVDITFA+ +  IG+ILWAGYPG+
Sbjct: 479 GLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQ 538

Query: 565 AGAVALAEVIFGDHNP 580
           AG +A+A V+FGDHNP
Sbjct: 539 AGGLAIARVLFGDHNP 554


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/588 (55%), Positives = 417/588 (70%), Gaps = 26/588 (4%)

Query: 15  LCFTSLLTR----VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           L FTS +      +DS  P F C P  P   ++PFC  +L I QRA  LVS L L EKI 
Sbjct: 11  LFFTSAIAETFKNLDS-HPQFPCKP--PHFSSYPFCNVSLSIKQRAISLVSLLMLPEKIG 67

Query: 71  QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
           QL N+A ++PRLGIP YEWWSE+LHG+A  G G+ FNG+I  ATSFPQVI++AASF+  L
Sbjct: 68  QLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNRTL 127

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
           WY IG A+ +E RA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V  +Y V +
Sbjct: 128 WYEIGSAVAVEGRAMYNGGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPKVVSEYGVEF 186

Query: 191 VRGVQ----------------GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           VRG Q                 D  +     GKL  SACCKHFTAYDL+ W   TRY F+
Sbjct: 187 VRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFTRYDFN 246

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
           A VT QD+ DTYQPPFE+C++ G+AS +MC+YN VNG+P+CA  +LL K AR +WGF GY
Sbjct: 247 AVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQK-ARVEWGFEGY 305

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
           ITSDCDAV+ I+  +GY KSPE+AV D +KAG+D+NCG+++ +HT++A++Q K+ E  +D
Sbjct: 306 ITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKVSEELVD 365

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
           RAL NLF+V++RLGLF+G+P    +GK+G++ +CS  H+ LAL+A + GIVLLKN H LL
Sbjct: 366 RALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLKNDHKLL 425

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS 474
           PL K+   SLA++GP AN+   + G Y G  C+  T    L  YV+ T Y  GC  V+C 
Sbjct: 426 PLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGCSDVSCD 485

Query: 475 SAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
           S +   +AV IAKGAD V+++ GLD +QE E+ DRV L LPG+Q++L++ VA  +KKPVI
Sbjct: 486 SDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAVSKKPVI 545

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           LVL  GGPVD+TFAK D  IGSI+W GYPGE G  ALAE+IFGD NPG
Sbjct: 546 LVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 593


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/569 (54%), Positives = 412/569 (72%), Gaps = 10/569 (1%)

Query: 15  LCFTSLLTRVD-STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
           L F  +L  V    + PF+CDP + +   FPFC+ ++ I +R +DL+ RLTL+EK+  LV
Sbjct: 12  LIFLVVLAVVSGEARDPFACDPKDGANAGFPFCRKSIGIGERVKDLIGRLTLEEKVRLLV 71

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
           N+A  +PRLGI  YEWWSEALHGV+ VG G  F+G   GATSFPQVI TAASF+S LW  
Sbjct: 72  NNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAASFNSSLWEA 131

Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
           IGQ +  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ GKYA  YVRG
Sbjct: 132 IGQVVSDEARAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAARYVRG 190

Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
           +QG+  +      +L+ +ACCKHFTAYDLDNW G  R+ FDARV+ Q++ DT+  PF SC
Sbjct: 191 LQGNAGD------RLKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEMEDTFDVPFRSC 244

Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
           V +G+ + +MC+YN+VNG+P+CAD NLL  T R+QW  +GY+ SDCD+V + YD + Y  
Sbjct: 245 VVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSVGVFYDNQHYTN 304

Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +PE+A  D +KAG+D++CG FL  HT+ A+K+  + E+++D AL N  +V+MRLG+F+G 
Sbjct: 305 TPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTVTVQMRLGMFDGE 364

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P+ QPFG +G   VCSPAHQ LA++AA+ GIVLLKN    LPL      S+A+IGPN+++
Sbjct: 365 PSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSHRSIAVIGPNSDA 424

Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVV 492
             T++GNYAG  C   TPLQ +  Y   T++  GC  VACS   +   A+D A  AD  V
Sbjct: 425 NVTMIGNYAGIPCEYTTPLQGIGRY-SRTIHQKGCADVACSEDQLFAGAIDAASQADATV 483

Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           L+MGLDQ+ E E  DR DL+LPGRQQEL+++VA A++ P +LVL+ GGPVD++FAK D  
Sbjct: 484 LVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGPVDVSFAKKDPR 543

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I +I+WAGYPG+AG  A+A+++FG  NPG
Sbjct: 544 IAAIVWAGYPGQAGGAAIADILFGVANPG 572


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  664 bits (1713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/553 (57%), Positives = 405/553 (73%), Gaps = 3/553 (0%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC    PS    PFC T LP +QRA DLVSR+T  EK SQL + A  +PRLG+P+Y+
Sbjct: 84  PPFSCG-GGPSL-GLPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141

Query: 89  WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           WW+EALHGVA  GKGI  + G +R ATSFPQV+LTAASF+  LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YA ++VRG+QG + N   +   
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L  SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV  G+AS +MCAY 
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            VNG+PSCA+ +LL+KT R  WG  GY+ +DCDAVSI+ +++ Y  + ED V   LKAG+
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGL 381

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D++CG ++Q+H  AA+++ KL + ++D+A+ NLF+ RMRLG F+G+P    +G +GA  +
Sbjct: 382 DIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHI 441

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C+  H+ LAL+AA DGIVLLKNS G+LPL +    S A+IG NAN    LLGNY GP C 
Sbjct: 442 CTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCA 501

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
             TPLQ +Q YV+N  +  GC   AC+ A+  +A  +A  +D V+L MGL Q QE E  D
Sbjct: 502 PTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEGKD 561

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  L+LPG QQ LIT VA AAK+PVILVLL GGPVDITFA+ +  IG+ILWAGYPG+AG 
Sbjct: 562 RTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGG 621

Query: 568 VALAEVIFGDHNP 580
           +A+A+V+FG+ NP
Sbjct: 622 LAIAKVLFGEKNP 634


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score =  664 bits (1713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/598 (54%), Positives = 419/598 (70%), Gaps = 25/598 (4%)

Query: 4   HKLSLV-FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
           H+L+L+               +DS  P F C P  P   ++PFC  +L I QRA  LVS 
Sbjct: 3   HQLTLISLVFFTSAIAETFKNLDS-HPQFPCKP--PHFSSYPFCNVSLSIKQRAISLVSL 59

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LTL EKI QL  +A ++PRLGIP YEWWSE+LHG+A  G G+ FNG+I  ATSFPQVI++
Sbjct: 60  LTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVS 119

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           AASF+  LWY IG A+ +EARA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V
Sbjct: 120 AASFNRTLWYEIGSAVAVEARAMYNGGQA-GLTFWAPNINLFRDPRWGRGQETPGEDPKV 178

Query: 183 TGKYAVSYVRGVQGDT--------FNGGKLK----------GKLQASACCKHFTAYDLDN 224
             +Y V +VRG Q           F    +           GKL  SACCKHFTAYDL+ 
Sbjct: 179 VSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLEK 238

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W   TRY F+A VT QD+ DTYQPPFE+C+K G+AS +MC+YN VNG+P+CA  +LL K 
Sbjct: 239 WGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLLQK- 297

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
           AR +WGF GYITSDCDAV+ I++ +GY KSPE+AV D +KAG+D+NCG+++ ++T++A++
Sbjct: 298 ARVEWGFDGYITSDCDAVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNTQSAIE 357

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           Q K+ E  +DRAL NLF+V++RLGLF+G+P    +GK+G++ +CS  H+ LAL+AA+ GI
Sbjct: 358 QGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQGI 417

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
           VLLKN + LLPL K+   SLA++GP AN+   + G Y G  C+  T    L  YV+ T Y
Sbjct: 418 VLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSY 477

Query: 465 YPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
             GC  V+C S     +AV IAKGAD V+++ GLD +QE E+ DR  L LPG+Q++L++ 
Sbjct: 478 ASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLVSS 537

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           VA  +KKPVILVL  GGPVD+TFAK D  IGSI+W GYPGE G  ALAE+IFGD NPG
Sbjct: 538 VAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 595


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/561 (55%), Positives = 408/561 (72%), Gaps = 6/561 (1%)

Query: 23  RVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
           R  + Q  ++CD  SN S  +FPFC T L I  R +DLVSRLTLDEK+ ++VN+A  IPR
Sbjct: 28  RASTAQLRYACDVSSNASLGSFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPR 87

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
           LG+P+Y+WW EALHGVA    G+ F G    ATSFP  I TAASF+S L+Y IG+A+  E
Sbjct: 88  LGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSE 146

Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
           ARAL+N G+A G+TFW+PN+NIFRDPRWGRGQETPGEDPL+  K+A  YVRG+QG  + G
Sbjct: 147 ARALHNLGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEG 205

Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
               G L+ SACCKH TAYD+DNWKG  RY F+A V+ QDL DTY PPF+SC++ GR S 
Sbjct: 206 SASDGFLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 265

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
           +MC+YNRVNG+P+CADRNLL++T R  WGF+GYI SDCDA+ ++++   YA S EDAV D
Sbjct: 266 VMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 325

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            + AG+D+NCG+FL KH K+A++  K+ E+++D A+ NL   RMRLGLF+G+P  QP+  
Sbjct: 326 SILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSS 385

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +GA  +CS  HQ LAL AA  G+VLLKN  G LPL  +   ++ALIGPNAN+  T+LGNY
Sbjct: 386 LGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTALK-TVALIGPNANATYTMLGNY 443

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
            G  C+ I+PLQ +Q Y  N +Y PGC  VAC+    +  AV++A  AD VVL++GLDQ+
Sbjct: 444 EGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQS 503

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
           QE+E  DR  L+LPG Q +L++ +A A   P++LV++  GPVDI+  K +  I S++W G
Sbjct: 504 QERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLG 563

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPG++G  ALA V+FG +NPG
Sbjct: 564 YPGQSGGAALAHVVFGAYNPG 584


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  662 bits (1708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/555 (56%), Positives = 404/555 (72%), Gaps = 9/555 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
            P F+C P++ + + F FC   LP  QRA DLV+RLT  EK+ QL + AP +PRLGIP Y
Sbjct: 34  NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 92

Query: 88  EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           +WWSEALHG+A  GKGI F NG  R ATSFPQVI TAA+FD  LW+RIGQAIG E RA Y
Sbjct: 93  KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 152

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N GQA G+  W+PN+NIFRDPRWGRGQETPGEDP    KY  ++V+G+QG +        
Sbjct: 153 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 206

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV  G+AS IMCAY
Sbjct: 207 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 266

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
             +NG+P+CA  +LL+KT R +W   GY  SDCDAV+I++ +E + ++ E+AV   LKAG
Sbjct: 267 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 326

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
           +D+NCG ++Q++  +A++Q K+ E ++D+AL NLF++RMRLG F+G+P     +G++GA 
Sbjct: 327 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLGAA 386

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VC+P H+ LAL+AA+ G+VLLKN   LLPL      S A+IG NAN    LLGNY G  
Sbjct: 387 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILALLGNYYGLP 446

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C + TP   +Q YV++  + PGC + AC  A+ D+A  +AK +D+V L+MGL Q QE+E 
Sbjct: 447 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 506

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           LDR  L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ +  IG+ILWAGYPG+A
Sbjct: 507 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 566

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A+V+FG+ NP
Sbjct: 567 GGQAIADVLFGEFNP 581


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  662 bits (1708), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/538 (59%), Positives = 403/538 (74%), Gaps = 11/538 (2%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC   L   QRA DLV+ LTL EK+SQL + A  + RLG+PAYEWWSE LHG++  G+GI
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            FNGT+R  TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGEDP+   +YAV++V G+QG    GG      +ASACCKH TAYDLD 
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 201

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W    RY +D++VT+QDL DTY PPF+SCV +G+A+ IMC YN +NG+P+CA  +LL+K 
Sbjct: 202 WNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKK 261

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R++WG +GY+ SDCDAV+ I DA  Y  SPED V   +K GMDVNCG++ Q H  AAV+
Sbjct: 262 VRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQ 321

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDG 403
           +  L E +IDRAL NLF+VRMRLG F+G+P     +G +GA  VCSPAH+ LAL+AAQDG
Sbjct: 322 KGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDG 381

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT 462
           IVLLKN  G LPL  S   SLA+IGPNA++   L GNY GP C + TPLQ ++ Y+ +  
Sbjct: 382 IVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRA 441

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
            +  GCD+ AC+ A+ ++A  +A  +DHVVL MGL Q QE++ LDR  L+LPG QQ LIT
Sbjct: 442 RFLAGCDSPACAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLIT 501

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            VA AA++PVILVLL GGPVD+TFAK +  IG+ILWAGYPG+AG +A+A+V+FGDHNP
Sbjct: 502 AVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNP 559


>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score =  662 bits (1707), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 307/553 (55%), Positives = 408/553 (73%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CD  + +T+  PFC+  LPI  R +DL+ RLTL EK+  LVN+A A+ RLGI  YEW
Sbjct: 27  PFACDSKDGTTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEW 86

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G+  GATSFPQVI TAASF+S LW  IG+ +  EARA+YN G
Sbjct: 87  WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGG 146

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NI RDPRWGRGQETPGEDPL+ GKYA SYV+G+QG   N G+   +L+
Sbjct: 147 AA-GLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQG---NDGE---RLK 199

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G+ + +MC+YN+V
Sbjct: 200 VAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVMCSYNQV 259

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NGIP+CAD NLL KT R QWG +GYI SDCD+V + YD + Y  +PE+A  D +KAG+D+
Sbjct: 260 NGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAIKAGLDL 319

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL  HT+ AVK+  + E++++ AL N  +V+MRLG+F+G P+ QP+G +G   VC+
Sbjct: 320 DCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLGPKDVCT 379

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAHQ LAL+A + GIVLLKN    LPL   +  ++A+IGPN+N   T++GNYAG +C+  
Sbjct: 380 PAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAGVACQYT 439

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           TPLQ + +Y + T++  GC  V C +  +   A+D A+ AD  VL+MGLDQ+ E E  DR
Sbjct: 440 TPLQGIGSYAK-TIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIEAEFRDR 498

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L+LPGRQQEL+++VA A+K P ILVL+ GGP+D++FAK D  I +ILWAGYPG+AG  
Sbjct: 499 TGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYPGQAGGA 558

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG  NPG
Sbjct: 559 AIADVLFGTINPG 571


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score =  661 bits (1706), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/582 (53%), Positives = 417/582 (71%), Gaps = 10/582 (1%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           MI HK++ +  +L    +S    V S +  F+CD  + +T T  FC+ ++PI +R RDL+
Sbjct: 1   MILHKMAFLAVILFFLISSSSVCVHSRET-FACDTKDAATATLRFCQLSVPIPERVRDLI 59

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
            RLTL EK+S L N+A AIPRLGI  YEWWSEALHGV+ VG G  F G    ATSFPQVI
Sbjct: 60  GRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVI 119

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
            T ASF++ LW  IG+ +  EARA+YN G   G+T+W+PN+NI RDPRWGRGQETPGEDP
Sbjct: 120 TTVASFNASLWESIGRVVSNEARAMYNGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDP 178

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           +V GKYA SYVRG+QG+       + +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ Q
Sbjct: 179 VVAGKYAASYVRGLQGND------RSRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQ 232

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           D+ DT+  PF  CVK+G  + IMC+YN+VNG+P+CAD NLL KT R QWG +GYI SDCD
Sbjct: 233 DIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCD 292

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
           +V ++YD + Y  +PE+A  D +KAG+D++CG FL  HT  AVK+  L ES++D AL N 
Sbjct: 293 SVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNALINT 352

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
            +V+MRLG+F+G+   QP+G +G   VC+P H+ LAL+AAQ GIVLLKN    LPL   +
Sbjct: 353 LTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQR 412

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-D 479
             ++A+IGPN+++  T++GNYAG +C   +P+Q +  Y   T++  GC  V C    + D
Sbjct: 413 HRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYAR-TIHQKGCVDVHCMDDRLFD 471

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
            AV+ A+GAD  VL+MGLDQ+ E E  DR  L+LPG+QQEL++RVA+AAK PVILVL+ G
Sbjct: 472 AAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSG 531

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GP+DI+FA+ DR I +I+WAGYPG+ G  A+A+++FG  NPG
Sbjct: 532 GPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPG 573


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  660 bits (1704), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/555 (55%), Positives = 403/555 (72%), Gaps = 9/555 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
            P F+C P++ + + F FC   LP  QRA DLV+RLT  EK+ QL + AP +PRLGIP Y
Sbjct: 108 NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 166

Query: 88  EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           +WWSEALHG+A  GKGI F NG  R ATSFPQVI TAA+FD  LW+RIGQAIG E RA Y
Sbjct: 167 KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 226

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N GQA G+  W+PN+NIFRDPRWGRGQETPGEDP    KY  ++V+G+QG +        
Sbjct: 227 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 280

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV  G+AS IMCAY
Sbjct: 281 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 340

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
             +NG+P+CA  +LL+KT R +W   GY  SDCDAV+I++ +E + ++ E+AV   LKAG
Sbjct: 341 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 400

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
           +D+NCG ++Q++  +A++Q K+ E ++D+AL NLF++RMRLG F+G+P     +G++ A 
Sbjct: 401 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAA 460

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VC+P H+ LAL+AA+ G+VLLKN   LLPL      S A+IG NAN    LLGNY G  
Sbjct: 461 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLP 520

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C + TP   +Q YV++  + PGC + AC  A+ D+A  +AK +D+V L+MGL Q QE+E 
Sbjct: 521 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 580

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           LDR  L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ +  IG+ILWAGYPG+A
Sbjct: 581 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 640

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A+V+FG+ NP
Sbjct: 641 GGQAIADVLFGEFNP 655


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  660 bits (1703), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/555 (55%), Positives = 403/555 (72%), Gaps = 9/555 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
            P F+C P++ + + F FC   LP  QRA DLV+RLT  EK+ QL + AP +PRLGIP Y
Sbjct: 34  NPGFTCGPAS-AQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVY 92

Query: 88  EWWSEALHGVAGVGKGIFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           +WWSEALHG+A  GKGI F NG  R ATSFPQVI TAA+FD  LW+RIGQAIG E RA Y
Sbjct: 93  KWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFY 152

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N GQA G+  W+PN+NIFRDPRWGRGQETPGEDP    KY  ++V+G+QG +        
Sbjct: 153 NLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------ 206

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            LQ SACCKH TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV  G+AS IMCAY
Sbjct: 207 NLQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAY 266

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
             +NG+P+CA  +LL+KT R +W   GY  SDCDAV+I++ +E + ++ E+AV   LKAG
Sbjct: 267 TLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAG 326

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGAD 385
           +D+NCG ++Q++  +A++Q K+ E ++D+AL NLF++RMRLG F+G+P     +G++ A 
Sbjct: 327 LDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAA 386

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VC+P H+ LAL+AA+ G+VLLKN   LLPL      S A+IG NAN    LLGNY G  
Sbjct: 387 DVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLP 446

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C + TP   +Q YV++  + PGC + AC  A+ D+A  +AK +D+V L+MGL Q QE+E 
Sbjct: 447 CETTTPFGGIQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEG 506

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           LDR  L+LPG+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ +  IG+ILWAGYPG+A
Sbjct: 507 LDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQA 566

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A+V+FG+ NP
Sbjct: 567 GGQAIADVLFGEFNP 581


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  659 bits (1700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/554 (57%), Positives = 405/554 (73%), Gaps = 4/554 (0%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC    PS    PFC T LP +QRA DLVSR+T  EK SQL + A  +PRLG+P+Y+
Sbjct: 84  PPFSCG-GGPSL-GLPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141

Query: 89  WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           WW+EALHGVA  GKGI  + G +R ATSFPQV+LTAASF+  LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YA ++VRG+QG + N   +   
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L  SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV  G+AS +MCAY 
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
            VNG+PSCA+ +LL+KT R  WG  G Y+ +DCDAVSI+ +++ Y  + ED V   LKAG
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAG 381

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           +D++CG ++Q+H  AA+++ KL + ++D+A+ NLF+ RMRLG F+G+P    +G +GA  
Sbjct: 382 LDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAH 441

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           +C+  H+ LAL+AA DGIVLLKNS G+LPL +    S A+IG NAN    LLGNY GP C
Sbjct: 442 ICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPC 501

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
              TPLQ +Q YV+N  +  GC   AC+ A+  +A  +A  +D V+L MGL Q QE E  
Sbjct: 502 APTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALASTSDSVILFMGLSQEQESEGK 561

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG QQ LIT VA AAK+PVILVLL GGPVDITFA+ +  IG+ILWAGYPG+AG
Sbjct: 562 DRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAG 621

Query: 567 AVALAEVIFGDHNP 580
            +A+A+V+FG+ NP
Sbjct: 622 GLAIAKVLFGEKNP 635


>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/567 (54%), Positives = 412/567 (72%), Gaps = 12/567 (2%)

Query: 16  CFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS 75
           CF S   R   ++ PF+CDP N  T +  FC+T++PI  R +DL+ RLTL EKI  LVN+
Sbjct: 16  CFGS---RFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNN 72

Query: 76  APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG 135
           A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI TAA+F+  LW  IG
Sbjct: 73  AAAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIG 132

Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
           + +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVR +Q
Sbjct: 133 RVVSDEARAMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQ 191

Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
           G+T        +L+ +ACCKH+TAYDLDNW G  RY F+ARV+ QDL DTY  PF++CV 
Sbjct: 192 GNT------GSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVV 245

Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP 315
           +G+ + +MC+YN+VNG P+CAD ++L  T R QW   GYI SDCD+V ++Y+ + Y ++P
Sbjct: 246 EGKVASVMCSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHYTRTP 305

Query: 316 EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
           E+A  D +KAG+D++CG FL  HT+ AV+   L E +++ A     +V+MRLG+F+G P+
Sbjct: 306 EEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPS 365

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
            QPFG +G   VC+PAHQ LALQAA  GIVLLKNS   LPL   +  ++A+IGPN++   
Sbjct: 366 AQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTV 425

Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLM 494
           T++GNYAG +C   TPLQ +  Y + T++  GC  VAC+   +  A ++ A+ AD  VL+
Sbjct: 426 TMIGNYAGVACGYTTPLQGISRYAK-TIHQAGCLGVACNGNQLIGAAEVAARQADATVLV 484

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           MGLDQ+ E E +DR  L+LPGRQQEL++RVA+A++ PV+LVL+CGGPVD++FAK D  IG
Sbjct: 485 MGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIG 544

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
           +ILW GYPG+AG  A+A+V+FG  NPG
Sbjct: 545 AILWVGYPGQAGGAAIADVLFGRANPG 571


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/560 (57%), Positives = 411/560 (73%), Gaps = 10/560 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
            P F C P  P+  T+ FC  +LPI++RA+ L+S LTL EKI QL ++A  IPRLGIP Y
Sbjct: 27  NPQFPCKP--PTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPRLGIPHY 84

Query: 88  EWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           EWWSE+LHG++  G G+ F   G +  AT FPQVI++AASF+  LW+ IG AI +EARA+
Sbjct: 85  EWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIAIEARAM 144

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF---NGG 202
           YN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   YA+ +V+G QG  +   +G 
Sbjct: 145 YNVGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWKNEDGE 203

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
               KL  SACCKH TAYDL+ W   +RY F+A VT QD+ DTYQPPF SC+++G+AS +
Sbjct: 204 INDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKGKASCL 263

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YN VNG+P+CA  +LL K  R +WGF GYITSDCDAV+ I++ + Y+KSPEDAV   
Sbjct: 264 MCSYNEVNGVPACAREDLLQK-PRTEWGFKGYITSDCDAVATIFEYQNYSKSPEDAVAIA 322

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAGMD+NCG+++ ++ ++AV++ KL E +IDRALHNLFSV++RLGLF+G+P    FGK+
Sbjct: 323 LKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRKGQFGKL 382

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           G   VC+  H+ LAL+AA+ GIVLLKN   LLPL K    SLA+IGP AN A +L G+Y 
Sbjct: 383 GPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANSLGGDYT 442

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
           G  C   +  + L+ YV+ T Y  GC  VAC S     KA+ +AK AD V+++ GLD +Q
Sbjct: 443 GYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVAGLDLSQ 502

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E EE DRV L+LPG+Q  L++ VA A+KKPVILVL  GGP+D++FAK D  I SILW GY
Sbjct: 503 ETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIASILWIGY 562

Query: 562 PGEAGAVALAEVIFGDHNPG 581
           PGEAGA ALAE+IFG++NPG
Sbjct: 563 PGEAGAKALAEIIFGEYNPG 582


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 307/571 (53%), Positives = 407/571 (71%), Gaps = 10/571 (1%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           L+ + F  LL   ++ + PF+CDP N ST  FPFCK +LPI  R  DL+ RLTL EK+S 
Sbjct: 5   LITIVFLLLLMSSEA-RDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQEKVSM 63

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           LVN+A A+PR+GI  YEWWSEALHGV+ VG G  F G    ATSFPQVI T ASF++ LW
Sbjct: 64  LVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFNASLW 123

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
             IG+    EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYV
Sbjct: 124 EAIGRVASDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYV 182

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+QG          +L+ +A CKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF 
Sbjct: 183 RGLQGTD------SSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKVSKQDMEDTFNVPFR 236

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
            CVK+G  + +MC+YN+VNG+P+CAD NLL +T R QW   GYI SDCD+V + Y  + Y
Sbjct: 237 MCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVSDCDSVGVFYTNQHY 296

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
             +PE+A  D +KAG+D++CG FL +HT+ AVK+  L E++++ AL N  +V+MRLG+F+
Sbjct: 297 TSTPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGALANTLTVQMRLGMFD 356

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           G P+ QP+G +G   VC+P HQ LAL AA+ GIVLLKN+   LPL      ++A+IGPN+
Sbjct: 357 GEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPLSTKNHQTVAVIGPNS 416

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADH 490
           N+  T++GNYAG +C   +PLQ +  Y   T++ PGC  VAC+       A++ A+ AD 
Sbjct: 417 NATVTMIGNYAGIACGYTSPLQGIGKYAR-TIHEPGCANVACNDDKQFGSALNAARQADA 475

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            VL+MGLDQ+ E E +DR  L+LPG QQ+L+++VA A++ P ILVL+ GGP+DITFAK D
Sbjct: 476 TVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVLMSGGPIDITFAKND 535

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             I  ILWAGYPG+AG  A+A+++FG  NPG
Sbjct: 536 PRIMGILWAGYPGQAGGAAIADILFGTTNPG 566


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 308/561 (54%), Positives = 408/561 (72%), Gaps = 6/561 (1%)

Query: 23  RVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
           RV + QP ++CD  SN S  +FPFC T L +  R +DLVSRLTLDEK+ ++VN+A  IPR
Sbjct: 29  RVSTAQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIPR 88

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
           LG+P+Y+WW EALHGVA    G+ F G    ATSFP  I  AASF+S L+Y IG+A+  E
Sbjct: 89  LGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSSE 147

Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
           ARAL+N G+A G+TFW+PN+NIFRDPRWGRGQETPGEDPL+  K+A  YVRG+QG  + G
Sbjct: 148 ARALHNLGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGG 206

Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
               G L+ SACCKH TAYD+DNWKG  RY F+A V+ QDL DTY PPF+SC++ GR S 
Sbjct: 207 SASDGFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 266

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
           +MC+YNRVNG+P+CADR+LL++T R  WGF+GYI SDCDA+ ++++   YA S EDAV D
Sbjct: 267 VMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 326

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            + AG+D+NCG+FL KH K+A++  K+ E+++D A+ NL   RMRLGLF+G+   +P+  
Sbjct: 327 SILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTRPYSS 386

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +GA  +CS  HQ LAL AA  G+VLLKN  G LPL  +   ++ALIGPNAN+  T+LGNY
Sbjct: 387 LGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTALK-TVALIGPNANATYTMLGNY 444

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
            G  C+ ++PLQ +Q Y  N +Y PGC  VACS    +  AV++A  AD VVL++GLDQ+
Sbjct: 445 EGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVGLDQS 504

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
           QE+E  DR  L+LPG Q +L++ +A A   P++LV++  GPVDI+  K +  I S++W G
Sbjct: 505 QERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSVIWIG 564

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPG++G  ALA V+FG +NPG
Sbjct: 565 YPGQSGGAALAHVVFGAYNPG 585


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/578 (53%), Positives = 420/578 (72%), Gaps = 10/578 (1%)

Query: 4   HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
           +KLSL+   L LC ++LL  +   +PPF+CDP NP T  F FC+T +P+  R +DL+ RL
Sbjct: 7   NKLSLI--ALVLCVSALLFNLVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRL 64

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL EKI  LVN+A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI TA
Sbjct: 65  TLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTA 124

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF+  LW  IGQ +  EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP+++
Sbjct: 125 ASFNQSLWQEIGQVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLS 183

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
            KYA SYV+G+QGD         +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDLA
Sbjct: 184 AKYAASYVKGLQGD-----GAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLA 238

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DTY  PF  CV +G+ + +MC+YN+VNG P+CAD +LL  T R +W  +GYI SDCD+V 
Sbjct: 239 DTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYIVSDCDSVG 298

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           + YD + Y ++PE+A  + +KAG+D++CG FL  HT+ A+K   LPE ++D AL N  +V
Sbjct: 299 VFYDQQHYTRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVDYALANTLTV 358

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           +MRLG+F+G P+ Q +G +G   VC+PAHQ LAL+A++ GIVLL+N+   LPL   +  +
Sbjct: 359 QMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTLPLSTVRHRT 418

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAV 482
           +A++GPN++  +T++GNYAG +C   TPLQ +  Y + T++  GC  VAC++  +   A 
Sbjct: 419 VAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRYTK-TIHQQGCTNVACTTNQLFGAAE 477

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
             A+ AD  VL+MGLDQ+ E E  DR DLV+PG QQEL++RVA A++ P +LVL+ GGP+
Sbjct: 478 AAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVLVLMSGGPI 537

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           D++FAK D  IG+I+W GYPG+AG  A+A+V+FG  NP
Sbjct: 538 DVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNP 575


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 304/553 (54%), Positives = 413/553 (74%), Gaps = 11/553 (1%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++  + +FPFC  +LP+  R  DLV R+ L EKI+Q+V++A  IPRLGIP Y+WW EALH
Sbjct: 26  TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALH 85

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
           GVA    G+ F   +  ATSFPQVILT ASF+S LW +I QAI +EA A+YNAG++ G+T
Sbjct: 86  GVAE-SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRS-GLT 143

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK----LKG---KL 208
           FW+PNINIFRDPRWGRGQETPGEDPL++ KYA  +VRG+Q   ++ G     ++G   +L
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPTRL 203

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + S+CCKHFTAYD++  +GT  + F+A+VT+QDL DT+ PPF SC+  G+ASG+MC+YNR
Sbjct: 204 KVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNR 263

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+PSCAD   L++T R  WGF GYI SDCDAV+++Y+   Y  + EDAV DVL AGMD
Sbjct: 264 VNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMD 323

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AA++Q K+ E+ +DRAL N+ +VRMRLGLF+GN + + +  IG D VC
Sbjct: 324 LNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVC 382

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +P H+ L+L+AA+ GIVLLKNS  +LP P++  +++A+IGP+ N+ +T+LGNYAG  C+ 
Sbjct: 383 TPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQY 442

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID-KAVDIAKGADHVVLMMGLDQTQEKEELD 507
           ITP Q LQ Y +  V+ PGC  + C+  ++   AV  A+ +D VV+++GLD+ QE+E LD
Sbjct: 443 ITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLD 502

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  L+LPG QQ L+  V++ AK PVILV++ GGP+D+TFAK +  I ++LW GYPGEAG 
Sbjct: 503 RTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGEAGG 562

Query: 568 VALAEVIFGDHNP 580
            A+A VIFGDHNP
Sbjct: 563 KAIARVIFGDHNP 575


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score =  655 bits (1691), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/564 (56%), Positives = 409/564 (72%), Gaps = 18/564 (3%)

Query: 31  FSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           F C+ S +PS   +PFC  +L  + RA+ LVS LTLDEKI QL N+A +IPRLGIP+Y+W
Sbjct: 20  FPCNSSLHPS---YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQW 76

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSE LHG+A  G G+ FNG+I  ATSFPQV++TAASF+  LW+ IG AI +EARA++N G
Sbjct: 77  WSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVG 136

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-GDTFNGGKLK--- 205
           Q  G+T WAPNINIFRDPRWGRGQETPGEDP+V   Y++ +VRG+Q G+     +++   
Sbjct: 137 QC-GLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEV 195

Query: 206 -------GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
                  G L  SACCKHFTAYDL+ W   TRY FD+ VT QDL DTYQPPF SC++QG+
Sbjct: 196 LEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGK 255

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
           AS +MC+YN VNG+P+CA+ +LL K AR  WG  GYITSDCDAV+ +Y+ + Y  +PEDA
Sbjct: 256 ASCLMCSYNAVNGVPACANPDLLKK-ARNDWGLKGYITSDCDAVATVYEYQKYTDTPEDA 314

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
           + DVLKAGMD+NCG+F+ + TK+A+ Q K+ E E+D AL NLFSV+ RLG F+GNP    
Sbjct: 315 IADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGK 374

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           FG++GA  VC+  H+ LAL+AA+ GIVLLKN +  LPL K+   SL +IG  AN +  LL
Sbjct: 375 FGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLL 434

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGL 497
           G YAG  C  ++ ++  Q Y E   +  GC  V C+S +  + A+ IAK AD V+ + GL
Sbjct: 435 GGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGL 494

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
           D +QE E+LDRV L+LPG+Q +L++ VA  +KKP+ILVL+ GGP+DI+FAK D  + SIL
Sbjct: 495 DASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASIL 554

Query: 558 WAGYPGEAGAVALAEVIFGDHNPG 581
           W G PGEAG  ALAEVIFGD+NPG
Sbjct: 555 WIGNPGEAGGKALAEVIFGDYNPG 578


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/564 (56%), Positives = 409/564 (72%), Gaps = 18/564 (3%)

Query: 31  FSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           F C+ S +PS   +PFC  +L  + RA+ LVS LTLDEKI QL N+A +IPRLGIP+Y+W
Sbjct: 10  FPCNSSLHPS---YPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQW 66

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSE LHG+A  G G+ FNG+I  ATSFPQV++TAASF+  LW+ IG AI +EARA++N G
Sbjct: 67  WSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVG 126

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-GDTFNGGKLK--- 205
           Q  G+T WAPNINIFRDPRWGRGQETPGEDP+V   Y++ +VRG+Q G+     +++   
Sbjct: 127 QC-GLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEV 185

Query: 206 -------GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
                  G L  SACCKHFTAYDL+ W   TRY FD+ VT QDL DTYQPPF SC++QG+
Sbjct: 186 LEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGK 245

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
           AS +MC+YN VNG+P+CA+ +LL K AR  WG  GYITSDCDAV+ +Y+ + Y  +PEDA
Sbjct: 246 ASCLMCSYNAVNGVPACANPDLLKK-ARNDWGLKGYITSDCDAVATVYEYQKYTDTPEDA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
           + DVLKAGMD+NCG+F+ + TK+A+ Q K+ E E+D AL NLFSV+ RLG F+GNP    
Sbjct: 305 IADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGK 364

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           FG++GA  VC+  H+ LAL+AA+ GIVLLKN +  LPL K+   SL +IG  AN +  LL
Sbjct: 365 FGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLL 424

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGL 497
           G YAG  C  ++ ++  Q Y E   +  GC  V C+S +  + A+ IAK AD V+ + GL
Sbjct: 425 GGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGL 484

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
           D +QE E+LDRV L+LPG+Q +L++ VA  +KKP+ILVL+ GGP+DI+FAK D  + SIL
Sbjct: 485 DASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASIL 544

Query: 558 WAGYPGEAGAVALAEVIFGDHNPG 581
           W G PGEAG  ALAEVIFGD+NPG
Sbjct: 545 WIGNPGEAGGKALAEVIFGDYNPG 568


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  655 bits (1689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 303/553 (54%), Positives = 411/553 (74%), Gaps = 11/553 (1%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++  + +FPFC  +LP+  R  DLV R+ L EKI+Q+V++A  IPRLGIP Y+WW EALH
Sbjct: 26  TDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALH 85

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
           GVA    G+ F   +  ATSFPQVILT ASF+S LW +I QAI +EA A+YNAG++ G+T
Sbjct: 86  GVAE-SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRS-GLT 143

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG-------KLKGKL 208
           FW+PNINIFRDPRWGRGQETPGEDPL++ KYA  +VRG+Q   ++ G       +   +L
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPTRL 203

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + S+CCKHFTAYD++  +GT  + F+A+VT+QDL DT+ PPF SC+  G+ASG+MC+YNR
Sbjct: 204 KVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNR 263

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+PSCAD   L++T R  WGF GYI SDCDAV+++Y+   Y  + EDAV DVL AGMD
Sbjct: 264 VNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTTAEDAVADVLSAGMD 323

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AA++Q K+ E+ +DRAL N+ +VRMRLGLF+GN + + +  IG D VC
Sbjct: 324 LNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGN-SGETYNSIGPDAVC 382

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +  H+ L+L+AA+ GIVLLKNS  +LP P++  +++A+IGP+ N+ +T+LGNYAG  C+ 
Sbjct: 383 TREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQY 442

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID-KAVDIAKGADHVVLMMGLDQTQEKEELD 507
           ITP Q LQ Y +  V+ PGC  + C+  ++   AV  A+ +D VV+++GLD+ QE+E LD
Sbjct: 443 ITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLD 502

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  L+LPG QQ+L+  V++ AK PVILV++ GGP+D+TFAK +  I S+LW GYPGEAG 
Sbjct: 503 RTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEAGG 562

Query: 568 VALAEVIFGDHNP 580
            A+A VIFGDHNP
Sbjct: 563 KAIARVIFGDHNP 575


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score =  653 bits (1685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 304/556 (54%), Positives = 405/556 (72%), Gaps = 9/556 (1%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           ++  F+CD  + +T T  FC+ ++PI +R RDL+ RLTL EK+S L N+A AIPRLGI  
Sbjct: 21  SRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEKVSLLGNTAAAIPRLGIKG 80

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           YEWWSEALHGV+ VG G  F G    ATSFPQVI T ASF++ LW  IG+ +  EARA+Y
Sbjct: 81  YEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMY 140

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N G   G+T+W+PN+NI RDPRWGRGQETPGEDP+V GKYA SYVRG+QG+       + 
Sbjct: 141 NGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------RS 193

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
           +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G  + IMC+Y
Sbjct: 194 RLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSY 253

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           N+VNG+P+CAD NLL KT R QWG +GYI SDCD+V ++YD + Y  +PE+A  D +KAG
Sbjct: 254 NQVNGVPTCADPNLLKKTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAG 313

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           +D++CG FL  HT  AVK+  L ES++D AL N  +V+MRLG+F+G+   QP+G +G   
Sbjct: 314 LDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAH 373

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+P H+ LAL+AAQ GIVLLKN    LPL   +  ++A+IGPN+++  T++GNYAG +C
Sbjct: 374 VCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVAC 433

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEE 505
              +P+Q +  Y   T++  GC  V C    + D AV+ A+GAD  VL+MGLDQ+ E E 
Sbjct: 434 GYTSPVQGITGYAR-TIHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEF 492

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR  L+LPG+QQEL++RVA+AAK PVILVL+ GGP+DI+FA+ DR I +I+WAGYPG+ 
Sbjct: 493 KDRNSLLLPGKQQELVSRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQE 552

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+++FG  NPG
Sbjct: 553 GGTAIADILFGSANPG 568


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score =  652 bits (1682), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/583 (54%), Positives = 419/583 (71%), Gaps = 25/583 (4%)

Query: 12  LLCLCFTSLLTRVDS--TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           ++   F+ LL  +    T P + C P +     +PFC  +LPIS R   L+S LTL +KI
Sbjct: 10  IIIFLFSLLLIHLPKFFTTPDYPCKPPH---SHYPFCNISLPISTRTTSLISLLTLSDKI 66

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
           +QL N+A +I  LGIP+Y+WWSEALHG+A  G G+ FNG+++ AT+FPQVI++AA+F+  
Sbjct: 67  NQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRS 126

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           LW+ IG A+G+E RA++N GQA G++FWAPN+N+FRDPRWGRGQETPGEDP+V   YAV 
Sbjct: 127 LWFLIGYAVGVEGRAMFNVGQA-GLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVE 185

Query: 190 YVRGVQG-----------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           +VRG+QG           D+ + G     L  SACCKHFTAYDL+ W   +RY F+A VT
Sbjct: 186 FVRGIQGVDGIKKVLNDHDSDDDG-----LMVSACCKHFTAYDLEKWGEFSRYNFNAVVT 240

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
            QDL DTYQPPF  CV+QG+AS +MC+YN VNG+P+CA ++LL    R +WGF GYI SD
Sbjct: 241 QQDLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASD 299

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
           CDAV+ +++ + YAKS EDAV DVLKAGMD+NCG+F+ +HT++A++Q  + E ++DRAL 
Sbjct: 300 CDAVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALF 359

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
           NLFSV+MRLGLFNG+P    FGK+G   VC+P H+ LAL+AA+ GIVLLKN +  LPL K
Sbjct: 360 NLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDK 419

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
              VSLA+IGP A +++ L G Y+G  C   +    L+ YV+   Y  GC  V C S   
Sbjct: 420 KDRVSLAIIGPMATTSE-LGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDG 478

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
              A+DIAK AD VV++ GLD T E E+LDRV L+LPG+Q +L++RVA A+K+PVILVL 
Sbjct: 479 FAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLT 538

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            GGP+D++FA+ ++ I SILW GYPGEAG  ALAE+IFG+ NP
Sbjct: 539 GGGPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIFGEFNP 581


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score =  651 bits (1679), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/587 (53%), Positives = 415/587 (70%), Gaps = 17/587 (2%)

Query: 4   HKLSLVFPLLCLC--------FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQR 55
           H L    PLL +          +SL   +  ++ PF+CDP N  T    FC+  LPI  R
Sbjct: 7   HILPTFPPLLIIMFFFFFFFFLSSLPIHLVESRAPFACDPRNGVTRNLKFCRANLPIHVR 66

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
            RDL+SRLTL EKI  LVN+A A+PRLGI  YEWWSEALHGV+ VG G+ F G   GATS
Sbjct: 67  VRDLISRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVGPGVKFGGAFPGATS 126

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
           FPQVI TAASF+  LW +IG+ +  EARA+YN G A G+T+W+PN+N+FRDPRWGRGQET
Sbjct: 127 FPQVITTAASFNQSLWEQIGRVVSDEARAMYNGGLA-GLTYWSPNVNVFRDPRWGRGQET 185

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
           PGEDP++ GKYA SYVRG+Q  T        KL+ +ACCKH+TAYDLDNW G  RY F+A
Sbjct: 186 PGEDPVLAGKYAASYVRGLQSST------GLKLKVAACCKHYTAYDLDNWNGVDRYHFNA 239

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           RV+ QDL DTY  PF++CV +G+ + +MC+YN+VNG P+CAD  LL  T R QWG +GYI
Sbjct: 240 RVSKQDLEDTYDVPFKACVVEGKVASVMCSYNQVNGKPTCADPILLKNTIRGQWGLNGYI 299

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDR 355
            SDCD+V ++YD + Y  +PE+A    +KAG+D++CG FL  HT+ AVK+  L E +++ 
Sbjct: 300 VSDCDSVGVLYDNQHYTSTPEEAAAATIKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNL 359

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
           AL N  +V+MRLG+F+G P+  P+G +G   VC+PAHQ LAL+AA+ GIVLL+N    LP
Sbjct: 360 ALANTITVQMRLGMFDGEPSAHPYGNLGPRDVCTPAHQELALEAARQGIVLLENRGQALP 419

Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-S 474
           L  S+  ++A+IGPN++   T++GNYAG +C+  +PLQ +  Y + T++  GC  VAC S
Sbjct: 420 LSSSRHHTIAVIGPNSDVTVTMIGNYAGIACKYTSPLQGISRYAK-TLHQNGCGDVACHS 478

Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
           +     A   A+ AD  VL+MGLDQ+ E E  DRV L+LPG QQEL++RVA A++ P IL
Sbjct: 479 NQQFGAAEAAARQADATVLVMGLDQSIEAEFRDRVGLLLPGHQQELVSRVARASRGPTIL 538

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           VL+ GGP+D++FAK D  +G+ILWAGYPG+AG  A+A+V+FG  NPG
Sbjct: 539 VLMSGGPIDVSFAKNDPRVGAILWAGYPGQAGGAAIADVLFGTTNPG 585


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/580 (52%), Positives = 413/580 (71%), Gaps = 11/580 (1%)

Query: 3   FHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSR 62
           F  L  +  +  L F  L+      + PF+CDP N +TE  PFCK +L I +R +DLV R
Sbjct: 5   FSPLLNLIAVFLLLF--LVRHTCEARDPFACDPKNGATENMPFCKASLAIPERVKDLVGR 62

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LTL EK+  LVN+A A+PRLG+  YEWWSEALHGV+ VG G+ FN    GATSFPQVI T
Sbjct: 63  LTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITT 122

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           AASF++ LW  IGQ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++
Sbjct: 123 AASFNASLWEAIGQVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVL 181

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G YA SYVRG+QG   N      +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+
Sbjct: 182 AGTYAASYVRGLQGTDGN------RLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDI 235

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            +T+  PF  CV +G+ + +MC+YN+VNG+P+CAD NLL KT R  W   GYI SDCD+V
Sbjct: 236 EETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSV 295

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            + YD + Y  +PE+A  D +KAG+D++CG FL  HT+ AV++  L E++++ AL N  +
Sbjct: 296 GVFYDNQHYTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLLSEADVNGALVNTLT 355

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           V+MRLG+F+G P+   +GK+G   VC PAHQ LAL+AA+ GIVLLKN+  +LPL   +  
Sbjct: 356 VQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSPQRHH 415

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKA 481
           ++A+IGPN+ +  T++GNYAG +C    PLQ +  Y + T++  GC+ VAC +  +   A
Sbjct: 416 TVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRYAK-TIHQLGCENVACKNDKLFGSA 474

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           ++ A+ AD  VL+MGLDQ+ E E +DR  L+LPGRQQ+L+++VA A+K P ILV++ GG 
Sbjct: 475 INAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAASKGPTILVIMSGGS 534

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           VDITFAK +  I  ILWAGYPG+AG  A+A+++FG  NPG
Sbjct: 535 VDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPG 574


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  650 bits (1677), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/584 (53%), Positives = 409/584 (70%), Gaps = 16/584 (2%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPP--FSCDPSNPSTETFPFCKTTLPISQRARD 58
           +I   + L  PLL +           +QP   ++CD ++P++  FPFC T+LP   RA D
Sbjct: 30  LILLPIFLSLPLLAI-----------SQPTHRYACDRTDPNSSQFPFCNTSLPYQDRASD 78

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LVSRLTL EK  QL+NSA  I RLG+P YEWWSEALHGV+  G G+ F+  I   T FP 
Sbjct: 79  LVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPA 138

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
           VIL+AASF+  LWY +GQ +  E RA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGE
Sbjct: 139 VILSAASFNESLWYTMGQVVSTEGRAMYNVGQA-GLTYWSPNVNIFRDPRWGRGQETPGE 197

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           DPLV  +YAV+YVRG+Q     G     +L+ S+CCKH+TAYD+D WKG  R+ FDA+VT
Sbjct: 198 DPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVT 257

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
           +QDL DTYQPPF+SCV++G  S +MC+YNRVNG+P+CA+  LL    R QWG  GYI SD
Sbjct: 258 LQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSD 317

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
           CD++ + ++   Y ++PEDAV   LKAG+++NCGS+L  +TK AV   K+ ES +++AL 
Sbjct: 318 CDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALI 377

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
             + V MRLG F+G+PTM PFGK+G   VC+  HQ+LAL AA+ GIVLL N +G LPL  
Sbjct: 378 YNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHN-NGALPLSP 436

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
           + + +LA+IGPNA++  T+L NYAG  CR  +PLQ LQ YV    Y  GC  V+CS  + 
Sbjct: 437 NTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETL 496

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           I+ A  IA  AD  V+++GLD   E E+LDRV+L LPG Q++L+   A+AA   VILV++
Sbjct: 497 IEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVM 556

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             GPVDI+F K    IG ILW GYPG+AG  A+++VIFGD+NPG
Sbjct: 557 SAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPG 600


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/555 (57%), Positives = 405/555 (72%), Gaps = 6/555 (1%)

Query: 27  TQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + P F+CD  SNPS  +F FC  +L IS R  DLV+RLTL EKI  LVNSA ++ RLGIP
Sbjct: 37  SSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSRLGIP 96

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAASF++ L+  IG+ +  EARA+
Sbjct: 97  KYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAM 156

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY   YVRG+Q  T NG    
Sbjct: 157 YNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ-QTDNGDS-- 212

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
            +L+ +ACCKH+TAYDLDNWKGT RY F+A VT QDL DT+QPPF+SCV  G  + +MC+
Sbjct: 213 ERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNVASVMCS 272

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YN+VNG P+CAD +LL+   R +W  +GYI SDCD+V +IY+++ Y K+PE+A    + A
Sbjct: 273 YNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHYTKTPEEAAAITILA 332

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G+D+NCGSFL KHT+AAV    L  S +D+A+ N F+  MRLG F+G+P+ Q +GK+G  
Sbjct: 333 GLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQLYGKLGPK 392

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            VC+  +Q LA +AA+ GIVLLKNS G LPL  +   +LA+IGPNAN  KT++GNY G  
Sbjct: 393 DVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTP 452

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           C+  TPLQ L   V  T Y  GC  VAC++A +D A  +A  AD  VL+MG DQ+ E E 
Sbjct: 453 CKYTTPLQGLTASVA-TTYLAGCSNVACAAAQVDDAKKLAASADATVLVMGADQSIEAES 511

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DRVD++LPG+QQ LIT+VA  +K PVILV++ GG +D++FAK +  I SILW GYPGEA
Sbjct: 512 RDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEA 571

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A+VIFG +NP
Sbjct: 572 GGAAIADVIFGYYNP 586


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score =  649 bits (1675), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 303/556 (54%), Positives = 404/556 (72%), Gaps = 9/556 (1%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           ++  F+CD  + +T T  FC+ ++PI++R +DL+ RLTL EK+S L N+A AIPRLGI  
Sbjct: 21  SRETFACDIKDAATATLRFCQLSVPITERVKDLIGRLTLVEKVSLLGNTAAAIPRLGIKG 80

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           YEWWSEALHGV+ VG G  F G    ATSFPQVI T ASF++ LW  IG+ +  EARA+Y
Sbjct: 81  YEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNASLWESIGRVVSNEARAMY 140

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N G   G+T+W+PN+NI RDPRWGRGQETPGEDP+V GKYA SYVRG+QG+       + 
Sbjct: 141 NGGVG-GLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------RS 193

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
           +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G  + IMC+Y
Sbjct: 194 RLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSY 253

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           N VNG+P+CAD NLL KT R +WG +GYI SDCD+V ++YD + Y  +PE+A  D +KAG
Sbjct: 254 NEVNGVPTCADPNLLKKTIRNEWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAG 313

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           +D++CG FL  HT  AVK+  L ES++D AL N  +V+MRLG+F+G+   QP+G +G   
Sbjct: 314 LDLDCGPFLGAHTIDAVKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAH 373

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+P H+ LAL+AAQ GIVLLKN    LPL   +  ++A+IGPN+++   ++GNYAG +C
Sbjct: 374 VCTPVHKGLALEAAQQGIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIAC 433

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEE 505
              +P+Q +  Y   TV+  GC  V C    + D AV+ A+GAD  VL+MGLDQ+ E E 
Sbjct: 434 GYTSPVQGITGYAR-TVHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEF 492

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR  L+LPG+QQELI+RVA+AAK PVILVL+ GGP+DI+FA+ DR I +I+WAGYPG+ 
Sbjct: 493 KDRNSLLLPGKQQELISRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQE 552

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+++FG  NPG
Sbjct: 553 GGTAIADILFGSANPG 568


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  649 bits (1673), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/584 (53%), Positives = 409/584 (70%), Gaps = 16/584 (2%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPP--FSCDPSNPSTETFPFCKTTLPISQRARD 58
           +I   + L  PLL +           +QP   ++CD ++P++  FPFC T+LP   RA D
Sbjct: 30  LILLPIFLSLPLLAI-----------SQPTHRYACDRTDPNSSQFPFCNTSLPYQDRASD 78

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LVSRLTL EK  QL+NSA  I RLG+P YEWWSEALHGV+  G G+ F+  I   T FP 
Sbjct: 79  LVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNSGIGVHFHDPIPAVTIFPA 138

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
           VIL+AASF+  LWY +GQ +  E RA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGE
Sbjct: 139 VILSAASFNESLWYTMGQVVSTEGRAMYNVGQA-GLTYWSPNVNIFRDPRWGRGQETPGE 197

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           DPLV  +YAV+YVRG+Q     G     +L+ S+CCKH+TAYD+D WKG  R+ FDA+VT
Sbjct: 198 DPLVVSRYAVNYVRGLQEVGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVT 257

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
           +QDL DTYQPPF+SCV++G  S +MC+YNRVNG+P+CA+  LL    R QWG  GYI SD
Sbjct: 258 LQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSD 317

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
           CD++ + ++   Y ++PEDAV   LKAG+++NCGS+L  +TK AV   K+ ES +++AL 
Sbjct: 318 CDSIMVYHERMNYTETPEDAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALI 377

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
             + V MRLG F+G+PTM PFGK+G   VC+  HQ+LAL AA+ GIVLL N +G LPL  
Sbjct: 378 YNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHN-NGALPLSP 436

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS- 477
           + + +LA+IGPNA++  T+L NYAG  CR  +PLQ LQ YV    Y  GC  V+CS  + 
Sbjct: 437 NTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETL 496

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           I+ A  IA  AD  V+++GLD   E E+LDRV+L LPG Q++L+   A+AA   VILV++
Sbjct: 497 IEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVM 556

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             GPVDI+F K    IG ILW GYPG+AG  A+++VIFGD+NPG
Sbjct: 557 SAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFGDYNPG 600


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 305/552 (55%), Positives = 397/552 (71%), Gaps = 3/552 (0%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           ++CD ++P++  FPFC T+LP   RA DLVSRLTL EK  QL+NSA  I RLG+P YEWW
Sbjct: 27  YACDRTDPNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWW 86

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHGV+  G G+ F+  I   T FP VIL+AASF+  LWY +GQ +  E RA+YN GQ
Sbjct: 87  SEALHGVSNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQ 146

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
           A G+T+W+PN+NIFRDPRWGRGQETPGEDPLV  +YAV+YVRG+Q     G     +L+ 
Sbjct: 147 A-GLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGKEGNFAADRLKV 205

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           S+CCKH+TAYD+D WKG  R+ FDA+VT+QDL DTYQPPF+ CV++G  S +MC+YNRVN
Sbjct: 206 SSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVN 265

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           G+P+CA+  LL    R QWG  GYI SDCD++ + ++   Y ++PEDAV   LKAG+++N
Sbjct: 266 GVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPEDAVALALKAGLNLN 325

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CGS+L  +TK AV   K+ ES +B+AL   + V MRLG F+G+PTM PFGK+G   VC+ 
Sbjct: 326 CGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTV 385

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ+LAL AA+ GIVLL N+ G LPL  + + +LA+IGPNA++  T+L NYAG  CR  +
Sbjct: 386 DHQLLALDAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTS 444

Query: 451 PLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           PLQ LQ YV    Y  GC  V+CS  + I+ A  IA  AD  V+++GLD   E E+LDRV
Sbjct: 445 PLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRV 504

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           +L LPG Q++L+   A+AA   VILV++  GPVDI+F K    IG ILW GYPG+AG  A
Sbjct: 505 NLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDA 564

Query: 570 LAEVIFGDHNPG 581
           +++VIFGD+NPG
Sbjct: 565 ISQVIFGDYNPG 576


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 297/553 (53%), Positives = 407/553 (73%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP N +T+  PFCK  L    R +DL+ RLTL EK++ LVN+A A+PRLGI  YEW
Sbjct: 26  PFACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 85

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G    ATSFPQVI TAASF++ LW  IG+    EARA+YN G
Sbjct: 86  WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 145

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+Q    N      +L+
Sbjct: 146 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGN------RLK 198

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +A CKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G+ + +MC+YN+V
Sbjct: 199 VAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQV 258

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG+P+CAD  LL +T R QWG +GYI SDCD+V + Y+++ Y  +PE+A  D +KAG+D+
Sbjct: 259 NGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDL 318

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL +HT+ AVK+  + E++++ AL N  +V+MRLG+++G P+  P+GK+G   VC+
Sbjct: 319 DCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSSHPYGKLGPRDVCT 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           P+HQ LAL+AA+ GIVLLKN    LPL   +  ++A+IGPN+N   T++GNYAG +C   
Sbjct: 379 PSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVTMIGNYAGIACGYT 438

Query: 450 TPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           +PL+ +  Y + T++  GC  VAC++     +A+++A+ AD  VL+MGLDQ+ E E +DR
Sbjct: 439 SPLEGIGRYTK-TIHELGCANVACTNDKQFGRAINVAQQADATVLVMGLDQSIEAETVDR 497

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L+LPGRQQ+L+++VA A+K P ILV++ GGPVDITFAK +  I +ILWAGYPG+AG  
Sbjct: 498 AGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAILWAGYPGQAGGA 557

Query: 569 ALAEVIFGDHNPG 581
           A+A+++FG  NPG
Sbjct: 558 AIADILFGTSNPG 570


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/566 (56%), Positives = 403/566 (71%), Gaps = 39/566 (6%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC   L   QRA DLV+ LTL EK+SQL + A  + RLG+PAYEWWSE LHG++  G+GI
Sbjct: 32  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            FNGT+R  TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 92  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGEDP+   +YAV++V G+QG    GG      +ASACCKH TAYDLD 
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 202

Query: 225 WKGTTRYKFDAR----------------------------VTMQDLADTYQPPFESCVKQ 256
           W    RY +D++                            VT+QDL DTY PPF+SCV +
Sbjct: 203 WNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDTYNPPFKSCVAE 262

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G+A+ IMC YN +NG+P+CA  +LL+K  R++WG +GY+ SDCDAV+ I DA  Y  SPE
Sbjct: 263 GKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPE 322

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           D V   +K GMDVNCG++ Q H  AAV++  L E +IDRAL NLF+VRMRLG F+G+P  
Sbjct: 323 DTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRS 382

Query: 377 QP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
              +G +GA  VCSPAH+ LAL+AAQDGIVLLKN  G LPL  S   SLA+IGPNA++  
Sbjct: 383 NAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLG 442

Query: 436 TLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
            L GNY GP C + TPLQ ++ Y+ +   +  GCD+ AC+ A+ ++A  +A  +DHVVL 
Sbjct: 443 ALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAAALASSSDHVVLF 502

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           MGL Q QE++ LDR  L+LPG QQ LIT VA AA++PVILVLL GGPVD+TFAK +  IG
Sbjct: 503 MGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIG 562

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           +ILWAGYPG+AG +A+A+V+FGDHNP
Sbjct: 563 AILWAGYPGQAGGLAIAKVLFGDHNP 588


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  647 bits (1668), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/573 (54%), Positives = 408/573 (71%), Gaps = 17/573 (2%)

Query: 10  FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           F  + L   +L++ +  +Q  F+CD ++P+T  + FC  +L    RA+DLVSRL+L EK+
Sbjct: 6   FVGVSLLIAALVSSLCESQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKV 65

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
            QLVN A  + RLG+P YEWWSEALHGV+ VG G+ FNGT+ GATSFP  ILTAASF++ 
Sbjct: 66  QQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTS 125

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           LW ++G+ +  EARA++N G A G+T+W+PN+NIFRDPRWGRGQETPGEDPLV  KYAV+
Sbjct: 126 LWLKMGEVVSTEARAMHNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVN 184

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YV+G+Q D  + GK + +L+ S+CCKH+TAYDLDNWKG  R+ FDA+VT QDL DTYQPP
Sbjct: 185 YVKGLQ-DVQDAGKSR-RLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTYQPP 242

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F+SCV++G  S +MC+YNRVNGIP+CAD NLL    R QW   GYI SDCD++ + +D  
Sbjct: 243 FKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYFDDI 302

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            Y K+            +++NCG FL K+T+ AVK KKL  SE+D AL   + V MRLG 
Sbjct: 303 HYTKTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMRLGF 351

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
           F+G+P   PFG++G   VCS  HQ+LAL+AA+ GIVLL+N  G LPL K+    +A+IGP
Sbjct: 352 FDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAVIGP 410

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTVACSSAS-IDKAVDIAKG 487
           NAN+ K ++ NYAG  C+  +PLQ LQ YV E  VY PGC  V C   + I  AV     
Sbjct: 411 NANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKAVSE 470

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           AD  VL++GLDQT E E LDRV+L LPG Q++L+  VA AAKK V+LV++  GP+DI+FA
Sbjct: 471 ADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDISFA 530

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           K    I ++LW GYPGEAG  A+A+VIFGD+NP
Sbjct: 531 KNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNP 563


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score =  647 bits (1668), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/561 (54%), Positives = 405/561 (72%), Gaps = 10/561 (1%)

Query: 22  TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
             +  ++ PF+CDP    T +  FC+  LPI  R RDL+ RLTL EKI  LVN+A A+PR
Sbjct: 19  NHIVESREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPR 78

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
           LGI  YEWWSEALHGV+ VG G  F G   GAT+FPQVI TAASF+  LW  IG+ +  E
Sbjct: 79  LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDE 138

Query: 142 ARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
           ARA+YN G A G+T+W+PN+N+FRDPRWGRGQETPGEDP+V GKYA SYVRG+QG+  NG
Sbjct: 139 ARAMYNGGMA-GLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGN--NG 195

Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
            +LK     +ACCKH+TAYDLDNW G  RY F+ARV+ QDL DTY  PF+SCV  G+ + 
Sbjct: 196 LRLK----VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVAS 251

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD 321
           +MC+YN+VNG P+CAD  LL  T R +WG +GYI SDCD+V +++D + Y  +PE+A   
Sbjct: 252 VMCSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTATPEEAAAS 311

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            ++AG+D++CG FL  HT+ AVK   L E +++ AL N  +V+MRLG+F+G P+ QPFG 
Sbjct: 312 TIRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGN 371

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +G   VC+PAHQ LALQAA+ GIVLL+N    LPL ++   ++A+IGPN++   T++GNY
Sbjct: 372 LGPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPLSRTLQ-TVAVIGPNSDVTVTMIGNY 430

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQT 500
           AG +C   TPLQ ++ Y + TV++PGC+ V C+      A ++ A+ AD  +L+MGLDQ+
Sbjct: 431 AGVACGYTTPLQGIRRYAK-TVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQS 489

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E E  DR  L+LPG QQEL++ VA A++ P ILVL+ GGP+D++FAK D  IG+ILW G
Sbjct: 490 IEAEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVG 549

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPG+AG  A+A+V+FG  NPG
Sbjct: 550 YPGQAGGAAIADVLFGTANPG 570


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score =  645 bits (1665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 299/572 (52%), Positives = 410/572 (71%), Gaps = 10/572 (1%)

Query: 11  PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           PL+ L F  LL      +  F+CD  + +T+  PFC   L I +R +DL+ RLT+ EK++
Sbjct: 9   PLITL-FILLLQSSCDARDSFACDAKDAATKNLPFCNVKLAIPERVKDLIGRLTMQEKVN 67

Query: 71  QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
            LVN+APA+PR+G+ +YEWWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ L
Sbjct: 68  LLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFPAATSFPQVITTAASFNASL 127

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
           W  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G+YA SY
Sbjct: 128 WEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGRYAASY 186

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           V+G+QG   N      KL+ +ACCKHFTAYD+DNW G  R+ F+A V+ QD+ DT+  PF
Sbjct: 187 VKGLQGTDGN------KLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPF 240

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
             CVK+G+ + +MC+YN+VNG+P+CAD NLL KT R  WG  GYI SDCD+V ++Y+++ 
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQH 300

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           Y  +PE+A  D +KAG+D++CG FL  HT+ AVK+  L E++++ AL N   V+MRLG+F
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMF 360

Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
           +G P+ Q +G++G   VC PAHQ LAL+AA+ GIVLLKN+   LPL   +  ++A+IGPN
Sbjct: 361 DGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPN 420

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGAD 489
           ++   T++GNYAG +C   +PLQ +  Y + T++  GC  VAC        A+D A+ AD
Sbjct: 421 SDVTVTMIGNYAGIACGYTSPLQGIGRYAK-TIHQQGCSNVACRDDKQFGPALDAARHAD 479

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
             +L++GLDQ+ E E +DR  L+LPG QQ+L+++VA A+K P ILVL+ GGPVDITFAK 
Sbjct: 480 ATILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKN 539

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           D  +  ILWAGYPG+AG  A+A+++FG  +PG
Sbjct: 540 DPKVAGILWAGYPGQAGGAAIADILFGTASPG 571


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 297/553 (53%), Positives = 402/553 (72%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP N +T+  PFCK +L    R +DL+ RLTL EK++ LVN+A A+PRLGI  YEW
Sbjct: 27  PFACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEW 86

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G    ATSFPQVI TAASF++ LW  IG+    EARA+YN G
Sbjct: 87  WSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGG 146

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+QG   N      +L+
Sbjct: 147 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGN------RLK 199

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +A CKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G+ + +MC+YN+V
Sbjct: 200 VAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEGKVASVMCSYNQV 259

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG+P+CAD  LL +T R QWG +GYI SDCD+V + Y+++ Y  +PE+A  D +KAG+D+
Sbjct: 260 NGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTSTPEEAAADAIKAGLDL 319

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL +HT+ AVK+  + E++++ AL N  +V+MRLG+++G P+  P+  +G   VC+
Sbjct: 320 DCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSSHPYNNLGPRDVCT 379

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            +HQ LAL+AA+ GIVLLKN    LPL   +  ++A+IGPN+N   T++GNYAG +C   
Sbjct: 380 QSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFTMIGNYAGIACGYT 439

Query: 450 TPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           +PLQ +  Y + T+Y  GC  VAC+      +A++ A+ AD  VL+MGLDQ+ E E +DR
Sbjct: 440 SPLQGIGTYTK-TIYEHGCANVACTDDKQFGRAINAAQQADATVLVMGLDQSIEAETVDR 498

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L+LPG QQ+L+++VA A+K P ILV++ GGPVDITFAK D  I  ILWAGYPG+AG  
Sbjct: 499 ASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGILWAGYPGQAGGA 558

Query: 569 ALAEVIFGDHNPG 581
           A+A+++FG  NPG
Sbjct: 559 AIADILFGTSNPG 571


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/568 (54%), Positives = 404/568 (71%), Gaps = 8/568 (1%)

Query: 15  LCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
             F  L   + + +PPF+CD  N +   FPFC+T LPI  R RDL+ RLTL EK+  L N
Sbjct: 15  FIFIFLFVSIQAARPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGN 74

Query: 75  SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
           +A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI TAASF++ LW  I
Sbjct: 75  NAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAASFNASLWEEI 134

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
           G+ +  EARA+YN G+  G+T+W+PN+NIFRDPRWGRGQETPGEDP+V   YA  YVRG+
Sbjct: 135 GRVVSDEARAMYN-GEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGL 193

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           QG+  +G  LK     +ACCKH+TAYDLDNW G  R+ F+A+VT QD+ DT+  PF SCV
Sbjct: 194 QGNE-DGDSLK----VAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFDVPFRSCV 248

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
           KQG+ + IMC+YN+VNGIP+CAD  LL KT R  WG +GYI SDCD+V + YD + Y  +
Sbjct: 249 KQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYDTQHYTST 308

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           PE+A    +KAG+D++CG FL +HT+ AV    L E+ ID  L N  +V+MRLG+F+G P
Sbjct: 309 PEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRLGMFDGEP 368

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           + Q +G +G   VCSPAHQ LA++AA+ GIVLLKN    LPL   +  ++A+IGPN++  
Sbjct: 369 SAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVIGPNSDVT 428

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVL 493
            T++GNYAG +C   +PLQ +  Y + T++  GC  VACS   +   AV+ A+ AD  VL
Sbjct: 429 VTMIGNYAGVACGYTSPLQGISKYAK-TIHEKGCGDVACSDDKLFAGAVNAARQADATVL 487

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           +MGLDQ+ E E  DR  L+LPG QQELI+ V++A++ PV+LVL+ GGPVD+TFA  D  I
Sbjct: 488 VMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTFANNDPRI 547

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
           G+I+WAGYPG+ G  A+A+V+FG HNPG
Sbjct: 548 GAIVWAGYPGQGGGAAIADVLFGAHNPG 575


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/568 (54%), Positives = 410/568 (72%), Gaps = 9/568 (1%)

Query: 16  CFT-SLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
           C T S   RV   + PF+CDP N  T  F FC T +PI  R +DL++RLTL EKI  +VN
Sbjct: 21  CVTLSFFPRVTEGRVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVN 80

Query: 75  SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
           +A A+PRLGI  YEWWSEALHGV+ VG G  F G   GAT FPQVI TAASF+  LW  I
Sbjct: 81  NAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEI 140

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
           G+ +  EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP +  KYA SYV+G+
Sbjct: 141 GRVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGL 199

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           QGD+         L+ +ACCKH+TAYDLDNW G  R+ F+A+V+ QDL DTY  PF++CV
Sbjct: 200 QGDSAG-----NHLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACV 254

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
            +G+ + +MC+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V + +D + Y K+
Sbjct: 255 LEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHYTKT 314

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           PE+A  + +KAG+D++CG FL  HT +A+++  + E++++ AL NL SV+MRLG+F+G P
Sbjct: 315 PEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEP 374

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           + QP+G +G   VC+ AHQ LAL+AA++ IVLL+N    LPL  S+  ++ ++GPNA++ 
Sbjct: 375 STQPYGNLGPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADAT 434

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD-IAKGADHVVL 493
            T++GNYAG +C   TPLQ +  YV+ T +  GC  VAC    +  A + IA+ AD +VL
Sbjct: 435 VTMIGNYAGVACGYTTPLQGIARYVK-TAHQVGCRGVACRGNELFGAAETIARQADAIVL 493

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           +MGLDQT E E  DRV L+LPG QQEL+TRVA AAK PVIL+++ GGPVDI+FAK D  I
Sbjct: 494 VMGLDQTVEAETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKI 553

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +ILW GYPG+AG  A+A+VIFG  NPG
Sbjct: 554 SAILWVGYPGQAGGTAIADVIFGTTNPG 581


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/561 (55%), Positives = 402/561 (71%), Gaps = 14/561 (2%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           T+PPF+C P  PST   PFC+  LP   RARDLV+RLT  EK+  LVN+A  +PRLG+  
Sbjct: 23  TRPPFACAPGGPSTR-LPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEG 81

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           YEWWSEALHGV+  G G+ F G   GAT+FPQVI TAASF++ LW  IG+A+  E RA+Y
Sbjct: 82  YEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIY 141

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N  QA G+TFW+PN+NIFRDPRWGRGQETPGEDP V+G+YA +YVRG+Q       +  G
Sbjct: 142 NGRQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQ------QHAG 194

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
           +L+ +ACCKHFTAYDLD W G  R+ F+A VT QDL DT+  PF +CV +GRA+ +MC+Y
Sbjct: 195 RLKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRAAAVMCSY 254

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           N+VNG+P+CAD+  L  T R +W   GYI SDCD+V + Y  + Y ++ EDAV   L+AG
Sbjct: 255 NQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHYTRTREDAVAATLRAG 314

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           +D++CG FL ++T+AAV Q K+ E++ID A+ N  +V+MRLG+F+G+   QPFG +G   
Sbjct: 315 LDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQPFGHLGPQH 374

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHG---LLPL-PKSKSVSLALIGPNANSAKTLLGNYA 442
           VC+PAH+ LAL+AA   IVLLKN  G    LPL    +  ++A++GP++ +   ++GNYA
Sbjct: 375 VCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEATVAMIGNYA 434

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQT 500
           G  C   TPLQ +  Y   TV+  GC  VAC  S   ID AVD A+ AD  V+++GLDQ+
Sbjct: 435 GKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATVVVVGLDQS 494

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E E LDR  L+LPGRQ EL++ VA A+K PVILVL+ GGPVDI FA+ DRN+ +ILWAG
Sbjct: 495 VEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRNVAAILWAG 554

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPG+AG  A+A+VIFG HNPG
Sbjct: 555 YPGQAGGQAIADVIFGHHNPG 575


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 299/553 (54%), Positives = 400/553 (72%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP N  T+   FCK +L I++R +DL+ RLTL+EK+  LVN+A A+PRLG+  YEW
Sbjct: 26  PFACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEW 85

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ +G  + FN     ATSFPQVI TAASF++ LW  IGQ +  EARA+YN G
Sbjct: 86  WSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGG 145

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YA +YVRG+QG   N      +L+
Sbjct: 146 TA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHAN------RLK 198

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF+ CV +G+ + +MC+YN+V
Sbjct: 199 VAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVASVMCSYNQV 258

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG+P+CAD NLL KT R  W   GYI SDCD+V + YD + Y  +PE+A  D +KAG+D+
Sbjct: 259 NGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDL 318

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL  HT+ AVK+  L E++++ AL N  +V+MRLG+F+G PT  P+G +G   VC 
Sbjct: 319 DCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYGHLGPKDVCK 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAHQ LAL+AA+ GIVLLKN+  +LPL      ++A+IGPN+ +  T++GNYAG +C   
Sbjct: 379 PAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGNYAGVACGYT 438

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
            PLQ +  Y   TV+  GC  VAC +  +   A++ A+ AD  VL+MGLDQ+ E E +DR
Sbjct: 439 NPLQGIGRYAR-TVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQSIEAETVDR 497

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L+LPGRQ +L+++VA A+K P ILVL+ GGPVDITFAK +  I  ILWAGYPG+AG  
Sbjct: 498 TGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAGYPGQAGGA 557

Query: 569 ALAEVIFGDHNPG 581
           A+A+++FG  NPG
Sbjct: 558 AIADILFGTANPG 570


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score =  643 bits (1658), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 309/569 (54%), Positives = 408/569 (71%), Gaps = 9/569 (1%)

Query: 14  CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
           C+  T    RV   + PF+CDP N  T  F FC T +PI  R +DL++RLTL EKI  +V
Sbjct: 20  CVTLT-FFPRVTEGRVPFACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVV 78

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
           N+A A+PRLGI  YEWWSEALHGV+ VG G  F G   GAT FPQVI TAASF+  LW  
Sbjct: 79  NNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQE 138

Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
           IG+ +  EARA+YN GQA G+T+W+PN+NIFRDPRWGRGQETPGEDP +  KYA SYV+G
Sbjct: 139 IGRVVSDEARAMYNGGQA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKG 197

Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
           +QGD         +L+ +ACCKH+TAYDLDNW G  R+ F+A+V+ QDL DTY  PF++C
Sbjct: 198 LQGD-----GAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKAC 252

Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
           V +G+ + +MC+YN+VNG P+CAD +LL  T R QWG +GYI SDCD+V + +D + Y +
Sbjct: 253 VLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHYTR 312

Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +PE+A  + +KAG+D++CG FL  HT +A+++  + E++++ AL NL +V+MRLG+F+G 
Sbjct: 313 TPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGE 372

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P+ QPFG +G   VC+PAHQ LAL+AA++ IVLL+N    LPL  S+   + +IGPN ++
Sbjct: 373 PSTQPFGNLGPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDA 432

Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD-IAKGADHVV 492
             T++GNYAG +C   TPLQ +  YV+ T +  GC  VAC    +  A + IA+  D  V
Sbjct: 433 TVTMIGNYAGVACGYTTPLQGIARYVK-TAHQVGCRGVACRGNELFGAAEIIARQVDATV 491

Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           L+MGLDQT E E  DRV L+LPG QQEL+TRVA AAK PVILV++ GGPVD++FAK +  
Sbjct: 492 LVMGLDQTIEAETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPK 551

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I +ILW GYPG+AG  A+A+VIFG  NPG
Sbjct: 552 ISAILWVGYPGQAGGTAIADVIFGATNPG 580


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 306/571 (53%), Positives = 404/571 (70%), Gaps = 8/571 (1%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
             C     +L   ++   P +CDP +  T    FC+  LPI  R +DL+ RL L EK+  
Sbjct: 7   FFCFLVLFILLSAEARAAPLACDPKSGLTRALRFCRVNLPIRARVQDLIGRLNLQEKVKL 66

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           LVN+A  +PRLGI  YEWWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ LW
Sbjct: 67  LVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSFPQVITTAASFNASLW 126

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
             IGQ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP +  +YA SYV
Sbjct: 127 EAIGQVVSDEARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLASQYAASYV 185

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+QG  +N    K +L+ +ACCKH+TAYDLDNW    R+ F+A+V+ QDL DTY  PF+
Sbjct: 186 RGLQG-IYN----KNRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFK 240

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
            CV++GR + +MC+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V ++YD + Y
Sbjct: 241 GCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHY 300

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            ++PE+A  D +KAG+D++CG FL  HT+AAVK+  L E+++++AL N F+V+MRLG+F+
Sbjct: 301 TRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFD 360

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           G    QPFG +G   VCSPAHQ LALQAA+ GIVLL+N    LPL  ++  ++A+IGPNA
Sbjct: 361 GEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNA 420

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKGADH 490
           ++  T++GNYAG +C   +PLQ +  Y + TV+  GC  VAC+S      A   A  AD 
Sbjct: 421 DATVTMIGNYAGVACGYTSPLQGIARYAK-TVHQAGCIGVACTSNQQFGAATAAAAHADA 479

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            VL+MGLDQ+ E E  DR  ++LPG QQEL+++VA A++ P ILVL+CGGPVD+TFAK D
Sbjct: 480 TVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKND 539

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             I +ILW GYPG+AG  A+A+V+FG  NPG
Sbjct: 540 PKISAILWVGYPGQAGGTAIADVLFGTTNPG 570


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/577 (55%), Positives = 412/577 (71%), Gaps = 10/577 (1%)

Query: 9   VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
           VF     CF+  L+     +  + P F+CD  +NP+   F FC T+L  + R  DLV RL
Sbjct: 13  VFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL+EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  FN  + GATSFPQVILTA
Sbjct: 73  TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTA 132

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF++ L+  IG+A+  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKAVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
            KYA  YVRG+Q  + +G     +L+ +ACCKH+TAYDLDNWKG  R+ F+A VT QD+ 
Sbjct: 192 SKYASGYVRGLQ-QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMD 248

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V 
Sbjct: 249 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVD 308

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           + Y+++ Y K+PE+A    + AG+D+NCGSFL +HT+AAVK   + ES +D+A+ N F+ 
Sbjct: 309 VFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFAT 368

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            MRLG F+GNP+   +GK+G   VC+  HQ LA +AA+ GIVLLKNS G LPL  +   +
Sbjct: 369 LMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKT 428

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IGPNAN  KT++GNY G  C+  TPLQ L   V  T Y PGC  VAC +A ID+A  
Sbjct: 429 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKK 487

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IA  AD  VL++G+DQ+ E E  DRV++ LPG+Q  LIT VA+A+K  VILV++ GG  D
Sbjct: 488 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFD 547

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I+FAK D  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 548 ISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNP 584


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  640 bits (1650), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/577 (55%), Positives = 412/577 (71%), Gaps = 10/577 (1%)

Query: 9   VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
           VF     CF+  L+     +  + P F+CD  +NP+   F FC T+L  + R  DLV RL
Sbjct: 13  VFLCFLSCFSHFLSSPKWVLAQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL+EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  FN  + GATSFPQVILTA
Sbjct: 73  TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTA 132

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF++ L+  IG+A+  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKAVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
            KYA  YVRG+Q  + +G     +L+ +ACCKH+TAYDLDNWKG  R+ F+A VT QD+ 
Sbjct: 192 SKYASGYVRGLQ-QSDDGSP--DRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMD 248

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V 
Sbjct: 249 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVD 308

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           + Y+++ Y K+PE+A    + AG+D+NCGSFL +HT+AAVK   + ES +D+A+ N F+ 
Sbjct: 309 VFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFAT 368

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            MRLG F+GNP+   +GK+G   VC+  HQ LA +AA+ GIVLLKNS G LPL  +   +
Sbjct: 369 LMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKT 428

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IGPNAN  KT++GNY G  C+  TPLQ L   V  T Y PGC  VAC +A ID+A  
Sbjct: 429 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAKK 487

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IA  AD  VL++G+DQ+ E E  DRV++ LPG+Q  LIT VA+A+K  VILV++ GG  D
Sbjct: 488 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFD 547

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I+FAK D  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 548 ISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNP 584


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 300/574 (52%), Positives = 409/574 (71%), Gaps = 10/574 (1%)

Query: 9   VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
           + P+L +  +++       + PF+CDP + +   +PFC+  LPI +R +DL+ RLTL EK
Sbjct: 8   IIPILII-LSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEK 66

Query: 69  ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
           +  LVN+A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI T ASF+ 
Sbjct: 67  VRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNV 126

Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
            LW  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V G+YA 
Sbjct: 127 SLWEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAA 185

Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
            Y++G+QG+  +G +LK     +ACCKHFTAYDLDNW GT R+ F+A+VT QD+ DT++ 
Sbjct: 186 RYIKGLQGN--DGDRLK----VAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEV 239

Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
           PF  CVK+G+ + +MC+YN+VNG+P+CAD NLL  T R QWG +GYI SDCD+V + YD 
Sbjct: 240 PFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDN 299

Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
           + Y  + E+A  D +KAG+D++CG FL  HT+ AVK+  L ++ I+ AL N  +V+MRLG
Sbjct: 300 QHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTITVQMRLG 359

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
           +F+G P+   +GK+G   VCSP+HQ LAL AA+ GIVLLKN    LPL      ++A+IG
Sbjct: 360 MFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHRTVAVIG 419

Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKG 487
           PN++   T++GNYAG +C  +TPL+ ++ Y    V+  GCD VAC++  S   A+  A  
Sbjct: 420 PNSDVNVTMIGNYAGVACGYVTPLEGIKRYT-TVVHRKGCDNVACATDYSFTDALAAAST 478

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           AD  VL+MGLDQ+ E E  DR  L+LPGRQQEL+ +VA A++ P +++L+ GGP+D++FA
Sbjct: 479 ADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFA 538

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             D  I +ILW GYPG+AG  A+A+V+FG  NPG
Sbjct: 539 DNDPRISAILWVGYPGQAGGAAIADVLFGTTNPG 572


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 300/574 (52%), Positives = 409/574 (71%), Gaps = 10/574 (1%)

Query: 9   VFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
           + P+L +  +++       + PF+CDP + +   +PFC+  LPI +R +DL+ RLTL EK
Sbjct: 8   IIPILII-LSAIFRHGGGAREPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTLQEK 66

Query: 69  ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
           +  LVN+A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI T ASF+ 
Sbjct: 67  VRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVASFNV 126

Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
            LW  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V G+YA 
Sbjct: 127 SLWEAIGRVVSDEARAMYNGGAA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEYAA 185

Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
            Y++G+QG+  +G +LK     +ACCKHFTAYDLDNW GT R+ F+A+VT QD+ DT++ 
Sbjct: 186 RYIKGLQGN--DGDRLK----VAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMVDTFEV 239

Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
           PF  CVK+G+ + +MC+YN+VNG+P+CAD NLL  T R QWG +GYI SDCD+V + YD 
Sbjct: 240 PFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVGVFYDN 299

Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
           + Y  + E+A  D +KAG+D++CG FL  HT+ AVK+  L ++ I+ AL N  +V+MRLG
Sbjct: 300 QHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTITVQMRLG 359

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
           +F+G P+   +GK+G   VCSP+HQ LAL AA+ GIVLLKN    LPL      ++A+IG
Sbjct: 360 MFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHRTVAVIG 419

Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAKG 487
           PN++   T++GNYAG +C  +TPL+ ++ Y    V+  GCD VAC++  S   A+  A  
Sbjct: 420 PNSDVNVTMIGNYAGVACGYVTPLEGIKRYT-TVVHRKGCDNVACATDYSFTDALAAAST 478

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           AD  VL+MGLDQ+ E E  DR  L+LPGRQQEL+ +VA A++ P +++L+ GGP+D++FA
Sbjct: 479 ADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPIDVSFA 538

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             D  I +ILW GYPG+AG  A+A+V+FG  NPG
Sbjct: 539 DNDPRISAILWVGYPGQAGGAAIADVLFGTTNPG 572


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 315/571 (55%), Positives = 403/571 (70%), Gaps = 5/571 (0%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
            LC+ F+  +  + S +   +CD  +P T  FPFC T+L    RA+DLVSRLTL EK  Q
Sbjct: 7   FLCIFFSFFILPITSQK--HACDKGSPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQ 64

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           LVN +  I RLG+PAYEWWSEALHGV+ VG G  F+  + GATSFP VIL+AASF+  LW
Sbjct: 65  LVNPSTGISRLGVPAYEWWSEALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLW 124

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           Y +GQ +  EARA+YN   A G+TFW+PN+N+FRDPRWGRGQETPGEDPLV  +YAV+YV
Sbjct: 125 YTMGQVVSNEARAMYNVDLA-GLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYV 183

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+Q           +L+ S+CCKH+TAYD+DNWKG  R+ FDA+VT QDL DTYQPPF+
Sbjct: 184 RGLQEVGDEASAKGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFK 243

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           SCV +G  S +MC+YNRVNGIP+CAD +LL    R QWG  GYI SDCD+V + Y++  Y
Sbjct: 244 SCVLEGHVSSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY 303

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            K+PEDAV   LKAG+++NCG FL+K+T  AV  KK+  S +D+AL   + V MRLG F 
Sbjct: 304 TKTPEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE 363

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
            NP   PF  +G   VC+  +Q LAL+AA+ GIVLL+N+ G LPL K+K  +LA+IGPNA
Sbjct: 364 -NPKSLPFANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNA 422

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADH 490
           N+   ++ NYAG  CR  +PLQ LQ Y+ +  Y  GC  V CS+ ++   AV  A  AD 
Sbjct: 423 NATTVMISNYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADA 482

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
           VVL++GLDQ+ E E LDRV+L LPG Q++L+  VA A K  +ILV++  GP+DI+F K  
Sbjct: 483 VVLVVGLDQSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSV 542

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            NIG ILW GYPG+ G  A+A+VIFGD+NPG
Sbjct: 543 SNIGGILWVGYPGQDGGNAIAQVIFGDYNPG 573


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 305/551 (55%), Positives = 391/551 (70%), Gaps = 3/551 (0%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           F+CD ++P T  + FC T+L    RA+DL+SRLTL EK+ Q+VN A  IPRLGIPAYEWW
Sbjct: 25  FACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEWW 84

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHGV+ VG G+ FNGT+ GATSFP +IL+AASF+  LW ++GQ +  EAR +++ G 
Sbjct: 85  SEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVGL 144

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
           A G+T+W+PN+N+FRDPRWGRGQETPGEDPLV  +YAV+YVRG+Q     G     KL+ 
Sbjct: 145 A-GLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGNSTADKLKV 203

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           S+CCKH+TAYDLD WKG  R+ FDA+VT QDL DTYQPPF SCV++   S +MC+YNRVN
Sbjct: 204 SSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHVSSVMCSYNRVN 263

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           GIP+CAD +LL    R +W   GYI SDCD++ + YD+  Y  +PEDAV   LKAG+++N
Sbjct: 264 GIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTATPEDAVALALKAGLNMN 323

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CG FL K+T  AVK  K+ ES +D+AL   F V MRLG F+G+P    FG +G   VCS 
Sbjct: 324 CGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLLFGNLGPSDVCSD 383

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ LAL AA+ GIVLL N  G LPL K+ + +LA+IGPNAN   T++ NYAG  C+  T
Sbjct: 384 GHQKLALDAARQGIVLLYNK-GALPLSKNNTRNLAVIGPNANVTTTMISNYAGIPCKYTT 442

Query: 451 PLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           PLQ LQ YV    Y  GC +V+CS  + ID A   A  AD VVL++GLDQ+ E+E LDR 
Sbjct: 443 PLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGLDQSIEREGLDRE 502

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           +L LPG Q++L+  V  A    V+LV++   P+D++FA     I  ILW GYPG+AG  A
Sbjct: 503 NLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGILWVGYPGQAGGDA 562

Query: 570 LAEVIFGDHNP 580
           +A+V+FGD+NP
Sbjct: 563 VAQVMFGDYNP 573


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  637 bits (1643), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 319/578 (55%), Positives = 410/578 (70%), Gaps = 12/578 (2%)

Query: 9   VFPLLCLCFTSLLTR----VDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
           VF     CF+  L+     +  + P F+CD  +NP+   F FC T+L  + R  DLV RL
Sbjct: 13  VFLCFLSCFSHFLSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRL 72

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL+EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  FN  + GATSFPQVILTA
Sbjct: 73  TLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILTA 132

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF++ L+  IG+ +  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++
Sbjct: 133 ASFNASLFEAIGKVVSTEARAMYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLS 191

Query: 184 GKYAVSYVRGVQ-GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            KYA +YVRG+Q GD  +  +LK     +ACCKH+TAYDLDNWKG  R  F+A VT QD+
Sbjct: 192 SKYASAYVRGLQQGDDGSPDRLK----VAACCKHYTAYDLDNWKGVDRLHFNAVVTKQDM 247

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            DT+QPPF+SCV  G  + +MC++N+VNG P+CAD +LLS   R +W  +GYI SDCD+V
Sbjct: 248 DDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYIVSDCDSV 307

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            + Y+++ Y K+PE+A    + AG+D+NCGSFL +HT+AAVK   + ES +D+A+ N F+
Sbjct: 308 DVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFA 367

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
             MRLG F+GNP+   +GK+G   VC+  HQ +A +AA+ GIVLLKNS G LPL  +   
Sbjct: 368 TLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLPLSPTAIK 427

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
           +LA+IGPNAN  KT++GNY G  C+  TPLQ L   V  T Y PGC  VAC +A ID+A 
Sbjct: 428 TLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVA-TTYLPGCSNVACGTAQIDEAK 486

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
            IA  AD  VL++G+DQ+ E E  DRV + LPG+Q  LIT VA+A+K  VILV++ GG  
Sbjct: 487 KIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVILVVMSGGGF 546

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           DI+FAK D  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 547 DISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNP 584


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score =  636 bits (1640), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 300/572 (52%), Positives = 410/572 (71%), Gaps = 11/572 (1%)

Query: 13  LCLCFTSLLTRVD--STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           L L    LL+ +   +++ PF+CDP+N  T  +PFC+ +L + +R +DL+ RLTL+EK+ 
Sbjct: 7   LILSIFILLSAIHGRASRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRLTLEEKVK 66

Query: 71  QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
            LV++A  +PRLGI AY+WWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ L
Sbjct: 67  LLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASL 126

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
           W  IG+ +  EARA+YN G   G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YAV+Y
Sbjct: 127 WEAIGRVVSDEARAMYNGGVG-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNY 185

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           VRG+QG   N      +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT++ PF
Sbjct: 186 VRGLQGTEGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
             CVK G+ S +MC+YN+VNG+P+CAD NLL+ T R QW   GYI SDCD+V + Y+++ 
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           Y  +PE+A    +KAG+D++CGSFL+ HT+ AVK+  L ES I+ AL N  SV+MRLG+F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359

Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
           +G+   QP+  +GA  VCS  ++ LA+ AA+ GIVLL+N  G LPL  ++   +A++GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGAD 489
           +N+  T++GNYAG +C  ITPLQ +  Y   T++  GC  VAC S+     A++ A+ AD
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTR-TIHQEGCRGVACRSNKFFGGAIEAARVAD 478

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
            VVL+MGLDQ+ E E  DR  L+LPG Q +L+ +VA  AK PVILVL+ GGP+D++FAK 
Sbjct: 479 AVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKD 538

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              I  I+W GYPG+AG +A+A+V+FG  NPG
Sbjct: 539 HPKISGIIWGGYPGQAGGLAIADVLFGQTNPG 570


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score =  635 bits (1639), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 300/572 (52%), Positives = 410/572 (71%), Gaps = 11/572 (1%)

Query: 13  LCLCFTSLLTRVD--STQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           L L    LL+ +   +++ PF+CDP+N  T  +PFC+ +L + +R +DL+ RLTL+EK+ 
Sbjct: 7   LILSIFILLSAIHGRASRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRLTLEEKVK 66

Query: 71  QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYL 130
            LV++A  +PRLGI AY+WWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ L
Sbjct: 67  LLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASL 126

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
           W  IG+ +  EARA+YN G   G+T+W+PN+NIFRDPRWGRGQETPGEDP++ G YAV+Y
Sbjct: 127 WEAIGRVVSDEARAMYNGGVG-GLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNY 185

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           VRG+QG   N      +L+ +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT++ PF
Sbjct: 186 VRGLQGTEGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFEVPF 239

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
             CVK G+ S +MC+YN+VNG+P+CAD NLL+ T R QW   GYI SDCD+V + Y+++ 
Sbjct: 240 RMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFYNSQH 299

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           Y  +PE+A    +KAG+D++CGSFL+ HT+ AVK+  L ES I+ AL N  SV+MRLG+F
Sbjct: 300 YTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMRLGMF 359

Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
           +G+   QP+  +GA  VCS  ++ LA+ AA+ GIVLL+N  G LPL  ++   +A++GPN
Sbjct: 360 DGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAVVGPN 419

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGAD 489
           +N+  T++GNYAG +C  ITPLQ +  Y   T++  GC  VAC S+     A++ A+ AD
Sbjct: 420 SNATLTMIGNYAGIACEYITPLQGISKYTR-TIHQEGCRGVACRSNKFFGGAIEAARVAD 478

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
            VVL+MGLDQ+ E E  DR  L+LPG Q +L+ +VA  AK PVILVL+ GGP+D++FAK 
Sbjct: 479 AVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVSFAKD 538

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              I  I+W GYPG+AG +A+A+V+FG  NPG
Sbjct: 539 HPKISGIIWGGYPGQAGGLAIADVLFGQTNPG 570


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 302/559 (54%), Positives = 401/559 (71%), Gaps = 8/559 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V   +PPF+CDP NP T    FC+ T+PI  R +DL+ RLTL EKI  LVN+A A+PRLG
Sbjct: 25  VVHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLG 84

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           I  YEWWSEALHGV+ VG G  F G   GATSFPQVI TAASF+  LW  IG+ +  EAR
Sbjct: 85  IQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEAR 144

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++  KYA  YV+G+QGD      
Sbjct: 145 AMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGD-----G 198

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
              +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDLADTY  PF++CV +G  + +M
Sbjct: 199 AGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVM 258

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V ++Y+ + Y ++PE+A  D +
Sbjct: 259 CSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAI 318

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           KAG+D++CG FL  HT+AAV++  + + EI+ AL N  +V+MRLG+F+G P+   +G +G
Sbjct: 319 KAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLG 378

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC+PAHQ LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++   T++GNYAG
Sbjct: 379 PRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAG 438

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y   T++  GC  V C+   +   A   A+ AD  VL+MGLDQ+ E
Sbjct: 439 VACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIE 497

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E +DRV L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D  I +I+W GYP
Sbjct: 498 AEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYP 557

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G+AG  A+A+V+FG  NPG
Sbjct: 558 GQAGGTAIADVLFGTTNPG 576


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  635 bits (1637), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 302/559 (54%), Positives = 401/559 (71%), Gaps = 8/559 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V   +PPF+CDP NP T    FC+ T+PI  R +DL+ RLTL EKI  LVN+A A+PRLG
Sbjct: 25  VVHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLG 84

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           I  YEWWSEALHGV+ VG G  F G   GATSFPQVI TAASF+  LW  IG+ +  EAR
Sbjct: 85  IQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEAR 144

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++  KYA  YV+G+QGD      
Sbjct: 145 AMYNGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQGD-----G 198

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
              +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDLADTY  PF++CV +G  + +M
Sbjct: 199 AGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHVASVM 258

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V ++Y+ + Y ++PE+A  D +
Sbjct: 259 CSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHYTRTPEEAAADAI 318

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           KAG+D++CG FL  HT+AAV++  + + EI+ AL N  +V+MRLG+F+G P+   +G +G
Sbjct: 319 KAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYGNLG 378

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC+PAHQ LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++   T++GNYAG
Sbjct: 379 PRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGNYAG 438

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y   T++  GC  V C+   +   A   A+ AD  VL+MGLDQ+ E
Sbjct: 439 VACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQSIE 497

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E +DRV L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D  I +I+W GYP
Sbjct: 498 AEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWVGYP 557

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G+AG  A+A+V+FG  NPG
Sbjct: 558 GQAGGTAIADVLFGTTNPG 576


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 304/534 (56%), Positives = 389/534 (72%), Gaps = 19/534 (3%)

Query: 65  LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L EKI QL N+A ++PRLGIP YEWWSE+LHG+A  G G+ FNG+I  ATSFPQVI++AA
Sbjct: 2   LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           SF+  LWY IG A+ +E RA+YN GQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V  
Sbjct: 62  SFNRTLWYEIGSAVAVEGRAMYNGGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPKVVS 120

Query: 185 KYAVSYVRGVQ----------------GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
           +Y V +VRG Q                 D  +     GKL  SACCKHFTAYDL+ W   
Sbjct: 121 EYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNF 180

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
           TRY F+A VT QD+ DTYQPPFE+C++ G+AS +MC+YN VNG+P+CA  +LL K AR +
Sbjct: 181 TRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQK-ARVE 239

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
           WGF GYITSDCDAV+ I+  +GY KSPE+AV D +KAG+D+NCG+++ +HT++A++Q K+
Sbjct: 240 WGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKV 299

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
            E  +DRAL NLF+V++RLGLF+G+P    +GK+G++ +CS  H+ LAL+A + GIVLLK
Sbjct: 300 SEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLK 359

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
           N H LLPL K+   SLA++GP AN+   + G Y G  C+  T    L  YV+ T Y  GC
Sbjct: 360 NDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGC 419

Query: 469 DTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
             V+C S +   +AV IAKGAD V+++ GLD +QE E+ DRV L LPG+Q++L++ VA  
Sbjct: 420 SDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAV 479

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +KKPVILVL  GGPVD+TFAK D  IGSI+W GYPGE G  ALAE+IFGD NPG
Sbjct: 480 SKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 533


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 300/567 (52%), Positives = 402/567 (70%), Gaps = 10/567 (1%)

Query: 15  LCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVN 74
           +   S+L  +   + PF+CDP N  T ++ FC T +PI  R +DL+ RL L EKI  +VN
Sbjct: 11  MLLISILVTLSEGRVPFACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPEKIRLVVN 70

Query: 75  SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
           +A A+PRLGI  YEWWSEALHGV+ VG G  F G    ATSFPQVI TAASF+  LW  I
Sbjct: 71  NAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFNQSLWLEI 130

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
           G+ +  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP V GKYA SYV+G+
Sbjct: 131 GRIVSDEARAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGL 189

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           QG   NG     +L+ +ACCKH+TAYDLDNW G  R+ F+A+V+ QDLADTY  PF++CV
Sbjct: 190 QG---NGAG--NRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPFKACV 244

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
           + G+ + +MC+YN+VNG P+CAD  LL  T R +WG +GYI SDCD+V ++YD + Y ++
Sbjct: 245 RDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQHYTRT 304

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           PE A    +KAG+D++CG FL  HT  A+KQ  + E++++ AL NL +V+MRLG+F+G+ 
Sbjct: 305 PEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMFDGD- 363

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
             QP+G +G   VC P+H  +AL+AA+ GIVLL+N    LPL  ++  ++ +IGPN++  
Sbjct: 364 -AQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPNSDVT 422

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVL 493
            T++GNYAG +C   TPLQ +  YV+ T++  GC  V C    +   +  +A+ AD  VL
Sbjct: 423 VTMIGNYAGIACGYTTPLQGIARYVK-TIHQAGCKDVGCGGNQLFGLSEQVARQADATVL 481

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           +MGLDQ+ E E  DR  L+LPG QQEL++RVA AA+ PVILVL+ GGP+D+TFAK D  I
Sbjct: 482 VMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKNDPKI 541

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP 580
            +ILW GYPG++G  A+A+VIFG  NP
Sbjct: 542 SAILWVGYPGQSGGTAIADVIFGRTNP 568


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  632 bits (1631), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 317/577 (54%), Positives = 403/577 (69%), Gaps = 15/577 (2%)

Query: 6   LSLVFPLLCLCFTSLLTRVDSTQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLT 64
           LSL+F  LCL  +S      ST P F+CD   NPS   + FC T + I  R  DLV+RLT
Sbjct: 20  LSLIF--LCLLDSS---NAQST-PVFACDVAGNPSLAAYGFCNTAIKIEYRVADLVARLT 73

Query: 65  LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L EKI  L +    + RLGIP YEWWSEALHGV+ VG G  F+G + GATSFPQVILTAA
Sbjct: 74  LQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVILTAA 133

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           SF+  L+  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++ 
Sbjct: 134 SFNVSLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSS 192

Query: 185 KYAVSYVRGVQ-GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           KYA  YV+G+Q  D+ +  +LK     +ACCKH+TAYD+DNWKG  RY F+A V  QDL 
Sbjct: 193 KYASGYVKGLQETDSSDANRLK----VAACCKHYTAYDVDNWKGVERYSFNAVVNQQDLD 248

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DTYQPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V 
Sbjct: 249 DTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVD 308

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           ++Y  + Y K+PE+A    + AG+D+NCG FL  HT+AAVK   + E+ ID+A+ N F  
Sbjct: 309 VLYKNQHYTKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAITNNFLT 368

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            MRLG F+G+P  Q +G +G   VC+PA+Q LA +AA+ GIVLLKN+ G LPL      +
Sbjct: 369 LMRLGFFDGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKNT-GALPLSPKTIKT 427

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IGPNAN  KT++GNY G  C+  TPLQ L   V +T Y PGC  VAC+ A +  +  
Sbjct: 428 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTV-HTTYLPGCSNVACAVADVAGSTK 486

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           +A  +D  VL++G DQ+ E E  DRVDL LPG+QQEL+T+VA+AAK PV LV++ GG  D
Sbjct: 487 LAAASDATVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMSGGGFD 546

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ITFAK D  I  ILW GYPGEAG +A A+VIFG +NP
Sbjct: 547 ITFAKNDAKIAGILWVGYPGEAGGIATADVIFGRYNP 583


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/578 (54%), Positives = 403/578 (69%), Gaps = 10/578 (1%)

Query: 6   LSLVFPLLCLCFTSLLTRVDSTQ--PPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSR 62
           +S VF L  LCF      + + Q  P F+CD  +NPS   + FC T L I  R  DLV+R
Sbjct: 18  VSSVF-LTFLCFFLYFLDLSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVAR 76

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LTL EKI  LV+ A  + RLGIP YEWWSEALHGV+ +G G  F+  + GATSFPQVILT
Sbjct: 77  LTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILT 136

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           AASF+  L+  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL+
Sbjct: 137 AASFNVSLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLL 195

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
             KYA  YV+G+Q    +GG    +L+ +ACCKH+TAYD+DNWKG  RY F+A VT QD+
Sbjct: 196 ASKYASGYVKGLQ--ETDGGD-SNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDM 252

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            DTYQPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V
Sbjct: 253 DDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSV 312

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            ++Y  + Y K+P +A    + AG+D+NCGSFL +HT+ AVK   + E+ ID+A+ N F 
Sbjct: 313 DVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFL 372

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
             MRLG F+GNP  Q +G +G   VC+ A+Q LA  AA+ GIVLLKN+ G LPL      
Sbjct: 373 TLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GFLPLSPKSIK 431

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
           +LA+IGPNAN  KT++GNY G  C+  TPLQ L   V +T Y PGC  VAC+ A +  A 
Sbjct: 432 TLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAV-STTYLPGCSNVACAVADVAGAT 490

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
            +A  AD  VL++G DQ+ E E  DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG  
Sbjct: 491 KLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGF 550

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           DITFAK D  I  ILW GYPGEAG +A+A++IFG +NP
Sbjct: 551 DITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNP 588


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 299/558 (53%), Positives = 395/558 (70%), Gaps = 9/558 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V   +PPF+CDP NP T T  FC+  +PI  R +DL+ RLTL EKI  LVN+A A+PRLG
Sbjct: 25  VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           I  YEWWSEALHGV+ VG G  F GT  GATSFPQVI TAASF+  LW  IG+ +  EAR
Sbjct: 85  IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++  KY   YV+G+QGD      
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQGD-----G 197

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
              +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDL DTY  PF +CV  G  + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNVASVM 257

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN+VNG P+CAD  LL  T R QW  +GYI SDCD+V + YD + Y K+PE+A    +
Sbjct: 258 CSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEEAAAYAI 317

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           KAG+D++CG FL  HT+AAV+  ++ E +I+ AL N  +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQRYGNLG 377

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC P+   LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++  +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMIGNYAG 437

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y   T++  GC  V C+   +  A ++ A+ AD  VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR DL+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D  IG+I+W GYP
Sbjct: 497 AEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYP 556

Query: 563 GEAGAVALAEVIFGDHNP 580
           G+AG  A+A+V+FG  NP
Sbjct: 557 GQAGGTAIADVLFGTTNP 574


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  632 bits (1629), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 315/556 (56%), Positives = 402/556 (72%), Gaps = 8/556 (1%)

Query: 28  QPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
            P F+CD  SNPS  +F FC T+L +S R  DLV RLTL EKI  LVNSA ++ RLGIP 
Sbjct: 1   SPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPK 60

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAASF++ L+  IG+ +  EARA+Y
Sbjct: 61  YEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMY 120

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY   YV+G+Q    + G   G
Sbjct: 121 NVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQ--QRDDGNPDG 177

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            L+ +ACCKH+TAYDLDNWKG  RY F+A VT QD+ DT+QPPF+SCV  G  + +MC+Y
Sbjct: 178 -LKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVASVMCSY 236

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           N+VNGIP+CAD +LLS   R +W  +GYI +DCD++ + Y+++ Y K+PE+A    + AG
Sbjct: 237 NKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAG 296

Query: 327 --MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
             +D+NCGSFL KHT+AAV    + ES IDRA+ N F+  MRLG F+G+P+ Q +GK+G 
Sbjct: 297 IRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGP 356

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+  +Q LA +AA+ GIVLLKN+ G LPL  +   +LA+IGPNAN  KT++GNY G 
Sbjct: 357 KDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGT 416

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            C+  TPLQ L   V  T Y PGC  VACS+A +D A  IA  AD  VL+MG D + E E
Sbjct: 417 PCKYTTPLQGLAALVA-TTYLPGCSNVACSTAQVDDAKKIAAAADATVLVMGADLSIEAE 475

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DRVD++LPG+QQ LIT VA A+  PVILV++ GG +D++FAK +  I SILW GYPGE
Sbjct: 476 SRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGE 535

Query: 565 AGAVALAEVIFGDHNP 580
           AG  A+A++IFG +NP
Sbjct: 536 AGGAAIADIIFGSYNP 551


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 311/575 (54%), Positives = 408/575 (70%), Gaps = 16/575 (2%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
           LVF LLCL  +S     +S +P F+CDP+N  T T  FC+  +PI  R +DL+ RLTL E
Sbjct: 18  LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQE 71

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KI  LVN+A A+PRLGI  YEWWSEALHG++ VG G  F G   GATSFPQVI TAASF+
Sbjct: 72  KIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFN 131

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             LW  IG+ +  EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V  KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
            SYVRG+QG T  G +LK     +ACCKH+TAYDLDNW G  R+ F+A+VT QDL DTY 
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
            PF+SCV +G+ + +MC+YN+VNG P+CAD NLL  T R QW  +GYI SDCD+V + ++
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYIVSDCDSVDVFFN 305

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
            + Y  +PE+A    +KAG+D++CG FL   T+ AVK+  L E++I+ AL N  +V+MRL
Sbjct: 306 QQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
           G+F+GN  + P+  +G   VC+PAH+ LAL+AA  GIVLLKNS   LPL   +  ++A+I
Sbjct: 366 GMFDGN--LGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPLSPRRHRTVAVI 423

Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAK 486
           GPN++  +T++GNYAG +C   +PLQ +  Y   T++  GC  VAC  +     A   A+
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTSPLQGISRYAR-TLHQAGCAGVACKGNQGFGAAEAAAR 482

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            AD  VL+MGLDQ+ E E  DR  L+LPG QQ+L+TRVA+A++ PVILVL+ GGP+D+TF
Sbjct: 483 EADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTF 542

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           AK D  + +I+WAGYPG+AG  A+A +IFG  NPG
Sbjct: 543 AKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPG 577


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 313/571 (54%), Positives = 402/571 (70%), Gaps = 9/571 (1%)

Query: 13  LCLCFTSLLTRVDSTQ--PPFSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           L LCF   ++   + Q  P F+CD + NPS     FC   L I  R  DLV RLTL+EKI
Sbjct: 14  LFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTDLVGRLTLEEKI 73

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
             L + A  + RLGIP+Y+WWSEALHGV+ VG G  F G + GATSFPQVILTAASF+  
Sbjct: 74  GFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQVILTAASFNVS 133

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           L+  IG+ +  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP ++ KYAV+
Sbjct: 134 LFQAIGKVVSTEARAMYNVGSA-GLTFWSPNVNIFRDPRWGRGQETPGEDPTLSSKYAVA 192

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YV+G+Q    +GG    +L+ +ACCKH+TAYD+DNW+   R  F+A V  QDLADT+QPP
Sbjct: 193 YVKGLQ--ETDGGD-PNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPP 249

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F+SCV  G  + +MC+YN+VNG P+CAD +LLS   R QW  +GYI SDCD+V +++  +
Sbjct: 250 FKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQ 309

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            YAK+PE+AV   L AG+D+NC  F  +H   AVK   + E+ ID+A+ N F+  MRLG 
Sbjct: 310 HYAKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKAISNNFATLMRLGF 369

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
           F+G+P  Q +G +G   VC+  +Q LA   A+ GIVLLKNS G LPL  S   +LA+IGP
Sbjct: 370 FDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGP 429

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           NAN+ +T++GNY G  C+  TPLQ L   V +T Y  GC+ VAC  A I  AVD+A  AD
Sbjct: 430 NANATETMIGNYHGVPCKYTTPLQGLAETVSST-YQLGCN-VACVDADIGSAVDLAASAD 487

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
            VVL++G DQ+ E+E  DRVDL LPG+QQEL+TRVA AA+ PV+LV++ GG  DITFAK 
Sbjct: 488 AVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAKN 547

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           D+ I SI+W GYPGEAG +A+A+VIFG HNP
Sbjct: 548 DKKITSIMWVGYPGEAGGLAIADVIFGRHNP 578


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 312/572 (54%), Positives = 399/572 (69%), Gaps = 9/572 (1%)

Query: 12  LLCLCFTSLLTRVDSTQ--PPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEK 68
           L+ LCF        + Q  P F+CD  +NPS   + FC T L I  R  DLV+RLTL EK
Sbjct: 23  LIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEK 82

Query: 69  ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
           I  LV+ A  + RLGIP YEWWSEALHGV+ +G G  F+  + GATSFPQVILTAASF+ 
Sbjct: 83  IGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNV 142

Query: 129 YLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
            L+  IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL+  KYA 
Sbjct: 143 SLFQAIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYAS 201

Query: 189 SYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQP 248
            YV+G+Q    +GG    +L+ +ACCKH+TAYD+DNWKG  RY F+A VT QD+ DTYQP
Sbjct: 202 GYVKGLQ--ETDGGD-SNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMDDTYQP 258

Query: 249 PFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA 308
           PF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V ++Y  
Sbjct: 259 PFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKN 318

Query: 309 EGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
           + Y K+P +A    + AG+D+NCGSFL +HT+ AVK   + E+ ID+A+ N F   MRLG
Sbjct: 319 QHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLG 378

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
            F+GNP  Q +G +G   VC+ A+Q LA  AA+ GIVLLKN+ G LPL      +LA+IG
Sbjct: 379 FFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GCLPLSPKSIKTLAVIG 437

Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGA 488
           PNAN  KT++GNY G  C+  TPLQ L   V +T Y PGC  VAC+ A +  A  +A  A
Sbjct: 438 PNANVTKTMIGNYEGTPCKYTTPLQGLAGTV-STTYLPGCSNVACAVADVAGATKLAATA 496

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D  VL++G DQ+ E E  DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG  DITFAK
Sbjct: 497 DVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAK 556

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D  I  ILW GYPGEAG +A+A++IFG +NP
Sbjct: 557 NDPKIAGILWVGYPGEAGGIAIADIIFGRYNP 588


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  629 bits (1623), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 308/576 (53%), Positives = 394/576 (68%), Gaps = 33/576 (5%)

Query: 23  RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           +V + Q P++CD  N +   F FC T LP S R  DL+SR+TL EKI QLVN+A  IPRL
Sbjct: 24  QVVAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRL 83

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           G+P YEWW EALHGVA V  G+ F G   GATSFP  ILTAASFD         A+  EA
Sbjct: 84  GLPRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEA 133

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           RA++N  +A G+T+W+PN+NI+RDPRWGRGQETPGEDPL++ KYA  YVRG+Q     G 
Sbjct: 134 RAMHNYQRA-GLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGD 192

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           KLK     SACCKH TAYD+DNWKGTTR+KF+A VT QDL+DTY PPF+SCV+  + S +
Sbjct: 193 KLK----VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHG----------------YITSDCDAVSIIY 306
           MC+YNRVNG+P+CAD NLLS T R  W  +G                YI SDCD++   +
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
           D   YAK+ ED V D L AG++++CG FL  HT++A+   K+ E+ +++AL  L++V+MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
           LGL++GNP  QP+G +G   VC+  +Q LAL AA++GIVLLKN+  +LP  KS   ++A 
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428

Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
           IGP+A + + ++GNY G  C+  TP   L  Y    VY  GC  VAC S S I  A   A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYAR-VVYSAGCSDVACYSNSLIGSAASTA 487

Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
             AD VVL +GLD  QE E  DR  L+LPG+QQEL+T V +AAK PV+LV+  GG VD++
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVS 547

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           FAKYD+ +  +LWAGYPGEAG  A+A+V+FGDHNPG
Sbjct: 548 FAKYDKKVQGMLWAGYPGEAGGAAIAQVLFGDHNPG 583


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score =  629 bits (1623), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/575 (54%), Positives = 407/575 (70%), Gaps = 16/575 (2%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
           LVF LLCL  +S     +S +P F+CDP+N  T T  FC+  +PI  R +DL+ RLTL E
Sbjct: 18  LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRLTLQE 71

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KI  LVN+A A+PRLGI  YEWWSEALHGV+ VG G  F G   GATSFPQVI TAASF+
Sbjct: 72  KIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTAASFN 131

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             LW  IG+ +  EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V  KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
            SYVRG+QG T  G +LK     +ACCKH+TAYDLDNW G  R+ F+A+VT QDL DTY 
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVTQQDLEDTYN 245

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
            PF+SCV +G+ + +MC+YN+VNG P+CAD NLL  T R +W  +GYI SDCD+V + ++
Sbjct: 246 VPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIVSDCDSVDVFFN 305

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
            + Y  +PE+A    +KAG+D++CG FL   T+ AVK+  L E++I+ AL N  +V+MRL
Sbjct: 306 QQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINLALANTLTVQMRL 365

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
           G+F+GN  + P+  +G   VCS AH+ LAL+AA  GIVLLKNS   LPL   +  ++A+I
Sbjct: 366 GMFDGN--LGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLSPRRHRTVAVI 423

Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAK 486
           GPN++  +T++GNYAG +C   TPLQ +  Y   T++  GC  VAC  +     A   A+
Sbjct: 424 GPNSDVTETMIGNYAGKACAYTTPLQGISRYAR-TLHQAGCAGVACKGNQGFGAAEAAAR 482

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            AD  VL+MGLDQ+ E E  DR  L+LPG QQ+L+TRVA+A++ PVILVL+ GGP+D+TF
Sbjct: 483 EADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTF 542

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           AK D  + +I+WAGYPG+AG  A+A +IFG  NPG
Sbjct: 543 AKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPG 577


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 317/562 (56%), Positives = 400/562 (71%), Gaps = 16/562 (2%)

Query: 26  STQPP--FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           S Q P  F+CD  +NPS   F FC ++L    R  DLV RLTL EKI  L+N+A  + RL
Sbjct: 40  SAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRL 99

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           GIP YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAASF++ L+  IG+ +  EA
Sbjct: 100 GIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEA 159

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----GDT 198
           RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++ KYA  YVRG+Q    GD 
Sbjct: 160 RAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD- 217

Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
                   +L+ +ACCKH+TAYDLDNWKGT RY F+A V+ QDL DT+QPPF+SCV  G 
Sbjct: 218 ------PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGN 271

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
            + +MC+YN+VNG P+CAD +LL+   R QW  +GYI SDCD+V ++Y+++ Y KSPE+A
Sbjct: 272 VASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEA 331

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               + AG+D++CG FL KHT+AAV    + E+ I +A+ N     MRLG F+GNP+ Q 
Sbjct: 332 AAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQL 391

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +GK+G   VC+P HQ LA +AA+ GIVLLKNS   LPL  S   SLA+IGPNAN  KT++
Sbjct: 392 YGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMI 451

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           GNY G  C+  TPLQ L   V +T + PGC  VAC+SA +D+A  IA  AD  VL++G D
Sbjct: 452 GNYEGTPCKYTTPLQGLSAVV-STSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSD 510

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
           Q+ E E  DRVDL LPG+Q  LIT VA+A+K PVILV++ GG +DITFAK D  I SILW
Sbjct: 511 QSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILW 570

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
            G+PGEAG  A+A+VIFG  NP
Sbjct: 571 VGFPGEAGGAAIADVIFGSFNP 592


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  629 bits (1621), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 314/554 (56%), Positives = 404/554 (72%), Gaps = 9/554 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           Q P++CD SN +   FPFC T+LPI+ R  D V+RLTL+EKISQL+N+A  IPRLG+P Y
Sbjct: 27  QAPYACDQSNATLAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKY 86

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WW EALHGVA    G+ F G++  ATSFP  I TAASF++ L+Y IGQA+  EARA++N
Sbjct: 87  QWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHN 145

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQ+ G+TFW+PNINI+RDPRWGRGQETPGEDPL++  +A  YVRG+Q       KLK  
Sbjct: 146 LGQS-GLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK-- 202

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
              SACCKH TAYD+DNW GT RY F+A VT QDL DTY  PF+SCV+ G  S +MC+YN
Sbjct: 203 --VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYN 260

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           R+NG+P+CAD  LL+ T R  W  +GYI SDCD++ + +D   YA + EDA  D L AG+
Sbjct: 261 RLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGL 320

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           ++NCG+FL KHT +A++QKK+ E+ I++AL  L +V+MRLGL++G+P  Q +G +GA  V
Sbjct: 321 NLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDV 380

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C+  HQ LAL+AA+ G+VLLKN  G LPL  SK  SLA++GP+AN+ + ++GNYAG  C+
Sbjct: 381 CTSEHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCK 439

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
             +PLQA Q Y +   Y PGC  VACSS S I  AV  A  AD VV+ +GLD T E E L
Sbjct: 440 YTSPLQAFQKYAQ-VSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESL 498

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+QQEL+++V +AAK PV++V+L  G +DI FA  D  I  ILWAGYPG+AG
Sbjct: 499 DRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAG 558

Query: 567 AVALAEVIFGDHNP 580
             A+AEVIFGDHNP
Sbjct: 559 GAAIAEVIFGDHNP 572


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 309/553 (55%), Positives = 401/553 (72%), Gaps = 10/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CD  N  T +  FC+  +P+  R RDL+ RLTL EKI  LVN+A A+PRLGI  YEW
Sbjct: 19  PFACDAKNGLTRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEW 78

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G   GATSFPQVI TAASF+  LW  IG+ +  EARA++N G
Sbjct: 79  WSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGG 138

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+N+FRDPRWGRGQETPGEDP+V GKYA SYVRG+QG++        +L+
Sbjct: 139 MA-GLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNS------GFRLK 191

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKH+TAYDLDNW G  RY F+ARV+ QDL DTY  PF+SCV +G+ + +MC+YN+V
Sbjct: 192 VAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVMCSYNQV 251

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+CAD NLL  T R +W  +GYI SDCD+V ++Y+ + Y  +PE+A    +KAG+D+
Sbjct: 252 NGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAATIKAGLDL 311

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL  HT+ AVK   L E +++ AL N  +V+MRLGLF+G P+ QPFGK+G   VC+
Sbjct: 312 DCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKLGPRDVCT 371

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAHQ LAL AAQ GIVLL+NS   LPL +  ++++A+IGP A+   T++GNYAG +C   
Sbjct: 372 PAHQQLALHAAQQGIVLLQNSGRTLPLSRP-NLTVAVIGPIADVTVTMIGNYAGVACGYT 430

Query: 450 TPLQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           TPLQ +  Y + T++  GC  VAC+ +     A   A  AD  VL+MGLDQ+ E E  DR
Sbjct: 431 TPLQGISRYAK-TIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIEAEFRDR 489

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
            DL+LPG QQELI+RVA A++ P ILVL+ GGP+D++FAK D  IG+ILWAGYPG+AG  
Sbjct: 490 KDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYPGQAGGA 549

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG  NPG
Sbjct: 550 AIADVLFGTTNPG 562


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  628 bits (1619), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 308/576 (53%), Positives = 393/576 (68%), Gaps = 33/576 (5%)

Query: 23  RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           +V + Q P++CD  N +   F FC T LP S R  DL+SR+TL EKI QLVN+A  IPRL
Sbjct: 24  QVVAGQAPYACDQRNATLLQFGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRL 83

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           G+P YEWW EALHGVA V  G+ F G   GATSFP  ILTAASFD         A+  EA
Sbjct: 84  GLPRYEWWQEALHGVA-VSPGVKFGGKFPGATSFPMPILTAASFD---------AVSTEA 133

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           RA++N  +A G+T+W+PN+NI+RDPRWGRGQETPGEDPL++ KYA  YVRG+Q     G 
Sbjct: 134 RAMHNYQRA-GLTYWSPNVNIYRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLGGD 192

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           KLK     SACCKH TAYD+DNWKGTTR+KF+A VT QDL+DTY PPF+SCV+  + S +
Sbjct: 193 KLK----VSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHG----------------YITSDCDAVSIIY 306
           MC+YNRVNG+P+CAD NLLS T R  W  +G                YI SDCD++   +
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
           D   YAK+ ED V D L AG++++CG FL  HT++A+   K+ E+ +++AL  L++V+MR
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
           LGL++GNP  QP+G +G   VC+  +Q LAL AA++GIVLLKN+  +LP  KS   ++A 
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428

Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
           IGP+A + + ++GNY G  C+  TP   L  Y    VY  GC  VAC S S I  AV  A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAYAR-VVYSAGCSDVACYSDSLIGSAVSTA 487

Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
             AD VVL +GLD  QE E  DR  L+LPG+QQEL+T V +AAK P +LV+  GG VD++
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVS 547

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           FAKY+  +  ILWAGYPGEAG  A+A+V+FGDHNPG
Sbjct: 548 FAKYNNKVQGILWAGYPGEAGGAAIAQVLFGDHNPG 583


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  628 bits (1619), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 308/555 (55%), Positives = 396/555 (71%), Gaps = 9/555 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +   + FC  T   + RA DL+ RLTL EK+  LVN   A+PRLGIPAYE
Sbjct: 29  PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 89  WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 148

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYAV YV G+Q    + G     L
Sbjct: 149 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGGGSDAL 203

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + +ACCKH+TAYD+DNWKG  RY FDA V+ QDL DT+QPPF+SCV  G  + +MC+YN+
Sbjct: 204 KVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNK 263

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LLS   R  W  +GYI SDCD+V ++Y+ + Y K+PEDA    +K+G+D
Sbjct: 264 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLD 323

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AAV+  KL ES++DRA+ N F V MRLG F+G+P   PFG +G   VC
Sbjct: 324 LNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVC 383

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + ++Q LA +AA+ GIVLLKN+ G LPL      S+A+IGPNAN++ T++GNY G  C+ 
Sbjct: 384 TSSNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 442

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
            TPLQ L   V  TVY PGC  V CS  S+    A   A  AD  VL++G DQ+ E+E L
Sbjct: 443 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESL 501

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+Q +L++ VA A++ PVILV++ GGP DI+FAK    I +ILW GYPGEAG
Sbjct: 502 DRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPGEAG 561

Query: 567 AVALAEVIFGDHNPG 581
             ALA+++FG HNPG
Sbjct: 562 GAALADILFGYHNPG 576


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  627 bits (1617), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 317/562 (56%), Positives = 400/562 (71%), Gaps = 16/562 (2%)

Query: 26  STQPP--FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           S Q P  F+CD  +NPS   F FC ++L    R  DLV RLTL EKI  L+N+A  + RL
Sbjct: 67  SAQSPTAFACDAETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRL 126

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           GIP YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAASF++ L+  IG+ +  EA
Sbjct: 127 GIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEA 186

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----GDT 198
           RA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDPL++ KYA  YVRG+Q    GD 
Sbjct: 187 RAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRDDGD- 244

Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
                   +L+ +ACCKH+TAYDLDNWKGT RY F+A V+ QDL DT+QPPF+SCV  G 
Sbjct: 245 ------PDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDGN 298

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
            + +MC+YN+VNG P+CAD +LL+   R QW  +GYI SDCD+V ++Y+++ Y KSPE+A
Sbjct: 299 VASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEEA 358

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               + AG+D++CG FL KHT+AAV    + E+ I +A+ N     MRLG F+GNP+ Q 
Sbjct: 359 AAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQL 418

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +GK+G   VC+P HQ LA +AA+ GIVLLKNS   LPL  S   SLA+IGPNAN  KT++
Sbjct: 419 YGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTMI 478

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           GNY G  C+  TPLQ L   V +T + PGC  VAC+SA +D+A  IA  AD  VL++G D
Sbjct: 479 GNYEGTPCKYTTPLQGLSAVV-STSFQPGCANVACTSAQLDEAKKIAASADATVLVVGSD 537

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
           Q+ E E  DRVDL LPG+Q  LIT VA+A+K PVILV++ GG +DITFAK D  I SILW
Sbjct: 538 QSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSILW 597

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
            G+PGEAG  A+A+VIFG  NP
Sbjct: 598 VGFPGEAGGAAIADVIFGSFNP 619


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  626 bits (1615), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 308/568 (54%), Positives = 399/568 (70%), Gaps = 7/568 (1%)

Query: 15  LCFTSLLTRVDSTQPP-FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           +  +SLL +V +   P F+CD  SNP+     FC  +L +  R  DLV+RLTL EKI  L
Sbjct: 18  ILLSSLLKQVLAQNSPVFACDVTSNPALGNLTFCDASLAVENRVNDLVNRLTLGEKIGFL 77

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
           V+ A  + RLGIP YEWWSEALHGVA  G G+ F   + GATSFPQVILTAASF+  L+ 
Sbjct: 78  VSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVILTAASFNVTLFQ 137

Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
            IG+ +  EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP +T KY V+YV 
Sbjct: 138 TIGKVVSTEARAMYNVGLA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVE 196

Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
           G+Q  T +G     KL+ +ACCKH+TAYD+DNWKG  RY F+A V  QDL DT+QPPF S
Sbjct: 197 GLQ-QTDDGST--NKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDTFQPPFRS 253

Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA 312
           CV +G  + +MC+YN+VNG P+C D NLL+   R +W  +GYI +DCD++ +I+ ++ Y 
Sbjct: 254 CVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVIFKSQNYT 313

Query: 313 KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
           K+PE+A    L +G+D+NCGS+L  +T+ AV QK + ES IDRA+ N F+  MRLG F+G
Sbjct: 314 KTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLMRLGFFDG 373

Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
           NP  + +G +G   VC+P +Q LA +AA+ GIVLLKN+ G LPL  +   SLA+IGPNAN
Sbjct: 374 NPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLAVIGPNAN 433

Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
             KT++GNY G  C+  TPLQ L   V  T+Y PGC  V+C++A ID A  IA  AD VV
Sbjct: 434 VTKTMIGNYEGIPCKYTTPLQGLTASVA-TIYKPGCADVSCNTAQIDDAKQIATTADAVV 492

Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           L+MG DQ+ EKE LDR  + LPG+Q  L+  VA+ AK PVILV++ GG +D+ FA  +  
Sbjct: 493 LVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDVQFAVDNPK 552

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I SILW G+PGEAG  ALA+VIFG +NP
Sbjct: 553 ITSILWVGFPGEAGGAALADVIFGYYNP 580


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score =  626 bits (1614), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 303/553 (54%), Positives = 404/553 (73%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP N  T   PFC+ +LPI +RARDLV RLTL EKI  LVN+A  +PRLGI  YEW
Sbjct: 26  PFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEW 85

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G+  GATSFPQVI TAASF++ LW  IG+ +  EARA+YN G
Sbjct: 86  WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGG 145

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP V  KYA +YVRG+QG+       + +L+
Sbjct: 146 MA-GLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA------RDRLK 198

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKH+TAYDLD+W G  R+ F+ARV+ QDL DTY  PF++CV +G  + +MC+YN+V
Sbjct: 199 VAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQV 258

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+CAD +LL  T R +W  +GYI SDCD+V + YD + Y  +PE+A    +KAG+D+
Sbjct: 259 NGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDL 318

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL  HT+AA++  KL E++++ AL N  SV+MRLG+F+G P+ QP+G +G   VC+
Sbjct: 319 DCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCT 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAHQ LAL+AA+ GIVL++N    LPL  S+  ++A+IGPN++  +T++GNYAG +C   
Sbjct: 379 PAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYT 438

Query: 450 TPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           TPLQ +  Y   T++  GC  VAC        AV  A+ AD  VL+MGLDQ+ E E  DR
Sbjct: 439 TPLQGIGRYAR-TIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDR 497

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           VD++LPGRQQEL+++VA A++ P +LVL+ GGP+D++FAK D  I +I+W GYPG+AG  
Sbjct: 498 VDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGT 557

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG  NPG
Sbjct: 558 AIADVLFGRTNPG 570


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  625 bits (1612), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 313/554 (56%), Positives = 403/554 (72%), Gaps = 9/554 (1%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           Q P++CD SN +   FPFC T+L I+ R  D V+RLTL+EKISQL+N+A  IPRLG+P Y
Sbjct: 27  QAPYACDQSNATLAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKY 86

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WW EALHGVA    G+ F G++  ATSFP  I TAASF++ L+Y IGQA+  EARA++N
Sbjct: 87  QWWQEALHGVAS-SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHN 145

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQ+ G+TFW+PNINI+RDPRWGRGQETPGEDPL++  +A  YVRG+Q       KLK  
Sbjct: 146 LGQS-GLTFWSPNINIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQAGSDKLK-- 202

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
              SACCKH TAYD+DNW GT RY F+A VT QDL DTY  PF+SCV+ G  S +MC+YN
Sbjct: 203 --VSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYN 260

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           R+NG+P+CAD  LL+ T R  W  +GYI SDCD++ + +D   YA + EDA  D L AG+
Sbjct: 261 RLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAADALLAGL 320

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           ++NCG+FL KHT +A++QKK+ E+ I++AL  L +V+MRLGL++G+P  Q +G +GA  V
Sbjct: 321 NLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDV 380

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C+  HQ LAL+AA+ G+VLLKN  G LPL  SK  SLA++GP+AN+ + ++GNYAG  C+
Sbjct: 381 CTSEHQTLALEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCK 439

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
             +PLQA Q Y +   Y PGC  VACSS S I  AV  A  AD VV+ +GLD T E E L
Sbjct: 440 YTSPLQAFQKYAQ-VSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESL 498

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+QQEL+++V +AAK PV++V+L  G +DI FA  D  I  ILWAGYPG+AG
Sbjct: 499 DRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAG 558

Query: 567 AVALAEVIFGDHNP 580
             A+AEVIFGDHNP
Sbjct: 559 GAAIAEVIFGDHNP 572


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score =  625 bits (1612), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 303/553 (54%), Positives = 404/553 (73%), Gaps = 9/553 (1%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP N  T   PFC+ +LPI +RARDLV RLTL EKI  LVN+A  +PRLGI  YEW
Sbjct: 26  PFACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEW 85

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G+  GATSFPQVI TAASF++ LW  IG+ +  EARA+YN G
Sbjct: 86  WSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGG 145

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP V  KYA +YVRG+QG+       + +L+
Sbjct: 146 MA-GLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQGNA------RDRLK 198

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKH+TAYDLD+W G  R+ F+ARV+ QDL DTY  PF++CV +G  + +MC+YN+V
Sbjct: 199 VAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVMCSYNQV 258

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+CAD +LL  T R +W  +GYI SDCD+V + YD + Y  +PE+A    +KAG+D+
Sbjct: 259 NGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTATPEEAAAVAIKAGLDL 318

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG FL  HT+AA++  KL E++++ AL N  SV+MRLG+F+G P+ QP+G +G   VC+
Sbjct: 319 DCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNLGPRDVCT 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAHQ LAL+AA+ GIVL++N    LPL  S+  ++A+IGPN++  +T++GNYAG +C   
Sbjct: 379 PAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYAGVACGYT 438

Query: 450 TPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           TPLQ +  Y   T++  GC  VAC        AV  A+ AD  VL+MGLDQ+ E E  DR
Sbjct: 439 TPLQGIGRYAR-TIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIEAEFRDR 497

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           VD++LPGRQQEL+++VA A++ P +LVL+ GGP+D++FAK D  I +I+W GYPG+AG  
Sbjct: 498 VDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYPGQAGGT 557

Query: 569 ALAEVIFGDHNPG 581
           A+A+V+FG  NPG
Sbjct: 558 AIADVLFGRTNPG 570


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  625 bits (1611), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 310/554 (55%), Positives = 398/554 (71%), Gaps = 9/554 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +  ++ FC  +   S RA DLVSRLTL EK+  LV+   A+PRLGIP YE
Sbjct: 32  PVFACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYE 91

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  +  ATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 92  WWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 151

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+Q    + G   G L
Sbjct: 152 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ----DAGSGSGSL 206

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + +ACCKH+TAYD+DNWKG  RY F+A V+ QDL DT+QPPF+SCV  G  + +MC+YN+
Sbjct: 207 KVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 266

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LLS   R  W  +GYI+SDCD+V ++Y+ + Y K+PEDA    +KAG+D
Sbjct: 267 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 326

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AAV+  KL ES++DRA+ N F   MRLG F+G+P   PFG +G   VC
Sbjct: 327 LNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKLPFGNLGPSDVC 386

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + ++Q LA +AA+ GIVLLKNS G LPL  S   SLA+IGPNAN++ T++GNY G  C+ 
Sbjct: 387 TSSNQELAREAARQGIVLLKNS-GALPLSASSIKSLAVIGPNANASFTMIGNYEGTPCKY 445

Query: 449 ITPLQALQNYVENTVYYPGCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
            TPLQ L   V  TVY PGC  V CS  S  +D A   A  AD  VL++G DQ+ E+E L
Sbjct: 446 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESL 504

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+Q +L++ VA A++ P ILV++ GGP DI+FAK    I +ILW GYPGEAG
Sbjct: 505 DRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAILWVGYPGEAG 564

Query: 567 AVALAEVIFGDHNP 580
             A+A+V+FG HNP
Sbjct: 565 GAAIADVLFGHHNP 578


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 308/558 (55%), Positives = 397/558 (71%), Gaps = 6/558 (1%)

Query: 25  DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
           ++  P F+CD SN +   + FC      S RARDLVSRLTL EK+  LVN  PA+ RLGI
Sbjct: 31  EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           PAYEWWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA
Sbjct: 91  PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           ++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYAV YV G+Q D   GG  
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVT 208

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
            G L+ +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+QPPF+SCV  G  + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           +YN+VNG P+CAD++LL    R  W  +GYI SDCD+V ++Y  + Y K+PE+A    +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           +G+D+NCG+FL +HT AAV+  +L E ++DRA+ N F + MRLG F+G+P    FG +G 
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+ +++ LA + A+ GIVLLKNS G LPL      S+A+IGPNAN++ T++GNY G 
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
            C+  TPLQ L   V NTVY PGC  V CS  S+    AV  A  AD  VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           +E LDR  L+LPG+Q +L++ VA A+  PVILV++ GGP DI+FAK    I +ILW GYP
Sbjct: 507 RESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYP 566

Query: 563 GEAGAVALAEVIFGDHNP 580
           GEAG  ALA+++FG HNP
Sbjct: 567 GEAGGAALADILFGSHNP 584


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  624 bits (1609), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 308/558 (55%), Positives = 397/558 (71%), Gaps = 6/558 (1%)

Query: 25  DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
           ++  P F+CD SN +   + FC      S RARDLVSRLTL EK+  LVN  PA+ RLGI
Sbjct: 31  EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           PAYEWWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA
Sbjct: 91  PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           ++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYAV YV G+Q D   GG  
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVT 208

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
            G L+ +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+QPPF+SCV  G  + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           +YN+VNG P+CAD++LL    R  W  +GYI SDCD+V ++Y  + Y K+PE+A    +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           +G+D+NCG+FL +HT AAV+  +L E ++DRA+ N F + MRLG F+G+P    FG +G 
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+ +++ LA + A+ GIVLLKNS G LPL      S+A+IGPNAN++ T++GNY G 
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
            C+  TPLQ L   V NTVY PGC  V CS  S+    AV  A  AD  VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           +E LDR  L+LPG+Q +L++ VA A+  PVILV++ GGP DI+FAK    I +ILW GYP
Sbjct: 507 RESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYP 566

Query: 563 GEAGAVALAEVIFGDHNP 580
           GEAG  ALA+++FG HNP
Sbjct: 567 GEAGGAALADILFGSHNP 584


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  623 bits (1606), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 308/554 (55%), Positives = 397/554 (71%), Gaps = 9/554 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +  ++ FC  +   + RA DLVSRLTL EK+  LV+   A+PRLG+P YE
Sbjct: 38  PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 98  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+QG     G LK   
Sbjct: 158 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK--- 213

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             +ACCKH+TAYD+DNWKG  RY FDA V+ QDL DT+QPPF+SCV  G  + +MC+YN+
Sbjct: 214 -VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 272

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LLS   R  W  +GYI+SDCD+V ++Y+ + Y K+PEDA    +KAG+D
Sbjct: 273 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 332

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AAV+  KL ES++DRA+ N     MRLG F+G+P   PFG +G   VC
Sbjct: 333 LNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVC 392

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +P++Q LA +AA+ GIVLLKN+ G LPL  +   S+A+IGPNAN++ T++GNY G  C+ 
Sbjct: 393 TPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTPCKY 451

Query: 449 ITPLQALQNYVENTVYYPGCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
            TPLQ L   V  TVY PGC  V CS  S  +D A   A  AD  VL++G DQ+ E+E L
Sbjct: 452 TTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIERESL 510

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+Q +L++ VA A+  P ILV++ GGP DI+FAK    I +ILW GYPGEAG
Sbjct: 511 DRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPGEAG 570

Query: 567 AVALAEVIFGDHNP 580
             A+A+V+FG HNP
Sbjct: 571 GAAIADVLFGYHNP 584


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  622 bits (1604), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 297/558 (53%), Positives = 395/558 (70%), Gaps = 9/558 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V   +PPF+CDP NP T T  FC+  +PI  R +DL+ RLTL EKI  LVN+A A+PRLG
Sbjct: 25  VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           I  YEWWSEALHGV+ VG G  F GT  GATSFPQVI TAASF+  LW  IG+ +  EAR
Sbjct: 85  IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++  KY   YV+G+QGD      
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGD-----G 197

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
              +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDL DTY  PF++CV  G  + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V + YD + Y K+PE A    +
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           KAG+D++CG FL  HT+AA++  ++ E +I+ AL N  +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC P+   LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++  +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y   T++  GC  V C+   +  A ++ A+ AD  VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR  L+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D  IG+I+W GYP
Sbjct: 497 AEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIWVGYP 556

Query: 563 GEAGAVALAEVIFGDHNP 580
           G+AG  A+A+V+FG  NP
Sbjct: 557 GQAGGTAIADVLFGTTNP 574


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 297/558 (53%), Positives = 395/558 (70%), Gaps = 9/558 (1%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V   +PPF+CDP NP T T  FC+  +PI  R +DL+ RLTL EKI  LVN+A A+PRLG
Sbjct: 25  VVHARPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLG 84

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           I  YEWWSEALHGV+ VG G  F GT  GATSFPQVI TAASF+  LW  IG+ +  EAR
Sbjct: 85  IQGYEWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEAR 143

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++  KY   YV+G+QGD      
Sbjct: 144 AMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQGD-----G 197

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
              +L+ +ACCKH+TAYDLDNW G  R+ F+ARV+ QDL DTY  PF++CV  G  + +M
Sbjct: 198 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNVASVM 257

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN+VNG P+CAD +LL  T R QW  +GYI SDCD+V + YD + Y K+PE A    +
Sbjct: 258 CSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHYTKTPEAAAAYAI 317

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           KAG+D++CG FL  HT+AA++  ++ E +I+ AL N  +V+MRLG+F+G P+ Q +G +G
Sbjct: 318 KAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLG 377

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC P+   LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++  +T++GNYAG
Sbjct: 378 LADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAG 437

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y   T++  GC  V C+   +  A ++ A+ AD  VL++GLDQ+ E
Sbjct: 438 IACGYTTPLQGIARYTR-TIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIE 496

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR  L+LPG QQEL++RVA A++ P ILV++ GGP+D+TFAK D  IG+I+W GYP
Sbjct: 497 AEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIWVGYP 556

Query: 563 GEAGAVALAEVIFGDHNP 580
           G+AG  A+A+V+FG  NP
Sbjct: 557 GQAGGTAIADVLFGTTNP 574


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score =  620 bits (1600), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 313/546 (57%), Positives = 394/546 (72%), Gaps = 8/546 (1%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           +++ PFC T+LP   RAR LVS LTL EKI  L N+A +IPRLGIPAY+WWSE+LHG+A 
Sbjct: 36  SQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLAL 95

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
            G G+ F G +  ATSFPQVIL+AASF+  LW R   AI  EARA++N GQA G+TFWAP
Sbjct: 96  NGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQA-GLTFWAP 154

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG----DTFNGGKLKGKLQASACCK 215
           NIN+FRDPRWGRGQETPGEDP++   YAV YVRG+QG               L  SACCK
Sbjct: 155 NINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCK 214

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HFTAYDLD W   +RY F+A V+ QDL DTYQPPF SC++QG+AS +MC+YN VNG+P+C
Sbjct: 215 HFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPAC 274

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           A   LL   AR +WGF GYITSDCDAV+ +Y+ + YAKS EDAV DVLKAGMD+NCG+F+
Sbjct: 275 ASEELLG-LARDKWGFKGYITSDCDAVATVYEYQKYAKSQEDAVADVLKAGMDINCGTFM 333

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
            +HT++A++Q K+ E ++DRAL NLFSV++RLGLF+G+P    FGK+G   VC+  H+ L
Sbjct: 334 LRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTL 393

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL AA+ GIVLLKN    LPL +    SLA+IGP A + K L G Y+G  C S +  + L
Sbjct: 394 ALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPLATTTK-LGGGYSGIPCSSSSLYEGL 452

Query: 456 QNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
             + E   Y  GC  V C S     +A+D AK AD VV++ GLD TQE E+ DRV L+LP
Sbjct: 453 GEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLP 512

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G+Q  L++ VA+A+K PVILVL+ GGP+D++FA+ +  I SI+W GYPGEAG  ALAE+I
Sbjct: 513 GKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEII 572

Query: 575 FGDHNP 580
           FG+ NP
Sbjct: 573 FGEFNP 578


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  619 bits (1595), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 315/578 (54%), Positives = 407/578 (70%), Gaps = 15/578 (2%)

Query: 9   VFPLLCLCFTSLLTRVDS-TQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
           VF L C+    L T V + + P F+CD  SNPS  +  FC T++ I+ R  DLV RLTL 
Sbjct: 15  VFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQ 74

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           EKI  LVNSA  + RLGIP YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAASF
Sbjct: 75  EKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASF 134

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           ++ L+  IG+ +  EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL++ KY
Sbjct: 135 NTSLFEAIGKVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKY 193

Query: 187 AVSYVRGVQ----GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
              YV+G+Q    GD         KL+ +ACCKH+TAYDLDNWKG+ RY F+A VT QD+
Sbjct: 194 GSCYVKGLQQRDDGD-------PDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTKQDM 246

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI +DCD++
Sbjct: 247 DDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDCDSL 306

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            + Y ++ Y K+PE+A    + AG+D+NCGSFL +HT+AAVK   + E  ID A+ N F+
Sbjct: 307 DVFYKSQNYTKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFA 366

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
             MRLG F+G+P+ Q +GK+G   VC+  +Q LA +AA+ GIVLLKN+ G LPL  +   
Sbjct: 367 TLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIK 426

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAV 482
           +LA+IGPNAN  KT++GNY G  C+  TPLQ L   V  T Y PGC  VACS+A +D A 
Sbjct: 427 NLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVA-TTYLPGCSNVACSTAQVDDAK 485

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
            +A  AD  VL+MG D + E E  DRVD++LPG+QQ LIT VA  +  PVILV++ GG +
Sbjct: 486 KLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGM 545

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           D++FA+ +  I SILW GYPGEAG  A+A++IFG +NP
Sbjct: 546 DVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNP 583


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score =  618 bits (1593), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 298/571 (52%), Positives = 396/571 (69%), Gaps = 8/571 (1%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
            + + F         ++ PF+CDP+N       FCKT+LPI  R +DL++RLTL EKI  
Sbjct: 6   FILIIFVLAFAYSGESRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKIRL 65

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           LVN+A  + RLGI  YEWWSEALHGV+  G G+ F G   GATSFPQVI TAASF++ LW
Sbjct: 66  LVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNASLW 125

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
             IG+ +  E RA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP +  +Y VSYV
Sbjct: 126 EEIGRVVSEEGRAMYNGGAA-GLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSYV 184

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           +G+QG      +   +L+ +ACCKH+TAYDLD+W G  RY F+A+V+MQDL DTY  PF+
Sbjct: 185 KGLQGGGG---RGNTRLKVAACCKHYTAYDLDDWNGYDRYHFNAKVSMQDLEDTYNAPFK 241

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           +CV +G  + +MC+YN++NG PSCAD  LL  T R QW  +GYI SDCD+V ++++ + Y
Sbjct: 242 ACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDCDSVGVLFEKQHY 301

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            + PEDA    +KAG+D++CG FL  HT  AV   K+ + EI+ AL N  +V+MRLG+F+
Sbjct: 302 TRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALANTITVQMRLGMFD 361

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           G     P+  +G   VCSPAHQ LALQAA++GIVLLKN    LPL   +  ++A+IGPN+
Sbjct: 362 G--PNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKRHRTVAVIGPNS 419

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADH 490
           ++   ++GNYAG  C  I+PLQ +  Y   T++  GC  VAC  + +   A   A+ AD 
Sbjct: 420 DATLAMIGNYAGVPCGYISPLQGISRYAR-TIHQQGCMGVACPGNQNFGLAEVAARHADA 478

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            VL+MGLDQ+ E E  DRV L+LPG QQ+LI+RVA A+K PV+LVL+ GGP+D+TFAK D
Sbjct: 479 TVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGGPIDVTFAKND 538

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             + SI+W GYPG+AG  A+A+V+FG  NPG
Sbjct: 539 PRVSSIVWVGYPGQAGGAAIADVLFGATNPG 569


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  617 bits (1592), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 306/554 (55%), Positives = 392/554 (70%), Gaps = 6/554 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +   + FC      S RARDLVSRLTL EK+  LVN  PA+ RLGIPAYE
Sbjct: 37  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 96

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 97  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 156

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYAV YV G+Q D   GG   G L
Sbjct: 157 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-DAGAGGVTDGAL 214

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+QPPF+SCV  G  + +MC+YN+
Sbjct: 215 KVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNK 274

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LL    R  W  +GYI SDCD+V ++Y  + Y K+PE+A    +K+G+D
Sbjct: 275 VNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGVD 334

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AAV+  +L E ++DRA+ N F + MRLG F+G+P    FG +G   VC
Sbjct: 335 LNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVC 394

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + +++ LA + A+ GIVLLKNS G LPL      S+A+IGPNAN++ T++GNY G  C+ 
Sbjct: 395 TSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 453

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
            TPLQ L   V NTVY PGC  V CS  S+    AV  A  AD  VL++G DQ+ E+E L
Sbjct: 454 TTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESL 512

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+Q +L++ VA A+  PVILV++ GGP DI+FAK    I + LW GYPGEAG
Sbjct: 513 DRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWVGYPGEAG 572

Query: 567 AVALAEVIFGDHNP 580
             AL + +FG HNP
Sbjct: 573 GAALDDTLFGSHNP 586


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  617 bits (1591), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 305/539 (56%), Positives = 385/539 (71%), Gaps = 24/539 (4%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC   L   QRA DLV+ LTL EK+SQL + A  + RLG+PAYEWWSE LHG++  G+GI
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            FNGT+R  TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRG-VQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           RDP   R    PG+             RG   G+   GG      +ASACCKH TAYDLD
Sbjct: 151 RDPSGTR----PGD-----------ARRGPRHGEQGIGG------EASACCKHATAYDLD 189

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
            W    RY +D++VT+QDL DTY PPF+SCV +G+A+ IMC YN +NG+P+CA  +LL+K
Sbjct: 190 YWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTK 249

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
             R++WG +GY+ SDCDAV+ I DA  Y  SPED V   +K GMDVNCG++ Q H  AAV
Sbjct: 250 KVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAV 309

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQD 402
           ++  L E +IDRAL NLF+VRMRLG F+G+P     +G +GA  VCSPAH+ LAL+AAQD
Sbjct: 310 QKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQD 369

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-EN 461
           GIVLLKN  G LPL  S   SLA+IGPNA++   L GNY GP C + TPLQ ++ Y+ + 
Sbjct: 370 GIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDR 429

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
             +  GCD+ AC+  + ++A  +A  +DHVVL MGL Q QE++ LDR  L+LPG QQ LI
Sbjct: 430 ARFLAGCDSPACAVDATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLI 489

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           T VA AA++PVILVLL GGPVD+TFAK +  IG+ILWAGYPG+AG +A+A+V+FGDHNP
Sbjct: 490 TAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNP 548


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  617 bits (1590), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 301/548 (54%), Positives = 399/548 (72%), Gaps = 8/548 (1%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++P+   +PFC  TLP   RAR LVS LTLDEKI+QL N+A   PRLG+P +EWWSE+LH
Sbjct: 31  ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90

Query: 96  GVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GV   G G+ F +G +R AT FPQVIL+AA+F+  LW    +AI +EARA++NAGQA G+
Sbjct: 91  GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA-GL 149

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TFWAPNIN+FRDPRWGRGQETPGEDP V   Y+V YV+G Q D       +G++  SACC
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EGRMMLSACC 205

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH+ AYDL+ W+G TRY F+A+V  QD+ DTYQPPF+SC+++GRAS +MC+YN+VNG+P+
Sbjct: 206 KHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPA 265

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA +++L + AR +WGF GYITSDCDAV+II++ + Y  S ED++  VLKAGMD+NCGSF
Sbjct: 266 CARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSF 324

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+     Q F ++G + VC+  H+ 
Sbjct: 325 LIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRE 384

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LA +A + G VLLKN +G LPL +S+   +ALIGP AN    L G+Y G  C S T ++ 
Sbjct: 385 LAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKG 444

Query: 455 LQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           +Q YV  T +  GC  V C+S     +A++ AK AD VVL+ GL+ T+E E+ DRV L+L
Sbjct: 445 MQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLL 504

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +LI  VA   KKPV+LVL+ GGPVD++FAK+D  I SILW GYPGE G   L E+
Sbjct: 505 PGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEI 564

Query: 574 IFGDHNPG 581
           +FG +NPG
Sbjct: 565 LFGKYNPG 572


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 300/577 (51%), Positives = 401/577 (69%), Gaps = 10/577 (1%)

Query: 9   VFPLLCLCFTSLLT---RVDS-TQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
           VF    + F +LL    RV S T   F+CD   NP+   + FC   L +  R +DLV RL
Sbjct: 13  VFLCFFVLFATLLLSGGRVSSQTSAVFACDVAKNPALANYGFCNKKLSVDARVKDLVRRL 72

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL EK+  LVNSA  + RLGIP YEWWSEALHGV+ +G G  F+  I GATSFP  IL A
Sbjct: 73  TLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATSFPMPILIA 132

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF++ L+  IG+ +  EARA++N G A G+T+W+PNINIFRDPRWGRGQETPGEDPL+ 
Sbjct: 133 ASFNASLFQTIGKVVSTEARAMHNVGLA-GLTYWSPNINIFRDPRWGRGQETPGEDPLLA 191

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
            KYA  YV+G+Q  T +G     KL+ +ACCKH+TAYD+D+WKG  RY F+A VT QDL 
Sbjct: 192 SKYAAGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLD 248

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DTYQPPF+SCV  G  + +MC+YN+VNG P+CAD +LL    R +W  +GYI SDCD+V 
Sbjct: 249 DTYQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVD 308

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           +++  + Y K+PE+A    + AG+D+NCGSFL ++T+ AVKQ  + E+ I+ A++N F+ 
Sbjct: 309 VLFKNQHYTKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFAT 368

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            MRLG F+G+P+ QP+G +G   VC+ A+Q LA +AA+ GIVLLKN  G LPL      S
Sbjct: 369 LMRLGFFDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKS 428

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IGPNAN+ + ++GNY G  C+  +PLQ L   V  T +  GC  V C++A++D A  
Sbjct: 429 LAVIGPNANATRAMIGNYEGIPCKYTSPLQGLTALVP-TSFAAGCPDVQCTNAALDDAKK 487

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IA  AD  V+++G +   E E  DR++++LPG+QQ+L+T VA  AK PVIL ++ GG +D
Sbjct: 488 IAASADATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMD 547

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++FAK ++ I SILW GYPGEAG  A+A+VIFG HNP
Sbjct: 548 VSFAKTNKKITSILWVGYPGEAGGAAIADVIFGYHNP 584


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 302/543 (55%), Positives = 385/543 (70%), Gaps = 3/543 (0%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           T  FPFC T+L    RA+DLVSRLTL EK  QLVN +  I RLG+PAYEWWSEALHGV+ 
Sbjct: 31  TSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVSN 90

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
           +G G  F+  + GATSFP VIL+AASF++ LW ++GQ +  EARA+YN   A G+TFW+P
Sbjct: 91  LGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLA-GLTFWSP 149

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
           N+N+FRDPRWGRGQETPGEDPLV  +YAV Y+RG+Q           +L+ S+CCKH+TA
Sbjct: 150 NVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVEDEASAKADRLKVSSCCKHYTA 209

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YDLDNWKG  R+ FDA+VT QDL D+YQPPF+SCV +G  S +MC+YNRVNGIP+CAD +
Sbjct: 210 YDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCADPD 269

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
           LL    R QWG  GYI SDCD+V + Y+A  Y  +PEDAV   LKAG+++NCG FL+K+T
Sbjct: 270 LLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTATPEDAVALALKAGLNMNCGDFLKKYT 329

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             AV  KK+  + +D+AL   + V MRLG F+ +P   PF  +G   VC+  +Q LAL A
Sbjct: 330 ANAVNLKKVDVATVDQALVYNYIVLMRLGFFD-DPKSLPFANLGPSDVCTKDNQQLALDA 388

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A+ GIVLL+N++G LPL ++    LA+IGPNAN+   ++ NYAG  CR  +PLQ LQ Y+
Sbjct: 389 AKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQKYI 448

Query: 460 ENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
            +  Y PGC  V C + S I  AV  A  AD VVL++GLDQ+ E E LDR +L LPG Q+
Sbjct: 449 SSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRENLTLPGFQE 508

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           + +  VA A K  VILV++  GP+DI+  K   NIG ILW GYPG+AG  A+A+VIFGD+
Sbjct: 509 KFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIFGDY 568

Query: 579 NPG 581
           NPG
Sbjct: 569 NPG 571


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/545 (54%), Positives = 401/545 (73%), Gaps = 8/545 (1%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           ST  +PFC  +L I  RAR LVS LTLDEKI+QL N+A  +PRLGIP Y+WWSE+LHG+A
Sbjct: 33  STSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLA 92

Query: 99  GVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW 157
             G G+ F +G +R AT FPQVIL+ A+F+  LW  + +A+  EA  ++NAGQA G+T+W
Sbjct: 93  DNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA-GLTYW 151

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGRGQET GEDP V   Y++ YV+G QG+       +G+++ SACCKH+
Sbjct: 152 APNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEE----GEEGRIRLSACCKHY 207

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
           TAYD++ W+G +RY F+A+V  QDL DTYQPPF++C+++ RAS +MCAYN+VNG+P CA 
Sbjct: 208 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAH 267

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           ++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++  VLKAGMD+NCGSFL +
Sbjct: 268 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLKAGMDINCGSFLVR 326

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
           HTK+A+++ K+ E +IDRAL NLFSV++RLG+F+     Q F ++G + VC+  H+ LA 
Sbjct: 327 HTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELAA 386

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +A + G VLLKN H  LPL +S+   +A+IGP+AN A  + G+Y G  C   T L+ +Q 
Sbjct: 387 EAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQA 446

Query: 458 YVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
           Y   T + PGC   +C+S  +  +AV+ AK AD VV++ GL+ T+E+E+ DRV L+LPG+
Sbjct: 447 YATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPGK 506

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q  LI  +A  AKKP++LVLL GGPVD++FAK D  I SILW GYPGE G   L E++FG
Sbjct: 507 QMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILFG 566

Query: 577 DHNPG 581
           ++NPG
Sbjct: 567 EYNPG 571


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  614 bits (1584), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 295/551 (53%), Positives = 401/551 (72%), Gaps = 8/551 (1%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           C      +  +PFC  +L I  RAR LVS LTLDEKI+QL N+A  +PRLGIP Y+WWSE
Sbjct: 24  CASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 83

Query: 93  ALHGVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           +LHG+A  G G+ F +G +R AT+FPQVIL+ A+F+  LW  + +A+  EA  ++NAGQA
Sbjct: 84  SLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 143

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+T+WAPNINIFRDPRWGRGQET GEDP V   Y++ YV+G QG+       +G+++ S
Sbjct: 144 -GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGE----EGRIRLS 198

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           ACCKH+TAYD++ W+G +RY F+A+V  QDL DTYQPPF++C+++ RAS +MCAYN+VNG
Sbjct: 199 ACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNG 258

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           +P CA+++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++  VLKAGMD+NC
Sbjct: 259 VPMCANKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSDEDSIAIVLKAGMDINC 317

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GSFL +HTK+AV++ K+ E +IDRAL NLFSV++RLG+F+     Q   ++G + VC+  
Sbjct: 318 GSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKE 377

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LA +A + G VLLKN H  LPL +S+   +A+IGP+AN    + G+Y G +C   T 
Sbjct: 378 HRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTF 437

Query: 452 LQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
           L+ +Q Y   T +  GC  V+C+S  +  +A+  AK AD VV++ GL+ T+E+E+ DRV 
Sbjct: 438 LKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVS 497

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           L+LPG+Q  LI  VA  AKKP++LVLL GGPVD++FAK D  I SILW GYPGE G   L
Sbjct: 498 LLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVL 557

Query: 571 AEVIFGDHNPG 581
            E++FG++NPG
Sbjct: 558 PEILFGEYNPG 568


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  614 bits (1584), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 301/577 (52%), Positives = 401/577 (69%), Gaps = 16/577 (2%)

Query: 15  LCFTSLL---------TRVD-STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDLVSRL 63
           LCF S +          RV   T   F+CD   NP+   + FC  +L +  R  DLV RL
Sbjct: 15  LCFFSFMFVATVLLNCDRVSGQTSSVFACDVAKNPALAGYGFCDKSLSLEDRVADLVKRL 74

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  F+  + GATSFP  ILTA
Sbjct: 75  TLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPMPILTA 134

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ASF++ L+  IG+ +  EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGEDPL++
Sbjct: 135 ASFNASLFEAIGRVVSTEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGEDPLLS 193

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
            KYA  YV+G+Q  T +G     KL+ +ACCKH+TAYDLDNWKG  RY F+A VT QD+ 
Sbjct: 194 SKYATGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMD 250

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LL    R +W  +GYI SDCD+V 
Sbjct: 251 DTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVE 310

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           +++  + Y K+PE+A  + + AG+D+NCG++L ++T+ AVKQ  L E+ I+ A+ N F+ 
Sbjct: 311 VLFKDQHYTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFAT 370

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            MRLG F+G+P+ Q +G +G + VC+  ++ LA +AA+ GIVLLKNS G LPL      S
Sbjct: 371 LMRLGFFDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKS 430

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
           LA+IGPNAN+ + ++GNY G  C  I+PLQAL   V  T Y  GC  V C++A +D A  
Sbjct: 431 LAVIGPNANATRVMIGNYEGIPCNYISPLQALTALVP-TSYAAGCPNVQCANAELDDATQ 489

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           IA  AD  V+++G     E E LDR++++LPG+QQ L++ VA A+K PVILV++ GG +D
Sbjct: 490 IAASADATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMD 549

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++FAK +  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 550 VSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNP 586


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  613 bits (1581), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 301/582 (51%), Positives = 401/582 (68%), Gaps = 12/582 (2%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVD-STQPPFSCD-PSNPSTETFPFCKTTLPISQRARD 58
           + F   + V  +L  C      RV   T   F+CD   NP+   + FC  +L +  R  D
Sbjct: 15  LCFFSFTFVASVLLNC-----NRVSGQTSAVFACDVAKNPALAGYGFCDKSLSVEDRVAD 69

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LV RLTL EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  F+  + GATSFP 
Sbjct: 70  LVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATSFPM 129

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
            ILTAASF++ L+  IG+ +  EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGE
Sbjct: 130 PILTAASFNASLFEAIGRVVSTEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGE 188

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           DPL++ KYA  YV+G+Q  T +G     KL+ +ACCKH+TAYDLDNWKG  RY F+A VT
Sbjct: 189 DPLLSSKYATGYVKGLQ-QTDDGDS--NKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVT 245

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
            QD+ DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LL    R +W  +GYI SD
Sbjct: 246 QQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSD 305

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALH 358
           CD+V +++  + Y K+PE+A    + AG+D+NCG++L ++T+ AVKQ  L E+ I+ A+ 
Sbjct: 306 CDSVEVLFKDQHYTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVS 365

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
           N F+  MRLG F+G+P+ QP+G +G   VC+  ++ LA +AA+ GIVLLKNS G LPL  
Sbjct: 366 NNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNA 425

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI 478
               SLA+IGPNAN+ + ++GNY G  C  I+PLQ L   V  T Y  GC  V C++A +
Sbjct: 426 KTIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQTLTALVP-TSYAAGCPNVQCANAEL 484

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D A  IA  AD  V+++G     E E LDR++++LPG+QQ L++ VA A+K PVILV++ 
Sbjct: 485 DDATQIAASADATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMS 544

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           GG +D++FAK +  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 545 GGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNP 586


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score =  612 bits (1577), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 293/550 (53%), Positives = 395/550 (71%), Gaps = 8/550 (1%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CD  N  T    FC+ +L I +R +DL+ RLTL EKI  LVN+A A+PRLGI  YEWWSE
Sbjct: 34  CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93

Query: 93  ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
           ALHGV+ VG G  F GT  GATSFPQVI TAASF+  LW  IG+ +  EARA+YN G A 
Sbjct: 94  ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA- 152

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+T+W+PN+NIFRDPRWGRGQETPGEDP++  KYA +YV+G+QG   N GK   +L+ +A
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQG---NDGK--KRLKVAA 207

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
           CCKH+TAYDLDNW G  RY F+A+V+ QDL DTY  PF++CV +G+ + +MC+YN+VNG 
Sbjct: 208 CCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGK 267

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
           P+CAD +LL  T R  WG  GYI SDCD+V ++YD++ +  +PE+A    +KAG+D++CG
Sbjct: 268 PTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIKAGLDLDCG 327

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
            FL  HT  AV +  L E +++ AL NL SV+MRLG+F+G P  QP+G +G   VC+PAH
Sbjct: 328 PFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAH 387

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LAL+AA+ GIVLL+N  G LPL  ++  ++A+IGPN+++  T++GNYAG +C   TP+
Sbjct: 388 KHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPV 447

Query: 453 QALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           Q +  YV+ T++  GC  VAC     I +A   A+ AD  V+++GLDQ+ E E  DR  +
Sbjct: 448 QGISKYVK-TIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGV 506

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
           +LPG+Q+EL+ R+  A K P ++VL+ GGP+D++FAK D  I  ILW GYPG+AG  A+A
Sbjct: 507 LLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIA 566

Query: 572 EVIFGDHNPG 581
           +V+FG  NPG
Sbjct: 567 DVLFGATNPG 576


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  610 bits (1574), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 302/554 (54%), Positives = 390/554 (70%), Gaps = 8/554 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD +N +   + FC      S RA DLVSRLTL +K+  LVN  PA+ RLGIPAYE
Sbjct: 36  PVFACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYE 95

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 96  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNV 155

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  +YAV YV G+Q     G    G L
Sbjct: 156 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQD---AGADADGPL 211

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+QPPF+SCV  G+ + +MC+YN+
Sbjct: 212 KVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGKVASVMCSYNK 271

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LLS   R  W  +GYI SDCD+V ++Y  + Y K+PE+A    +K+G+D
Sbjct: 272 VNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHYTKTPEEAAAITIKSGLD 331

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG FL KHT AAV+   L ES++DRA+ N F + MRLG F+G+P    +G +G   VC
Sbjct: 332 LNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKLAYGSLGPKDVC 391

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + ++Q LA + A+ GIVLLKN  G LPL      S+A+IGPNAN++ T++GNY G  C+ 
Sbjct: 392 TSSNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKY 450

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEEL 506
            TPL  L N V  TVY PGC  V CS  S+    A   A  AD  VL++G DQ+ E+E L
Sbjct: 451 TTPLHGLGNNVA-TVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVGADQSIEREAL 509

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L+LPG+Q +LI+ VA A+K  VILV++ GGP DI+FAK    I +ILW GYPGEAG
Sbjct: 510 DRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAILWVGYPGEAG 569

Query: 567 AVALAEVIFGDHNP 580
             A+A++IFG +NP
Sbjct: 570 GAAIADIIFGKYNP 583


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  610 bits (1573), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 309/572 (54%), Positives = 404/572 (70%), Gaps = 10/572 (1%)

Query: 13  LCLCFTSLLTRVDSTQ--PPFSCDPS-NPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           L LCF   ++   + Q  P F+CD + NPS     FC T L I  R  DLV RLTL+EKI
Sbjct: 19  LLLCFLLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKSRVTDLVGRLTLEEKI 78

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
             L ++A  + RLGIPAY+WWSEALHGV+ VG G  F+G + GATSFPQVILTAASF+  
Sbjct: 79  GFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGATSFPQVILTAASFNVS 138

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           L+  IG+ +  EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP ++ KYAV+
Sbjct: 139 LFQAIGKVVSTEARAMYNVGSA-GLTFWSPNVNIFRDPRWGRGQETPGEDPELSSKYAVA 197

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YVRG+Q    +GG    +L+ +ACCKH+TAYD+DNWK   R+ F+A V  QD+ADT+QPP
Sbjct: 198 YVRGLQ--ETDGGD-PNRLKVAACCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPP 254

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F+SCV  G  + +MC+YN+VNG P+CAD +LLS   R QW  +GYI SDCD+V ++Y  +
Sbjct: 255 FKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQ 314

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            Y K+PE+AV   + AG+D+NC  F  ++   AVK   + E+ ID+A+ N F+  MRLG 
Sbjct: 315 HYTKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGF 374

Query: 370 FNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
           F+G+P  Q  +G +G + VC+  +Q LA  AA+ GIVLLKNS G LPL  S   +LA+IG
Sbjct: 375 FDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIG 434

Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGA 488
           PNAN+ +T++GNY G  C+  TPLQ L   V +T Y  GC+ VAC+   +  A  +A  A
Sbjct: 435 PNANATETMIGNYNGIPCKYTTPLQGLAETVSST-YQLGCN-VACAEPDLGSAAALAASA 492

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D VVL+MG DQ+ E+E LDR+DL LPG+QQEL+T+VA+ AK PV+LV++ GG  DITFAK
Sbjct: 493 DAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAK 552

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +  I  I+W GYPGEAG +A+A+VIFG HNP
Sbjct: 553 NEEKITGIMWVGYPGEAGGLAIADVIFGRHNP 584


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 293/541 (54%), Positives = 395/541 (73%), Gaps = 6/541 (1%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PFC  +LP   RAR LVS LTLDEKI+QL N+A  +PRLGIP YEWWSE+LHG+A  G 
Sbjct: 37  YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96

Query: 103 GI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
           G+ F +G +  AT FPQVIL+AASF+  LW  + +A+ +EARA++NAGQA G+T+WAPNI
Sbjct: 97  GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQA-GLTYWAPNI 155

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+FRDPRWGRGQETPGEDP V   Y+V YV+G QG+ +  GK +G++  SACCKH+ AYD
Sbjct: 156 NVFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGE-YGDGK-EGRMMLSACCKHYVAYD 213

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+ W   TRY F+A+V  QD  DTY+PPF+SC+++GRAS +MC+YN+VNG+P+CA ++LL
Sbjct: 214 LEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKDLL 273

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            K  R +WGF GY+ SDCDAV IIY  + Y  S ED++  VLKAGMD+NCGSFL +HTK+
Sbjct: 274 QKV-RDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIVLKAGMDINCGSFLIRHTKS 332

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++ K+ E +I+ AL NLFSV++RLGLF+     Q F ++G   +C+  H+ LA +AA+
Sbjct: 333 AIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAEAAR 392

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
            G VLLKN +  LPL +S+   +A+IGP AN A  + G+Y G  C   T L+ +Q  V  
Sbjct: 393 QGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAVVPQ 452

Query: 462 TVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           T    GC  ++C+S     +A+++AK AD VVL+ GL+ TQE E+LDRV L+LPG+Q +L
Sbjct: 453 TTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQMDL 512

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I  +A   KKP++LV+  GGPVD++FAK D+ I S+LW GYPGE G   L E++FG++NP
Sbjct: 513 INSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGEYNP 572

Query: 581 G 581
           G
Sbjct: 573 G 573


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score =  608 bits (1567), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 296/560 (52%), Positives = 380/560 (67%), Gaps = 26/560 (4%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + FC   LPIS R + L+S LT+DEKI  L ++  +IPRLG+PAYEWWSE+LHG+A  G 
Sbjct: 41  YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
            + FNG I+G TSFPQVILTAA+F+  LW+ I  AI +EARA+YN GQA G+TFWAPNIN
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQA-GLTFWAPNIN 159

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG---------------- 206
           I RDPRWGRGQETPGEDP+V   YA+ YV G QG   N    KG                
Sbjct: 160 ILRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQG--LNPKAKKGNRNGYGKKRRVLKEDD 217

Query: 207 ----KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
               +L  SACCKHFTAYDL+ W   TRY F+A VT QD+ DT+Q PF SC++QG+AS +
Sbjct: 218 NDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCL 277

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YN VNG+P+CAD+ LL K  R  WGF GYITSDCDAV+ IY+ + Y K+PEDAV   
Sbjct: 278 MCSYNSVNGVPACADKELLDKV-RTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVA 336

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAG ++NCG+++ +H K+A +Q  + E ++DRAL  LFSV+ RLGLF+GNP    F   
Sbjct: 337 LKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANF 396

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           GA  VC+  H  LAL AA+ GIVLLKN    LPL K+   +LA++GP AN   +  G Y+
Sbjct: 397 GAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMAN-VSSPGGTYS 455

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
           G  C+  +  +    ++  T+Y  GC  V C S+A    A+ I K AD+V+++ G D ++
Sbjct: 456 GVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSE 515

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E E+ DR  L+LPG+Q  L+T +A A+KKP+ILVL  GGPVD++FA+ D  I SILW  Y
Sbjct: 516 ETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAY 575

Query: 562 PGEAGAVALAEVIFGDHNPG 581
           PGE G  AL+E+IFG  NPG
Sbjct: 576 PGETGGKALSEIIFGYQNPG 595


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  608 bits (1567), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 295/560 (52%), Positives = 392/560 (70%), Gaps = 12/560 (2%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           F+C P  P+  + PFC+ +LP+  RARDLVSRLT  EK+  LVN+A  +PRLG+  YEWW
Sbjct: 27  FACAPGGPAA-SLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWW 85

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHGV+  G G+ F G   GAT+FPQVI TAAS ++ LW  +G+A+  EARA+YN G+
Sbjct: 86  SEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGR 145

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF--NGGKLKGKL 208
           A G+TFW+PN+NIFRDPRWGRGQETPGEDP V+ +YA +YVRG+Q      NGG  + +L
Sbjct: 146 A-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH-RNRL 203

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + +ACCKHFTAYDLD W GT R+ F+A V  QDL DT+  PF +CV+ GRA+ +MC+YN+
Sbjct: 204 KLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVMCSYNQ 263

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+P+CAD   L  T R +WG  GYI SDCD+V + +  + Y ++PEDA    L+AG+D
Sbjct: 264 VNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAATLRAGLD 323

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG FL  +  +AV   K+ ++++D AL N  +V+MRLG+F+G+P   PFG++G   VC
Sbjct: 324 LDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRLGPADVC 383

Query: 389 SPAHQVLALQAAQDGIVLLKNSHG------LLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           +  HQ LAL AA+ G+VLLKN  G      +LPL  +    +A++GP+A++   ++GNYA
Sbjct: 384 TREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVAMIGNYA 443

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQ 501
           G  CR  TPLQ +  Y     +  GC  VAC  +  I  AV+ A+ AD  V++ GLDQ  
Sbjct: 444 GKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVAGLDQRV 503

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E E LDR  L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+ D  I  ILW GY
Sbjct: 504 EAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGY 563

Query: 562 PGEAGAVALAEVIFGDHNPG 581
           PG+AG  A+A+VIFG HNPG
Sbjct: 564 PGQAGGQAIADVIFGHHNPG 583


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  603 bits (1555), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 297/581 (51%), Positives = 402/581 (69%), Gaps = 10/581 (1%)

Query: 5   KLSLVFPLLCLCFTSLLTRVD----STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDL 59
           K+S VF    + + ++L   +     T   F+CD   N +  ++ FC  +L +  R  DL
Sbjct: 10  KVSSVFLCFSIFYVAVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDKSLSVEDRVSDL 69

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           V RLTL EKI  L NSA  + RLGIP YEWWSEALHGV+ +G G  F+  + GATSFP  
Sbjct: 70  VKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGATSFPMP 129

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           ILTAASF++ L+  IG  +  EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGED
Sbjct: 130 ILTAASFNTSLFQAIGSVVSNEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGED 188

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PL++ KYA  YV+G+Q  T +G     KL+ +ACCKH+TAYD+DNWKG  RY FDA V+ 
Sbjct: 189 PLLSSKYAAGYVKGLQ-QTDDGDS--DKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQ 245

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           QDL DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LL    R +W  +GYI SDC
Sbjct: 246 QDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDC 305

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
           D+V +++  + Y K+PE+A    + +G+D++CGS+L ++T  AVKQ  + E+ I+ A+ N
Sbjct: 306 DSVEVLFKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASINNAVSN 365

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
            F+  MRLG F+G+P+ QP+G +G   VC+P +Q LA +AA+ GIVLLKNS G LPL   
Sbjct: 366 NFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPGSLPLSSK 425

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
              SLA+IGPNAN+ + ++GNY G  C+  +PLQ L  +V  T Y PGC  V C++A ID
Sbjct: 426 AIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVP-TSYAPGCPDVQCANAQID 484

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
            A  IA  AD  ++++G +   E E LDRV+++LPG+QQ+L+  VA  +K PVILV++ G
Sbjct: 485 DAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSG 544

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G +D++FAK +  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 545 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNP 585


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  602 bits (1553), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 293/551 (53%), Positives = 386/551 (70%), Gaps = 6/551 (1%)

Query: 31  FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           F+CD   +P+   + FC  +L +  R +DLV RLTL EKI  LVNSA  + RLGIP YEW
Sbjct: 30  FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEW 89

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F+  I GATSFP  ILTAASF++ L+  IG+ +  EARA+YN G
Sbjct: 90  WSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVG 149

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PNINIFRDPRWGRG ETPGEDP++T KYA  YV+G+Q    +GG    KL+
Sbjct: 150 LA-GLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--QTDGGD-PNKLK 205

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKH+TAYD+DNWKG  RY F+A VT QD+ DT+QPPF+SCV  G  + +MC+YN+V
Sbjct: 206 VAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVMCSYNKV 265

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+CAD +LL    R +W  +GYI SDCD+V ++Y  + Y K+PE+A    + AG+D+
Sbjct: 266 NGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHYTKTPEEAAAISILAGLDL 325

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           NCG FL ++T+ AVKQ  + E+ I+ A+ N F+  MRLG F+G+P  QP+G +G   VC+
Sbjct: 326 NCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNLGPKDVCT 385

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             +Q LA +AA+ GIVLLKNS   LPL      SLA+IGPNAN+ + ++GNY G  C+ I
Sbjct: 386 QENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYI 445

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           +PLQ L  +   T Y  GC  V C +  +D A  IA  AD  V+++G     E E LDRV
Sbjct: 446 SPLQGLTAFAP-TSYAAGCLDVRCPNPVLDDAKKIAASADATVIVVGASLAIEAESLDRV 504

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           +++LPG+QQ L++ VA A+K PVILV++ GG +D++FAK +  I SILW GYPGEAG  A
Sbjct: 505 NILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYPGEAGGAA 564

Query: 570 LAEVIFGDHNP 580
           +A+VIFG HNP
Sbjct: 565 IADVIFGFHNP 575


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  599 bits (1545), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 296/581 (50%), Positives = 400/581 (68%), Gaps = 10/581 (1%)

Query: 5   KLSLVFPLLCLCFTSLLTRVD----STQPPFSCD-PSNPSTETFPFCKTTLPISQRARDL 59
           K+S VF    + + ++L   +     T   F+CD   N +  ++ FC  +L +  R  DL
Sbjct: 9   KVSSVFLCFSIFYVTVLLNCNHVYGQTSTVFACDVAKNTNVSSYGFCDNSLSVEDRVSDL 68

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           V RLTL EKI  L NSA  + RLGIP YEWWSEALHGV+ +G G  F+  + GAT+FP  
Sbjct: 69  VKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGATNFPMP 128

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           ILTAASF++ L+  IG  +  EARA+YN G A G+T+W+PNINIFRDPRWGRGQETPGED
Sbjct: 129 ILTAASFNTSLFQAIGSVVSNEARAMYNVGLA-GLTYWSPNINIFRDPRWGRGQETPGED 187

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PL++ KYA  YV+G+Q  T +G     KL+ +ACCKH+TAYD+DNWKG  RY FDA V+ 
Sbjct: 188 PLLSSKYAAGYVKGLQ-QTDDGDS--DKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQ 244

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           QDL DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LL    R +W  +GYI SDC
Sbjct: 245 QDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDC 304

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
           D+V ++Y  + Y K+PE+A    + +G+D++CGS+L ++T  AVKQ  + E+ I  A+ N
Sbjct: 305 DSVEVLYKDQHYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASITNAVSN 364

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
            F+  MRLG F+G+P+ QP+G +G   VC+P +Q LA +AA+ GIVLLKNS   LPL   
Sbjct: 365 NFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPRSLPLSSK 424

Query: 420 KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID 479
              SLA+IGPNAN+ + ++GNY G  C+  +PLQ L  +V  T Y PGC  V C++A ID
Sbjct: 425 AIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVP-TSYAPGCPDVQCANAQID 483

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
            A  IA  AD  ++++G +   E E LDRV+++LPG+QQ+L+  VA  +K PVILV++ G
Sbjct: 484 DAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSG 543

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G +D++FAK +  I SILW GYPGEAG  A+A+VIFG +NP
Sbjct: 544 GGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNP 584


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  599 bits (1544), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 294/555 (52%), Positives = 396/555 (71%), Gaps = 6/555 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PP +   ++    ++ FC  +LP   RAR LVS LTLDEKI+QL N+A  +PRLG+P YE
Sbjct: 24  PPNARACASAEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYE 83

Query: 89  WWSEALHGVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           WWSE+LHG+A  G G+ F +G +  AT FPQVIL+AA+F+  LW  + +A+ +EARA++N
Sbjct: 84  WWSESLHGLADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHN 143

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
           AGQA G+T+WAPNIN+FRDPRWGRGQETPGEDP +   Y+V YV+G QG+  +G   +G+
Sbjct: 144 AGQA-GLTYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYGDG--REGR 200

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           +  SACCKH+ AYDL+ W    RY F+A V  QD  DTY+PPF+SC+++GRAS +MC+YN
Sbjct: 201 MMLSACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYN 260

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           +VNG+P+CA ++LL K  R +WGF GYI SDCDAV+II++ + Y  S ED+V  VLKAGM
Sbjct: 261 QVNGVPACARKDLLQKI-RDEWGFKGYIVSDCDAVAIIHENQTYTSSDEDSVAIVLKAGM 319

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           DVNCGSFL +HTK+A+++ K+ E +I+ AL+NLFSV++RLGLF      Q F ++G   V
Sbjct: 320 DVNCGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNV 379

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C+  H+ LA +A + G VLLKN +  LPL +SK   +ALIG  AN A  + G+Y G  C 
Sbjct: 380 CTKEHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCD 439

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
            IT L+ +Q +V  T    GC  V+C S     +A++ AK AD VV++ GL+ TQE E+L
Sbjct: 440 PITFLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDL 499

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DRV L+LPGRQQ+L+  +A   KKP++LV+  GGPVD+ FAK D  I S+LW GYPGE G
Sbjct: 500 DRVTLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVG 559

Query: 567 AVALAEVIFGDHNPG 581
              L E++FG++NPG
Sbjct: 560 GQVLPEILFGEYNPG 574


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  593 bits (1529), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 291/551 (52%), Positives = 385/551 (69%), Gaps = 7/551 (1%)

Query: 31  FSCD-PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           F+CD   +P+   + FC  +L +  R +DLV RLTL EKI  LVNSA  + RLGIP YEW
Sbjct: 30  FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIPRYEW 89

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F+  + GATSFP  ILTAASF++ L+  IG+ +  EA A+YN G
Sbjct: 90  WSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAMYNVG 149

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PNINIFRDPRWGRG ETPGEDP++T KYA  YV+G+Q    +GG    KL+
Sbjct: 150 LA-GLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--QTDGGD-PNKLK 205

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKH+TAYD+D WKG  RY F+A +T QDL DT+QPPF+SCV  G  + +MC+YN+V
Sbjct: 206 VAACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVMCSYNKV 265

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+CAD +LL    R +W  +GY+ SDCD+V ++Y  + Y K+PE+A    + AG+D+
Sbjct: 266 NGKPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLYKYQHYTKTPEEAAAISILAGLDL 325

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           NCG FL ++T+ AVKQ  + ES I+ A+ N F+  MRLG F+G+P  QP+G +G   VC+
Sbjct: 326 NCGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNLGPKDVCT 384

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PA+Q LA +AA+ GIV LKNS   LPL      SLA+IGPNAN+ + ++GNY G  C+ I
Sbjct: 385 PANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCKYI 444

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           +PLQ L  +V  T Y  GC  V C +  +D A  I+   D  V+++G     E E LDRV
Sbjct: 445 SPLQGLTAFVP-TSYAAGCLDVRCPNPVLDDAKKISASGDATVIVVGASLAIEAESLDRV 503

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           +++LPG+QQ L+T VA A+K PVILV++ GG +D++FAK +  I SILW GYPGEAG  A
Sbjct: 504 NILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYPGEAGGAA 563

Query: 570 LAEVIFGDHNP 580
           +A+VIFG HNP
Sbjct: 564 IADVIFGFHNP 574


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  593 bits (1528), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 304/575 (52%), Positives = 390/575 (67%), Gaps = 40/575 (6%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T PFC+ +LP   RARDLV+RLT  EK+  LVN+A  +PRLG+  YEWWSEALHGV+  G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG-------------------------- 135
            G+ F G   GAT+FPQVI TAASF++ LW  IG                          
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159

Query: 136 ------QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
                 QA+  E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V  +YA +
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAA 218

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YVRG+Q    + G+LK     +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+  P
Sbjct: 219 YVRGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVP 274

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           F SCV  GRA+ +MC+YN+VNG+P+CAD   L  T RR+WG  GYI SDCD+V + Y  +
Sbjct: 275 FRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQ 334

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            Y ++ EDAV   L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N  +V+MRLG+
Sbjct: 335 HYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGM 394

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIG 428
           F+G+P  QPFG +G   VC+ AHQ LA++AA+ GIVLLKN    LPL P +   ++A++G
Sbjct: 395 FDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVG 454

Query: 429 PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAK 486
           P+A +   ++GNYAG  CR  TPLQ +  Y     + PGC  VAC+ +   I  AVD A+
Sbjct: 455 PHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAAR 514

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            AD  +++ GLDQ  E E LDR  L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI F
Sbjct: 515 RADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGF 574

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A+ D  I  ILWAGYPG+AG  A+A+VIFG HNPG
Sbjct: 575 AQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPG 609


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 316/599 (52%), Positives = 409/599 (68%), Gaps = 35/599 (5%)

Query: 8   LVFPLLCLCFTSLLTRVDS-TQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           +VF L C+    L T V + + P F+CD  SNPS  +  FC T++ I+ R  DLV RLTL
Sbjct: 15  VVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTL 74

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
            EKI  LVNSA  + RLGIP YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAAS
Sbjct: 75  QEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAAS 134

Query: 126 FDSYLWYRIG-----QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
           F++ L+  IG     Q +  EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 135 FNTSLFEAIGKVYYTQVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDP 193

Query: 181 LVTGKYAVSYVRGVQ----GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           L++ KY   YV+G+Q    GD         KL+ +ACCKH+TAYDLDNWKG+ RY F+A 
Sbjct: 194 LLSSKYGSCYVKGLQQRDDGD-------PDKLKVAACCKHYTAYDLDNWKGSDRYHFNAV 246

Query: 237 V-TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY- 294
           V T QD+ DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GY 
Sbjct: 247 VVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYQ 306

Query: 295 ------ITSDCDAVSIIYDAEGYAKSPEDAVV------DVLKAGMDVNCGSFLQKHTKAA 342
                 I +DCD++ + Y ++ Y K+PE+A        + L  G+D+NCGSFL +HT+AA
Sbjct: 307 WGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCGSFLGQHTEAA 366

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           VK   + E  ID A+ N F+  MRLG F+G+P+ Q +GK+G   VC+  +Q LA +AA+ 
Sbjct: 367 VKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQ 426

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS-CRSITPLQALQNYVEN 461
           GIVLLKN+ G LPL  +   +LA+IGPNAN  KT++GNY G + C+  TPLQ L   V  
Sbjct: 427 GIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTPLQGLAASVA- 485

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
           T Y PGC  VACS+A +D A  +A  AD  VL+MG D + E E  DRVD++LPG+QQ LI
Sbjct: 486 TTYLPGCSNVACSTAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLI 545

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           T VA  +  PVILV++ GG +D++FA+ +  I SILW GYPGEAG  A+A++IFG +NP
Sbjct: 546 TAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNP 604


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 303/543 (55%), Positives = 379/543 (69%), Gaps = 20/543 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC T+L    R  DLVSRLTL+EK++QLVN+A AIPRL IPAYEWW E LHGVA V    
Sbjct: 3   FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            F G++  ATSFP  ILT ASF+  LW +IGQA   EARA YN G A G+T+W+P INI 
Sbjct: 60  -FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIA-GLTYWSPVINIA 117

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGR QET GEDP  T  YA  +V+G+Q    N  +LK     SACCKHFTAYD+DN
Sbjct: 118 RDPRWGRIQETSGEDPYTTSAYATHFVQGMQEGDANSKRLK----LSACCKHFTAYDVDN 173

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W+G  RY FDA+    +LADTY PPF+SCV++GR++ +MC+YN+VNG+P+CA+ + L  T
Sbjct: 174 WEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLENT 230

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            RR WG +GYI SDCD+V +++++  YA + EDA  D L AG+D+NCG +L  +T+ AV 
Sbjct: 231 VRRAWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALNAGLDLNCGDYLASYTEGAVA 290

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
             K+  S +D A++N+F VRMRLG+F+GNP  Q FG IG   VC+PAHQ LA++AA+ GI
Sbjct: 291 MGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAARQGI 350

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY----VE 460
           VLLKN   +LPL  SK+++ A+IGPNAN+  T+LGNY G  C+ ITPLQ L  +      
Sbjct: 351 VLLKNDGNILPL--SKNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSGDYH 408

Query: 461 NTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
              +  GC   AC     I  AV  A  AD VVL++GL Q QE E LDR  L+LPG QQ 
Sbjct: 409 KVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGYQQT 468

Query: 520 LITRVA-EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           LI  VA  AA +PV+LVL+C GPVDI FAK D+ I SILW GYPG++G  A+AEVIFG H
Sbjct: 469 LIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIFGAH 528

Query: 579 NPG 581
           NPG
Sbjct: 529 NPG 531


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  584 bits (1506), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 287/516 (55%), Positives = 372/516 (72%), Gaps = 6/516 (1%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           +K+  LVN  PA+ RLGIPAYEWWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF
Sbjct: 7   QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASF 66

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           ++ L+  IG+ +  EARA++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KY
Sbjct: 67  NASLFRAIGEVVSTEARAMHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKY 125

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AV YV G+Q D   GG   G L+ +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+
Sbjct: 126 AVGYVTGLQ-DAGAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTF 184

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
           QPPF+SCV  G  + +MC+YN+VNG P+CAD++LL    R  W  +GYI SDCD+V ++Y
Sbjct: 185 QPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLY 244

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
             + Y K+PE+A    +K+G+D+NCG+FL +HT AAV+  +L E ++DRA+ N F + MR
Sbjct: 245 TQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMR 304

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
           LG F+G+P    FG +G   VC+ +++ LA + A+ GIVLLKNS G LPL      S+A+
Sbjct: 305 LGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAV 363

Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDI 484
           IGPNAN++ T++GNY G  C+  TPLQ L   V NTVY PGC  V CS  S+    AV  
Sbjct: 364 IGPNANASFTMIGNYEGTPCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAA 422

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           A  AD  VL++G DQ+ E+E LDR  L+LPG+Q +L++ VA A+  PVILV++ GGP DI
Sbjct: 423 AASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDI 482

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +FAK    I +ILW GYPGEAG  ALA+++FG HNP
Sbjct: 483 SFAKASDKIAAILWVGYPGEAGGAALADILFGSHNP 518


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  581 bits (1498), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 287/556 (51%), Positives = 381/556 (68%), Gaps = 26/556 (4%)

Query: 35  PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEAL 94
           P  P+T T PFC+ +LP+  RARDLVSRLT  EK+  LVN+A  + RLG+  YEWWSEAL
Sbjct: 38  PGGPAT-TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEAL 96

Query: 95  HGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           HGV+  G G+ F G   GAT+FPQVI  AA+ ++ LW  IG+A+  EARA+YN G+A G+
Sbjct: 97  HGVSDTGPGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRA-GL 155

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TFW+PN+NIFRDPRWGRGQETPGEDP ++ +YA +YVRG+Q   ++  +LK     +ACC
Sbjct: 156 TFWSPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ-QPYDHNRLK----LAACC 210

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHFTAYDLD+W GT R+ F+A V+ QDL DT+  PF +CV  GRA+ +MC+YN+VNG+P+
Sbjct: 211 KHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPT 270

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CAD+  L  T R+ WG  GYI SDCD+V + +  + Y ++ EDAV   L+AG+D++CG F
Sbjct: 271 CADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAATLRAGLDLDCGPF 330

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L  +T+ AV +KK+ ++++D AL N  +V+MRLG+F+G+P   PFG +GA  VC+ AHQ 
Sbjct: 331 LALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQD 390

Query: 395 LALQAAQDGIVLLKNSHG-------LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           LAL AA+  +VLLKN  G       +LPL  +    +A++GP+A++   ++GNYAG  CR
Sbjct: 391 LALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPCR 450

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
             TPLQ +  Y    V+  GC  VAC   +  I  AVD A+          L        
Sbjct: 451 YTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAAR---------RLTPPSSSPG 501

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           L R  L+LPGRQ ELI+ VA+AAK PVILVL+ GGP+DI FA+ D  I  ILW GYPG+A
Sbjct: 502 LTR-SLLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQA 560

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+VIFG HNPG
Sbjct: 561 GGQAIADVIFGQHNPG 576


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score =  580 bits (1496), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 292/564 (51%), Positives = 393/564 (69%), Gaps = 30/564 (5%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           ++P F+CD +   T   PFC+ +LPI  R RDL+ RLTL EKI  LVN+A A+PRLGI  
Sbjct: 27  SRPAFACDGA---TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKG 83

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           YEWWSEALHGV+    G+ F G   GATSFPQVI TAASF++ LW  IG+ +  EARA+Y
Sbjct: 84  YEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMY 143

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N G A G+T+W+PN+NIFRDPRWGRGQETPGEDP++ GKYA SYVRG+QG++ N      
Sbjct: 144 NGGMA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGN------ 196

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
           +L+ +ACCKH+TAYDLDNW    RY+F+ARV+ QDLADTY  PF++CV +G+   + CA+
Sbjct: 197 QLKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGKYQ-VYCAH 255

Query: 267 NRVNGIPSCADRNLLSKTARRQ--WGFHGYITSDCDAVSIIYDAEGY------AKSPEDA 318
                I   A+  +L+  + +   W +H ++   C     +Y   G+        +PEDA
Sbjct: 256 T----IKLMANPLVLTLISPQHHPWSWHSWL--HCFR---LYRCWGFICHSTLHSTPEDA 306

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +KAG+D+ CG FL  HT+ AV+Q KL E++++ AL N  SV+MRLG+F+G P+ QP
Sbjct: 307 AAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDGEPSSQP 366

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +G +G   VC+PAHQ LAL+AA+ GIVLL+N    LPL      ++A+IGPN++   T+L
Sbjct: 367 YGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSDVTVTML 426

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI-AKGADHVVLMMGL 497
           GNYAG +C   TPLQ ++ YV  T++  GCD+VACS+  +    +  A+ AD  VL+MGL
Sbjct: 427 GNYAGVACGFTTPLQGIERYVR-TIHQSGCDSVACSNNQLFGVAETAARQADATVLVMGL 485

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
           DQ+ E E  DRV L+LPG QQEL++RVA A++ PV+LVL+ GGP+D++FAK D  IG+IL
Sbjct: 486 DQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDPRIGAIL 545

Query: 558 WAGYPGEAGAVALAEVIFGDHNPG 581
           W GYPG+AG  A+A+V+FG  NPG
Sbjct: 546 WVGYPGQAGGTAIADVLFGRTNPG 569


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  577 bits (1486), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 306/584 (52%), Positives = 391/584 (66%), Gaps = 48/584 (8%)

Query: 8   LVFPLLC-LCFTSLLTRVDSTQPPFSCDP-SNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           L F L+C L F+S    +  + P F+CD  SNPS  +F FC T+L +S R  DLV RLTL
Sbjct: 17  LFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTL 76

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
            EKI  LVNSA ++ RLGIP YEWWSEALHGV+ VG G  F+  + GATSFPQVILTAAS
Sbjct: 77  QEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAAS 136

Query: 126 FDSYLWYRIG----QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
           F++ L+  IG    Q +  EARA+YN G A G+TFW+PNINIFRDPRWGRGQETPGEDPL
Sbjct: 137 FNTSLFVAIGKVISQVVSTEARAMYNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPL 195

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQ 240
           ++ KY   YV+G+Q    + G   G L+ +ACCKH+TAYDLDNWKG  RY F+A  VT Q
Sbjct: 196 LSSKYGSGYVKGLQQR--DDGNPDG-LKVAACCKHYTAYDLDNWKGVDRYHFNAVVVTKQ 252

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG--YITSD 298
           D+ DT+QPPF+SCV  G  + +MC+YN+VNGIP+CAD +LLS   R +W  +G  YI +D
Sbjct: 253 DMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYVYIVTD 312

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKA--GMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
           CD++ + Y+++ Y K+PE+A    + A  G+D+NCGSFL KHT+AAV    + ES IDRA
Sbjct: 313 CDSIDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAVTAGLVNESAIDRA 372

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           + N F+  MRLG F+G+P+ Q +GK+G   VC+  +Q LA +AA+ GIVLLKN+      
Sbjct: 373 VSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNT------ 426

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
                                     G  C+  TPLQ L   V  T Y PGC  VACS+A
Sbjct: 427 --------------------------GTPCKYTTPLQGLAALVATT-YLPGCSNVACSTA 459

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
            +D A  IA  AD  VL+MG D + E E  DRVD++LPG+QQ LIT VA A+  PVILV+
Sbjct: 460 QVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVI 519

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + GG +D++FAK +  I SILW GYPGEAG  A+A++IFG +NP
Sbjct: 520 MSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNP 563


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 283/561 (50%), Positives = 376/561 (67%), Gaps = 15/561 (2%)

Query: 31  FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           F CDPS          +F FC ++L   +RA+DL+ R+TL EK +QL + A  + RLG+P
Sbjct: 56  FVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLP 115

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHGV+ VG G  F+  + GATSFP VI TA+SF+  LW  IGQA+  EARA+
Sbjct: 116 PYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAM 175

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
           YN G+A G+T+W+P IN+ RDPRWGR  ETPGEDP V GKYA +YVRG+Q    + N   
Sbjct: 176 YNLGRA-GLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTD 234

Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           L  + L+ S+CCKH+ AYD+DNW G  RY FDARVT QD+ +T+  PFE CVK+G  S +
Sbjct: 235 LNSRPLKVSSCCKHYAAYDVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSV 294

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVD 321
           MC+YNRVNGIP+CAD  LL  T R  WG HGYI SDCD+V + + DA     + EDAV  
Sbjct: 295 MCSYNRVNGIPTCADPVLLKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQ 354

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            LKAG+D++CG     +T++ V+Q K+    ID AL+NL+ V MRLG F+GN   +  GK
Sbjct: 355 TLKAGLDLDCGQIYPNYTESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGK 414

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
                +CS  H  LA +AA+ G VLLKN +  LP   S   +LA++GP+AN+   +LGNY
Sbjct: 415 PD---ICSDEHIELATEAARQGTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNY 471

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
           AG  CR  +P+  L  Y +   Y  GCD+VAC + + I  A++ A+ +D  V+ +G+D +
Sbjct: 472 AGVPCRMNSPMDGLSEYAK-VKYQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLS 530

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E E LDRVDL+LPG Q +L+ +VA  +K PV+LV+L  G +D++FAK + NI +I+WAG
Sbjct: 531 IEAESLDRVDLLLPGYQTQLVQQVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAG 590

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPGE G  A+A+VIFG  NPG
Sbjct: 591 YPGEEGGRAIADVIFGKFNPG 611


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  567 bits (1460), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 283/571 (49%), Positives = 382/571 (66%), Gaps = 14/571 (2%)

Query: 23  RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           R+   +  ++CDP  P+   FPFC T++    R  DL+SRLT+ EKI QLVN+A  + RL
Sbjct: 12  RIKVYKLQYACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRL 71

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           GIP Y+WW E LHGVA +   ++F G    ATSFP   L+  S++  LW +IGQ +  E 
Sbjct: 72  GIPPYQWWGEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEG 130

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           RA+YN G++ G+T+W+PNINI RDPRWGR QETPGEDP ++  YAV +V+G+Q   ++  
Sbjct: 131 RAMYNQGRS-GLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQN 189

Query: 203 KLKG------KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
           + +       +L+ SACCKHFTA+DLD WK   R  FD++VT QDL DTY P F+SCVK+
Sbjct: 190 QPQAVSRGPRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKE 249

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G++S +MC+YNR+NGIP C    LL+ T R QWGF GYI SDCDAV++I+D   YA + E
Sbjct: 250 GQSSSVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINYAPTSE 309

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           DAV  V+ AGMD+NCGS    H  AA+ +K + E  ID  L NLF VRMRLG+F+GNP+ 
Sbjct: 310 DAVSYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPST 369

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
            P+G +G + +C+  +Q LAL+AA+  +VLLKN    LP  K+  + LA+IG +A++ + 
Sbjct: 370 LPYGSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATRE 429

Query: 437 LLGNYAGPSCRSITPLQA----LQNYVENTVYYPGCDTVACSSA-SIDKAVDIAKGADHV 491
           +LGNY G  C+ ++PLQ     L ++     +  GC   AC     I  A + A  AD V
Sbjct: 430 MLGNYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAV 489

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA-AKKPVILVLLCGGPVDITFAKYD 550
           VL++G+ Q QEKE  DR  L+LPGRQ EL++ V EA A +PV+LVLL G P+D++FA  D
Sbjct: 490 VLVLGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDD 549

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             I SI+WAGYPG++G  A+AE IFG  NPG
Sbjct: 550 PRIQSIIWAGYPGQSGGEAIAEAIFGLVNPG 580


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 287/556 (51%), Positives = 384/556 (69%), Gaps = 23/556 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR---------LGIPA 86
           ++P+   +PFC  TLP   RAR LVS LTLDEKI+QL+      P          +G+P+
Sbjct: 31  ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGVPS 90

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
               + A  G           G +R AT FPQVIL+AA+F+  LW    +AI +EARA++
Sbjct: 91  T---ASATTGPGSTSP----RGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMH 143

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           NAGQA G+TFWAPNIN+FRDPRWGRGQETPGEDP V   Y+V YV+G Q D       +G
Sbjct: 144 NAGQA-GLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EG 198

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
           ++  SACCKH+ AYDL+ W+G TRY F+A+V  QD+ DTYQPPF+SC+++GRAS +MC+Y
Sbjct: 199 RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSY 258

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           N+VNG+P+CA +++L + AR +WGF GYITSDCDAV+II++ + Y  S ED++  VLKAG
Sbjct: 259 NQVNGVPACARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAG 317

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           MD+NCGSFL +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+     Q F ++G + 
Sbjct: 318 MDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNN 377

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+  H+ LA +A + G VLLKN +G LPL +S+   +ALIGP AN    L G+Y G  C
Sbjct: 378 VCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPC 437

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
            S T ++ +Q YV  T +  GC  V C+S     +A++ AK AD VVL+ GL+ T+E E+
Sbjct: 438 HSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETED 497

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DRV L+LPGRQ +LI  VA   KKPV+LVL+ GGPVD++FAK+D  I SILW GYPGE 
Sbjct: 498 HDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEV 557

Query: 566 GAVALAEVIFGDHNPG 581
           G   L E++FG +NPG
Sbjct: 558 GGNVLPEILFGKYNPG 573


>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 278/544 (51%), Positives = 372/544 (68%), Gaps = 11/544 (2%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           TF FC ++L    RA+DLV+++TL EK+ QL + A  +PRLGIP YEWWSEALHGV+ VG
Sbjct: 65  TFGFCDSSLSYEVRAKDLVNQMTLKEKVQQLGDLAYGVPRLGIPKYEWWSEALHGVSDVG 124

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
            G FF+  + GATSFP  ILT ASF+  LW  IGQA   +ARA+YN G+A G+T+W+PN+
Sbjct: 125 PGTFFDDLVPGATSFPTTILTTASFNESLWKNIGQA-SAKARAMYNLGRA-GLTYWSPNV 182

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFT 218
           N+ RDPRWGR  ETPGEDP V G+YAV+YVRG+Q    T N   L  + L+ S+CCKH+ 
Sbjct: 183 NVVRDPRWGRTVETPGEDPYVVGRYAVNYVRGLQDVEGTENYTDLNTRPLKVSSCCKHYA 242

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AYD++ W+G  R  FDARVT QD+ +T+  PFE CVK+G  S +MC++NRVNGIP+CAD 
Sbjct: 243 AYDVEKWQGVERLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADP 302

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY-AKSPEDAVVDVLKAGMDVNCGSFLQK 337
            LL++T R  W  HGYI SDCD++ ++ D   +   + EDAV  VLKAG+D++CG +   
Sbjct: 303 KLLNQTIRGDWDLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTN 362

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
            T+ +VKQ K  E  IDR+L  L+ V MRLG F+G P  Q   K+G   +C+  +  LA 
Sbjct: 363 FTETSVKQGKAREEYIDRSLKYLYVVLMRLGFFDGTPQYQ---KLGKKDICTKENVELAK 419

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           QAA++GIVLLKN+   LPL   K  +LA++GP+AN+ + ++GNYAG  CR ++P+     
Sbjct: 420 QAAREGIVLLKNND-TLPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSI 478

Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
           Y  N  Y  GCD    + + +  AV  AK AD  +++ GLD T E E LDR DL+LPG Q
Sbjct: 479 Y-SNVTYEIGCDVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQ 537

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            +LI +VA AA  PVILV++  G VDI+FA+ +  I +ILW GYPG+ G  A+A+V+FG 
Sbjct: 538 TQLINQVAGAANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGK 597

Query: 578 HNPG 581
           +NPG
Sbjct: 598 YNPG 601


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 281/548 (51%), Positives = 371/548 (67%), Gaps = 40/548 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++P+   +PFC  TLP   RAR LVS LTLDEKI+QL N+A   PRLG+P +EWWSE+LH
Sbjct: 31  ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90

Query: 96  GVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GV   G G+ F +G +R AT FPQVIL+AA+F+  LW    +AI +EARA++NAGQA G+
Sbjct: 91  GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA-GL 149

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TFWAPNIN+FRDPRWGRGQETPGEDP V   Y+V YV+G Q D       +G++  SACC
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGE----EGRMMLSACC 205

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH+ AYDL+ W+G TRY F+A+V                                NG+P+
Sbjct: 206 KHYIAYDLEKWRGFTRYTFNAKV--------------------------------NGVPA 233

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA +++L + AR +WGF GYITSDCDAV+II++ + Y  S ED++  VLKAGMD+NCGSF
Sbjct: 234 CARKDILQR-ARDEWGFQGYITSDCDAVAIIHENQTYTASDEDSIAVVLKAGMDINCGSF 292

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L +HTK+A+++ K+ E +I+ AL NLFSV++RLG F+     Q F ++G + VC+  H+ 
Sbjct: 293 LIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRE 352

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LA +A + G VLLKN +G LPL +S+   +ALIGP AN    L G+Y G  C S T ++ 
Sbjct: 353 LAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKG 412

Query: 455 LQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           +Q YV  T +  GC  V C+S     +A++ AK AD VVL+ GL+ T+E E+ DRV L+L
Sbjct: 413 MQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLL 472

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +LI  VA   KKPV+LVL+ GGPVD++FAK+D  I SILW GYPGE G   L E+
Sbjct: 473 PGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEI 532

Query: 574 IFGDHNPG 581
           +FG +NPG
Sbjct: 533 LFGKYNPG 540


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  550 bits (1418), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 275/555 (49%), Positives = 365/555 (65%), Gaps = 16/555 (2%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CD  +P   + PFC  +LPI  R  DLV+R+ L++ +  LVN A A P + +P+YEWW+E
Sbjct: 24  CD--DPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNE 81

Query: 93  ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
           ALHGVA +  G+ F G +  ATSFPQV+ TAASF+  L+Y+I +AI  EARA YN   A 
Sbjct: 82  ALHGVA-LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNA- 139

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK---LQ 209
           G+TFW PN+NIFRDPRWGRGQETPGEDP +TG+YAV++VRG+QG+   G + K     L+
Sbjct: 140 GLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDDNKFLK 199

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            S+CCKHF+AY  +      R++ DA VT QD ADTY P FE CVK+G  S IMC+YN V
Sbjct: 200 ISSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAV 255

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NGIPSCAD+ LL+   R QW F GYITSDC+AV+ +     + +SPE      L AGMD+
Sbjct: 256 NGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHFTQSPEQTCATTLDAGMDL 315

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           NCG FL++H  +A++Q  +    +  AL N F V MRLG+F      QPF  I  D V +
Sbjct: 316 NCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKG--TQPFSNITKDAVDT 373

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPK---SKSVSLALIGPNANSAKTLLGNYAGPSC 446
            AH+ LAL+AA+  +VLLKN    LPL     SK  SLALIGP+ N++  LLGNY G   
Sbjct: 374 AAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPS 433

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
             +TPL+ + +YV N  Y  GC          D+A+++ K AD VV+ MGLDQ+QE+EE+
Sbjct: 434 HIVTPLKGVSSYVPNVAYSLGCKVSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEI 493

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L LPG Q  L+ R+  AA  P++LVL+ GG VD++  K    +G+I++ GY G+AG
Sbjct: 494 DRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAG 553

Query: 567 AVALAEVIFGDHNPG 581
             ALA+++FG ++P 
Sbjct: 554 GQALADMLFGKYSPA 568


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 274/499 (54%), Positives = 354/499 (70%), Gaps = 6/499 (1%)

Query: 25  DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
           ++  P F+CD SN +   + FC      S RARDLVSRLTL EK+  LVN  PA+ RLGI
Sbjct: 31  EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           PAYEWWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA
Sbjct: 91  PAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARA 150

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           ++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+Q D   GG  
Sbjct: 151 MHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ-DAGAGGVT 208

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
            G L+ +ACCKH+TAYD+DNWKG  RY FDA+V+ QDL DT+QPPF+SCV  G  + +MC
Sbjct: 209 DGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMC 268

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           +YN+VNG P+CAD++LL    R  W  +GYI SDCD+V ++Y  + Y K+PE+A    +K
Sbjct: 269 SYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIK 328

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           +G+D+NCG+FL +HT AAV+  +L E ++DRA+ N F + MRLG F+G+P    FG +G 
Sbjct: 329 SGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGP 388

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+ +++ LA + A+ GIVLLKNS G LPL      S+A+IGPNAN++ T++GNY G 
Sbjct: 389 KDVCTSSNRELARETARQGIVLLKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGT 447

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQE 502
            C+  TPLQ L   V NTVY PGC  V CS  S+    AV  A  AD  VL++G DQ+ E
Sbjct: 448 PCKYTTPLQGLGAKV-NTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIE 506

Query: 503 KEELDRVDLVLPGRQQELI 521
           +E LDR  L+LPG+Q +L+
Sbjct: 507 RESLDRTSLLLPGQQTQLV 525


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 281/578 (48%), Positives = 385/578 (66%), Gaps = 44/578 (7%)

Query: 12  LLCLCFTSLLTRVDS--TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           ++   F+ LL  +    T P + C P +     +PFC  +LPIS R   L+S LTL +KI
Sbjct: 9   IIIFLFSLLLIHLPKFFTTPDYPCKPPH---SHYPFCNISLPISTRTTSLISLLTLSDKI 65

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
           +QL N+A +I  LGIP+Y+WWSEALHG+A  G G+ FNG+++ AT+FPQVI++AA+F+  
Sbjct: 66  NQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRS 125

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           LW+ IG A+G+E RA++N GQA G++FWAPN+N+FRDPRWGRGQETPGEDP+V   YAV 
Sbjct: 126 LWFLIGYAVGVEGRAMFNVGQA-GLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVE 184

Query: 190 YVRGVQG-----------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           +VRG+QG           D+ + G     L  SACCKHFTAYDL+ W   +RY F+A V 
Sbjct: 185 FVRGIQGVDGIKKVLNDHDSDDDG-----LMVSACCKHFTAYDLEKWGEFSRYNFNAVV- 238

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY-ITS 297
                +TYQPPF  CV+QG+AS +MC+YN VNG+P+CA ++LL    R +WGF G  I  
Sbjct: 239 -----NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILP 292

Query: 298 DCDAVSIIY-DAEGYAKSPEDAVVDVLKA-----------GMDVNCGSFLQKHTKAAVKQ 345
               + +++   +     P+  ++  LK             MD+NCG+F+ +HT++A++Q
Sbjct: 293 QTVMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQ 352

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E ++DRAL NLFSV+MRLGLFNG+P    FGK+G   VC+P H+ LAL+AA+ GIV
Sbjct: 353 GLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIV 412

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
           LLKN +  LPL K   VSLA+IGP A +++ L G Y+G  C   +    L+ YV+   Y 
Sbjct: 413 LLKNDNKFLPLDKKDRVSLAIIGPMATTSE-LGGGYSGIPCSPRSLYDGLKEYVKTISYA 471

Query: 466 PGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
            GC  V C S      A+DIAK AD VV++ GLD T E E+LDRV L+LPG+Q +L++RV
Sbjct: 472 FGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRV 531

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           A A+K+PVILVL  GGP+D++FA+ ++ I SILW GYP
Sbjct: 532 AAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYP 569


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score =  546 bits (1408), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 271/561 (48%), Positives = 372/561 (66%), Gaps = 17/561 (3%)

Query: 31  FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + CD S  +      + F +C ++ P   RA+DLV R+TL EK+ Q  + A  + R+G+P
Sbjct: 51  YVCDESRFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLP 110

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHGV+  G+ +FF+  + GATSFP VIL+AASF+  LW  +GQA+  EARA+
Sbjct: 111 KYNWWSEALHGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
           YN+G A G+TFW+PNIN+ RDPRWGR  ETPGEDP + G YAV+YVRG+Q      N   
Sbjct: 171 YNSGNA-GLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTD 229

Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           L  + L+ S+CCKH+ AYDLDNWKG  R  FDARV++QD+A+T+  PFE CVK+G  S +
Sbjct: 230 LNSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSV 289

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII-YDAEGYAKSPEDAVVD 321
           MC+YN++NGIPSCAD  LL +T R +W  HGYI SDCD+V ++  D +    S  D+   
Sbjct: 290 MCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQ 349

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            L AGM+++CG+F  +    AV Q K  ++++D +L  L+ + MR+G F+G P    F  
Sbjct: 350 ALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FAS 406

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +G D +CS  H  LA +AA+ GIVLLKN +  LPL   K++  AL+GP+AN+   ++GNY
Sbjct: 407 LGKDDICSAEHIELAREAARQGIVLLKNDNATLPLKSVKNI--ALVGPHANATDAMIGNY 464

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQT 500
           AG  C  ++PL A  +  E   Y  GC  V C +   I  A++ AK AD  ++  G D +
Sbjct: 465 AGIPCYYVSPLDAFSSMGE-VRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLS 523

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E E LDRVDL+LPG Q +LI +VA+ +  PV+LV++ GG VDI+FA+ +  I +ILWAG
Sbjct: 524 IEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAG 583

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPGE G  A+A+VI G +NPG
Sbjct: 584 YPGEQGGNAIADVILGKYNPG 604


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  543 bits (1400), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 251/374 (67%), Positives = 305/374 (81%)

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L+ASACCKHFTAYDL+NWKG TR+ FDA+VT QDLADTY PPF+SCV+ G ASGIMC+YN
Sbjct: 5   LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYN 64

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           RVNG+P+CAD NLLSKTAR  W F+GYITSDCDAV+II+D +GYAK+PEDAV DVLKAGM
Sbjct: 65  RVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAGM 124

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           DVNCG ++Q H  +A +Q K+   +IDRAL NLF++RMRLGLFNGNP    +G IGAD V
Sbjct: 125 DVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQV 184

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           C   HQ LALQAAQDGIVLLKN  G LPL KSK  S+A+IGPN N+A  LLGNY GP C 
Sbjct: 185 CKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPCI 244

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           S+TP QALQ YV++  +  GC+   C+ ++I +AV  A  AD+VVL MGLDQ QE+EE+D
Sbjct: 245 SVTPFQALQGYVKDATFVQGCNAAVCNVSNIGEAVHAASSADYVVLFMGLDQNQEREEVD 304

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R++L LPG Q+ L+ +VA+AAKKPVILVLLCGGPVD+TFAK +  IG+I+WAGYPG+AG 
Sbjct: 305 RLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGG 364

Query: 568 VALAEVIFGDHNPG 581
           +A+A+V+FG+HNPG
Sbjct: 365 IAIAQVLFGEHNPG 378


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score =  540 bits (1390), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 265/528 (50%), Positives = 362/528 (68%), Gaps = 22/528 (4%)

Query: 58  DLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFP 117
           DLV+++TL+EK+ QL N A  +PRLG+  Y+WWSEALHGV+ VG G FF+  I G+TSFP
Sbjct: 2   DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61

Query: 118 QVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPG 177
            VI TAA+F+  LW  IGQA+  EARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPG
Sbjct: 62  TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRA-GLTYWSPNINVVRDPRWGRAIETPG 120

Query: 178 EDPLVTGKYAVSYVRGVQ---GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           EDP + G+YAV+YVRG+Q   G           L+ S+CCKH+ AYD+DNWKG  RY FD
Sbjct: 121 EDPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFD 180

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
           ARV+ QD+ +T+  PFE CVK G  S +MC+YNRVNGIP+CAD  LL++T R  W  HGY
Sbjct: 181 ARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGY 240

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
           I SDCD++ ++ +   +              G+D++CG++  ++ +AAV+Q K+ E++ID
Sbjct: 241 IVSDCDSLQVMVENHKWL-------------GLDLDCGAYYTENVEAAVRQGKVREADID 287

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
           ++L+ L+ V MRLG F+G P    FGK   + VCS  +  LA +AA++G VLLKN +  L
Sbjct: 288 KSLNFLYVVLMRLGFFDGIPQYNSFGK---NDVCSKENIELATEAAREGAVLLKNENDSL 344

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS 474
           PL   K  +LA+IGP++N+   ++GNYAG  C+ ITP++ L  Y +   Y  GC  +AC 
Sbjct: 345 PLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAK-VDYQMGCSDIACK 403

Query: 475 SAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
             S I  A++ AK AD  +++ G+D + E E LDR DL+LPG Q +LI +VA  +  PV+
Sbjct: 404 DESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVV 463

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           LVL+  G VDI+FAK + +I SILW GYPGE G  A+A+VIFG +NPG
Sbjct: 464 LVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPG 511


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 273/531 (51%), Positives = 353/531 (66%), Gaps = 51/531 (9%)

Query: 75  SAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
           S  A+PRLGIPAYEWWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  I
Sbjct: 41  SGRALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAI 100

Query: 135 GQA------------------------------------------IGLEARALYNAGQAI 152
           G++                                          +  EARA++N G A 
Sbjct: 101 GESACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLA- 159

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYAV YV G+Q    + G     L+ +A
Sbjct: 160 GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGGGSDALKVAA 215

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
           CCKH+TAYD+DNWKG  RY FDA V+ QDL DT+QPPF+SCV  G  + +MC+YN+VNG 
Sbjct: 216 CCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGK 275

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
           P+CAD++LLS   R  W  +GYI SDCD+V ++Y+ + Y K+PEDA    +K+G+D+NCG
Sbjct: 276 PTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCG 335

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +FL +HT AAV+  KL ES++DRA+ N F V MRLG F+G+P   PFG +G   VC+ ++
Sbjct: 336 NFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSN 395

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           Q LA +AA+ GIVLLKN+ G LPL      S+A+IGPNAN++ T++GNY G  C+  TPL
Sbjct: 396 QELAREAARQGIVLLKNT-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPL 454

Query: 453 QALQNYVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
           Q L   V  TVY PGC  V CS  S+    A   A  AD  VL++G DQ+ E+E LDR  
Sbjct: 455 QGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTS 513

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           L+LPG+Q +L++ VA A++ PVILV++ GGP DI+FAK    I +ILW GY
Sbjct: 514 LLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGY 564


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score =  534 bits (1375), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 257/561 (45%), Positives = 369/561 (65%), Gaps = 17/561 (3%)

Query: 31  FSCDPSNPS-----TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + CD S  +      ++F FC  +L   +RA+DLVSR+TL EK+ Q V++A  + RLG+P
Sbjct: 47  YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHG++ +G G+FF+ TI GATS P VIL+ A+F+  LW  +G+ +  E RA+
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
           YN G A G+TFW+PNIN+ RD RWGR QET GEDP + G++AV+YVRG+Q    T N   
Sbjct: 167 YNLGHA-GLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 225

Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           L  + L+ S+CCKH+ AYD+D+W    R+ FDARV+ QD+ +T+  PFE CV++G  S +
Sbjct: 226 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 285

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVD 321
           MC++N++NGIP C+D  LL    R +W  HGYI SDC  + +I D + Y   S  DAV  
Sbjct: 286 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 345

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            L+AG+D+ CG +       +V   K+ + E+DRAL N++ + MR+G F+G P  +    
Sbjct: 346 TLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---S 402

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +G   +C+  H  LA +AA+ GIVLLKN + +LPL   K +  AL+GP+AN+ + ++GNY
Sbjct: 403 LGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPGKKI--ALVGPHANATEVMIGNY 460

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQT 500
           AG  C+ ++PL+A    + N  Y  GC   +CS+ +   +A + AK A+  ++ +G D +
Sbjct: 461 AGLPCKYVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLS 519

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E E +DRVD +LPG Q ELI +VAE +  PVILV+L G  +DITFAK +  I +ILW G
Sbjct: 520 IEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVG 579

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           +PGE G  A+A+V+FG +NPG
Sbjct: 580 FPGEQGGHAIADVVFGKYNPG 600


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 256/544 (47%), Positives = 358/544 (65%), Gaps = 12/544 (2%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F +C ++LP   R +DLV R+TL+EK   +++ A  +PR+G+P Y+WWSEALHGVA VG 
Sbjct: 65  FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
             FF+  + GATSFP VIL+AASF+  LW  +GQ +  EARA+YN G A G+TFW+PNIN
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHA-GLTFWSPNIN 183

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFTA 219
           + RDPRWGR  ETPGEDPL  G Y V+YVRG+Q    T N   L  + L+ ++ CKHF A
Sbjct: 184 VARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAA 243

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YDLD W    R  FDA+V+ QD+ +T+  PFE CVK+G  S +MC++N +NGIP CAD  
Sbjct: 244 YDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPR 303

Query: 280 LLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
            L    R QW  HGYI SDC A+ +I+ D +    + E+ V   +KAG+D+ CG +    
Sbjct: 304 FLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDS 363

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
              AV++ ++ E ++D++L  L+ V MR+G F+G P++   GK     +C+  H  LA +
Sbjct: 364 LATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKD---ICNDEHIELARE 420

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           AA+ GIVLLKN +  LPL   K   LAL+GP+AN+   ++GNYAG  C  ++PL A    
Sbjct: 421 AARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE- 477

Query: 459 VENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
           + +  Y  GC  V C +   + KA + AK AD  ++++G D + E EE DR DL+LPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            E++ +V + +  PVILV++CGGP+DI+FAK +  I +ILWAG+PGE G  A+A+++FG 
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597

Query: 578 HNPG 581
           +NPG
Sbjct: 598 YNPG 601


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 256/544 (47%), Positives = 358/544 (65%), Gaps = 12/544 (2%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F +C ++LP   R +DLV R+TL+EK   +++ A  +PR+G+P Y+WWSEALHGVA VG 
Sbjct: 65  FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
             FF+  + GATSFP VIL+AASF+  LW  +GQ +  EARA+YN G A G+TFW+PNIN
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHA-GLTFWSPNIN 183

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGK-LQASACCKHFTA 219
           + RDPRWGR  ETPGEDPL  G Y V+YVRG+Q    T N   L  + L+ ++ CKHF A
Sbjct: 184 VARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASSCKHFAA 243

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YDLD W    R  FDA+V+ QD+ +T+  PFE CVK+G  S +MC++N +NGIP CAD  
Sbjct: 244 YDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIPPCADPR 303

Query: 280 LLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
            L    R QW  HGYI SDC A+ +I+ D +    + E+ V   +KAG+D+ CG +    
Sbjct: 304 FLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECGHYYNDS 363

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
              AV++ ++ E ++D++L  L+ V MR+G F+G P++   GK     +C+  H  LA +
Sbjct: 364 LATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKD---ICNDEHIELARE 420

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           AA+ GIVLLKN +  LPL   K   LAL+GP+AN+   ++GNYAG  C  ++PL A    
Sbjct: 421 AARQGIVLLKNDNATLPLKPVKK--LALVGPHANATVAMIGNYAGIPCHYVSPLDAFSE- 477

Query: 459 VENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
           + +  Y  GC  V C +   + KA + AK AD  ++++G D + E EE DR DL+LPG Q
Sbjct: 478 LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPGYQ 537

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            E++ +V + +  PVILV++CGGP+DI+FAK +  I +ILWAG+PGE G  A+A+++FG 
Sbjct: 538 TEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVFGK 597

Query: 578 HNPG 581
           +NPG
Sbjct: 598 YNPG 601


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  520 bits (1339), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 260/558 (46%), Positives = 357/558 (63%), Gaps = 39/558 (6%)

Query: 31  FSCDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + CD S  +      + F +C ++ P   RA+DLV R+TL EK+ Q  + A  + R+G+P
Sbjct: 51  YVCDESRFAALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLP 110

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHGV+  G+ +FF+  + GATSFP VIL+AASF+  LW  +GQA+  EARA+
Sbjct: 111 KYNWWSEALHGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAM 170

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN+G A G+TFW+PNIN+ RDPRWGR  ETPGEDP + G YAV+Y               
Sbjct: 171 YNSGNA-GLTFWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY--------------- 214

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
                     H+ AYDLDNWKG  R  FDARV++QD+A+T+  PFE CVK+G  S +MC+
Sbjct: 215 ----------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCS 264

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII-YDAEGYAKSPEDAVVDVLK 324
           YN++NGIPSCAD  LL +T R +W  HGYI SDCD+V ++  D +    S  D+    L 
Sbjct: 265 YNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALN 324

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AGM+++CG+F  +    AV Q K  ++++D +L  L+ + MR+G F+G P    F  +G 
Sbjct: 325 AGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPA---FASLGK 381

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
           D +CS  H  LA +AA+ GIVLLKN +  LPL   K++  AL+GP+AN+   ++GNYAG 
Sbjct: 382 DDICSAEHIELAREAARQGIVLLKNDNATLPLKSVKNI--ALVGPHANATDAMIGNYAGI 439

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
            C  ++PL A  +  E   Y  GC  V C +   I  A++ AK AD  ++  G D + E 
Sbjct: 440 PCYYVSPLDAFSSMGE-VRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEA 498

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E LDRVDL+LPG Q +LI +VA+ +  PV+LV++ GG VDI+FA+ +  I +ILWAGYPG
Sbjct: 499 EALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPG 558

Query: 564 EAGAVALAEVIFGDHNPG 581
           E G  A+A+VI G +NPG
Sbjct: 559 EQGGNAIADVILGKYNPG 576


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  518 bits (1334), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 264/559 (47%), Positives = 370/559 (66%), Gaps = 13/559 (2%)

Query: 24  VDSTQP-PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           +D+  P P +CD      ++ PFC T L    R RDL+SR+   +    LVNSA  +  L
Sbjct: 164 LDAGSPTPRTCDVE--PGKSLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHL 221

Query: 83  GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
            +PAY+WWSEALHGV G   G+ F G +  ATSFPQVI T A+F+  L+ +IG  I  EA
Sbjct: 222 NLPAYQWWSEALHGV-GHSPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEA 280

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           RA+ N  Q  G TFWAPNINI RDPRWGRGQETPGEDP  TG+YA ++V G     F  G
Sbjct: 281 RAMNNV-QRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSG-----FQDG 334

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
           +    ++AS+CCKHF  Y+L+NW G  R+ ++A  T QD+ADTY P FE+CV+ GRASG+
Sbjct: 335 EDMNYIKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASGL 394

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YN VNG+PSCA+ ++++  AR  WGF GYITSDC AV+ + ++  + ++  + +  V
Sbjct: 395 MCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRAV 454

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           L+AGMD +CGSF+Q++   A+++  +P   ++ ALH LF V+ RLGLF+   + QP+   
Sbjct: 455 LEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFD-PVSKQPYTNY 513

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
               V +PA+Q LAL+AAQ GIVLLKN++  LPL     + +ALIGPNA++   + GNY 
Sbjct: 514 SVARVNTPANQQLALEAAQQGIVLLKNTNARLPL--KTGLHVALIGPNADATTVMQGNYQ 571

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           G +   I+P++  +NY     Y  GCD     ++  D AV  AK AD VV+++GLDQ QE
Sbjct: 572 GTAPFLISPVRGFKNYSAAVTYAKGCDVACKDTSGFDAAVAAAKEADAVVVVVGLDQGQE 631

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR  + LPG Q++L+ +VA AAK P+++ ++ GG VD++  K ++N+  ILW GYP
Sbjct: 632 SEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWCGYP 691

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G++G  A+A+V+FG  +PG
Sbjct: 692 GQSGGQAMADVVFGAVSPG 710


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  514 bits (1324), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 269/575 (46%), Positives = 357/575 (62%), Gaps = 68/575 (11%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG- 101
           + +C   LP   R RDL+  +T++EK+S L + A   PR+G+P Y+WWSEALHG++  G 
Sbjct: 60  YRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGP 119

Query: 102 ---------------KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
                          +   FNGT+     F  VI +AASF+  LW  IGQAI  EARA+Y
Sbjct: 120 TTKFDDLKKPRLHSGRAAVFNGTV-----FANVINSAASFNESLWRSIGQAISTEARAMY 174

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ-----GDTFNG 201
           N G+  G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q        FNG
Sbjct: 175 NLGKG-GLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQDVDDAAAGFNG 233

Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
             L   L+ SACCKH+ AYD+D+W G TR+KFDARVT +D+ +T+Q PFE CV+ G AS 
Sbjct: 234 DPLSRPLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASA 293

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVV 320
           +MC+YNRVNGIP+CAD  LL+ T RR WG HGYI SDCDAV ++ D A     +P +A  
Sbjct: 294 VMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASA 353

Query: 321 DVLKAGMDVNCGS------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
             LKAG+D++CG             FL  +  AAV+Q K+ ES+ID AL NL++  MRLG
Sbjct: 354 ASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLG 413

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
            F+G P  +    +    +CS AH+ LAL  A+  +VLLKN  GLLPL  SK  S+A+ G
Sbjct: 414 YFDGMPRYE---SLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDASKLASVAVRG 470

Query: 429 PNANS-AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIA-K 486
           P+A +  K + G+Y GP CR ITP +                        I K V+I+ +
Sbjct: 471 PHAEAPEKVMDGDYTGPPCRYITPREG-----------------------ISKDVNISQQ 507

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
           G D  + M G++   E+E  DR DL+LP  Q E I RVA A+  P++LV+L GG +D++F
Sbjct: 508 GGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLVILSGGGIDVSF 567

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A+    IG+ILWAGYPG  G  A+A+VIFG +NPG
Sbjct: 568 AQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPG 602


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  513 bits (1321), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 247/557 (44%), Positives = 353/557 (63%), Gaps = 42/557 (7%)

Query: 31  FSCDPSNPS-----TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + CD S  +      ++F FC  +L   +RA+DLVSR+TL EK+ Q V++A  + RLG+P
Sbjct: 47  YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHG++ +G G+FF+ TI GATS P VIL+ A+F+  LW  +G+ +  E RA+
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           YN G A G+TFW+PNIN+ RD RWGR QET GEDP + G++AV+YVRG+Q        ++
Sbjct: 167 YNLGHA-GLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQ-------DVE 218

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
           G    S+CCKH+ AYD+D+W    R+ FDARV+ QD+ +T+  PFE CV++G  S +MC+
Sbjct: 219 GTENVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVMCS 278

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVDVLK 324
           +N++NGIP C+D  LL    R +W  HGYI SDC  + +I D + Y   S  DAV   L+
Sbjct: 279 FNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKTLQ 338

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AG+D+ CG +       +V   K+ + E+DRAL N++ + MR+G F+G P  +    +G 
Sbjct: 339 AGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---SLGL 395

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             +C+  H  LA +AA+ GIVLLKN + +LPL   K +  AL+GP+AN+ + ++GNYAG 
Sbjct: 396 KDICAADHIELAREAARQGIVLLKNDYEVLPLKPGKKI--ALVGPHANATEVMIGNYAGL 453

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            C+ ++PL+A    + N  Y  G                        ++ +G D + E E
Sbjct: 454 PCKYVSPLEAFSA-IGNVTYATG----------------------FTIIFVGTDLSIEAE 490

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            +DRVD +LPG Q ELI +VAE +  PVILV+L G  +DITFAK +  I +ILW G+PGE
Sbjct: 491 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 550

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+A+V+FG +NPG
Sbjct: 551 QGGHAIADVVFGKYNPG 567


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 256/557 (45%), Positives = 361/557 (64%), Gaps = 23/557 (4%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F +C  +LP ++R RDLV RL+L+EK+  L + A   PR+G+P Y+WW EALHGV+ VG 
Sbjct: 66  FRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPYKWWGEALHGVSDVGP 125

Query: 103 G-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
           G  +F   + GATSFP VI +AA+F+  LW  IG  +  E RA+YN G A  +T+W+PNI
Sbjct: 126 GGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMYNLGHA-ELTYWSPNI 184

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK------LKGKLQASACCK 215
           N+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q      G           ++ S+CCK
Sbjct: 185 NVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAATADPFSRPIKVSSCCK 244

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF AYD+D W    R  FDA+V  +D+ +T++ PFE C++ G AS +MC+YNR+NGIP+C
Sbjct: 245 HFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASCVMCSYNRINGIPAC 304

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-- 332
           AD  LLS+T R QW  HGYI SDCD+V + + DA+    +  +A    +KAG+D++CG  
Sbjct: 305 ADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATAAAMKAGLDLDCGMF 364

Query: 333 -----SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
                 F   +   AV+Q K+ E+++D AL N+++  MRLG F+G P    F  +GAD V
Sbjct: 365 WEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDGMPE---FESLGADDV 421

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG--PNANSAKTLLGNYAGPS 445
           C+  H+ LA  AA+ G+VLLKN    LPL  SK  S++L+G   + N+   +LG+Y G  
Sbjct: 422 CTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINATDVMLGDYRGKP 481

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKE 504
           CR +TP  A++  V N  Y   CD+ ACS+A  + +A   AK AD  +++ GL+ + E+E
Sbjct: 482 CRIVTPYDAIRQVV-NATYVHACDSGACSTAEGMGRASRTAKIADATIVIAGLNMSVERE 540

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DR DL+LP  Q   I  VAEA+  P++LV++  G VD++FA+ +  IG+I+WAGYPGE
Sbjct: 541 SNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKIGAIVWAGYPGE 600

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+A+V+FG +NPG
Sbjct: 601 EGGTAIADVLFGKYNPG 617


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 259/541 (47%), Positives = 350/541 (64%), Gaps = 18/541 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C TT     R+   ++R+ LD+ +  LVN A   P + IP+YEWW+EALHGVA +  G+
Sbjct: 1   MCATT----PRSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGV 55

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            F G+I  ATSFPQV+ TAASF+  L+Y+I   I  EARA +NA  A G+TFW PN+NIF
Sbjct: 56  TFKGSITAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDA-GLTFWTPNVNIF 114

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK--LQASACCKHFTAYDL 222
           RDPRWGRGQETPGEDP +TG+YAV++VRG+QG+   G +++    L+ S+CCKHF+AY  
Sbjct: 115 RDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEGMEGREVENSKFLKISSCCKHFSAYSQ 174

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +      R++ +A VT QD ADTY P FE CVK+G  S IMC+YN VNGIPSCAD+ LL+
Sbjct: 175 E----VPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLT 230

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
              R QW F GYI SDC+AV+ + D   Y +SPE      L AGMD+NCG FL++H   A
Sbjct: 231 DLVRGQWKFDGYIASDCEAVADVIDHHHYTQSPEQTCATTLDAGMDLNCGEFLRQHLPKA 290

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           ++Q  +    I  AL N F V MRLG+F     ++PF  I  D V +  H+ LAL+AA+ 
Sbjct: 291 LEQGIVTTEMIHNALKNQFRVLMRLGMFE---KVEPFANITKDSVDTTMHRQLALEAARQ 347

Query: 403 GIVLLKNSHGLLPLPK---SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
            IVLLKN    LPL     ++  SLALIGP+ N++  LLGNY G     +TPL+ +  +V
Sbjct: 348 SIVLLKNDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFV 407

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            N  +  GC          D A+ +AK AD +++ +GLDQ+QE+EE+DR  + LP  Q  
Sbjct: 408 PNVAHSLGCKVSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQST 467

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L+ RV E A  P++ V++ GG VD++  K    +G+I++ GY G+AG  ALA+V+FG +N
Sbjct: 468 LLKRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYN 527

Query: 580 P 580
           P
Sbjct: 528 P 528


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  510 bits (1313), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 264/565 (46%), Positives = 368/565 (65%), Gaps = 22/565 (3%)

Query: 33  CDPSNPST-----ETFPFCKTTLPISQ-RARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           CDP+  +      + F +C ++LPI   R +DLV R+TL+EK + ++  A  + R+G+P 
Sbjct: 17  CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76

Query: 87  YEWWSEALHGVAGVGKG--IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           Y+WWSEALHGV+ V      FF+ T+ GATSFP VIL+AASF+  LW  I Q +  EARA
Sbjct: 77  YQWWSEALHGVSSVSINGPTFFDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEARA 136

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGG 202
            YN G A G+TFW PN+N+ RDPRWGR QET GEDP     YAVSYVRG+Q    T N  
Sbjct: 137 TYNLGHA-GLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQDVEGTENTT 195

Query: 203 KLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
            L  + L+ S+  KHF AYDLDNW    R  F+ARV+ QD+A+T+  PFE+CV++G  SG
Sbjct: 196 DLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMAETFLRPFEACVREGDVSG 255

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVV 320
           +MC++N +NGIP CAD  L   T R +W  HGYI SDC ++ +I+ D +    + E+AV 
Sbjct: 256 VMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIETIVEDQKFLDVTGEEAVA 315

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             LKAG+D+ CG +      +AV   ++ + ++D++L NL+ V MRLG F+G P +   G
Sbjct: 316 LNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYVVLMRLGFFDGIPALASLG 375

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           K   D+  S  H  LA +AA+ GIVLLKN +  LPL   K  +LAL+GPNA++   ++GN
Sbjct: 376 K--DDICLSAEHIELAREAARQGIVLLKNDNATLPLKSVK--NLALVGPNADAYGAMMGN 431

Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGL-D 498
           YAGP CRS++P  A    + N  Y  GC  V C + + + KAV+ AK AD  ++++G+ D
Sbjct: 432 YAGPPCRSVSPRDAFSA-IGNVTYEMGCGDVLCHNDTYVYKAVEAAKHADTTIIVVGITD 490

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL--CGGPVDITFAKYDRNIGSI 556
            +   E+ DRVDL+LPG Q  L+ ++A+A   P+ILV+   CGGP+DI+FA+ +  I  I
Sbjct: 491 VSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGPIDISFARDNPGIEPI 550

Query: 557 LWAGYPGEAGAVALAEVIFGDHNPG 581
           LWAG+PGE G  A+A+V++G +NPG
Sbjct: 551 LWAGFPGEEGGNAIADVVYGKYNPG 575


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  507 bits (1306), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 248/456 (54%), Positives = 324/456 (71%), Gaps = 9/456 (1%)

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
           S ++  I   +  EARA++N G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+  KYA
Sbjct: 202 SMMYNLIVLVVSTEARAMHNVGLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 260

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
           V YV G+Q    + G     L+ +ACCKH+TAYD+DNWKG  RY FDA V+ QDL DT+Q
Sbjct: 261 VGYVTGLQ----DAGGGSDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
           PPF+SCV  G  + +MC+YN+VNG P+CAD++LLS   R  W  +GYI SDCD+V ++Y+
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
            + Y K+PEDA    +K+G+D+NCG+FL +HT AAV+  KL ES++DRA+ N F V MRL
Sbjct: 377 NQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRL 436

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
           G F+G+P   PFG +G   VC+ ++Q LA +AA+ GIVLLKN+ G LPL      S+A+I
Sbjct: 437 GFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNT-GALPLSAKSIKSMAVI 495

Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID--KAVDIA 485
           GPNAN++ T++GNY G  C+  TPLQ L   V  TVY PGC  V CS  S+    A   A
Sbjct: 496 GPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAA 554

Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
             AD  VL++G DQ+ E+E LDR  L+LPG+Q +L++ VA A++ PVILV++ GGP DI+
Sbjct: 555 ASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDIS 614

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           FAK    I +ILW GYPGEAG  ALA+++FG HNPG
Sbjct: 615 FAKSSDKISAILWVGYPGEAGGAALADILFGYHNPG 650



 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 62/108 (57%), Positives = 76/108 (70%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +   + FC  T   + RA DL+ RLTL EK+  LVN   A+PRLGIPAYE
Sbjct: 29  PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+
Sbjct: 89  WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  507 bits (1305), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 256/574 (44%), Positives = 357/574 (62%), Gaps = 35/574 (6%)

Query: 33  CDPSNPSTET-----FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           CDP+  +        FP+C  +LP + R RDLV R+TL+EK++ L + A   PR+G+P Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 88  EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
            WW EALHGV+ VG G  +F   + GATSFP VI +AASF+  LW  IG  +  E RA+Y
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---------D 197
           N G A  +T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q           
Sbjct: 167 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225

Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
                     ++ S+CCKH+ AYD+D W GT R  FDARV  +D+ +T++ PFE C++ G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPE 316
            AS +MC+YNR+NG+P+CAD  LL++T RR W  HGYI SDCD+V + + DA+    +  
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345

Query: 317 DAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
           +A    +KAG+D++CG        F   +   AV+Q KL ES +D AL NL+   MRLG 
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG- 428
           F+G P ++    +GA  VC+  H+ LA  AA+ G+VLLKN   LLPL   K  S+AL G 
Sbjct: 406 FDGIPELE---SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQ 462

Query: 429 -PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
             + N+   +LG+Y G  CR +TP   ++  V +T  +      AC   S D A   AK 
Sbjct: 463 LQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVH------ACDKGSCDTAAAAAKT 516

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
            D  +++ GL+ + E+E  DR DL+LP  Q   I  VAEA+  P++LV++  G VD++FA
Sbjct: 517 VDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFA 576

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +  IG+++WAGYPGE G  A+A+V+FG +NPG
Sbjct: 577 QDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPG 610


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  506 bits (1303), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 257/576 (44%), Positives = 362/576 (62%), Gaps = 37/576 (6%)

Query: 33  CDPSN-----PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           CDP+           F +C  +LP + R RDLV RL L+EK+  L + A   PR+G+P Y
Sbjct: 63  CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122

Query: 88  EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           +WW EALHGV+ VG G  +F   + GATSFP VI +AA+F+  LW  IG  +  E RA+Y
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ----------G 196
           N G A  +T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q           
Sbjct: 183 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
           D F+       ++ S+CCKHF AYD+D W    R  FDA+V  +D+ +T++ PFE C++ 
Sbjct: 242 DPFSR-----PIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRD 296

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSP 315
           G AS +MC+YNR+NGIP+CAD  LLS+T R QW  HGYI SDCD+V + + DA+    + 
Sbjct: 297 GDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTG 356

Query: 316 EDAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
            +A    +KAG+D++CG        F   +   AV+Q K+ E ++D AL N+++  MRLG
Sbjct: 357 VEATAAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLG 416

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
            F+G P    F  +GA  VC+  H+ LA  AA+ G+VLLKN    LPL  +K  S++L+G
Sbjct: 417 FFDGMPE---FESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVG 473

Query: 429 --PNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIA 485
              + N+   +LG+Y G  CR +TP  A++N V N  Y   CD+ AC++A  + +A   A
Sbjct: 474 LLEHINATDVMLGDYRGKPCRIVTPYNAIRNMV-NATYVHACDSGACNTAEGMGRASSTA 532

Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           K AD  +++ GL+ + E+E  DR DL+LP  Q   I  VA A+  P++LV++  G VD++
Sbjct: 533 KIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVS 592

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           FA  +  IG+I+WAGYPGE G  A+A+V+FG +NPG
Sbjct: 593 FAHNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPG 628


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  506 bits (1302), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 238/438 (54%), Positives = 325/438 (74%), Gaps = 7/438 (1%)

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           ++NAGQA G+T+WAPNINIFRDPRWGRGQET GEDP V   Y++ YV+G QG+       
Sbjct: 1   MHNAGQA-GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEE----GE 55

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
           +G+++ SACCKH+TAYD++ W+G +RY F+A+V  QDL DTYQPPF++C+++ RAS +MC
Sbjct: 56  EGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMC 115

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           AYN+VNG+P CA ++LL KT R +WGF GYITSDCDAV+II++ + Y KS ED++  VLK
Sbjct: 116 AYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTYTKSGEDSIAIVLK 174

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AGMD+NCGSFL +HTK+A+++ K+ E +IDRAL NLFSV++RLG+F+     Q F ++G 
Sbjct: 175 AGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGP 234

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
           + VC+  H+ LA +A + G VLLKN H  LPL +S+   +A+IGP+AN A  + G+Y G 
Sbjct: 235 NSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGV 294

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQTQEK 503
            C   T L+ +Q Y   T + PGC   +C+S  +  +AV+ AK AD VV++ GL+ T+E+
Sbjct: 295 PCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEER 354

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E+ DRV L+LPG+Q  LI  +A  AKKP++LVLL GGPVD++FAK D  I SILW GYPG
Sbjct: 355 EDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPG 414

Query: 564 EAGAVALAEVIFGDHNPG 581
           E G   L E++FG++NPG
Sbjct: 415 EVGGQVLPEILFGEYNPG 432


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  506 bits (1302), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 268/559 (47%), Positives = 355/559 (63%), Gaps = 33/559 (5%)

Query: 24   VDSTQP-PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
            +D+  P P  CD      + FPFC  +L +  R RD++SRL++ +K++   N+A A    
Sbjct: 906  IDAAPPRPRPCD--ELPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADA 963

Query: 83   GIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
            G+PAY+WWSEALHGV G   G+ F G ++ ATSFPQVI T+ASF+  LW+ IG  I  EA
Sbjct: 964  GLPAYQWWSEALHGV-GFSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEA 1022

Query: 143  RALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
            RA+ N  QA G+TFWAPNINI RDPRWGRGQETPGEDP  TG YA ++V G+Q      G
Sbjct: 1023 RAMNNVNQA-GLTFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQE-----G 1076

Query: 203  KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
            +    ++AS+CCKHF  Y+L++W    R+ F+A  T QD+ADTY P FESCV+ GRAS +
Sbjct: 1077 EDTRYIKASSCCKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSL 1136

Query: 263  MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
            MC+YN VNG+PSCA+ ++++  AR  WGF GYITSDC AV  +Y    Y  +    V  V
Sbjct: 1137 MCSYNAVNGVPSCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYYNTTGATVNGV 1196

Query: 323  LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
            L AGMDV+CGSFL +H   A+    +  + +D+AL+NLF V+ RLG+F+     QP+  +
Sbjct: 1197 LSAGMDVDCGSFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAED-QPYLNL 1255

Query: 383  GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
              D V +P HQ LAL+AA+ G+ LL+N    LPL  S    LALIGPNAN+   + GNY 
Sbjct: 1256 TTDAVNTPEHQQLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYN 1315

Query: 443  GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
            G +   I+P Q +Q YV N     G             AV  AK AD VV+++GLDQTQE
Sbjct: 1316 GKAPFLISPQQGVQQYVSNVALELG-------------AVTAAKAADTVVMVIGLDQTQE 1362

Query: 503  KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
             E  DR  + LPG Q EL+ +VA A+  P+++V++ GG VD+T  K   N+         
Sbjct: 1363 SEGHDREIIALPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV--------- 1413

Query: 563  GEAGAVALAEVIFGDHNPG 581
            G+AG  ALAE +FGD+NPG
Sbjct: 1414 GQAGGQALAETLFGDNNPG 1432


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 231/426 (54%), Positives = 310/426 (72%), Gaps = 7/426 (1%)

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            W+PN+NIFRDPRWGRGQETPGEDP    KY  ++V+G+QG +         LQ SACCK
Sbjct: 1   MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLT------NLQTSACCK 54

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           H TAYD++ WKG +RY F+A+VT QDLADTY PPF SCV  G+AS IMCAY  +NG+P+C
Sbjct: 55  HITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVPAC 114

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           A  +LL+KT R +W   GY  SDCDAV+I++ +E + ++ E+AV   LKAG+D+NCG ++
Sbjct: 115 ASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRTAEEAVAVALKAGLDINCGVYM 174

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQV 394
           Q++  +A++Q K+ E ++D+AL NLF++RMRLG F+G+P     +G++ A  VC+P H+ 
Sbjct: 175 QQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPVHKA 234

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL+AA+ G+VLLKN   LLPL      S A+IG NAN    LLGNY G  C + TP   
Sbjct: 235 LALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTPFGG 294

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
           +Q YV++  + PGC + AC  A+ D+A  +AK +D+V L+MGL Q QE+E LDR  L+LP
Sbjct: 295 IQKYVKSAKFLPGCSSAACDVAATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTSLLLP 354

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G+QQ LIT VA A+K+PVIL+LL GGPVDITFA+ +  IG+ILWAGYPG+AG  A+A+V+
Sbjct: 355 GKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAIADVL 414

Query: 575 FGDHNP 580
           FG+ NP
Sbjct: 415 FGEFNP 420


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 261/571 (45%), Positives = 360/571 (63%), Gaps = 34/571 (5%)

Query: 33  CDPSNPST-----ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           CDP+  +        F +C  +LP + R RDLV RLTL+EK+  L + A    R+G+P Y
Sbjct: 46  CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105

Query: 88  EWWSEALHGVAGVGKG-IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
            WW EALHGV+  G G   F   + GATSFP VI +AA+F+  LW  IG A+  E RA+Y
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N G A  +T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAVS+VR +Q D    G   G
Sbjct: 166 NLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQ-DIDGAGPGAG 223

Query: 207 K------LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
                  ++ S+CCKH+ AYD+D W    R  FDA+V  +D+ +T++ PFE CV+ G AS
Sbjct: 224 ADPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDAS 283

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAV 319
            +MC+YNR+NG+P+CA+  LLS+T R +W  HGYI SDCD+V + + DA+    +  +A 
Sbjct: 284 CVMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEAT 343

Query: 320 VDVLKAGMDVNCGSFLQKHTK-------AAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
              +KAG+D++CG F +            AV+Q KL ESE+D AL NL+   MRLG F+G
Sbjct: 344 AAAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDG 403

Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG--PN 430
            P ++    +GA+ VC+  H+ LA  AA+ G+VL+KN HG LPL  SK  SL+L+G   +
Sbjct: 404 IPELE---SLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQH 460

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
            N+   +LG+Y G  CR +TP  A++  V  T     CD  ACS+A+  K VD       
Sbjct: 461 INATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQV-CDHGACSTAANGKTVDA------ 513

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            +++ GL+ + EKE  DR DL+LP  Q   I  VAEA+  P+ILV++  G VD++FA+ +
Sbjct: 514 TIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNN 573

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             IG+I+WAGYPGE G  A+A+V+FG +NPG
Sbjct: 574 PKIGAIVWAGYPGEEGGTAIADVLFGKYNPG 604


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  496 bits (1277), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 238/419 (56%), Positives = 308/419 (73%), Gaps = 6/419 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           P F+CD SN +  ++ FC  +   + RA DLVSRLTL EK+  LV+   A+PRLG+P YE
Sbjct: 38  PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           WWSEALHGV+ VG G  F+  + GATSFPQ ILTAASF++ L+  IG+ +  EARA++N 
Sbjct: 98  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+TFW+PNINIFRDPRWGRGQETPGEDPL+T KYAV YV G+QG     G LK   
Sbjct: 158 GLA-GLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGAGALK--- 213

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             +ACCKH+TAYD+DNWKG  RY FDA V+ QDL DT+QPPF+SCV  G  + +MC+YN+
Sbjct: 214 -VAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGNVASVMCSYNQ 272

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG P+CAD++LLS   R  W  +GYI+SDCD+V ++Y+ + Y K+PEDA    +KAG+D
Sbjct: 273 VNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAISIKAGLD 332

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           +NCG+FL +HT AAV+  KL ES++DRA+ N     MRLG F+G+P   PFG +G   VC
Sbjct: 333 LNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGPSDVC 392

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
           +P++Q LA +AA+ GIVLLKN+ G LPL  +   S+A+IGPNAN++ T++GNY G SC 
Sbjct: 393 TPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYEGTSCN 450


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 258/599 (43%), Positives = 365/599 (60%), Gaps = 39/599 (6%)

Query: 16  CFTSLLTRVDSTQPPFS---------CDPSNPST-----ETFPFCKTTLPISQRARDLVS 61
           C  S+    DS   P +         CDP+  ++       F +C  +LP ++R RDLV 
Sbjct: 20  CVVSVAFASDSVLGPITTNGRNYTKVCDPARFASLGLDMAGFRYCDASLPYAERVRDLVG 79

Query: 62  RLTLDEKISQLVNSAP-AIPRLGIPAYEWWSEALHGVAGVGKG-IFFNGTIRGATSFPQV 119
           RLTL+EK++ L + A  A  R+G+P Y WW EALHGV+    G   F   + GATSFP V
Sbjct: 80  RLTLEEKVANLGDQAKGAEQRVGLPRYMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLV 139

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           + +AA+F+  LW  IG A   E RA+YN G A  +T+W+PNIN+ RDPRWGR  ETPGED
Sbjct: 140 LNSAAAFNETLWRAIGGATSTEIRAMYNLGHA-ELTYWSPNINVVRDPRWGRASETPGED 198

Query: 180 PLVTGKYAVSYVRGVQG-------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           P + G++AVS+VR +Q                  +L+ S+CCKH+ AYD+D W G  R  
Sbjct: 199 PFLVGRFAVSFVRAMQDIDDGANAGAGAADPFARRLKVSSCCKHYAAYDVDKWFGADRLS 258

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
           FDA V  +D+ +T++ PFE CV+ G AS +MC+YNR+NG+P+CA+  LL+ T RR W  H
Sbjct: 259 FDANVQERDMVETFERPFEMCVRDGDASCVMCSYNRINGVPACANGRLLTGTVRRDWQLH 318

Query: 293 GYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAVK 344
           GYI SDCD+V + + DA+        A    +KAG+D++CG        F   +   AV+
Sbjct: 319 GYIVSDCDSVRVMVRDAKWLGYDGVQATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVR 378

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           Q KL E+E+D AL +L+   MRLG F+G+P  Q    +GA  VC+  H+ +A +AA+ G+
Sbjct: 379 QGKLKEAEVDEALGHLYLTLMRLGFFDGSPEFQ---SLGASDVCTEEHKEMAAEAARQGM 435

Query: 405 VLLKNSHGLLPLPKSKSVSLALIG--PNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           VLLKN H  LPL  +K  SLAL+G   + N+   +LG+Y G  CR +TP +A++  V  T
Sbjct: 436 VLLKNDHDRLPLDANKVNSLALVGLLQHINATDVMLGDYRGKPCRVVTPYEAIRKVVSGT 495

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
                CD  AC + ++  A+  AK  D  +++ GL+ + E+E  DR DL+LP  Q + I 
Sbjct: 496 S-MQACDKGACGTTALGAAI-AAKTVDATIVITGLNMSVEREGNDREDLLLPWDQTQWIN 553

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            VAEA++ P+ LV++  G VDI+FA+ +  IG+ILWAGYPGE G   +A+V+FG +NPG
Sbjct: 554 AVAEASRDPITLVIISAGGVDISFAQNNPKIGAILWAGYPGEEGGTGIADVLFGKYNPG 612


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  494 bits (1272), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 254/527 (48%), Positives = 342/527 (64%), Gaps = 22/527 (4%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           L + F  +  +      PF+CDP + +    PFC+  LPI  R  DL+ R+TL EK+  L
Sbjct: 9   LSVLFLGVSLQTSKALDPFACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQEKVGLL 68

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
           VN+A A+PRLGI  YEWWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ LW 
Sbjct: 69  VNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWE 128

Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
            IG+ +  EARA++N G A G+T+W+PN+     PRWGRGQETPGEDP+V GKYA SYVR
Sbjct: 129 AIGRVVSDEARAMFNGGVA-GLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVR 187

Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
           G+QG   +G +LK     +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  
Sbjct: 188 GLQGS--DGIRLK----VAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRM 241

Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA 312
           CVK+G+ + +MC+YN+VNGIP+CAD NLL KT R QW  +GYI SDCD+  + Y  + + 
Sbjct: 242 CVKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHFT 301

Query: 313 KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
            SP  + +   KAG+D++CG FL  H  A   +K   E+EI+ A     + ++ LG+F+G
Sbjct: 302 -SPRRSSLGCYKAGLDLDCGPFLVTHRDAV--KKAAEEAEINNAWLKTLTFQISLGIFDG 358

Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA- 431
           +P +Q  G +    +  P +Q LA+ A +  + + KN   LL  P+       + GP A 
Sbjct: 359 SP-LQAVGDV-VPTMGPPTNQDLAVNAPKR-LFIFKNRAFLLYSPRH------IFGPVAL 409

Query: 432 -NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
             S   +LGNY G  C+ + PLQ L  +V + +Y PGC  V C+ A +  AVD+A  AD 
Sbjct: 410 FKSLPFMLGNYEGLPCKYLFPLQGLAGFV-SLLYLPGCSNVICAVADVGSAVDLAASADA 468

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           VVL++G DQ+ E+E  DRVD  LPG+QQEL+TRVA AAK PV+LV++
Sbjct: 469 VVLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM 515


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 248/453 (54%), Positives = 322/453 (71%), Gaps = 8/453 (1%)

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           Y   QA+  E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V  +YA +YV
Sbjct: 223 YVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYV 281

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+Q    + G+LK     +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+  PF 
Sbjct: 282 RGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFR 337

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           SCV  GRA+ +MC+YN+VNG+P+CAD   L  T RR+WG  GYI SDCD+V + Y  + Y
Sbjct: 338 SCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY 397

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            ++ EDAV   L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N  +V+MRLG+F+
Sbjct: 398 TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFD 457

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPN 430
           G+P  QPFG +G   VC+ AHQ LA++AA+ GIVLLKN    LPL P +   ++A++GP+
Sbjct: 458 GDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPH 517

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGA 488
           A +   ++GNYAG  CR  TPLQ +  Y     + PGC  VAC+ +   I  AVD A+ A
Sbjct: 518 AEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRA 577

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D  +++ GLDQ  E E LDR  L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+
Sbjct: 578 DATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQ 637

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D  I  ILWAGYPG+AG  A+A+VIFG HNPG
Sbjct: 638 NDPKIAGILWAGYPGQAGGQAIADVIFGHHNPG 670



 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 57/95 (60%), Positives = 70/95 (73%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T PFC+ +LP   RARDLV+R+T  EK+  LVN+A  +PRLG+  YEWWSEALHGV+  G
Sbjct: 40  TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
            G+ F G   GAT+FPQVI TAASF++ LW  IGQ
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQ 134


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 248/453 (54%), Positives = 322/453 (71%), Gaps = 8/453 (1%)

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           Y   QA+  E RA+YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V  +YA +YV
Sbjct: 223 YVCAQAVSDEGRAMYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYV 281

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           RG+Q    + G+LK     +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+  PF 
Sbjct: 282 RGLQQQQPSSGRLK----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFR 337

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           SCV  GRA+ +MC+YN+VNG+P+CAD   L  T RR+WG  GYI SDCD+V + Y  + Y
Sbjct: 338 SCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY 397

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            ++ EDAV   L+AG+D++CG FL ++T+ AV Q K+ + +ID A+ N  +V+MRLG+F+
Sbjct: 398 TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFD 457

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPN 430
           G+P  QPFG +G   VC+ AHQ LA++AA+ GIVLLKN    LPL P +   ++A++GP+
Sbjct: 458 GDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPH 517

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGA 488
           A +   ++GNYAG  CR  TPLQ +  Y     + PGC  VAC+ +   I  AVD A+ A
Sbjct: 518 AEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRA 577

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D  +++ GLDQ  E E LDR  L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+
Sbjct: 578 DATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQ 637

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D  I  ILWAGYPG+AG  A+A+VIFG HNPG
Sbjct: 638 NDPKIAGILWAGYPGQAGGQAIADVIFGHHNPG 670



 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 58/95 (61%), Positives = 70/95 (73%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T PFC+ +LP   RARDLV+RLT  EK+  LVN+A  +PRLG+  YEWWSEALHGV+  G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
            G+ F G   GAT+FPQVI TAASF++ LW  IGQ
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQ 134


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 250/568 (44%), Positives = 351/568 (61%), Gaps = 57/568 (10%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FP+C  +LP + R RDL+  +T++EK+  L + +   PR+G+P Y+WWSEALHGV+  G 
Sbjct: 68  FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVSSTGP 127

Query: 103 GIFFN------------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
            + F+             T+  AT F  VI +AASF+  LW  IGQA+  EARA+YN G+
Sbjct: 128 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMYNLGK 187

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGKL 208
             G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q     ++G      +
Sbjct: 188 G-GLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQDIPGHYSGDPSARPI 246

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           + SACCKH  AYD+DNW   TR+ +DARV+ +D+A+T+  PFE CV++G  S +MC+YNR
Sbjct: 247 KTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEMCVREGDVSSVMCSYNR 306

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK--SPEDAVVDVLKAG 326
           VNG+P+CAD  LLS T R +W  +GYI SDCDAV ++ D   +    + E + V  L+AG
Sbjct: 307 VNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWLNFTAAESSAVS-LRAG 365

Query: 327 MDVNCGS------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           MD++C              +L ++  AAV Q K+ ES+ID AL NL+   MRLG F+  P
Sbjct: 366 MDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTNLYMTLMRLGYFDNIP 425

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
               +  +    VC+  H+ LAL  A+ GIVLLKN HGLLPL   K++++A+ GP+A + 
Sbjct: 426 R---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTLAVAVHGPHARAP 482

Query: 435 KTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           + ++ G+Y GP CR +TP Q                        I + V I+  A   + 
Sbjct: 483 EKIMDGDYTGPPCRYVTPRQG-----------------------ISRDVKISHKAKMTIY 519

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           + G++   E+E  DR DL+LP  Q E I   A+A+  P+ILV+L GG +DI+FA+    I
Sbjct: 520 LGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGGIDISFAQKHPKI 579

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
           G+ILWAGYPG  G  A+A+VIFG +NPG
Sbjct: 580 GAILWAGYPGGEGGNAIADVIFGRYNPG 607


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 245/545 (44%), Positives = 351/545 (64%), Gaps = 16/545 (2%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T+ +C  T  I +R  DL+SR+T+ +KI QL+ SAPAIP L IPAY+WWSE LHGVAG  
Sbjct: 28  TYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG-S 86

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
            G+ F G    ATSFPQVI   A+F+  L   + Q I  EARA  N GQA G+T++APNI
Sbjct: 87  PGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQA-GLTYFAPNI 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           NIFRDPRWGRGQETPGEDP ++ +YA ++V+G+Q     G      L+  A CKH+ AYD
Sbjct: 146 NIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ----EGADDTRYLKTIATCKHYAAYD 201

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+N+   +R+ F+A V+ QD  +TY P F SCV++G+   IMC+YN VNG+PSCA+  + 
Sbjct: 202 LENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNAVNGVPSCANDFIN 261

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
           ++ AR +WGF GY+ SDC A+S I ++  Y  + +D V   L+ G D+NCG F   H +A
Sbjct: 262 NEVARGKWGFEGYVVSDCGAISDIINSHKYTSNTDDTVAAGLRGGCDLNCGHFYSDHAQA 321

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A     + + +IDRA+  LF+ RMRLG+F+  P+MQPF     D V +  H+ LAL A++
Sbjct: 322 AYDNGAITDDDIDRAMTRLFTYRMRLGMFD-PPSMQPFRDYTNDKVDTKQHEALALDASR 380

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           + IVLL+N+  +LPL  +    +AL+GP+  +   + GNY G +   I+P+Q LQ+   +
Sbjct: 381 ESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYLISPMQGLQDLGLS 440

Query: 462 TVYYPGCDTVACSS----ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
             +  GC  VAC +    + + K V+     + ++ ++GLD++QE E  DR  L LPG+Q
Sbjct: 441 VTFAAGCTQVACPTIAGFSEVTKLVE-EHSIEAIIAVIGLDESQESEGHDRTSLTLPGQQ 499

Query: 518 QELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF 575
            +L+  + + A    P I+V++ GGPVD++  K   +  +ILWAGYPG++G  A+AEVI+
Sbjct: 500 VQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKDIAD--AILWAGYPGQSGGQAIAEVIY 557

Query: 576 GDHNP 580
           G  NP
Sbjct: 558 GKVNP 562


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  484 bits (1246), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 249/569 (43%), Positives = 348/569 (61%), Gaps = 57/569 (10%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FP+C  +LP + R RDL+  +T++EK+  L + +   PR+G+P Y+WWSEALHGV+  G 
Sbjct: 61  FPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPPYKWWSEALHGVSSTGP 120

Query: 103 GIFFN------------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
            + F+             T+  AT F  VI +AASF+  LW  IGQA+  EARA+YN G+
Sbjct: 121 TMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNLGK 180

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ---GDTFNGGKLKGK 207
             G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q   G    G     +
Sbjct: 181 G-GLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQDIPGHDGGGDDPSTR 239

Query: 208 -LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            ++ SACCKH+ AYD+D+W   TR+ FDARV+ +D+A+T+  PFE CV+ G ASG+MC+Y
Sbjct: 240 PIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSY 299

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVVDVLKA 325
           NRVNGIP+CAD  LLS T R  W  HGYI SDCDAV ++ D   +   +  ++    ++A
Sbjct: 300 NRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSAASIRA 359

Query: 326 GMDVNCGS------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           G+D++C              FL ++ KAAV Q K+ ES+ID AL N +   MRLG F+  
Sbjct: 360 GLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDNI 419

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P    +  +    +C+  H+ LA   A+ G+VLLKN  GLLPL   K +++A+ GP+A +
Sbjct: 420 PR---YASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHARA 476

Query: 434 AKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
            + ++ G+Y GP CR +TP Q                        I K V I+  A+  +
Sbjct: 477 PEKIMDGDYTGPPCRYVTPRQG-----------------------ISKDVKISHRANTTI 513

Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
            + G++   E+E  DR DL+LP  Q E I   A+A+  P+ILV+L GG +DI+FA     
Sbjct: 514 YLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHPK 573

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           IG+ILWAGYPG  G  A+A+VIFG +NPG
Sbjct: 574 IGAILWAGYPGGEGGNAIADVIFGRYNPG 602


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 248/555 (44%), Positives = 348/555 (62%), Gaps = 20/555 (3%)

Query: 28  QPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           +PPF+          + +C  +  I++R +DL+SRLT+ EK+SQ   +A AI RL IPAY
Sbjct: 50  KPPFNA---------YLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAY 100

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WWSE LHG+A    G+FF   +  ATSFPQVI   A+F+  L   +GQ I  EARA  N
Sbjct: 101 DWWSECLHGLAQ-SPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFAN 159

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQ+ G+TF+APNINI+RDPRWGRGQETPGEDP +T +YA ++V+G+Q     G + +  
Sbjct: 160 NGQS-GLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ----EGSEDRRY 214

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L+A A CKH+ AY+L+ +    R  F+A V+ QDL +TY P F++CV++G+   IMC+YN
Sbjct: 215 LKAIATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYN 274

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            +NG+P+CA+  + +K AR  WGF GYI SDC A+  I     Y       V D LK G 
Sbjct: 275 AINGVPNCANDFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTNITVADALKGGC 334

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+NCG F +K+ + A     + E +ID++L  LF+ RMRLG+F+  P +QPF +     V
Sbjct: 335 DLNCGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFDP-PEIQPFRQYSVKDV 393

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            +P  Q LAL AA++GIVLL+N   +LPL   K  ++A IGPNA++   + GNY G +  
Sbjct: 394 NTPEAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPY 453

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
            I+PLQ   N   N  Y  GC      +     AV   +G D V+ ++GL+ TQE E  D
Sbjct: 454 LISPLQGFSNLGINATYQIGCPVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHD 513

Query: 508 RVDLVLPGRQQELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           R  + LPG Q++L+  + + A K  P+I+V++ GG VD+T  K   +  +ILWAGYPG++
Sbjct: 514 RTSIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKDIAD--AILWAGYPGQS 571

Query: 566 GAVALAEVIFGDHNP 580
           G  A+AEVI+G  NP
Sbjct: 572 GGQAIAEVIYGKVNP 586


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  480 bits (1235), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 229/442 (51%), Positives = 315/442 (71%), Gaps = 9/442 (2%)

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EARA+YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDP++ G YA SYVRG+QG   N
Sbjct: 2   EARAMYNGGMA-GLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQG---N 57

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G+   +L+ +ACCKH+TAYDLDNW+G  R+ F+ARV+ QD+ DT++ PF  CV  G+ +
Sbjct: 58  DGE---RLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGKVA 114

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +MC+YN+VNGIP+CA+  LL  T R  W  +GYI SDCD+V + ++ + Y   PE+AV 
Sbjct: 115 SVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTSKPEEAVA 174

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             +KAG+D++CG FL  HT+AAV++  + + EI+ AL N  + +MRLG+F+G P+   +G
Sbjct: 175 AAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAHQYG 234

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
            +G   VC+PAHQ LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++   T++GN
Sbjct: 235 NLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMIGN 294

Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQ 499
           YAG +C   TPLQ +  Y   T++  GC  V C+   +   A   A+ AD  VL+MGLDQ
Sbjct: 295 YAGVACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 353

Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
           + E E +DR   +LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D  I +I+W 
Sbjct: 354 SIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 413

Query: 560 GYPGEAGAVALAEVIFGDHNPG 581
           GYPG+AG  A+A+V+FG  NPG
Sbjct: 414 GYPGQAGGTAIADVLFGTTNPG 435


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 233/448 (52%), Positives = 314/448 (70%), Gaps = 16/448 (3%)

Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
           Q +  EARA++N G A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V GKYA SYVRG+Q
Sbjct: 2   QVVSDEARAMFNGGVA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQ 60

Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
           G   N      +L+ +ACCKHFTAYDLDNW G  R+ F+A V+ QD+ DT+  PF  CVK
Sbjct: 61  GSDGN------RLKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMCVK 114

Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP 315
           +G+ + +MC+YN+VNGIP+CAD NLL KT R      G +      +  I  +    +  
Sbjct: 115 EGKVASVMCSYNQVNGIPTCADPNLLKKTVR------GTLFQTVTLLEFIMGSNTILQPR 168

Query: 316 EDAVVDVLK-AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
                 +LK A +D++CG FL +HT+ AVK+  L E+EI+ AL N  +V+MRLG+F+G P
Sbjct: 169 RKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDGEP 228

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           + Q +G +G + VC+PAHQ LAL+AA+ GIVLLKN    LPL   + +S+A++GPN+N  
Sbjct: 229 SSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSNVT 288

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVL 493
            T++GNYAG +C   TPLQ +Q Y + T++  GC  VAC S      A+D A+ AD  VL
Sbjct: 289 ATMIGNYAGLACGYTTPLQGIQRYAQ-TIHRQGCADVACVSDQQFSAAIDAARQADATVL 347

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           +MGLDQ+ E E  DR  L+LPGRQQEL+++VA A+K P ILVL+ GGP+D++FA+ D  I
Sbjct: 348 VMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPKI 407

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
           GSI+WAGYPG+AG  A+++V+FG  NPG
Sbjct: 408 GSIVWAGYPGQAGGAAISDVLFGITNPG 435


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 249/570 (43%), Positives = 343/570 (60%), Gaps = 59/570 (10%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F +C  +LP + R RDL+ R+T++EK+  L +      R+G+PAY WWSEALHG++  G 
Sbjct: 68  FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 127

Query: 103 GIFFN-----------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
              F+             +  AT F  VI +AASF+  LW  IGQA+  EARA+YN G+ 
Sbjct: 128 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 187

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-----DTFNGGKLKG 206
            G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q          GG    
Sbjct: 188 -GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNT 246

Query: 207 K-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
           + L+ SACCKH+ AYDLD+W   TR++FDARV  +D+ +T+Q PFE CV+ G  S +MC+
Sbjct: 247 RPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCS 306

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVVDVLK 324
           YNRVNGIP+CAD  LLS+T RR WG HGYI SDCDAV ++ D A     +  +A    LK
Sbjct: 307 YNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALK 366

Query: 325 AGMDVNCGS-------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
           AG+D++CG              FL  +   AV + K+ ES+ID AL N +   MRLG F+
Sbjct: 367 AGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD 426

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
               +  +  +G   +C+  H+ LAL  A+ GIVLLKN + LLPL  +K   + + GP+ 
Sbjct: 427 ---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHV 483

Query: 432 NSAKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
            + + ++ G+Y GP CR +TP Q +  YV                         +  A+ 
Sbjct: 484 QAPEKIMDGDYTGPPCRYVTPRQGVSKYVR-----------------------FSHRANT 520

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            +   GL+   E+E  DR D++LP  Q E I RVA+A+  P+ILV+L GG +D++FA+ +
Sbjct: 521 TIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNN 580

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             IG+ILWAGYPG  G  A+A+VIFG HNP
Sbjct: 581 PKIGAILWAGYPGGEGGNAIADVIFGKHNP 610


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 249/570 (43%), Positives = 343/570 (60%), Gaps = 59/570 (10%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F +C  +LP + R RDL+ R+T++EK+  L +      R+G+PAY WWSEALHG++  G 
Sbjct: 67  FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 126

Query: 103 GIFFN-----------GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
              F+             +  AT F  VI +AASF+  LW  IGQA+  EARA+YN G+ 
Sbjct: 127 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 186

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-----DTFNGGKLKG 206
            G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q          GG    
Sbjct: 187 -GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQDIPGHEAVAAGGDPNT 245

Query: 207 K-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
           + L+ SACCKH+ AYDLD+W   TR++FDARV  +D+ +T+Q PFE CV+ G  S +MC+
Sbjct: 246 RPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFEMCVRDGDVSSVMCS 305

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-AEGYAKSPEDAVVDVLK 324
           YNRVNGIP+CAD  LLS+T RR WG HGYI SDCDAV ++ D A     +  +A    LK
Sbjct: 306 YNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATWLGYTGAEASAAALK 365

Query: 325 AGMDVNCGS-------------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
           AG+D++CG              FL  +   AV + K+ ES+ID AL N +   MRLG F+
Sbjct: 366 AGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNALTNQYMTLMRLGYFD 425

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
               +  +  +G   +C+  H+ LAL  A+ GIVLLKN + LLPL  +K   + + GP+ 
Sbjct: 426 ---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDANKVGFVNVRGPHV 482

Query: 432 NSAKTLL-GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADH 490
            + + ++ G+Y GP CR +TP Q +  YV                         +  A+ 
Sbjct: 483 QAPEKIMDGDYTGPPCRYVTPRQGVSKYVR-----------------------FSHRANT 519

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
            +   GL+   E+E  DR D++LP  Q E I RVA+A+  P+ILV+L GG +D++FA+ +
Sbjct: 520 TIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILSGGGIDVSFAQNN 579

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             IG+ILWAGYPG  G  A+A+VIFG HNP
Sbjct: 580 PKIGAILWAGYPGGEGGNAIADVIFGKHNP 609


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 231/446 (51%), Positives = 310/446 (69%), Gaps = 11/446 (2%)

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF--NGG 202
           +YN G+A G+TFW+PN+NIFRDPRWGRGQETPGEDP V+ +YA +YVRG+Q      NGG
Sbjct: 1   MYNGGRA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGG 59

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
             + +L+ +ACCKHFTAYDLD W GT R+ F+A V  QDL DT+  PF +CV+ GRA+ +
Sbjct: 60  H-RNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASV 118

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YN+VNG+P+CAD   L  T R +WG  GYI SDCD+V + +  + Y ++PEDA    
Sbjct: 119 MCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHYTRTPEDAAAAT 178

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           L+AG+D++CG FL  +  +AV   K+ ++++D AL N  +V+MRLG+F+G+P   PFG++
Sbjct: 179 LRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRL 238

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHG------LLPLPKSKSVSLALIGPNANSAKT 436
           G   VC+  HQ LAL AA+ G+VLLKN  G      +LPL  +    +A++GP+A++   
Sbjct: 239 GPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVA 298

Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMM 495
           ++GNYAG  CR  TPLQ +  Y     +  GC  VAC  +  I  AV+ A+ AD  V++ 
Sbjct: 299 MIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVA 358

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           GLDQ  E E LDR  L+LPGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+ D  I  
Sbjct: 359 GLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDG 418

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPG 581
           ILW GYPG+AG  A+A+VIFG HNPG
Sbjct: 419 ILWVGYPGQAGGQAIADVIFGHHNPG 444


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 241/551 (43%), Positives = 341/551 (61%), Gaps = 22/551 (3%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           ++F +C  +LPIS R +DL+SR+TL EKI+QL N+A +I RL IPAY+WWSE LHGVA  
Sbjct: 30  QSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVAD- 88

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA-- 158
             G+ FNG    ATSFPQVI TA+SF+  L++ I   +  EARA  N G           
Sbjct: 89  SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFANQGIVYFKQHQQLL 148

Query: 159 PNINIF--RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
            N  +F  RDPRWGR QETPGEDP +  +YA+ +V G QGD+         L+    CKH
Sbjct: 149 SNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQGDS-------KYLKVVTTCKH 201

Query: 217 FTAYDLDNW-KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           F  YDL+++  G TR+ F+A++T QD  +TY P F++CV++   + IMC+YN VNG+PSC
Sbjct: 202 FAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYNEVNGVPSC 261

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           AD  + +K AR  WGF G+I SDC A+  I +   Y  + +D V   LK G D+NCGS+ 
Sbjct: 262 ADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHYTNNTDDTVAAALKGGCDLNCGSYY 321

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           Q H ++A     +   EI+ AL  LF+ RM+LG+F+  P +QP+  I  DVV S  HQ L
Sbjct: 322 QSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFDP-PELQPYNAISPDVVNSLEHQAL 380

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL AA++ IVLL+N++ +LPL   K  ++A++GP+A +   + GNY G +   I+P++  
Sbjct: 381 ALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAPYLISPVEGF 440

Query: 456 QNY-VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
           +N  +++ +   GCD     +     A DIA  AD V+ ++GLDQ+ E E  DR DL LP
Sbjct: 441 ENLGIDSVLTASGCDVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESEGHDREDLFLP 500

Query: 515 GRQQELI-----TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
             Q + +     T  A     P+I+V++ G  VD+T  K  ++  +ILWAGYPG++G  A
Sbjct: 501 NLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTVTK--KHADAILWAGYPGQSGGQA 558

Query: 570 LAEVIFGDHNP 580
           +AE+I+G  NP
Sbjct: 559 IAEIIYGKVNP 569


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 223/398 (56%), Positives = 293/398 (73%), Gaps = 7/398 (1%)

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           KYAV++V+G+QG++         LQ SACCKH TAYDL++W G  RY F+A+VT QDL D
Sbjct: 4   KYAVAFVKGMQGNS------SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLED 57

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY PPF SCV   +A+ IMCAY  +NG+P+CA+ +LL+KT R  WG  GYI SDCDAV+I
Sbjct: 58  TYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAI 117

Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
           + DA+ Y ++PEDAV   LKAG+D+NCG+++Q+H  AA++Q KL E +ID+AL NLF++R
Sbjct: 118 MRDAQRYTQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIR 177

Query: 365 MRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           MRLG F+G+P     +G +GA  +C+P H+ LAL+AA DGIVLLKN  G+LPL ++   S
Sbjct: 178 MRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVAS 237

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVD 483
            A+IGPNAN    L+GNY GP C S TPL  +  Y++N  +  GC++ AC  A+ D+A  
Sbjct: 238 AAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAATDQAAA 297

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           +A  +D+V L MGL Q QE E  DR  L+LPG QQ LIT VA+AAK+PVILVLL GGPVD
Sbjct: 298 VASSSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVD 357

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +TFA+ +  IG+ILWAGYPG+AG +A+A V+FGDHNPG
Sbjct: 358 VTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPG 395


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 252/560 (45%), Positives = 338/560 (60%), Gaps = 22/560 (3%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           T+PP+   P+     T  +C   L    R +DL+SRLTL+EKISQL NSA AI RLGIP 
Sbjct: 23  TEPPYKTRPN----FTPAYCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPG 78

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           Y+WWSE LHGVA V  G+   G +   TSFPQ+I TA+SF+  L+Y IG+A+  EAR   
Sbjct: 79  YQWWSEGLHGVA-VSPGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFA 137

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           + GQ  G+T++ PNINI RDPRWGRGQET GEDP +T +YAV+ VRG QG+     K+  
Sbjct: 138 DNGQG-GLTYFTPNINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEYKKI-- 194

Query: 207 KLQASACCKHFTAYDLDNW-KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
                A CKHF AYDL+++  G  R  F+A VT QDL +TY P F SCV  G    IMC+
Sbjct: 195 ----IATCKHFAAYDLESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCS 250

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YN VNG+PSC D    +K AR +W F GY+ SDC A+  + +   Y  +P D V   LK 
Sbjct: 251 YNSVNGVPSCVDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNKHHYTSTPTDTVAAGLKG 310

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+NCGSF Q H   A     + E +IDRA+  LF+ RMRLGLF+  P  QP+     D
Sbjct: 311 GTDLNCGSFYQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLGLFDL-PKYQPYSYFNTD 369

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV +  HQ LALQAA++ IVLL+N +G LPL       +A++GPN  +  T+ G     +
Sbjct: 370 VVNTKQHQDLALQAARESIVLLQN-NGKLPLSYEDHHKIAVVGPNILANVTMQGISQVIA 428

Query: 446 CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
              I+P+   ++   +  Y  GCD     +     A  + K A  VV +MGLDQ  E+E 
Sbjct: 429 PYLISPVDGFKSKGLHVTYSLGCDVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERET 488

Query: 506 LDRVDLVLPGRQQELITRVAEAAKK-----PVILVLLCGGPVDITFAKYDRNIGSILWAG 560
           +DR D+ LPG Q + +  + +         P+I+V++ G  VD++ +K   +  +ILW G
Sbjct: 489 VDREDIFLPGLQDKFLLGLRDTLTNLQSPVPLIVVIMSGSSVDLSESKSLAD--AILWVG 546

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPG++G  A+AEVI+G+ NP
Sbjct: 547 YPGQSGGQAIAEVIYGEVNP 566


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 233/448 (52%), Positives = 318/448 (70%), Gaps = 6/448 (1%)

Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
           IG+ +  E RA+YN GQA G+T+W+PN+NI+RDPRWGRGQET GEDP ++ +Y V+YV+G
Sbjct: 2   IGKVVSTEGRAMYNVGQA-GLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKG 60

Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
           +Q    + GK K  L+ ++CCKH+TAYD+D+WKG  RY F+A+VT QDL DT+ PPF+SC
Sbjct: 61  LQ--QRDDGK-KDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFKSC 117

Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK 313
           V  G  + +MC+YN+V+G P+C D +LL+   R QW  +GYI +DCD+++ +Y A+ Y K
Sbjct: 118 VLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHYTK 177

Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +PE+     L AG+ +NCGS+L K+T+ AV Q  + ES IDRA+ N F+  MRLG F+GN
Sbjct: 178 TPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFFDGN 237

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P  Q +G +G   +C+  HQ LA +AA+ GIVLLKN+ G LPL      SLA+IGPNAN 
Sbjct: 238 PKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPNANL 297

Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           A T++G+Y G  C+  TPL  L   V +TVY  GCD +AC++A +D A  +A  AD VVL
Sbjct: 298 AYTMVGSYEGSPCKYTTPLDGLGASV-STVYQQGCD-IACATAQVDNAKKVAAAADAVVL 355

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           +MG DQT E+E  DR ++ LPG+Q  L+T VA  +K PVILV++ GG +D+ FA  +  +
Sbjct: 356 VMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDNPKV 415

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            SILW G+PGEAG  ALA+V+FG HNPG
Sbjct: 416 TSILWVGFPGEAGGAALADVVFGYHNPG 443


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 253/581 (43%), Positives = 346/581 (59%), Gaps = 22/581 (3%)

Query: 10  FPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKI 69
           F   C+        V  T    +C+  + +    PFC   L   QRA DLV RLTL EKI
Sbjct: 4   FGWTCITIAVAALVVAPTARALTCE--DAALRNLPFCNPNLAWEQRADDLVGRLTLQEKI 61

Query: 70  SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI--------L 121
           SQ   +AP + RLG+ AYEWWSEALHGVA    G+ F G    +T FPQ+I         
Sbjct: 62  SQFGTTAPGVARLGVNAYEWWSEALHGVAE-SPGVNFTGNTPVSTCFPQIIGNNCSSLSR 120

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL 181
             A+F+      + Q I  EARA  NAG A G+T++ PNINIFRDPRWGRGQETPGEDP 
Sbjct: 121 VGATFNLDSVAAMAQVISTEARAFANAGHA-GLTYFTPNINIFRDPRWGRGQETPGEDPY 179

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
           +T +Y  + V+ +Q      G+    L+  A CKH+TAYD+++W G  R+ F+A V+ QD
Sbjct: 180 LTSRYVETLVQNLQN-----GEDARYLKVVATCKHYTAYDMEDWGGIDRFHFNAVVSDQD 234

Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
           L +T+ PPFE+CV+ G+ + +MC+YN VNGIPSCAD  + ++ AR QWGF GYI SDC A
Sbjct: 235 LVETFMPPFEACVRVGKGASLMCSYNAVNGIPSCADDFINNEIAREQWGFDGYIVSDCGA 294

Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           +  I     Y  + +      ++ G D++CG F Q H   A+    L E+++D +L  LF
Sbjct: 295 IDCIQYTHNYTNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLF 354

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
             R+RLG F+   ++QP+ +I    + S  HQ LALQ A++ IVLL N +  LP   +  
Sbjct: 355 GHRIRLGEFDA-ASIQPYRQIPVSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATV 413

Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTVACSSASID 479
             LA+IGPNA+ A+TLLGNY G +   ITPL+  Q      +  +  GCD  +  ++   
Sbjct: 414 RKLAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSITFVKGCDVNSTDTSGFV 473

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
            A   AK AD  ++++GL+QT E E LDR  LVLPG Q ELI  +  AA+ PVILV++ G
Sbjct: 474 AAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSG 533

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            P+D++   +   + + LW GYPG+AG  ALAE +FG  +P
Sbjct: 534 SPIDLSNVIHP--VRAALWIGYPGQAGGRALAEAVFGVFSP 572


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 208/380 (54%), Positives = 281/380 (73%), Gaps = 7/380 (1%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLG+  YEWWSEALHGV+ VG G  F G    ATSFPQVI TAASF++ LW  IGQA+  
Sbjct: 1   RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EARA+YN G A G+T+W+PN+NIFRDPRWGRGQETPGEDP ++ +YA SYV G+QG+  N
Sbjct: 61  EARAMYNGGTA-GLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGN 119

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
                 +L+ +ACCKH+TAYDLDNW G  R+ F+A+V+ QDL DTY  PF++CV +G+ +
Sbjct: 120 ------RLKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGKVA 173

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +MC+YN+VNG P+CAD ++L  T R QW  +GYI SDCD+V ++YD + Y ++PE+A  
Sbjct: 174 SVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHYTRTPEEAAA 233

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
           D + AG+D++CG FL  HT+ A++Q  + E+ +++AL N  +V+MRLG+F+G P+ QPFG
Sbjct: 234 DTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQPFG 293

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
            +G   VC+PAHQ LALQAA++GIVLLKN  G LPL   +  ++A+IGPNA +  T++GN
Sbjct: 294 NLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTMIGN 353

Query: 441 YAGPSCRSITPLQALQNYVE 460
           YAG +C   +PLQ +  Y  
Sbjct: 354 YAGIACGYTSPLQGISRYAR 373


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 241/528 (45%), Positives = 330/528 (62%), Gaps = 21/528 (3%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RAR LV+ LTL EK+S + ++A  + RL IP Y+WWSEALHGVA    G+ F      A
Sbjct: 3   ERARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVAA-SPGVVFQEPTPFA 61

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T+FPQV LTA SFD  L++ I   I  EAR + NA +A  +T+W+PN+N++RDPRWGRGQ
Sbjct: 62  TAFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERA-NLTYWSPNVNVYRDPRWGRGQ 120

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ETPGEDP +   YAV +VRG+Q      G+    L+ SACCKH++AYDL+NW G  R++F
Sbjct: 121 ETPGEDPFLVATYAVEFVRGLQE-----GEDPRYLKVSACCKHYSAYDLENWHGVERFEF 175

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           DA V+ +D+ DT+Q PFE CVK+G  S +MC+YN +NGIP+CADR LL  TAR  WGF G
Sbjct: 176 DAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFEG 235

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           YITSDC A+  I     Y    +   +  ++A  D++CG F Q+H   +V+  +L E+E+
Sbjct: 236 YITSDCGAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGFYQQHILHSVESGRLKEAEV 295

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D AL NLF V+MRLGLF+     Q +   G D + +  HQ +AL+AA++GI LLKN +  
Sbjct: 296 DDALANLFKVQMRLGLFD-PVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQNDF 354

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC 473
           LPL   K   + ++GP A  A  +LGNY G     +T  Q L+N          CD V  
Sbjct: 355 LPL-SLKDKHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQGLRNV---------CDHVDV 404

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
             +   +A+   +G D +V+ +GL+Q  E+E LDR DL+LP  Q+ L+  +      PV+
Sbjct: 405 VKSL--EALSKLEGVDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPVV 462

Query: 534 LVLLC-GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L LL  GG VDI+  + + ++  +L  GY G  G  A+AEVI GD NP
Sbjct: 463 LTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNP 510


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 211/321 (65%), Positives = 257/321 (80%), Gaps = 2/321 (0%)

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YNRVNG+P+CAD NLLS TAR+ WGF+GYITSDCDAV+II+DA+GYAK+ EDAV DV
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           LKAGMDVNCGS++Q H  +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P    +G I
Sbjct: 61  LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           G D VC+  HQ LAL+AAQDGIVLLKN    G LPL K    SLA+IG NAN A  L GN
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180

Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
           Y GP C ++TPLQ LQ YV++T +  GC++ AC+  +I +AV  A  AD VVL MGLDQ 
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQD 240

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
           QE+EE+DR+DL LPG+QQ LI  VA AAKKPVILVLLCGGPVD++FAK +  IG+ILWAG
Sbjct: 241 QEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAG 300

Query: 561 YPGEAGAVALAEVIFGDHNPG 581
           YPGEAG +A+A+V+FG+HNPG
Sbjct: 301 YPGEAGGIAIAQVLFGEHNPG 321


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 218/416 (52%), Positives = 289/416 (69%), Gaps = 5/416 (1%)

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           P +    ETPGEDPL+  KYAV YV G+Q D   GG   G L+ +ACCKH+TAYD+DNWK
Sbjct: 91  PAYEWWSETPGEDPLLASKYAVGYVTGLQ-DAGAGGVTDGALKVAACCKHYTAYDVDNWK 149

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
           G  RY FDA+V+ QDL DT+QPPF+SCV  G  + +MC+YN+VNG P+CAD++LL    R
Sbjct: 150 GVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIR 209

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
             W  +GYI SDCD+V ++Y  + Y K+PE+A    +K+G+D+NCG+FL +HT AAV+  
Sbjct: 210 GDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAG 269

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           +L E ++DRA+ N F + MRLG F+G+P    FG +G   VC+ +++ LA + A+ GIVL
Sbjct: 270 ELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVL 329

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
           LKNS G LPL      S+A+IGPNAN++ T++GNY G  C+  TPLQ L   V NTVY P
Sbjct: 330 LKNS-GALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKV-NTVYQP 387

Query: 467 GCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
           GC  V CS  S+    AV  A  AD  VL++G DQ+ E+E LDR  L+LPG+Q +L++ V
Sbjct: 388 GCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAV 447

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A A+  PVILV++ GGP DI+FAK    I +ILW GYPGEAG  ALA+++FG HNP
Sbjct: 448 ANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNP 503



 Score = 85.1 bits (209), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 39/68 (57%), Positives = 46/68 (67%)

Query: 25 DSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGI 84
          ++  P F+CD SN +   + FC      S RARDLVSRLTL EK+  LVN  PA+ RLGI
Sbjct: 31 EAQAPVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGI 90

Query: 85 PAYEWWSE 92
          PAYEWWSE
Sbjct: 91 PAYEWWSE 98


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 240/571 (42%), Positives = 326/571 (57%), Gaps = 49/571 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PFC T+L   QR  DLVSRLTLD+ I Q+ + APA+P LGIPAY WW+E LHGV     
Sbjct: 13  LPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGV----- 67

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI- 161
                      TSFP      A+F+  L +++ +AI  EARAL N G   G+ FWAPNI 
Sbjct: 68  --LTKCGTNCPTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIG-GLDFWAPNIK 124

Query: 162 ----------------------NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
                                 +I RDPRWGR  E PGEDP +T +Y   ++RG+Q    
Sbjct: 125 YSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRGLQE--- 181

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
             G+     Q    CKHF AY L+ WK   R+ FDA V+  D  +TY P F+ C+ +GRA
Sbjct: 182 --GEDSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCIVEGRA 239

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
             IMC+YN VNG+PSCA+  LL    R  W F GY+ SDCDAV  IY+   + K+PE A 
Sbjct: 240 RSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHFTKTPEGAC 299

Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
              L AG D+NCG F QKH   A  + ++ E E+  A+  LF  RM LG+++  P  QP+
Sbjct: 300 AVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWD-PPAEQPY 358

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
            +    VV S  H  LALQAA++ +VLL+N  G+LPL KS    +A+IGPNAN+ +T+LG
Sbjct: 359 KQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKSVR-RVAVIGPNANATETMLG 417

Query: 440 NYAGPSCRS------ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHV 491
           NY G  C        ++P  A++  +   +  Y  GCD  + ++  I +AV  A+ AD  
Sbjct: 418 NYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGCDVDSTNTTGIPEAVKAAQAADVA 477

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           ++++GL+ + E E  DRV + LPG Q  LI  +  A   P ++V++ GG V I + K   
Sbjct: 478 IVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHGGAVAIEWIK--D 534

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGE 582
            +  I+ A YPGE G  A+A+V+FGD+NPG+
Sbjct: 535 QVDGIVDAFYPGENGGQAIADVLFGDYNPGD 565


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 224/536 (41%), Positives = 325/536 (60%), Gaps = 14/536 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV--GKG 103
           C ++     RA  LV  LTL+EK++   N++P +PRLGIP Y+WW+EALHGVA    G+ 
Sbjct: 41  CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           +  +G    ATSFPQ IL  A+FD  L Y +   +  EARA  N G++ G+ +W PNIN 
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRS-GLDYWTPNINP 159

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           ++DPRWGRGQETPGEDP     Y  + +RG++G+  +  K     +  A CKHFT YD++
Sbjct: 160 YKDPRWGRGQETPGEDPFHLASYVQNLIRGLEGNQNDPYK-----KIVATCKHFTGYDME 214

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW G  RY+FDA++ M+D+ + Y PPF++C ++ +    MC+YN VNG+P+CAD  LL  
Sbjct: 215 NWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADPWLLQT 274

Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  WG++    ++ SDCDA+  +Y    +A+S E AV D L AG D+NCG++ Q++  
Sbjct: 275 VLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAVADTLNAGTDLNCGTYYQRYLP 334

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q  + ++ +DRAL   +S  ++LG F+ N   QP+ +IG   V S   Q LAL+AA
Sbjct: 335 GAYEQGLINDTTLDRALTRTYSSLIKLGYFD-NADSQPYRQIGWQDVNSQHAQELALKAA 393

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
           Q+GIVLLKN  GLLPL      S+ALIG  AN+ + + GNYAG +    +PL A +    
Sbjct: 394 QEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAAEQLGV 452

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
              Y  G      ++         A+ +D ++++ G+D   E EELDRV +   G Q ++
Sbjct: 453 KVNYAEGASQSNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWSGPQLDM 512

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           IT++A    KPVI+V +  G +D T    + NI ++LW GYPG+ G  AL ++I G
Sbjct: 513 ITKLATYG-KPVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDIITG 567


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 208/284 (73%), Positives = 248/284 (87%)

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRAL 357
           DCDAVSIIYDA+GYAKSPEDAV DVLKAGMDVNCGS+LQKHTK+A++QKK+ E++IDRAL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
            NLFSVR+RLGLFNG+PT  P+G I  + VCSPAHQ LAL AA++GIVLLKN+  LLP  
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS 477
           K    SLA+IGPNA+  KTLLGNYAGP C+++TPL AL++YV+N VY+ GCD+VACS+A+
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVACSNAA 400

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           ID+AV IAK ADHVVL+MGLDQTQEKE+ DRVDL LPG+QQELIT VA AAKKPV+LVL+
Sbjct: 401 IDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLI 460

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           CGGPVDI+FA  +  IGSI+WAGYPGEAG +A++E+IFGDHNPG
Sbjct: 461 CGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPG 504



 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 159/204 (77%), Positives = 178/204 (87%), Gaps = 1/204 (0%)

Query: 24  VDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           V+S  PP SCDPSNP+T+ + FC+T LPI +RARDLVSRLT+DEKISQLVN+AP IPRLG
Sbjct: 17  VESAPPPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLG 76

Query: 84  IPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
           +PAYEWWSEALHGVA  G GI FNGT++ ATSFPQVILTAASFDSY W+RI Q IG EAR
Sbjct: 77  VPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEAR 136

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
            +YNAGQA GMTFWAPNINIFRDPRWGRGQETPGEDP++TG YAV+YVRG+QGD+F+G K
Sbjct: 137 GVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSFDGRK 196

Query: 204 -LKGKLQASACCKHFTAYDLDNWK 226
            L   LQASACCKHFTAYDLD WK
Sbjct: 197 TLSNHLQASACCKHFTAYDLDRWK 220


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 240/560 (42%), Positives = 333/560 (59%), Gaps = 45/560 (8%)

Query: 41  ETFPFCK-TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           E  P  K  +L    RA+DLVSR+TL EKISQ++  APAIPRL IPAY WW+E LHGVA 
Sbjct: 8   EKLPAYKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVAR 67

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-----NAGQAI-- 152
            G           AT FPQ I  AA+F+  L +++ +AI  EARA +     N  + I  
Sbjct: 68  AGI----------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYK 117

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+TFW+PNINIFRDPRWGRG ET GEDP +T +  V++V+G+QGD          L+  A
Sbjct: 118 GLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDD------PKYLKVVA 171

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KH+  +   +   + R+ FDARV+ +DL +TY P FE CVK+G+A  IM AYNR NG 
Sbjct: 172 TPKHYAVH---SGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGE 228

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
           P CA + LL    R +WGF GY+ SDC A+  I+      K+  ++    +  G ++NCG
Sbjct: 229 PCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESAALAVNNGCELNCG 288

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
              + +   AV+Q  + E  ID+A+  LF+ RMRLG+F+  P M  +  I  DV  SP H
Sbjct: 289 KTYE-YLCQAVEQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHIPYDVNDSPEH 346

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LAL+ A+  IVLLKN   +LPL K K  ++A+IGPNA+    LL NY G   + +TPL
Sbjct: 347 RELALETARQSIVLLKNDENILPLSK-KLKTIAVIGPNADDLDVLLANYFGTPSKYVTPL 405

Query: 453 QALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---- 505
           + ++N V      +Y  GC+    S    D+AV+IA+ AD V++ +GL    E EE    
Sbjct: 406 EGIKNKVSPDTKVLYAKGCEVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGDVA 465

Query: 506 -----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
                 DR+ + LPG Q++L+  +     KP++LVLL G  + I +A    ++ +I+ A 
Sbjct: 466 DSDGGGDRLHIDLPGMQEQLLETIY-GTGKPIVLVLLNGSAIAINWAH--EHVPAIIEAW 522

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPGE G  A+A+V+FGD+NP
Sbjct: 523 YPGEEGGTAIADVLFGDYNP 542


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 232/539 (43%), Positives = 320/539 (59%), Gaps = 14/539 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L  +QRA  LV   T++E I+  VN +P +PRLG+PAY+WWSE LHGVA    G+ 
Sbjct: 37  CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G    ATSFPQ I+ +A+FD  L   +G  +G+E R+  N G A G+ FW PNIN 
Sbjct: 96  FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHA-GLDFWTPNINP 154

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  + ++G+QG    G   +   Q  A CKHF  YDL+
Sbjct: 155 FKDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG----GVNPEPYFQVVATCKHFAGYDLE 210

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W+   RY FDA +T QDL++ Y P F+SC +  +A   MC+YN VNGIP+CAD  LL  
Sbjct: 211 DWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYLLQD 270

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  W F    ++TSDCDAV  IY+   Y   P+ A  D L+AG D++CG+F  ++   
Sbjct: 271 ILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAADALRAGTDLDCGTFYTEYLPL 330

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A  Q  + E+E+  AL   ++  +RLG F+     QP+ + G   V +P  Q LA  AA 
Sbjct: 331 AYNQSLITETELRAALTRQYASLVRLGYFD-PAAQQPYRQYGWSNVDTPYAQQLAYTAAT 389

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GI LLKN  G LPLP +   ++ALIGP AN+   + GNY G +   ++PLQ       N
Sbjct: 390 EGITLLKND-GTLPLPSTLK-NIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAAGYN 447

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
             Y  G +  + S+A    A+  A+ AD VV   G+D T E E +DR ++  PG Q +LI
Sbjct: 448 VTYVFGTNITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGNQLQLI 507

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +A A  KP ++    GG VD T  K + ++ S++WAGYPG++G  AL ++I G   P
Sbjct: 508 GELA-ALGKPFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISGKVAP 565


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 230/539 (42%), Positives = 320/539 (59%), Gaps = 14/539 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA+ LV+ +TL+E ++  VN++P +PRLG+P YEWWSEALHGVA    G+ 
Sbjct: 36  CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS-SPGVT 94

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G   GATSFP+ IL +A+FD  L + +   I  EARA  N   + G+ F+ PNIN 
Sbjct: 95  FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHS-GLDFFTPNINP 153

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDPL T +Y    + G+QG    G       +  A CKHF AYDL+
Sbjct: 154 FKDPRWGRGQETPGEDPLHTSRYVYQLITGLQG----GVGPSPYYKIIADCKHFAAYDLE 209

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW+G  R  F+A V+ QDLA+ Y P F+SCV+  +   +MC+YN VNG+P+C    LL  
Sbjct: 210 NWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPACGSPYLLQD 269

Query: 284 TARR--QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R   + G   +ITSDCDAV  I+D   Y  +  +A    L AG DV+CG+   +    
Sbjct: 270 LVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAVALLAGTDVDCGTSYSETLGE 329

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AV +  + +S+++RAL  L+   +RLG F+   ++ P+  +GA  V +PA Q LA  AA 
Sbjct: 330 AVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSV-PYRALGASDVNTPAAQTLAYTAAV 388

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKN  GLLPL  + S  +ALIGP AN+   + GNY G +   I+PL    +   N
Sbjct: 389 EGIVLLKN-DGLLPLSSNVS-HIALIGPWANATTQMQGNYEGIAPLLISPLDGFTSAGFN 446

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
             +  G      S++    A+ +A  AD +V + G+D T E E  DR  +  PG Q ELI
Sbjct: 447 VSFTNGTTISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSITWPGNQLELI 506

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +  A  KP +++ + GG VD T  K + ++ ++LW GYPG+AG  ALA++I G   P
Sbjct: 507 GELG-AFGKPFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALADIITGVQAP 564


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 207/379 (54%), Positives = 273/379 (72%), Gaps = 5/379 (1%)

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           +YN G A G+TFW+PN+NIFRDPRWGRGQETPGEDPL++ KYA  YVRG+Q  + +G   
Sbjct: 1   MYNVGLA-GLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ-QSDDGSP- 57

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
             +L+ +ACCKH+TAYDLDNWKG  R+ F+A VT QD+ DT+QPPF+SCV  G  + +MC
Sbjct: 58  -DRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMC 116

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           +YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V + Y+++ Y K+PE+A    + 
Sbjct: 117 SYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHYTKTPEEAAAKAIL 176

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AG+D+NCGSFL +HT+AAVK   + ES +D+A+ N F+  MRLG F+GNP+   +GK+G 
Sbjct: 177 AGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGP 236

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             VC+  HQ LA +AA+ GI+LLKNS G LPL  +   +LA+IGPNAN  KT++GNY G 
Sbjct: 237 KDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGT 296

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            C+  TPLQ L   V  T Y  GC  VACS+A ID+A  IA  AD  VL++G+DQ+ E E
Sbjct: 297 PCKYTTPLQGLMALVA-TTYLSGCSNVACSTAQIDEAKKIAAAADATVLIVGIDQSIEAE 355

Query: 505 ELDRVDLVLPGRQQELITR 523
             DRV++ LPG+Q  LIT 
Sbjct: 356 GRDRVNIQLPGQQPLLITE 374


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 217/536 (40%), Positives = 326/536 (60%), Gaps = 8/536 (1%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C   LP  QRA+ +V   T+DE I+ + N++P + RLG+P Y+WWSEALHG+A       
Sbjct: 61  CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIAR--SNFT 118

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
            +G    ATSFPQ IL   +F++ L+ ++G  IG EARA  N G+A G+ F++PNIN FR
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRA-GLDFYSPNINPFR 177

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           D RWGRGQE   E P++ G YA++YV+G+QG   +  +    LQ +A CKHF  YD+++W
Sbjct: 178 DARWGRGQEVASESPVLVGNYALNYVQGLQGG-LDSNQNDDTLQVAATCKHFVGYDMESW 236

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
              +R  ++A ++ QDLAD Y P F+SCV+  +A+G MC+YN VNG+P+CA    L+   
Sbjct: 237 NQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFLNTVL 296

Query: 286 RRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
           R  + F +G I SDCDA+  +++   YA+    A  D +KAG+DVNCG   Q +   A+ 
Sbjct: 297 RDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAIKAGVDVNCGDTYQNNLGYALG 356

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
            K + E++I  ++   +S  +RLG F+ +P    + K   + V +P    LA QAA +GI
Sbjct: 357 NKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQAAVEGI 415

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
            LLKN  G LP  K K   +A+IGP AN+   +LG+YAG     I+PLQ  Q+      Y
Sbjct: 416 ALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEGFQVEY 474

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G       ++    A++ AKGAD +V   G+D + E E LDR  L  PG Q +L++++
Sbjct: 475 ALGTQINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQLDLVSKL 534

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +   KKP++++   GG +D T  K ++N+ +I++AGYPG++G  A+ +++ G + P
Sbjct: 535 S-GLKKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKYAP 589


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 238/540 (44%), Positives = 326/540 (60%), Gaps = 16/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L    RA  +V   T+DE I+  V+++P +PRLG+P Y+WWSE LHGVA    G+ 
Sbjct: 37  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 95

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G    ATSFPQ I+  A+FD  L   +G  +G+E R+  N G+A G+ FW PNIN 
Sbjct: 96  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRA-GLDFWTPNINP 154

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  + V+G+QG    G   K   Q  + CKHF AYDL+
Sbjct: 155 FKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG----GLDPKPYYQVISTCKHFAAYDLE 210

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  RY FDA VT QDL++ Y P F+SC +  +    MC+YN VNGIPSCA+  LL  
Sbjct: 211 DWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQS 270

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDAV  IYD   Y K+PE+AV D LKAG D++CG+F  ++   
Sbjct: 271 ILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPG 330

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAA 400
           A  Q  + E+E+ +AL   ++  +RLG F  +PT +QP+ +   + V +P  Q LA QAA
Sbjct: 331 AYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYRQYNWNNVDTPQAQQLAYQAA 388

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN  G LPL  S   ++ALIGP  N+   + GNY G +   I+PL        
Sbjct: 389 AEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGY 446

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           N  Y  G +  +  ++    A+  A+GAD V+   G+D+T E E  DR  +  PG Q +L
Sbjct: 447 NVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDL 506

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  +A A  KP+++V   GG VD T  K +  + ++LWAGYPG++G  AL ++I G   P
Sbjct: 507 VGELA-AVGKPLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAP 565


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 227/552 (41%), Positives = 327/552 (59%), Gaps = 16/552 (2%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N   ++ P C +    + RA+ L+   T DE I    N++P +PRLG+P YEWWSEA
Sbjct: 26  DCTNGPLKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85

Query: 94  LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           LHGV G   G+ F  +G    ATSFPQ I+  A+FD  L   +   +  EARA  N G+A
Sbjct: 86  LHGV-GHSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+ ++ PNIN F+DPRWGRGQETPGEDP    +Y    V G+QG    G      ++ +
Sbjct: 145 -GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG----GIDPWPYIKVA 199

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A CKHF AYDL+NW+G  R+ FDA+V+ QDL++ Y PPF+SCV+  +A+ +MC+YN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259

Query: 272 IPSCADRNLLSKTARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           +P+CA   LL    R  WGF    ++TSDC A+  I+D+  + +S  +A    LKAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CGS    H  AA+ Q  +   ++ RA    ++  +RLG F+ + + Q + +     V +
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDS-QTYRQFDWSDVNT 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           P  Q L+ +AA +G+VLLKN  GLLPL P  K++  A+IGP  N+  ++ GNY G +   
Sbjct: 379 PEAQALSRRAAVEGLVLLKND-GLLPLAPDGKTI--AIIGPYTNATSSMQGNYFGNAPII 435

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
            +P Q  Q+     V   G      SSA   +A++ AK AD VV + G+D T E+E LDR
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDR 495

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             +  PG Q +L+  +A    KP+I+V   GG VD T    ++ + +I+WAGYPG++G  
Sbjct: 496 SSISWPGNQLDLVKDLASLG-KPLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGT 554

Query: 569 ALAEVIFGDHNP 580
           A+ ++I G   P
Sbjct: 555 AIFDIIVGSTAP 566


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 238/540 (44%), Positives = 326/540 (60%), Gaps = 16/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L    RA  +V   T+DE I+  V+++P +PRLG+P Y+WWSE LHGVA    G+ 
Sbjct: 22  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G    ATSFPQ I+  A+FD  L   +G  +G+E R+  N G+A G+ FW PNIN 
Sbjct: 81  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRA-GLDFWTPNINP 139

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  + V+G+QG    G   K   Q  + CKHF AYDL+
Sbjct: 140 FKDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG----GLDPKPYYQVISTCKHFAAYDLE 195

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  RY FDA VT QDL++ Y P F+SC +  +    MC+YN VNGIPSCA+  LL  
Sbjct: 196 DWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYLLQS 255

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDAV  IYD   Y K+PE+AV D LKAG D++CG+F  ++   
Sbjct: 256 ILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVADALKAGTDIDCGTFYSEYLPG 315

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQVLALQAA 400
           A  Q  + E+E+ +AL   ++  +RLG F  +PT +QP+ +   + V +P  Q LA QAA
Sbjct: 316 AYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYRQYNWNNVDTPQAQQLAYQAA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN  G LPL  S   ++ALIGP  N+   + GNY G +   I+PL        
Sbjct: 374 AEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVATGY 431

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           N  Y  G +  +  ++    A+  A+GAD V+   G+D+T E E  DR  +  PG Q +L
Sbjct: 432 NVTYVFGTNITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGNQLDL 491

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  +A A  KP+++V   GG VD T  K +  + ++LWAGYPG++G  AL ++I G   P
Sbjct: 492 VGELA-AVGKPLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISGKVAP 550


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 227/552 (41%), Positives = 327/552 (59%), Gaps = 16/552 (2%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N   ++ P C +T   + RA+ L+   T DE I    N++P +PRLG+P YEWWSEA
Sbjct: 26  DCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEA 85

Query: 94  LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           LHGV G   G+ F  +G    ATSFPQ I+  A+FD  L   +   +  EARA  N G+A
Sbjct: 86  LHGV-GHSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRA 144

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+ ++ PNIN F+DPRWGRGQETPGEDP    +Y    V G+QG    G      ++ +
Sbjct: 145 -GLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG----GIDPWPYIKVA 199

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A CKHF AYDL+NW+G  R+ FDA+V+ QDL++ Y PPF+SCV+  +A+ +MC+YN VNG
Sbjct: 200 ADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNG 259

Query: 272 IPSCADRNLLSKTARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           +P+CA   LL    R  WGF    ++TSDC A+  I+D+  + +S  +A    LKAG D+
Sbjct: 260 VPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDI 319

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CGS    H  AA+ Q  +   ++ RA    ++  +RLG F+ + + Q + +     V +
Sbjct: 320 DCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHS-QTYRQFDWSDVNT 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           P  Q L+ +AA +G+VLLKN  GLLPL P  K++  A+IGP  N+  ++ GNY G +   
Sbjct: 379 PEAQALSRRAAVEGLVLLKND-GLLPLAPDGKTI--AIIGPYTNATSSMQGNYFGNAPFI 435

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
            +P Q  Q+     V   G      SSA   +A++ A+ AD VV + G+D T E+E LDR
Sbjct: 436 TSPFQGAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDR 495

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             +  PG Q +L+  +A    KP+I+V   GG VD T    +  + +I+WAGYPG++G  
Sbjct: 496 SSISWPGNQLDLVKDLASLG-KPLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGT 554

Query: 569 ALAEVIFGDHNP 580
           A+ ++I G   P
Sbjct: 555 AIFDIIVGATAP 566


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 233/559 (41%), Positives = 331/559 (59%), Gaps = 25/559 (4%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           C P  P +     C T+L    RA  LV+  T  E I+  +N+AP +PRLGIP Y+WW+E
Sbjct: 24  CSPGTPLSAN-AVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTE 82

Query: 93  ALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           ALHGVAG   G+ F   +      AT+FPQ+I   A+FD  L+ ++   I  E RA  NA
Sbjct: 83  ALHGVAG-SPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNA 141

Query: 149 GQAIGMTFWAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
           G+A G+  ++P NIN FRDPRWGRGQET GEDPL   +YAV  V+G+QG        + +
Sbjct: 142 GKA-GLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPN------QDE 194

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L+ +A CKH+ AYDL+ W G  RY+FDA+V+ Q+LA+ Y P F +CV+ G+A  +M +YN
Sbjct: 195 LRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYN 254

Query: 268 RVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
            VN +P  A R  L   AR++WG    H Y+TSDCDAV+ ++D   YA S   A  D + 
Sbjct: 255 AVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHYADSYVQAAADSIN 314

Query: 325 AGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           AG D+NCG+    +   A++Q       I  A+  +++ ++RLGLF+     QP  ++G 
Sbjct: 315 AGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPK-QGQPLRELGW 373

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
           + V + A Q LA  +A   + LLKN +G LP+  +  V  A+IGP +N+   L GNYAGP
Sbjct: 374 EHVNTKAAQDLAYSSAAASVTLLKN-NGTLPVDGATKV--AVIGPYSNATFALRGNYAGP 430

Query: 445 SCRSITPLQALQN-YVENTVYYPGCDTVACSSASID--KAVDIAKGADHVVLMMGLDQTQ 501
              +IT  +A Q  + + T+      T++ +    D   A+ +AK AD V+   G+D T 
Sbjct: 431 GPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPTI 490

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E EELDR  +  P  Q +LI  +   AKK + +V   GG +D    K D NIG++LWAGY
Sbjct: 491 ESEELDRATIAWPPNQLQLIHALGGMAKK-MAVVQFGGGQIDGASIKADGNIGALLWAGY 549

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PG++GA+A+ +VI G+  P
Sbjct: 550 PGQSGALAVMDVIAGNTAP 568


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 235/566 (41%), Positives = 333/566 (58%), Gaps = 50/566 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E F +   TL   QRA+DLVSR+TL+EK++Q+V  +PAIPRLG+PAY WWSEALHGVA  
Sbjct: 2   EIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARA 61

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
           G           AT FPQ I  AA+FD  L + + + I +E RA ++        G   G
Sbjct: 62  GV----------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYKG 111

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW+PN+NIFRDPRWGRGQET GEDP +TG+  VS+++G+QG        K  L+A+AC
Sbjct: 112 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 165

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +   + R+ FDA V+ +DL +TY P F+ CVK+     +M AYNRVNG P
Sbjct: 166 AKHFAVH---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
            C    LL +T RR+WGF G++ SDC A+   ++      S  ++V   L  G D+NCG+
Sbjct: 223 CCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRVTSSAPESVAMALNNGCDLNCGN 282

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            +  +   A ++  + E  I+ A+  L   RM+LGLF+    + P+ KIG        H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTKIGFHQNDCQEHR 340

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ ++  +VLLKN + LLPL ++   S+A+IGPNANS + L GNY G +   IT L+
Sbjct: 341 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 400

Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
            ++  V ++T+  Y  GC       + +  +     +AV  A+ AD VV+ MGLD + E 
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEG 460

Query: 504 EE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           EE          D++ L LPG QQEL+  + +   KP+ILVLL G  + +T+A     I 
Sbjct: 461 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTG-KPIILVLLAGSALAVTWAA--EKIP 517

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           +I+ A YPG  G  ALA  IFG+++P
Sbjct: 518 AIIQAWYPGAEGGKALASAIFGEYSP 543


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 230/555 (41%), Positives = 326/555 (58%), Gaps = 20/555 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D  N   +T   C T+     RA+ LV+  TL+EKI+   + +P +PRLGIP Y+WWSE 
Sbjct: 23  DCQNGPLQTETICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEG 82

Query: 94  LHGVAGVGKGIFFNGT-IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
           LHG+AG       +G     +TSFPQ IL  A+FD +L   + + I  EARA  NA +  
Sbjct: 83  LHGIAGPYTNFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRT- 141

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+ FW PNIN FRDPRWGRGQETPGED      Y  + + G+QG+T +  K     +  A
Sbjct: 142 GLDFWTPNINPFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQGETTDPYK-----RVVA 196

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            CKHF  YD+++W G  RY+FDA+++ QDL + Y  PF++CV Q      MC+YN VNG+
Sbjct: 197 TCKHFAGYDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGV 255

Query: 273 PSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           P+CAD  LL    R  WG+     ++TSDCDAV  IY    ++ + E AV D L AG D+
Sbjct: 256 PTCADPYLLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDL 315

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG+++Q+H   A  Q  + E+ +D+AL   +S  +RLG F+ +   QP+ + G D V +
Sbjct: 316 DCGTYMQEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVAT 374

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            A Q LA +AA +GIVLLKN  G+LPL    SVSL + G  AN+   LLGNYAG      
Sbjct: 375 DASQALARRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLH 433

Query: 450 TPLQALQNYVEN-TVYYPGCDTVACSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEE 505
           +PL ALQ   EN T+ Y G +       + ++   ++     +D ++ + G+D + E+E 
Sbjct: 434 SPLWALQQ--ENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEG 491

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR  L   G Q ++I ++A A  KP I+V++ GG +D      + NI +ILWAGYPG+ 
Sbjct: 492 HDRTSLAWTGAQLDVIFQLA-ATGKPTIVVVMGGGQIDSAPLANNANISAILWAGYPGQD 550

Query: 566 GAVALAEVIFGDHNP 580
           G  A+ +++ G   P
Sbjct: 551 GGPAIVDILTGKSPP 565


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 245/552 (44%), Positives = 321/552 (58%), Gaps = 16/552 (2%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P C  +L   +RAR LV   T+ E I+  VN+A  +PRLG+P YEWW+EALHGV G+  G
Sbjct: 34  PVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGV-GLSPG 92

Query: 104 I-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           + FF      ATSFP  I   ++FD  L   +G  I  EARA  NAG+A G+ +W PNIN
Sbjct: 93  VVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRA-GLDYWTPNIN 151

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            F+DPRWGRG ETPGEDPL   +Y  S V G+QG     G     L+ +A CKH+ AYDL
Sbjct: 152 PFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG-----GIDPPSLKVAAACKHWAAYDL 206

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW G TRY FDA VT QDLA+ Y PPF SCV+  RA+  MC+YN VNG+P+CA   LL 
Sbjct: 207 ENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNGVPACASPYLLK 266

Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  WG     ++TSDC AV  +YD  GY +   +A    LKAG D+NCG+   ++  
Sbjct: 267 TVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTEDLVNASTVSLKAGTDLNCGTNYTQYLP 326

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +  + E ++  AL  L++  + LG F+  P  QP+ +I    V +P  Q LA  AA
Sbjct: 327 EAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQITWADVNTPEAQALAYTAA 385

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYV 459
               VLLKN  G LPL  S ++SLALIGP AN SA  +LGNY G     I PLQ   +  
Sbjct: 386 IKSFVLLKND-GTLPLTDS-TLSLALIGPMANASALQMLGNYFGIPPFVIAPLQGFLDAG 443

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            N  Y  G +     + S D AV  A+ AD V+ + G+D T E EE DR ++  P  Q  
Sbjct: 444 FNVTYVLGTNVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEKDRTEISWPDNQLA 503

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L++ + E   KP+++V + GG +D T  K    + +ILWAGYPG++G  A+A+ + G   
Sbjct: 504 LLSAL-EGVGKPLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVTGKVA 562

Query: 580 P-GEFTVIAVEM 590
           P G   V  V M
Sbjct: 563 PAGRLYVDEVAM 574


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 219/540 (40%), Positives = 311/540 (57%), Gaps = 13/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TT     RA+ L+   T +E ++  VN +P +PRLG+P Y+WWSEALHGVAG   G+ 
Sbjct: 37  CNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVH 96

Query: 106 F---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F         ATSFPQ IL  A+FD  L + +   I  EARA  N G + G+ F+ PNIN
Sbjct: 97  FAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFS-GIDFFTPNIN 155

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            FRDPRWGRGQETPGEDPL   +Y    V  +QG    G       +  A CKHF  YDL
Sbjct: 156 PFRDPRWGRGQETPGEDPLHISRYVFQLVTALQG----GLGPSPYYKIVADCKHFAGYDL 211

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++W+G  R+ FDA +T QDLA+ Y P F+SCV+  +   +MC+YN VNG+P+CA   LL 
Sbjct: 212 ESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNGVPACASSYLLQ 271

Query: 283 KTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
              R  +G   G+ITSDCDAV  ++    +  +  +A    LKAG DV+CG+   +    
Sbjct: 272 DIVRDFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAISLKAGTDVDCGNVYAQSLGD 331

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+ Q  + E ++ +AL  L+   +R G F+ +P  QPF ++G   V +PA + LAL AA+
Sbjct: 332 ALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTPASRRLALLAAE 390

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKN  GLLPL      ++ ++GP  N+   + GNY G +   ++P Q   +   N
Sbjct: 391 EGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVSPRQGFVDAGFN 449

Query: 462 TVYYPG-CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             ++ G   T    ++  D+AV  A   D +V + G D   E+E  DR+++  PG Q +L
Sbjct: 450 VTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESRDRINITWPGVQLDL 509

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I  +A    KP+I++ +  G VD T+ K    I +++W GYPG++G  ALA ++ G   P
Sbjct: 510 IKELA-GVGKPMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSGGTALANIVTGKTAP 568


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 188/255 (73%), Positives = 225/255 (88%)

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           MDVNCG++L+ +TK+AV++KK+ ESEIDRALHNLFS+RMRLGLFNGNPT  P+G I AD 
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VCS  HQ +AL+AA+DGIVLLKNS+ LLPL KSK+ SLA+IGPNA+++  L+GNYAGP C
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
           +++TP Q LQNY++ T Y+PGC TVACSSA+ID+A+ IAK AD VVL+MGLDQTQE+EE 
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSSAAIDQAIKIAKEADQVVLVMGLDQTQEREEH 180

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DRVDLVLPG+QQELI  VA AAKKPV+LVLLCGGPVDI+FAKYDRNIG ILWAGYPGEAG
Sbjct: 181 DRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEAG 240

Query: 567 AVALAEVIFGDHNPG 581
            +ALAE+IFG+HNPG
Sbjct: 241 GIALAEIIFGNHNPG 255


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 234/573 (40%), Positives = 327/573 (57%), Gaps = 19/573 (3%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           +  C  SL T++   Q PF  D  N        C TTL    RA+ L++ +TL+EKI+  
Sbjct: 12  ILACIVSL-TQLGFAQSPFP-DCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEEKINNT 69

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYL 130
             ++P +PRLG+PAY WWSEALHGVAG   G+ F  +G    ATSFP  I   A+FD  L
Sbjct: 70  QYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVHFADSGNFSYATSFPSPITLGAAFDDDL 128

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
             +I   IG E RA  NAG A G+ +W PNIN +RDPRWGRGQETPGEDP  T +Y    
Sbjct: 129 VKQIATVIGTEGRAFGNAGHA-GLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRYVYHL 187

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           + G+Q      G  K K+ A+  CKHF  YD+++W+G  RY FDA ++ QD+A+ Y PPF
Sbjct: 188 IDGLQDGI---GPEKPKIVAT--CKHFAGYDIEDWEGNERYAFDAVISDQDMAEYYFPPF 242

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYD 307
           ++C +  +   +MC+YN VNGIP+CAD  LL    R  W + G   ++TSDC A+  IY 
Sbjct: 243 KTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVLREHWEWEGVGHWVTSDCGAIDNIYK 302

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
              Y      A    + AG D++CGS   +   +A+ Q  L    +DRAL  L+S  ++L
Sbjct: 303 DHKYVADGAHAAAVAVNAGTDLDCGSVYPQFLGSAISQGLLGNRTLDRALTRLYSSLVKL 362

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALI 427
           G F+     QP+  IG   V +P  + LA  AA +G VLLKN  G LPL K+ +V  A++
Sbjct: 363 GYFD-PAADQPYRSIGWSDVATPDAEQLAHTAAVEGTVLLKND-GTLPLKKNGTV--AIV 418

Query: 428 GPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
           GP AN+   L GNY G +    T L A         Y PG    + S++  ++A++ AKG
Sbjct: 419 GPYANATTQLQGNYEGTAKYIHTMLSAAAQQGYKVKYAPGTGINSNSTSGFEQALNAAKG 478

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
           +D V+   G+D   E E LDR  +  PG Q +LI ++++  KKP+++V   GG VD +  
Sbjct: 479 SDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDLIQQLSD-LKKPLVVVQFGGGQVDDSSL 537

Query: 548 KYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +  +  +LWAGYP +AG  A+ +++ G   P
Sbjct: 538 LSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAP 570


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 234/566 (41%), Positives = 333/566 (58%), Gaps = 50/566 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E F +   TL   QRA+DLVSR+T++EK++Q+V S+PAI RLGIPAY WWSEALHGVA  
Sbjct: 2   EIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVARA 61

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G           AT FPQ I  AA+FD  L Y + + I +EARA ++  Q  G       
Sbjct: 62  GV----------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYKG 111

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW+PN+NIFRDPRWGRGQET GEDP +TG+  VS+++G+QG        K  L+A+AC
Sbjct: 112 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 165

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +   + R++FDA V+ +DL +TY P F+ CVK+     +M AYNRVNG P
Sbjct: 166 AKHFAVH---SGPESERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 222

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
            C    LL +T R++WGF G++ SDC A+   ++      S  ++V   L  G D+NCG+
Sbjct: 223 CCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 282

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            +  +   A ++  + E  I+ A+  L   RM+LGLF+    + P+  IG        H+
Sbjct: 283 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENV-PYTNIGFHQNDCQEHR 340

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ ++  +VLLKN + LLPL ++   S+A+IGPNANS + L GNY G +   IT L+
Sbjct: 341 EFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYITVLE 400

Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
            ++  V ++T+  Y  GC       + +        +AV  A+ AD VV+ MGLD + E 
Sbjct: 401 GIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDASIEG 460

Query: 504 EE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           EE          D++ L LPG QQEL+  + +   KP+ILVLL G  + +T+A     + 
Sbjct: 461 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTG-KPIILVLLAGSALAVTWAA--EKVP 517

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           +I+ A YPG  G  ALA  IFG+++P
Sbjct: 518 AIIQAWYPGAEGGKALASAIFGEYSP 543


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 236/584 (40%), Positives = 331/584 (56%), Gaps = 21/584 (3%)

Query: 2   IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
           + H   L+ PL    F + L   +S  P  S  P + +      C T+L    RA+ LV+
Sbjct: 1   MVHLSPLLRPLAAFSFFTSLASTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVA 56

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQV 119
            +TL+EKI+     +   PRLG+PAY WW+EALHGVA  G G+ F+  G    ATSFP  
Sbjct: 57  AMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMP 115

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           IL  A+FD  L  ++   I  EARA  N G A G+ +W PNIN FRDPRWGRGQETPGED
Sbjct: 116 ILLGAAFDDDLVKQVATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGED 174

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PL   +Y    V G+Q      G    + +  A CKHF AYDL+NW+G  RY FDA V+ 
Sbjct: 175 PLHLSRYVYHLVDGLQD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSP 229

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYIT 296
           QDL++ Y P F++C +  +   +MC+YN +NGIP+CADR LL    R  WG+     ++T
Sbjct: 230 QDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVT 289

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
            DC A+  IY    Y      A    L AG D++CGS   ++ ++A++Q       ++ A
Sbjct: 290 GDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLRSALQQGLYNNQTLNNA 349

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           L  L+S  ++LG F+     QP+  IG + V +PA + LA +A  +GIV+LKN  G LPL
Sbjct: 350 LIRLYSSLVKLGYFDPADD-QPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPL 407

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
             + +V  A+IGP AN+   L GNY GP     T + A  +      +  G D  + SSA
Sbjct: 408 KSNGTV--AIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSA 465

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
              +A+  AK AD V+   G+D T EKE  DR  +V PG Q +LI ++++  +KP+I+V 
Sbjct: 466 GFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQ 524

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             GG VD +    +  +G++LWAGYP +AG  A+ +++ G   P
Sbjct: 525 FGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAP 568


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 220/536 (41%), Positives = 322/536 (60%), Gaps = 8/536 (1%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C   LP +QRA+ +V   T+DE I+ + N++P + RLG+P Y+WWSE LHG+A       
Sbjct: 61  CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIAR--SNFT 118

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
            +G    ATSFPQ IL   +F+S L+ ++G  IG EARA  N G+A G+ +++PNIN F+
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRA-GLDYYSPNINPFK 177

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRGQE   E P++ G YA++YV+G+QG   +       LQ +A CKHF  YD+++W
Sbjct: 178 DPRWGRGQEVASESPVLVGNYALNYVQGLQGG-IDSNPNDDTLQVAATCKHFAGYDMESW 236

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
           K  +R  ++A ++ QDLAD Y P F+SCV+  +A+G MC+YN +NGIP CA    L    
Sbjct: 237 KQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFLGTVI 296

Query: 286 RRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
           R  + F +G I SDCD++  I++   Y +    A  D +KAG+DVNCG   Q +   A+ 
Sbjct: 297 REGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAADGIKAGVDVNCGDTYQNNLGYALG 356

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
            K + E +I  ++   +S  +RLG F+   T +      +DV  S A+Q LA QAA +GI
Sbjct: 357 NKTINEDQIRASVTRQYSNLIRLGYFDSPQTNKYRTYNWSDVSTSQANQ-LAYQAAVEGI 415

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
            LLKN  G LP  K K  ++A+IGP AN+   +LG+YAG     I+PLQ  Q+      Y
Sbjct: 416 TLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSGFKVQY 474

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G       + +   A++ AKGAD +V   G+D + E E LDR  L  PG Q +L++++
Sbjct: 475 AYGTQINTTLTTNYTAALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQLDLVSKL 534

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +    KP+++V    G VD T  K + N+ SI++AGYPG++G  A+ +V+ G + P
Sbjct: 535 S-GLNKPLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGIYAP 589


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 238/564 (42%), Positives = 335/564 (59%), Gaps = 55/564 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +    +RA+DLVSR+T++EK+SQ++ ++PAI RLGIPAY WW+EALHGVA  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                   AT FPQ I  AA+FD  L Y++   I  E RA Y+A       G   G+TFW
Sbjct: 65  --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+          L+A+AC KHF
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH------PKYLKAAACAKHF 170

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +   + R++F+A V+ +DL +TY P F++ V++ +   +M AYNR NG P C  
Sbjct: 171 AVH---SGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGS 227

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIY-DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
           + LLS   R +WGF G++ SDC A+   +      A +PE A + V + G D+NCG+   
Sbjct: 228 KTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAV-RNGCDLNCGNMF- 285

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
            +   A+K+  + E EIDRA+  L   RM+LG+F  +P  Q P+  I  D V    H+ L
Sbjct: 286 GNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPEDQVPYASISYDFVDCKEHREL 343

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL  A+  IVLLKN  GLLPL + K  S+A+IGPNA+S + L+GNY G +   +T L  +
Sbjct: 344 ALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGI 402

Query: 456 QNYVENTV---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           +    + V   Y  GC       + +      I +AV  A+ AD V++ +GLD T E EE
Sbjct: 403 REMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEE 462

Query: 506 L---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
           +         D+ DL LPG+QQEL+  V  A  KP++LVLL G  + +T+A  D +I +I
Sbjct: 463 MHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHIPAI 519

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           L A YPG  G  A+A V+FG+ NP
Sbjct: 520 LNAWYPGALGGRAIASVLFGETNP 543


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 232/566 (40%), Positives = 332/566 (58%), Gaps = 50/566 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E F +   TL   QRA+DLVSR+TL+EK++Q+V  +PAIPRLG+PAY WWSEALHGVA  
Sbjct: 28  EIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARA 87

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G           AT FPQ I  AA+FD  L + + + I +E RA ++  Q  G       
Sbjct: 88  GV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIYKG 137

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW+PN+NIFRDPRWGRGQET GEDP +TG+  VS+++G+QG        K  L+A+AC
Sbjct: 138 LTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD------KKYLRAAAC 191

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +   + R+ FDA V+ +DL +TY P F+ CVK+     +M AYNRVNG P
Sbjct: 192 AKHFAVH---SGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRVNGEP 248

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
            C    LL +T R++WGF G++ SDC A+   ++      S  ++V   L  G D+NCG+
Sbjct: 249 CCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSSAPESVALALNNGCDLNCGN 308

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            +  +   A ++  + E  I+ A+  L   RM+LGLF+    + P+  IG        H+
Sbjct: 309 -MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENV-PYTNIGFHQNDCQEHR 366

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ ++  +VLLKN + LLPL ++   S+A+IGPNANS + L GNY G +   IT L+
Sbjct: 367 EFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYITVLE 426

Query: 454 ALQNYV-ENTV--YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
            ++  V ++T+  Y  GC       + +  +     +AV  A+ AD VV+ MGLD + E 
Sbjct: 427 GIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDASIEG 486

Query: 504 EE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           EE          D++ L LPG QQEL+  + +   KP+ILVLL G  + +T+A     + 
Sbjct: 487 EEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTG-KPIILVLLAGSALAVTWAA--EKVP 543

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           +I+ A YPG  G  ALA  IFG+++P
Sbjct: 544 AIIQAWYPGAEGGKALASAIFGEYSP 569


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 236/584 (40%), Positives = 330/584 (56%), Gaps = 21/584 (3%)

Query: 2   IFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVS 61
           + H   L+ PL    F + L   +S  P  S  P + +      C T+L    RA+ LV+
Sbjct: 1   MVHLSPLLRPLAAFSFFTSLASTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVA 56

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQV 119
            +TL+EKI+     +   PRLG+PAY WW+EALHGVA  G G+ F+  G    ATSFP  
Sbjct: 57  AMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMP 115

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           IL  A+FD  L  ++   I  EARA  N G A G+ +W PNIN FRDPRWGRGQETPGED
Sbjct: 116 ILLGAAFDDDLVKQVATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGED 174

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           PL   +Y    V G+Q      G    + +  A CKHF AYDL+NW+G  RY FDA V+ 
Sbjct: 175 PLHLSRYVYHLVDGLQD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSP 229

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYIT 296
           QDL++ Y P F++C +  +   +MC+YN +NGIP+CADR LL    R  WG+     ++T
Sbjct: 230 QDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVT 289

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
            DC A+  IY    Y      A    L AG D++CGS   ++  +A++Q       ++ A
Sbjct: 290 GDCGAIDNIYADHHYVADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNA 349

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           L  L+S  ++LG F+     QP+  IG + V +PA + LA +A  +GIV+LKN  G LPL
Sbjct: 350 LIRLYSSLVKLGYFDPADD-QPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPL 407

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSA 476
             + +V  A+IGP AN+   L GNY GP     T + A  +      +  G D  + SSA
Sbjct: 408 KSNGTV--AIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSA 465

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
              +A+  AK AD V+   G+D T EKE  DR  +V PG Q +LI ++++  +KP+I+V 
Sbjct: 466 GFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQ 524

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             GG VD +    +  +G++LWAGYP +AG  A+ +++ G   P
Sbjct: 525 FGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTGKSAP 568


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/540 (42%), Positives = 316/540 (58%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+  ++ RA+ LV  ++  EK++     AP +PRLG+PAY WWSEALHGVAG   G+ 
Sbjct: 75  CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-PGVH 133

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSF Q IL  ASFD  L  ++   +G E RA  NAG+A G+ +W PNIN 
Sbjct: 134 FADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA-GLDYWTPNINP 192

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRDPRWGRGQETPGEDPL   +Y    V G+QG     G    + Q +A CKHF AYD++
Sbjct: 193 FRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQG-----GIGPARPQIAATCKHFAAYDME 247

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G +R++FDARV+ QDLA+ Y P F+SCV+  +   +MC+YN +NG+P+CAD  LL  
Sbjct: 248 DWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQT 307

Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     ++ SDC A+  IY    Y K+  +A    L AG D++CG+   KH  
Sbjct: 308 LLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLG 367

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q       +DRAL  L+S  ++LG F+     QP+G IG   V +PA + LA +AA
Sbjct: 368 EAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAA 426

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN    LPL K+K  +LALIGP AN+ K + GNY GP     T   A   +  
Sbjct: 427 VEGIVLLKNDQ-TLPL-KAKG-TLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGY 483

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
              Y PG      S+A    A+  AK AD V+   G+D T E E LDR  +  PG Q  L
Sbjct: 484 QVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSL 543

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I+ ++    KP+I++   GG VD T    + ++ ++LWAGYP + G  A+ +++ G   P
Sbjct: 544 ISELSN-LHKPLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAP 602


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 239/558 (42%), Positives = 315/558 (56%), Gaps = 28/558 (5%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
            T P C T+L    RA  LV++ T  E I+  VN AP +PRLGIP Y+WW+EALHGVA  
Sbjct: 30  STLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGVA-R 88

Query: 101 GKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
             G+ FN    G    ATSFPQVI   A+FD  L+  +   I  E RA  NAG+A G+  
Sbjct: 89  SPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRA-GLNM 147

Query: 157 WAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           ++P NIN FRDPRWGRGQET GEDPL   +YAV  VRG+QG      +   +L  +A CK
Sbjct: 148 YSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQ-DEANPRLTLAATCK 206

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           H+ AYDL+   G  RY+FDA V+ QDLAD + P F +CV+ G A+ +M +YN VNG+P  
Sbjct: 207 HYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVNGVPPS 266

Query: 276 ADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
           A +  L   AR  WG    H Y+TSDCDAV+ +YDA  YA     A    L AG D++CG
Sbjct: 267 ASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAADYVHAAAASLNAGTDLDCG 326

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +  +    AA+ Q     + I RA+  ++   +RLG F+     QP  ++G   V +PA 
Sbjct: 327 ATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAA-EAQPLRQLGWKDVNAPAA 385

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           Q LA +AA   I LLKN    LPL ++   ++ALIGP  N+   L GNYAGPS   ITP 
Sbjct: 386 QKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPLVITPF 445

Query: 453 QALQNYVENTVYYPGCDTVACSSASIDKAVD---------IAKGADHVVLMMGLDQTQEK 503
            A +    +         V+ +  SI    D          AK AD +V   G+D T E 
Sbjct: 446 DAARRTFSD------AHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEG 499

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E LDR D+  P  Q  LI  +A A  K +++V   GG VD    K D  +G+++WAGYPG
Sbjct: 500 ESLDRRDIAWPANQLRLIQELA-ALGKVLVVVQFGGGQVDGALLKGDDGVGALVWAGYPG 558

Query: 564 EAGAVALAEVIFGDHNPG 581
           ++GA+AL +++ G   P 
Sbjct: 559 QSGALALMDILAGKRAPA 576


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/540 (42%), Positives = 316/540 (58%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+  ++ RA+ LV  ++  EK++     AP +PRLG+PAY WWSEALHGVAG   G+ 
Sbjct: 43  CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWSEALHGVAGA-PGVH 101

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSF Q IL  ASFD  L  ++   +G E RA  NAG+A G+ +W PNIN 
Sbjct: 102 FADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAGRA-GLDYWTPNINP 160

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRDPRWGRGQETPGEDPL   +Y    V G+QG     G    + Q +A CKHF AYD++
Sbjct: 161 FRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQG-----GIGPARPQIAATCKHFAAYDME 215

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G +R++FDARV+ QDLA+ Y P F+SCV+  +   +MC+YN +NG+P+CAD  LL  
Sbjct: 216 DWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQT 275

Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     ++ SDC A+  IY    Y K+  +A    L AG D++CG+   KH  
Sbjct: 276 LLREHWDWDQPGHWVVSDCGAIDDIYIGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLG 335

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q       +DRAL  L+S  ++LG F+     QP+G IG   V +PA + LA +AA
Sbjct: 336 EAAEQGLYTNQTLDRALVRLYSSLVKLGYFD-PAEKQPYGSIGWKDVDTPAAEQLAHKAA 394

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN    LPL K+K  +LALIGP AN+ K + GNY GP     T   A   +  
Sbjct: 395 VEGIVLLKNDQ-TLPL-KAKG-TLALIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGY 451

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
              Y PG      S+A    A+  AK AD V+   G+D T E E LDR  +  PG Q  L
Sbjct: 452 QVQYSPGTAINNSSTAGFAAALAAAKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I+ ++    KP+I++   GG VD T    + ++ ++LWAGYP + G  A+ +++ G   P
Sbjct: 512 ISELSN-LHKPLIVIQFGGGQVDDTPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAP 570


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 231/554 (41%), Positives = 328/554 (59%), Gaps = 53/554 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           + A+ LVS++T+DEKISQ++  +PAI RLGIP Y WW+EALHGVA  G           A
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
           T FPQ I  AA+FD+ L  +IG  +  E R  +N        G   G+TFWAPN+NIFRD
Sbjct: 57  TVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TGK   +Y+RG+QGD  +       L+++AC KHF  +   +  
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA+ +  D+ DTY   F+ CVK  +   +M AYNRVNG P+C  R LL    R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
            ++GF G++ SDC A+   ++      + E++    +  G D+NCGS FL  H K A  +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + +  I  A+  L  VR+RLG+    P+  P+  I  +VV    H  L+++AA+  +V
Sbjct: 286 GMVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT-- 462
           LLKN    LPL +    ++A+IGPNANS   L+GNY G S R ITPL+ LQ Y+ E+T  
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRV 403

Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
           +Y  GC    D V   +   D+   A+ +A+ +D VV+ +GLD T E EE          
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D++ L+LPG Q+EL+  VA A  KPVILVL  G  +D+++A  + ++ +I+ + YPG  G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520

Query: 567 AVALAEVIFGDHNP 580
             A+AE IFG+++P
Sbjct: 521 GKAVAEAIFGEYSP 534


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 231/554 (41%), Positives = 328/554 (59%), Gaps = 53/554 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           + A+ LVS++T+DEKISQ++  +PAI RLGIP Y WW+EALHGVA  G           A
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
           T FPQ I  AA+FD+ L  +IG  +  E R  +N        G   G+TFWAPN+NIFRD
Sbjct: 57  TVFPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TGK   +Y+RG+QGD  +       L+++AC KHF  +   +  
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA+ +  D+ DTY   F+ CVK  +   +M AYNRVNG P+C  R LL    R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
            ++GF G++ SDC A+   ++      + E++    +  G D+NCGS FL  H K A  +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + +  I  A+  L  VR+RLG+    P+  P+  I  +VV    H  L+++AA+  +V
Sbjct: 286 GLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENT-- 462
           LLKN    LPL +    ++A+IGPNANS   L+GNY G S R ITPL+ LQ Y+ E+T  
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRV 403

Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
           +Y  GC    D V   +   D+   A+ +A+ +D VV+ +GLD T E EE          
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D++ L+LPG Q+EL+  VA A  KPVILVL  G  +D+++A  + ++ +I+ + YPG  G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520

Query: 567 AVALAEVIFGDHNP 580
             A+AE IFG+++P
Sbjct: 521 GKAVAEAIFGEYSP 534


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 217/540 (40%), Positives = 307/540 (56%), Gaps = 15/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-GKGI 104
           C T      RA+ L++  TL EKI+   +++P +PRLG+  YEWW EALHGVA   G   
Sbjct: 39  CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVASSPGVNF 98

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
             +G  R ATSFPQ IL  A+FD  L + +   I  EARA  N  +A G+ FW PNIN F
Sbjct: 99  SVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRA-GLDFWTPNINPF 157

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGRGQETPGEDP     Y  S +RG+QGD  +  K+       A CKHF AYD++N
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDNPSYKKV------VATCKHFVAYDVEN 211

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W G  RY+ DA +  QDL + Y PPF SC +       MC+YN +NG+P+CAD  LL   
Sbjct: 212 WNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPYLLQTV 271

Query: 285 ARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            R  W +     ++TSDCD+V  ++    YA S E+A    LKAG D+NCG++ Q+H   
Sbjct: 272 LREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAAAISLKAGTDINCGTYYQEHLPR 331

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A +Q  + E+++D +L   +   +RLG F+G+    P+  +  + V +P  Q LAL+AA 
Sbjct: 332 AYEQGLINETDVDTSLIRQYGSLIRLGYFDGD--RVPYRNLTWNDVSTPYAQDLALKAAT 389

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
            GI LLKN  G+LPL  +    +ALIG  AN+   +LGNY G      +PL A Q     
Sbjct: 390 SGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQTGAE 448

Query: 462 TVYYPGCDTVACSSASIDKAV-DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G    +  +    + +   A  +D ++ + G+D+  E EE DRV +   G Q ++
Sbjct: 449 VTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGPQLDV 508

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I ++A+   KP I+V + GG +D +    + NI ++LW GYPG+ G  A+ +++ G   P
Sbjct: 509 IGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQGISAP 568


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 199/442 (45%), Positives = 286/442 (64%), Gaps = 12/442 (2%)

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGG 202
           +YN G A G+TFW+PNIN+ RD RWGR QET  EDP + G++AV+YVRG+Q    T N  
Sbjct: 1   MYNLGHA-GLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVT 59

Query: 203 KLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
            L  + L+ S+CCKH+ AYD+D+W    R+ FDARV+ QD+ +T+  PFE CV++G  S 
Sbjct: 60  DLNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSS 119

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK-SPEDAVV 320
           +MC++N++NGIP C+D  LL    R +W  HGYI SDC  + +I D + Y   S  DAV 
Sbjct: 120 VMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVA 179

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L+AG+D+ CG +        V   K+ + E+DRAL N++ + MR+G F+G P  +   
Sbjct: 180 KTLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE--- 236

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
            +G   +C+  H  LA +AA+ GIVLLKN + + PL   K   LAL+GP+AN+ + ++GN
Sbjct: 237 SLGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPGKK--LALVGPHANATEVMIGN 294

Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQ 499
           YAG   + ++PL+A  + + N  Y  GC   +CS+ +   +A + AK A+  ++ +G D 
Sbjct: 295 YAGLPRKYVSPLEAF-SAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDL 353

Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
           + E E +DRVD +LPG Q ELI +VAE +  PVILV+L G  +DITFAK +  I +ILW 
Sbjct: 354 SIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWV 413

Query: 560 GYPGEAGAVALAEVIFGDHNPG 581
           G+PGE G  A+A+V+FG +NPG
Sbjct: 414 GFPGEQGGHAIADVVFGKYNPG 435


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 235/553 (42%), Positives = 324/553 (58%), Gaps = 56/553 (10%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           AR +VSR+TLDEKISQ+   A AIPRL IP Y WW+EALHGVA  G           AT 
Sbjct: 14  ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATV 63

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI---------GMTFWAPNINIFRD 166
           FPQ I  AA FDS +  RIG  I  E RA YN  +A+         G+TFW+PN+NIFRD
Sbjct: 64  FPQAIGLAAMFDSDMMERIGAVISTEGRAKYN--EAVRHGDRDIYKGLTFWSPNVNIFRD 121

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T + AV+++RG+QGD    GK    L+A+AC KHF  +   +  
Sbjct: 122 PRWGRGQETYGEDPYLTARLAVAFIRGIQGD----GKY---LKAAACAKHFAVH---SGP 171

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDARV+ +DL +TY   F++ VK+ +   +M AYNRVNG+P+CA   LLS   R
Sbjct: 172 EALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILR 231

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +WGF G++ SD +A+  I+    Y       +   LKAG ++ C   + +H +++V + 
Sbjct: 232 SEWGFEGHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGCNL-CAGKIARHLRSSVDEG 290

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E EI  A+  LF+ R+ +G+   +    P+  IG +   +P H  LA++AA    VL
Sbjct: 291 LISEDEITEAVERLFTTRIMMGMMADDC---PYDSIGYEENDTPEHHQLAVEAASRSFVL 347

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN  GLLPL   K  S+A+IGPNANS K L GNY G + R +T L+ +Q+ V ++V   
Sbjct: 348 LKND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVW 406

Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC        +++  +  + +AV  A+ AD VVL +GLD T E EE          D
Sbjct: 407 YSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGD 466

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + +L LPGRQQ L+  +     KPVIL+L  G  + +   + D N+ +IL   YPG  G 
Sbjct: 467 KPNLSLPGRQQLLLDTMLTVG-KPVILLLASGSALTLGGRENDENLKAILQIWYPGAMGG 525

Query: 568 VALAEVIFGDHNP 580
            A+A+V+FG   P
Sbjct: 526 KAVADVLFGRRAP 538


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 229/554 (41%), Positives = 326/554 (58%), Gaps = 53/554 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           + A+ LVS++T+DEKISQ++  +PAI RLGIP Y WW+EALHGVA  G           A
Sbjct: 7   EYAKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------A 56

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRD 166
           T FPQ I  AA+FD+ L  +IG  +  E R  +N        G   G+TFWAPN+NIFRD
Sbjct: 57  TVFPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRD 116

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TGK   +Y+RG+QGD  +       L+++AC KHF  +   +  
Sbjct: 117 PRWGRGHETYGEDPYLTGKLGCAYIRGLQGDDPD------HLKSAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA+ +  D+ DTY   F+ CVK  +   +M AYNRVNG P+C  R LL    R
Sbjct: 168 EAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
            ++GF G++ SDC A+   ++      + E++    +  G D+NCGS FL  H K A  +
Sbjct: 228 DEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFL--HLKDAYDK 285

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + +  I  A+  L  VR+RLG+    P+  P+  I  +VV    H  L+++AA+  +V
Sbjct: 286 GLVSDEAITAAVERLMEVRIRLGMMKDYPS--PYEDISYEVVECKEHVELSVEAARRSLV 343

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---T 462
           LLKN    LPL +    ++A+IGPNANS   L+GNY G S R ITPL+ LQ Y+ +    
Sbjct: 344 LLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRV 403

Query: 463 VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE---------L 506
           +Y  GC    D V   +   D+   A+ +A+ +D VV+ +GLD T E EE          
Sbjct: 404 LYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASG 463

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D++ L+LPG Q+EL+  VA A  KPVILVL  G  +D+++A  + ++ +I+ + YPG  G
Sbjct: 464 DKLGLMLPGLQEELLEAVA-AVGKPVILVLSAGSAIDLSWA--EEHVDAIIDSWYPGARG 520

Query: 567 AVALAEVIFGDHNP 580
             A+AE IFG+++P
Sbjct: 521 GKAVAEAIFGEYSP 534


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 234/559 (41%), Positives = 322/559 (57%), Gaps = 51/559 (9%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
           +L   + A+ LV ++TL+EKISQ+   +PAI RL IPAY WW+EALHGVA  G       
Sbjct: 2   SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------ 55

Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNI 161
               AT FPQ I  AA+FD  L  +IG  +  E RA + A       G   G+TFWAPNI
Sbjct: 56  ----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           NIFRDPRWGRG ET GEDP +T K   +Y+RG+QG      K    L+A+AC KHF  + 
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQG------KDPDHLKAAACAKHFAVH- 164

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             +     R++FDA+V++ DL DTY   F+ CVK      +M AYNRVNG P+C  + LL
Sbjct: 165 --SGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSKTLL 222

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
               R Q+GF G++ SDC A+   ++     K+ E++    +  G D+NCG +FL  +  
Sbjct: 223 QDILREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAFL--YLS 280

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q  + E  I  A+  L  VR+RLG+    P+  P+  I  DVV  P H  L+L+A+
Sbjct: 281 RACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPS--PYANIPYDVVECPEHIALSLEAS 338

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV- 459
           +  +VLLKN +  LPL + +  ++A+IGPNANS   L+GNY G S R ITPL+ +Q Y  
Sbjct: 339 KRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYTG 398

Query: 460 ENT--VYYPGC----DTVACSSASIDK---AVDIAKGADHVVLMMGLDQTQEKEE----- 505
           E T  +Y  GC    D V       D+   A+  A+ AD +V+ +GLD   E EE     
Sbjct: 399 EKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAGN 458

Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
                D++ L LPG QQEL+  VA A  KP++L +L G  +D+++A+    I +IL   Y
Sbjct: 459 EYASGDKLGLKLPGLQQELLEAVA-AVGKPIVLTVLAGSALDLSWAQEHAQIRAILDCWY 517

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PG  G  A+AE +FG+ +P
Sbjct: 518 PGARGGKAIAEALFGEFSP 536


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 232/569 (40%), Positives = 324/569 (56%), Gaps = 21/569 (3%)

Query: 17  FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
           F  + T  +S  P  S  P + +      C T+L    RA+ LV+ +TL+EKI+     +
Sbjct: 37  FNWVSTSTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVAAMTLEEKINNTKYDS 92

Query: 77  PAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRI 134
              PRLG+PAY WW+EALHGVA  G G+ F+  G    ATSFP  IL  A+FD  L  ++
Sbjct: 93  SGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQV 151

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
              I  EARA  N G A G+ +W PNIN FRDPRWGRGQETPGEDPL   +Y    V G+
Sbjct: 152 ATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGL 210

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           Q      G    + +  A CKHF AYDL+NW+G  RY FDA V+ QDL++ Y P F++C 
Sbjct: 211 QD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCT 265

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYITSDCDAVSIIYDAEGY 311
           +  +   +MC+YN +NGIP+CADR LL    R  WG+     ++T DC A+  IY    Y
Sbjct: 266 RDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHY 325

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
                 A    L AG D++CGS   ++  +A++Q       ++ AL  L+S  ++LG F+
Sbjct: 326 VADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFD 385

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
                QP+  IG + V +PA + LA +A  +GIV+LKN  G LPL  + +V  A+IGP A
Sbjct: 386 -PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNGTV--AIIGPFA 441

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
           N+   L GNY GP     T + A  +      +  G D  + SSA   +A+  AK AD V
Sbjct: 442 NATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTV 501

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           +   G+D T EKE  DR  +V PG Q +LI ++++  +KP+I+V   GG VD +    + 
Sbjct: 502 IYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQFGGGQVDDSSLLANA 560

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +G++LWAGYP +AG  A+ +++ G   P
Sbjct: 561 GVGALLWAGYPSQAGGAAVFDILTGKSAP 589


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 222/553 (40%), Positives = 322/553 (58%), Gaps = 50/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA +LV ++TL+EK++Q +  APA+ RL I AY WW+EALHGVA  G           A
Sbjct: 15  KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD  L  ++G A+  EARA +N  Q         G+TFWAPN+NIFRD
Sbjct: 65  TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V Y+ G+QG   N       L+A+AC KHF  +   +  
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDEN------YLKAAACAKHFAVH---SGP 175

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA VT QDL +TY P FE+CVK+G+   +M AYNR NG+P C ++ LL    R
Sbjct: 176 EAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILR 235

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
           ++WGF G++TSDC A+   ++      +  ++V   +  G D+NCG+      + AV+Q 
Sbjct: 236 KEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQG 294

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  +D A+ NLF  RM+LG+F+      P+ KI      S   + L    A+  +VL
Sbjct: 295 LVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVL 353

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN   +LPL K+K  ++ +IGPNA+S + L+GNY G + R IT L+ +++YV + V   
Sbjct: 354 LKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVL 413

Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC         +A  +  + + + + K +D VV ++GLD   E EE          D
Sbjct: 414 YSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGD 473

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL LPG Q+E++   A +  KPVILVLL G  + + +A  D ++ +I+   YPG  G 
Sbjct: 474 KPDLNLPGLQEEIL-EAAVSCGKPVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGG 530

Query: 568 VALAEVIFGDHNP 580
            A+A+++FG+ NP
Sbjct: 531 AAIADILFGEANP 543


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 222/553 (40%), Positives = 322/553 (58%), Gaps = 50/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA +LV ++TL+EK++Q +  APA+ RL I AY WW+EALHGVA  G           A
Sbjct: 15  KRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT----------A 64

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD  L  ++G A+  EARA +N  Q         G+TFWAPN+NIFRD
Sbjct: 65  TVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRD 124

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V Y+ G+QG   N       L+A+AC KHF  +   +  
Sbjct: 125 PRWGRGHETFGEDPYLTSRLGVRYIEGLQGHDEN------YLKAAACAKHFAVH---SGP 175

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA VT QDL +TY P FE+CVK+G+   +M AYNR NG+P C ++ LL    R
Sbjct: 176 EAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLLIDILR 235

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
           ++WGF G++TSDC A+   ++      +  ++V   +  G D+NCG+      + AV+Q 
Sbjct: 236 KEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMNNGCDLNCGTLFGFLVQ-AVRQG 294

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  +D A+ NLF  RM+LG+F+      P+ KI      S   + L    A+  +VL
Sbjct: 295 LVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPYLAADSREMKKLNEAVARRTVVL 353

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN   +LPL K+K  ++ +IGPNA+S + L+GNY G + R IT L+ +++YV + V   
Sbjct: 354 LKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGDDVRVL 413

Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC         +A  +  + + + + K +D VV ++GLD   E EE          D
Sbjct: 414 YSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNEYGSGD 473

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL LPG Q+E++   A +  KPVILVLL G  + + +A  D ++ +I+   YPG  G 
Sbjct: 474 KPDLNLPGLQEEIL-EAAVSCGKPVILVLLSGSALAVNWA--DEHVDAIVQGWYPGARGG 530

Query: 568 VALAEVIFGDHNP 580
            A+A+++FG+ NP
Sbjct: 531 AAIADILFGEANP 543


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 222/538 (41%), Positives = 308/538 (57%), Gaps = 13/538 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TT     RA  LVS  TL EKI+   N++P +PR+G+PAY+WW+EALHG+A       
Sbjct: 34  CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAYGTHFAA 93

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
                  ATSFPQ IL  A+FD  L + +   I  EARA  NA +  G+ FW PNIN ++
Sbjct: 94  AGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANR-YGLNFWTPNINPYK 152

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRGQETPGEDP     Y  + V G+QG   +    KG     A CKH+  YDL+N 
Sbjct: 153 DPRWGRGQETPGEDPFHVSSYVNALVTGLQGGLDDLPYKKGV----ATCKHYAGYDLENG 208

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
            G  RY FDA +  QDL D Y P F+ C +      IMC+YN VNG+P+CAD  LL    
Sbjct: 209 GGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDWLLQSLL 268

Query: 286 RRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
           R  WG+     ++TSDCDAV  I+D+  Y  +PE A  D L AG D++CG F   +  +A
Sbjct: 269 REHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADALNAGTDLDCGGFWPTYLGSA 328

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
             Q     S +DR+L   ++  +RLG F+   ++QP+ ++G   V +P+ + LALQAA+D
Sbjct: 329 YNQSLYNISTLDRSLTRRYASLVRLGYFDP-ASIQPYRQLGWSDVSTPSAEQLALQAAED 387

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           GIVLLKN  G+LPLP S   ++ALIGP AN+   + GNY G +    +PL A QN   + 
Sbjct: 388 GIVLLKND-GILPLP-SNITNVALIGPWANATTQMQGNYYGQAPYLHSPLIAAQNAGFHV 445

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
            Y  G D  + ++     A+  AK AD ++ + G+D + E E  DR  +  P  Q  L+ 
Sbjct: 446 TYVQGADIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPSSQISLVN 505

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++A  +  P+I+  + G  +D +    +R +  I+WAGYPG+ G  A+  ++ G   P
Sbjct: 506 QLANLS-IPLIISQM-GTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNILTGKTAP 561


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 223/566 (39%), Positives = 324/566 (57%), Gaps = 20/566 (3%)

Query: 32  SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
           SC P +P ++    C  T+P   RA  LV++ T +E ++  +N AP +PRLGIP Y+WW+
Sbjct: 23  SCAPDSPLSKIPDICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWT 82

Query: 92  EALHGVAGVGKGIFFN-----GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           EALHGVA    G+ F+          AT FPQ I   A+FD  L+ +I   I  E RA  
Sbjct: 83  EALHGVAK-SPGVNFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYN 141

Query: 147 NAGQAIGMTFWAP-NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           NAG+A G+  ++P NIN FRDPRWGRGQET GEDPL   ++AVS V G+QG         
Sbjct: 142 NAGKA-GLNLYSPLNINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPHAQNEAEG 200

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
            KL  +A CKHF AYDL+ +    RY+FDA V+ QDL+D + P F +CV+ G A+ +M +
Sbjct: 201 NKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTS 260

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           YN VN +P  A +  L   AR+ WG    H Y+TSDCDAV+ +YD   YA++  +A    
Sbjct: 261 YNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNYVEAAAKS 320

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           + AG D++CG+   ++  AA+KQK    + I RA+  +++  +RLG F+ +P  QP  ++
Sbjct: 321 INAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFD-DPASQPLRQL 379

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
               V SP+ Q LA  +A   I LLKN    LP+ K K   +A+IGP  N + +  GNYA
Sbjct: 380 TWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPI-KQKPTKIAIIGPYTNVSTSFSGNYA 438

Query: 443 GPSCRSITPLQALQNYVENT--VYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQ 499
           GP+  ++T + A      +   V+  G D       S    AV +   AD VV   G+D 
Sbjct: 439 GPAAFNMTMVHAASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDA 498

Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKK----PVILVLLCGGPVDITFAKYDRNIGS 555
           + E+E  DR D+  P  Q  LI  ++++ KK     +++V   GG +D    K D  +G+
Sbjct: 499 SIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGA 558

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPG 581
           ++WAGYPG++ ++A+ +++ G   P 
Sbjct: 559 LVWAGYPGQSASLAVWDILAGKAVPA 584


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 232/569 (40%), Positives = 323/569 (56%), Gaps = 21/569 (3%)

Query: 17  FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
           F  + T  +S  P  S  P + +      C T+L    RA+ LV+ +TL+EKI+     +
Sbjct: 37  FNWVSTSTESIFPDCSTGPLSKNN----VCDTSLDPVSRAKSLVAAMTLEEKINNTKYDS 92

Query: 77  PAIPRLGIPAYEWWSEALHGVAGVGKGIFFN--GTIRGATSFPQVILTAASFDSYLWYRI 134
              PRLG+PAY WW+EALHGVA  G G+ F+  G    ATSFP  IL  A+FD  L  ++
Sbjct: 93  SGAPRLGLPAYNWWNEALHGVA-EGHGVSFSDSGNFSYATSFPMPILLGAAFDDDLVKQV 151

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
              I  EARA  N G A G+ +W PNIN FRDPRWGRGQETPGEDPL   +Y    V G+
Sbjct: 152 ATVISTEARAFANGGHA-GLDYWTPNINPFRDPRWGRGQETPGEDPLHLSRYVYHLVDGL 210

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           Q      G    + +  A CKHF AYDL+NW+G  RY FDA V+ QDL++ Y P F++C 
Sbjct: 211 QD-----GIGPERPKVVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCT 265

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH---GYITSDCDAVSIIYDAEGY 311
           +  +   +MC+YN +NGIP+CADR LL    R  WG+     ++T DC A+  IY    Y
Sbjct: 266 RDAKVDAVMCSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHY 325

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
                 A    L AG D++CGS   ++  +A++Q       +  AL  L+S  ++LG F+
Sbjct: 326 VADGAHAAAAALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFD 385

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
                QP+  IG + V +PA + LA +A  +GIV+LKN  G LPL  + +V  A+IGP A
Sbjct: 386 -PADDQPYRSIGWNEVFTPAAEELAHKATVEGIVMLKND-GTLPLKSNGTV--AIIGPFA 441

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
           N+   L GNY GP     T + A  +      +  G D  + SSA   +A+  AK AD V
Sbjct: 442 NATTQLQGNYEGPPKYIRTLIWAAVHNGYKVKFSQGTDINSNSSAGFAEAISAAKEADTV 501

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           +   G+D T EKE  DR  +V PG Q +LI ++++  +KP+I+V   GG VD +    + 
Sbjct: 502 IYAGGIDNTIEKESQDRTTIVWPGNQLDLIEQLSD-LEKPLIVVQFGGGQVDDSSLLANA 560

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +G++LWAGYP +AG  A+ +++ G   P
Sbjct: 561 GVGALLWAGYPSQAGGAAVFDILTGKSAP 589


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/541 (43%), Positives = 318/541 (58%), Gaps = 17/541 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L   +RAR LV  LT+ E I+  V++AP +PRLG+P Y WW+EALHGVA    G+ 
Sbjct: 35  CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA-ASPGVV 93

Query: 106 FNG---TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F         ATSFP  I   ++FD  L   +G     EARA  NAG A G+ +W PNIN
Sbjct: 94  FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLA-GLDYWTPNIN 152

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            F+DPRWGRG ETPGEDPL   +Y  + V G+QG     G     L+ +A CKH+ AYDL
Sbjct: 153 PFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG-----GIDPPSLKVAADCKHWAAYDL 207

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++W G  RY FDA VT QDLA+ Y PPF+SCV+  RA+ +MC+YN VNG+P+CA   LL 
Sbjct: 208 EDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGVPACASPYLLK 267

Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  WG     ++TSDCDAV  +YD  GY +   +     LKAG D++CG+   ++  
Sbjct: 268 TVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSAVSLKAGSDLDCGTTYSQYLP 327

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +  + E ++  AL  L++  + LG F+  P  QP+ +I    V +PA Q LA  AA
Sbjct: 328 EAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYRQISWADVNTPAAQALAYTAA 386

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN-SAKTLLGNYAGPSCRSITPLQALQNYV 459
            +  VLLKN  G LPL  S S+S+ALIGP AN SA  L GNY G    +I PLQ   +  
Sbjct: 387 IESFVLLKN-DGTLPLTDS-SLSIALIGPMANASAVQLQGNYNGIPPFAIAPLQGFLDAG 444

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            N  Y  G +     +  ID AV  A+ AD V+ + G+D T E+E  DR ++  P  Q  
Sbjct: 445 FNVTYVLGTNVTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAKDRTEISWPDNQLA 504

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L++ + EA  KP+++V + GG +D T  K    + +ILWAGYPG++G  A+A+ + G   
Sbjct: 505 LLSALEEAG-KPLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAIADTVMGKVA 563

Query: 580 P 580
           P
Sbjct: 564 P 564


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 225/543 (41%), Positives = 319/543 (58%), Gaps = 22/543 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L+   TL+EKI+   N++P +PRLG+PAY+WW EALHGVA    G+ 
Sbjct: 34  CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVA-ESPGVI 92

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F   G    ATSFPQ IL  A+FD  L  ++   +  EARA  NA ++ G+ FW PNIN 
Sbjct: 93  FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRS-GLDFWTPNINP 151

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP     Y  + + G+QG     G      +  A CKH+  YDL+
Sbjct: 152 FKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIVATCKHYAGYDLE 206

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW+G  RY FDA +++QDL++ Y   FE+C +       MC+YN VNG+PSCA+  LL  
Sbjct: 207 NWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLLQD 266

Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     +ITSDCDA+  IY+   YA + E  V D L AG D++CG++  ++  
Sbjct: 267 ILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVADALNAGADLDCGTYYPENLG 326

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           AA  +    ES +DRAL   ++  ++LG F+     QP+ +IG   V +P  + LA +AA
Sbjct: 327 AAYDEGLFAESTLDRALIRQYASLVKLGYFDPAEN-QPYRQIGWANVSTPEAEELAYRAA 385

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NY 458
            +GI L+KN  G LPL  S   SLALIGP AN+   + GNY G     I+PL A +  NY
Sbjct: 386 VEGITLIKND-GTLPLSPSIK-SLALIGPWANATTQMQGNYYGQPPYLISPLMAAEALNY 443

Query: 459 VENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
              TVYY PG      +++S   A   A+ AD ++ + G+D T E E +DR  L  PG Q
Sbjct: 444 ---TVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQ 500

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            + I ++++   KP++++ + GG VD +    + N+ +++W GYPG++G  AL ++I G+
Sbjct: 501 PDFIDQLSQFG-KPLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGN 559

Query: 578 HNP 580
             P
Sbjct: 560 AAP 562


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 230/540 (42%), Positives = 320/540 (59%), Gaps = 15/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T     RA  L+S  T +E  +  VN++P +PRLG+PAY WWSE LHGVA    G+ 
Sbjct: 41  CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L   I   +  E RA  NAG+A G+ +W PNIN 
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRA-GLDYWTPNINP 158

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  + + G+QG    G   K   +  A CKHF AYD+D
Sbjct: 159 FKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG----GLDPKPYFKVVADCKHFAAYDMD 214

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW+G  RY F+A V+ QDL++ Y PPF++CV+  + + +MC+YN VNGIPSCA+  LL  
Sbjct: 215 NWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDAV  I+    Y   P  A  D L AG D++CG+F   +   
Sbjct: 275 VLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPE 334

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++  +  +++ RA    ++  +RLG F+ +P  QP+ ++G   V +   Q LA  AA 
Sbjct: 335 ALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAV 393

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +G+VLLKN  GLLPL K +   LALIGP AN+ + L GNY G +   ++P+Q  Q     
Sbjct: 394 EGMVLLKND-GLLPLSK-RVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFE 451

Query: 462 TVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G + T    ++    AV  AK AD VV   GLD+T E+EE+DR+++  PG Q +L
Sbjct: 452 VEYVFGTNVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  + E   KP+I+    GG +D T  K  + + +I+W GYPG++G  AL +++ G   P
Sbjct: 512 VAEL-ERVGKPLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAP 570


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 223/539 (41%), Positives = 316/539 (58%), Gaps = 14/539 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  +   + RA  LV  LTL+E ++  VN++P +PRLG+P Y WWSEALHGVA +  G  
Sbjct: 38  CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPGTN 96

Query: 106 FN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F+        ATSFPQ I+  A+FD  L   I   I  EARA  NAG+A G+ F+ PNIN
Sbjct: 97  FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRA-GLDFFTPNIN 155

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            F+DPRWGRGQETPGEDP    +Y    V G+QG    G       +  A CKHF  YDL
Sbjct: 156 PFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG----GLSPDPYYKVIADCKHFAGYDL 211

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW+G +R  F+A ++ QDLA+ Y P F+SCV+      +MC+YN VNGIPSCA+  LL 
Sbjct: 212 ENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSCANSYLLQ 271

Query: 283 KTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
              R  +G   G+ITSDCDAV+ I+    Y  +  +A    LKAG DV+CG+   +    
Sbjct: 272 DIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAVALKAGTDVDCGTTYSQTLVD 331

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AV Q  + E +I  ++  L+   +RLG F+ +P  QPF ++G   V +P+ Q LAL AA+
Sbjct: 332 AVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQLGWSDVNTPSSQALALTAAE 390

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +G+ LLKN  G LPL  S    +AL+GP AN+   + GNY G +   ++PLQALQ+    
Sbjct: 391 EGVTLLKN-DGTLPL-SSAIKRIALVGPWANATTQMQGNYQGIAPFLVSPLQALQDAGFQ 448

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
             +  G    +   +    AV   + AD V+   G+D+T E E  DR  +  PG Q +L+
Sbjct: 449 VTFANGTAINSTDDSGFAAAVSAVQVADAVIYAGGIDETIESEGNDREIITWPGNQLDLV 508

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +++A A  KP +++ + GG VD +  K ++ + +++W GYPG++G  A+  ++ G   P
Sbjct: 509 SQLA-AVGKPFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGAAIVNILTGKIAP 566


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 235/564 (41%), Positives = 333/564 (59%), Gaps = 55/564 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +    +RA+DLVSR+T++EK+SQ++ ++PAI RLGIPAY WW+EALHGVA  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT-- 64

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                   AT FPQ I  AA+FD  L Y++   I  E RA Y+A       G   G+TFW
Sbjct: 65  --------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+          L+A   CK+ 
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH------PKYLKAGGMCKNI 170

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             + +     + R++F+A V+ +DL +TY P F++ V++ +   +M AYNR NG P C  
Sbjct: 171 LPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEPCCGS 228

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIY-DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
           + LLS   R +WGF G++ SDC A+   +      A +PE A + V + G D+NCG+   
Sbjct: 229 KTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESAALAV-RNGCDLNCGNMF- 286

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
            +   A+K+  + E EIDRA+  L   RM+LG+F  +P  Q P+  I + V C   H+ L
Sbjct: 287 GNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPEDQVPYASISSFVDCK-EHREL 343

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL  A+  IVLLKN  GLLPL + K  S+A+IGPNA+S + L+GNY G +   +T L  +
Sbjct: 344 ALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLDGI 402

Query: 456 QNYVENTV---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           +    + V   Y  GC       + +      I +AV  A+ AD V++ +GLD T E EE
Sbjct: 403 REMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEGEE 462

Query: 506 L---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
           +         D+ DL LPG+QQEL+  V  A  KP++LVLL G  + +T+A  D +I +I
Sbjct: 463 MHESNIYGSGDKPDLNLPGQQQELLEAVY-ATGKPIVLVLLTGSALAVTWA--DEHIPAI 519

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           L A YPG  G  A+A V+FG+ NP
Sbjct: 520 LNAWYPGALGGRAIASVLFGETNP 543


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 182/297 (61%), Positives = 226/297 (76%), Gaps = 6/297 (2%)

Query: 29  PPFSCDPSNP-STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           PPFSC PS+  ++E   FC  TL  +QRA DLVSRLT  EKI+QL + AP +PRLG+P Y
Sbjct: 29  PPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLGVPGY 88

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           +WW+EALHG+A  GKG+ F+  +R ATSFPQV+LTAA+FD  LW RIGQAIG EARAL+N
Sbjct: 89  KWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREARALFN 148

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YAV++VRG+QG++ +       
Sbjct: 149 VGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSS-----SL 203

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           LQ SACCKH TAYDL++W G  RY F ARVT QDL DT+ PPF SCV + +AS +MCAY 
Sbjct: 204 LQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKASCVMCAYT 263

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
            +NG+P+CA+ +LL+ T R  WG  GY+ SDCDAV+I+ DA+ YA +PEDAV   LK
Sbjct: 264 AINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRYAPTPEDAVAVSLK 320


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 229/540 (42%), Positives = 316/540 (58%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L ++ RAR LV+ +T +EK++    ++P +PRLG+PAY WWSEALHGVAG   G+ 
Sbjct: 43  CDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVE 101

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L  ++   +  E RA  NAG+A G+ FW PNIN 
Sbjct: 102 FADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRA-GLDFWTPNINP 160

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRD RWGRGQETPGEDPL   +Y    V G+Q      G      +  A CKHF AYDL+
Sbjct: 161 FRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN-----GIGPANPKVVATCKHFAAYDLE 215

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  R+ F+A V+ QDL++ Y PPF+SC +  +   +MC+YN +NG+P+CAD  LL  
Sbjct: 216 DWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVMCSYNALNGVPACADSYLLQT 275

Query: 284 TARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     +IT DC A+  IY+   Y K+P +A    L AG D++CG+   K+  
Sbjct: 276 ILREHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAATALNAGTDLDCGTVFPKYLG 335

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +       +D+AL  L+S  ++LG F+     QP+  IG   V SPA + LA +AA
Sbjct: 336 QAADEGLYTNKTLDKALVRLYSSLVKLGYFDPAED-QPYRSIGWKDVDSPAAEALAHKAA 394

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN    LPL K+K  +LALIGP AN+ K + GNY GP     T L A      
Sbjct: 395 VEGIVLLKNDK-TLPL-KAKG-TLALIGPYANATKQMQGNYEGPPKYIRTLLWAATQAGY 451

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           +  Y  G    A S+A  D A+  AK AD VV   G+D T E E  DR  +V PG Q +L
Sbjct: 452 DVKYVAGTAINANSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGHDRTTIVWPGNQLDL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I ++++   KP+++V   GG VD +    + ++ ++LW GYP + G  A+ +++ G   P
Sbjct: 512 IDQLSKIG-KPLVVVQFGGGQVDDSSLLSNPHVNALLWTGYPSQEGGSAIFDILTGKTAP 570


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 220/539 (40%), Positives = 315/539 (58%), Gaps = 17/539 (3%)

Query: 48  TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
           T+ P   RA+ LV+  TL+EKI+   + AP +PRLG+P Y+WWSE LHG+AG       +
Sbjct: 3   TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIAGPYTNFSDS 62

Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
           G    +TSFPQ IL  A+FD  L   + + I  EARA  NA +  G+ FW PNIN FRDP
Sbjct: 63  GEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANR-TGLDFWTPNINPFRDP 121

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQETPGED      Y  + + G+QG++ +  K     +  A CKHF  YD+++W G
Sbjct: 122 RWGRGQETPGEDAYHLSSYVQALIHGLQGESTDPYK-----RVVATCKHFAGYDVEDWNG 176

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             RY+ D ++T Q+L + Y  PF++CV Q      MC+YN VNG P CAD  LL    R 
Sbjct: 177 NLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILRE 235

Query: 288 QWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            WG+     ++T DCDAV  +Y    ++ +   A  D L AG DV CG+++Q+H  AA +
Sbjct: 236 HWGWTNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDVTCGTYMQEHLPAAFQ 295

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           QK L ES +D+AL   +S  +RLG F+ +   QP+ ++G D V + A Q LA +AA +GI
Sbjct: 296 QKLLNESSLDQALIRQYSSLVRLGYFDASEN-QPYRQLGFDAVATNASQALARRAAAEGI 354

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
           VLLKN  G LPL    SV++ L G  AN+   LLGNYAG +    +PL AL+      + 
Sbjct: 355 VLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQ-TGVKIN 412

Query: 465 YPGCDTVACSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
           Y G +       + ++  ++      +D ++ + G+D + E+E  DR  L   G Q ++I
Sbjct: 413 YAGGNPGGQGDPTTNRWSNLYGAYSTSDVLIYVGGIDNSVEEEGRDRGYLTWTGAQLDVI 472

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            ++A+   KPVI+V+  GG +D +    + NI +I+WAGYPG+ G  A+ ++I G   P
Sbjct: 473 GQLADTG-KPVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIIGGKTAP 530


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 228/540 (42%), Positives = 315/540 (58%), Gaps = 15/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L+   T +E  +  VN++P +PRLG+PAY WWSE LHGVA    G+ 
Sbjct: 41  CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L   +   +  E RA  N G+A G+ +W PNIN 
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRA-GLDYWTPNINP 158

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP     Y  + + G+QG    G       +  A CKHF AYD+D
Sbjct: 159 FKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG----GLDPTPYFKVVADCKHFAAYDMD 214

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW+G  RY F+A VT QDL++ Y P F++CV+  + + +MC+YN VNGIPSCA+  LL  
Sbjct: 215 NWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDAV  IY    Y  +P  A  D L AG D++CG+F   +   
Sbjct: 275 ILRDYWGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAADALLAGTDIDCGTFSSTYLPD 334

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+ Q  +  +++ RA    ++  +RLG F+  P  QP+ ++G   V +P  Q LA  AA 
Sbjct: 335 ALSQGLVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQLGWSDVNTPEAQQLAHTAAV 393

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +G+VLLKN  G LPL K     LALIGP AN+   + GNYAG +   I+PL   Q    +
Sbjct: 394 EGMVLLKND-GTLPLSKHVR-KLALIGPWANATTLMQGNYAGIAPYLISPLLGAQQAGFD 451

Query: 462 TVYYPGCD-TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G + T    ++    AV  AK AD V+   GLD+T E+EE+DR+++  PG Q +L
Sbjct: 452 VEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRLNVTWPGNQLDL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  +A    KP+I+    GG +D +  K  R++ +I+W GYPG++G  AL +++ G   P
Sbjct: 512 VAELASVG-KPLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTALFDILTGKAAP 570


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 221/553 (39%), Positives = 314/553 (56%), Gaps = 49/553 (8%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A  LV ++TL+EK+ Q +N APAI RLGI AY WW+E LHGVA  G           A
Sbjct: 23  EKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAGV----------A 72

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRD 166
           T FPQ I  AA+FD  L   +G+A+  EARA Y+  Q  G       +T WAPNINIFRD
Sbjct: 73  TIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRD 132

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  + Y+RG+QG        +  L+ +AC KHF  +   +  
Sbjct: 133 PRWGRGHETYGEDPWLTSRLGIRYIRGLQGSH------EKYLKTAACVKHFAVH---SGP 183

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA V+ +DL +TY P FE+CVK G    +M AYNRVNG+P C +  LL    R
Sbjct: 184 EELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAYNRVNGVPCCGNEYLLETILR 243

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
           ++WGFHG++ SDC A+   ++  G   SP ++V   +  G D+NCG+    +   AVK+ 
Sbjct: 244 KEWGFHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHGCDLNCGNLFT-YLIQAVKEG 302

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           K+ E  +D A+  LF+ R++LG         P+  I    V SPA + L   AA   +VL
Sbjct: 303 KVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLEVDSPAMKKLNRSAAGKSVVL 362

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTV 463
           LKN+ GLLP+   +  ++ +IGPNA+S + L+GNY G +   +T L+ ++   E     +
Sbjct: 363 LKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTASEYVTVLEGIREAAEPEARVL 422

Query: 464 YYPGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC     + + +    D       I + +D V+  MGLD T E E+          D
Sbjct: 423 YSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMGLDSTLEGEQGDTGNIYAGGD 482

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL+LPG QQ+++   A  + KPV+LVLL G  + +T+A  D ++ +IL A YPG  G 
Sbjct: 483 KPDLMLPGLQQKIL-ETAYDSGKPVVLVLLAGSAMAVTWA--DEHLPAILTAWYPGAEGG 539

Query: 568 VALAEVIFGDHNP 580
             +A+V+FG  NP
Sbjct: 540 RGVADVLFGTVNP 552


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/553 (40%), Positives = 319/553 (57%), Gaps = 55/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A +LV+++TL+EK SQL  ++PAI RLGIPAY WW+EALHGVA  G           A
Sbjct: 9   KKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT----------A 58

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA FD     +I  AI +EARA YN            G+T W+PNINIFRD
Sbjct: 59  TVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRD 118

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP ++GK  V++++G+QGD       K  +  +AC KHF AY   +  
Sbjct: 119 PRWGRGHETYGEDPFLSGKLGVAFIKGLQGD-------KDVMMTAACVKHFAAY---SGP 168

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ F+A VT +DL +TY P FE+CVK  +   +M  YNR NG P C    LL    R
Sbjct: 169 EDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYTLLRDILR 228

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +WGF G++ SDC A+   +      K+PE++V   + AG D+NCG+ +      A+++ 
Sbjct: 229 EKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALAIDAGCDLNCGN-MYLMLLIALQEG 287

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  I RA   +F+ R +LGLF G+     F  I  +VV    H+ +A++AA+   VL
Sbjct: 288 LITEEHITRAAVRIFTTRFKLGLFEGS----EFDNIPYEVVECSEHKEMAIEAARKSAVL 343

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN  G+LP+ K    ++ +IGPNANS   L GNY G S R IT L+ +Q+ V + V   
Sbjct: 344 LKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGDEVRVL 402

Query: 464 YYPGCDTV-------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
           Y  GC+ V       A ++  + +AV +A+ +D VVL +GLD+T E E+          D
Sbjct: 403 YSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNNGGSGD 462

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL LP  Q+ L+ ++  A  KP +L L+ G  +++++A    N   IL   YPG  G 
Sbjct: 463 KKDLDLPEVQKSLLEKIV-ATGKPTVLCLMAGSAINLSYAHEHCN--GILLTWYPGARGG 519

Query: 568 VALAEVIFGDHNP 580
            A+A+++FG+ +P
Sbjct: 520 KAVADILFGNASP 532


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 225/540 (41%), Positives = 313/540 (57%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L ++ RA+ LV+ +T +EK++    ++P +PRLG+PAY WWSEALHGVAG   G+ 
Sbjct: 43  CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVE 101

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L  ++   +  E RA  NAG++ G+ FW PNIN 
Sbjct: 102 FADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS-GLDFWTPNINP 160

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRD RWGRGQETPGEDPL   +Y    V G+Q      G      +  A CKHF AY L+
Sbjct: 161 FRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN-----GIGPANPKVVATCKHFAAYGLE 215

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  R+ F+A V+ QDL++ Y PPF+SC +  R   +MC+YN +NG+P+CAD  LL  
Sbjct: 216 DWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQT 275

Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     +ITSDC A+  IY+   +  +P +A    L AG D++CG+   K+  
Sbjct: 276 ILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLG 335

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +       +DRAL  L+S  ++LG F+     QP+  IG   V +PA + LA +AA
Sbjct: 336 QAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAED-QPYRSIGWTDVDTPAVEALAHKAA 394

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN    LPL K+K  +LALIGP AN+ K + GNY GP+    T L A      
Sbjct: 395 GEGIVLLKNDK-TLPL-KAKG-TLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGY 451

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           +  Y  G      S+A  D A+  AK AD VV   G+D T E E  DR  +  PG Q  L
Sbjct: 452 DVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I ++++   KP+++V   GG VD +    +  + ++LWAGYP + G  A+ +++ G   P
Sbjct: 512 IDQLSKIG-KPLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAP 570


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 218/544 (40%), Positives = 311/544 (57%), Gaps = 22/544 (4%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  +     RA+ LV+  TL+EKI+   NSAP + RLG+P Y+WW+E LHG+AG     
Sbjct: 36  ICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIAGPFTSF 95

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
              G    +TSFPQ IL  A+FD  L   + + I  EARA  NA +  G+ FW PNIN F
Sbjct: 96  AKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRT-GLDFWTPNINPF 154

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGED      Y  + + G+QG+  +  +     +  A CKH+  YD++N
Sbjct: 155 RDPRWGRGQETPGEDSYHLSSYVKALIHGLQGNATDPYR-----RVVATCKHYAGYDIEN 209

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W G  RY+ D +++ QDL + Y  PFE+CV Q      MC+YN VNG P CAD  LL   
Sbjct: 210 WNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYLLQTV 268

Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            R  WG+     ++TSDCDA+  +Y    ++ + E A  D L AG D++CG++LQ H   
Sbjct: 269 LREHWGWSSDDHWVTSDCDAIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQTHLPG 328

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AVKQ    E+ +D+AL   +S  ++LG F+  P  QP+ ++G D V + A Q LAL+AA+
Sbjct: 329 AVKQGLTDETTLDKALIRQYSSLIKLGYFDA-PENQPYRQLGFDAVATSASQALALKAAE 387

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKN  G+LP+    S  + + G  AN+   L GNY G +    +PL ALQN   +
Sbjct: 388 EGIVLLKND-GVLPI-NLGSKQVGIYGDWANATSQLQGNYFGVAKFLTSPLMALQNLGVD 445

Query: 462 TVY---YPG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
             Y    PG   D    + +S+   +     +D  + + G+D   E E+ DR  L L G 
Sbjct: 446 VKYAGNLPGGQGDPTTGAWSSLSGVITT---SDVHIWVGGIDNGVESEDRDRSWLTLTGG 502

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q ++I ++A+   KPVI+V++ GG +D +    +  I ++LWAGYPG+ G  A+  ++ G
Sbjct: 503 QLDVIGQLADTG-KPVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDGGTAIVNILTG 561

Query: 577 DHNP 580
              P
Sbjct: 562 KAAP 565


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/550 (40%), Positives = 315/550 (57%), Gaps = 42/550 (7%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           L    RA+ LV+ +T+DEKI+QL +S PAIPRL +P Y WW+EALHG+A  GK       
Sbjct: 35  LSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK------- 87

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNIN 162
              AT FPQ I   A+FD  L   +  AI  EARA Y       N GQ  G+TFW PN+N
Sbjct: 88  ---ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVN 144

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRGQET GEDPL+T +   ++V+G+QGD          L+++   KHF  +  
Sbjct: 145 IFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDD------PKYLKSAGVAKHFAVH-- 196

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +   + R++FD   + +DL +TY P FE+ V Q + +G+MCAYN V G PSCA   LL 
Sbjct: 197 -SGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQPSCASEFLLG 255

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
           +  +++W F+GY+ SDC A+   +       +  ++    L+AG+D+NCG   +K  KAA
Sbjct: 256 EMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAA 315

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
            ++  + +S ID+ L NL  +R RLGLF+ +  + P   IG +V+ S  H  LA + A  
Sbjct: 316 FEEGLITQSLIDQRLKNLLMIRFRLGLFDPS-ELNPHNAIGQEVIHSLEHIELARKVAAK 374

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--- 459
            IVLLKN   +LPL K   V   + GP A S+  L+GNY G S   +T L+ +   V   
Sbjct: 375 SIVLLKNEKQVLPLSKDIKVPY-VTGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLG 433

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------DRVD 510
            +  Y  G      +   ++ A ++AK AD V+ ++G+    E EE+         DRV 
Sbjct: 434 SSLNYRAGALPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVA 493

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           + LP  Q + + ++AE  K P+ILV+  G PVDI+  + D    +ILW  YPGE G  A+
Sbjct: 494 ITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--ELDPLADAILWIWYPGEQGGNAV 551

Query: 571 AEVIFGDHNP 580
           A+VIFGD NP
Sbjct: 552 ADVIFGDTNP 561


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 231/591 (39%), Positives = 327/591 (55%), Gaps = 35/591 (5%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFS----CDPSNPSTETFPFCKTTLPISQRA 56
           M+  + SL   L      +  T  D T  P      CDPS              P++ RA
Sbjct: 1   MLTSRRSLPALLFATGAYAQFTFPDCTNGPLKNVTICDPS------------ASPLA-RA 47

Query: 57  RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG-ATS 115
           + LV+  TL+EKI+   + AP +PRLG+P Y+WW+E LHG+AG       +G     +TS
Sbjct: 48  KSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIAGPYTNFSHSGVEWSYSTS 107

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
           FPQ IL  A+FD  L   + + I  EARA  NA +  G+ FW PNIN FRDPRWGRGQET
Sbjct: 108 FPQPILMGAAFDDDLITEVAKVISTEARAFNNANR-TGLDFWTPNINPFRDPRWGRGQET 166

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
           PGED      Y  + + G+QG+  +  K     +  A CKHF  YD+++W G  RY+ D 
Sbjct: 167 PGEDAYHLSSYVQALIHGLQGEATDPYK-----RVVATCKHFAGYDVEDWNGNLRYQNDV 221

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG-- 293
           ++T QDL + Y  PF++CV Q      MC+YN VNG P CAD  LL    R  WG++   
Sbjct: 222 QITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGWNKEE 280

Query: 294 -YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
            ++T DCDAV  +Y    ++ +   A  D L AG D+ CG+++Q+H  AA +QK L ES 
Sbjct: 281 QWVTGDCDAVQNVYFPHQWSSTRAGAAADSLVAGTDITCGTYMQEHLPAAFRQKLLNESS 340

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +D AL   +S  +RLG F+  P  QP+ ++G D V + A Q LA +AA +GIVLLKN  G
Sbjct: 341 LDLALIRQYSSLVRLGYFDA-PENQPYRQLGFDAVATNASQALARRAAAEGIVLLKND-G 398

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVA 472
            LPL    S+++ L G  AN+   LLGNYAG +    +PL AL+      + Y G     
Sbjct: 399 TLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHSPLYALKQ-TGVKINYAGGKPGG 457

Query: 473 CSSASIDKAVDIAKG---ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
               + ++  ++      +D ++ + G+D   E+E  DR  L   G Q ++I ++AE   
Sbjct: 458 QGDPTTNRWSNLYGAYSTSDVLIYVGGIDNGVEEEGHDRGYLTWTGPQLDVIGQLAETG- 516

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KPVI+V+  GG +D +    + NI +I+WAGYPG+ G  A+ ++I G   P
Sbjct: 517 KPVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIISGKTAP 567


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 225/553 (40%), Positives = 316/553 (57%), Gaps = 50/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RAR LV+++TL EKISQ+++ APAIPRLGIPAY WW+E +HGV   G           A
Sbjct: 13  ERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT----------A 62

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I  AASFD  L  ++G+A+G+EAR  YN  ++        G+T WAPN+NIFRD
Sbjct: 63  TVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRD 122

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V +V G+QGD  +       L+A+AC KHF  +   +  
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVRFVEGMQGDDPD------YLRAAACAKHFAVH---SGP 173

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA+V+ QDL +TY P F + VK+     +M AYNR NG P C  + LL    R
Sbjct: 174 EDQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLLVDILR 233

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +W F G++TSDC A+   ++       P D+V   +  G D+NCG  L  + + AV + 
Sbjct: 234 GKWNFQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNGCDLNCGD-LYAYLEEAVAEG 292

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           K+ E  IDR+L  LF+ RM+LG+F+    + P+ KIG D V S   Q L L+ A+  +VL
Sbjct: 293 KVKEETIDRSLVRLFTTRMKLGMFDAEEKV-PYNKIGYDAVDSREMQALNLEVAEKILVL 351

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN +  LPL KSK   +A++GPNA++ K L+GNY G + R +T L  +Q Y+   V   
Sbjct: 352 LKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGEDVQVR 411

Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC         +A S+  I +   +    D V+  +GLD   E EE          D
Sbjct: 412 YSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQFASGD 471

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           +  L LPG Q+ ++    E+  KPV++V+L G  + +  A+      ++L A YPG  G 
Sbjct: 472 KQSLSLPGNQESVLKACIESG-KPVVVVVLSGSALALGTAQ--EGAAAVLQAWYPGAQGG 528

Query: 568 VALAEVIFGDHNP 580
            A+A  +FG+ NP
Sbjct: 529 RAVARALFGECNP 541


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 233/557 (41%), Positives = 322/557 (57%), Gaps = 43/557 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF  T+  I +R +DLVSRLTLDEK  Q++++APAI RLGI  Y WW+EALHGVA  G+
Sbjct: 38  YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTGR 97

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMT 155
                     AT FP+ +  AA+FD  L YRIGQAI  EA A YN        GQ  G+T
Sbjct: 98  ----------ATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           F+APN+NIFRDPRWGRGQET GEDP +T +  V+YV+G+QG+          L+ +AC K
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGND------PKYLKTAACAK 201

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           H+  +   +     R+ +DA   M+D  +TY P FE+ VK+G+   +MCAYNR  G P C
Sbjct: 202 HYVVH---SGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCAYNRTFGKPCC 258

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
               LL    R +WGF GY+T+DC A+   Y   G AK   +A    +K+G+++NCG+  
Sbjct: 259 GSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLEACALAIKSGVNLNCGNEF 318

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
             +  AAV++  + E E+D AL  L   R RLGLF+ +P   P+ KI  +V+ S  +  L
Sbjct: 319 N-YLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIKEEVIGSQQNIDL 376

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A +AA   +VLL+N +  LPL K    SL ++GP A +   LLGNY G + R  T +QA+
Sbjct: 377 AYEAAAKSLVLLQNKNNTLPLKKDMK-SLYVVGPYAANQDILLGNYNGVNSRLTTIMQAI 435

Query: 456 QNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
              V    +  Y  G +  A +  S++ ++  A  AD VV + G+    E EE       
Sbjct: 436 VGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFEGEEGESTAST 495

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR+DL LP  Q + +  + +  KKP+ILVL  G P  I   +    + +IL+  YPG
Sbjct: 496 SRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGGSP--ICTPELADMVDAILFVWYPG 553

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  A+A+VIFGD NP
Sbjct: 554 QEGGHAVADVIFGDVNP 570


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/574 (39%), Positives = 320/574 (55%), Gaps = 48/574 (8%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
            S  P N  T  +PF    L + +R  DLV R+TL+EK+SQ++NSAPAI RLG+PAY WW
Sbjct: 20  LSFKPQN-HTYKYPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWW 78

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           +E LHGVA          T    T +PQ I  AA++D    + +G     E RA+YN   
Sbjct: 79  NECLHGVA---------RTPFKVTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESI 129

Query: 151 A-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
                   +G+T+W PNINIFRDPRWGRGQET GEDP +TG+   ++V+G+QGD      
Sbjct: 130 KNDKHDIYLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGDD----- 184

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
               L+A+ C KH+  +   +     R+KF+  ++  DL DTY P F   V   + +G+M
Sbjct: 185 -PRYLKAAGCAKHYAVH---SGPEDLRHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVM 240

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVD 321
           CAYN   G P C    L++     +W F GY+TSDC  +   Y    +   P  E A  D
Sbjct: 241 CAYNAFKGQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDAESAAAD 300

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            +  G DV CG+   K    AVK  KL E +ID++L  LFSVR +LG+F+    ++ + +
Sbjct: 301 AVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVK-YNQ 359

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           IG D + +PAH   AL+ A   IVLLKN   LLPL K+    +A++GPNA++A ++LGNY
Sbjct: 360 IGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSKNLK-KIAVLGPNADNAVSVLGNY 418

Query: 442 AGPSCRSITPLQALQNYV---ENTVYYPGCDTVACSSASIDKAVDIA--KGADHVVLMMG 496
            G   R +T LQ ++N +      +Y    D VA S+A  + A   A  K AD ++ + G
Sbjct: 419 NGTPSRIVTALQGIKNKLPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAIIYIGG 478

Query: 497 LDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
           +    E EE+          DR  ++LPG Q EL+ +  +A  KPV+ V++ G  +   +
Sbjct: 479 ISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELL-KALKATGKPVVFVMMTGSAIATPW 537

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
                N+ +I+ A Y G+A   A+A+V+FGD+NP
Sbjct: 538 EA--ENLPAIVNAWYGGQAAGTAIADVLFGDYNP 569


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 228/555 (41%), Positives = 321/555 (57%), Gaps = 22/555 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N    T   C T+    +RA  L+S  TL+EKI+   N+AP +PRLG+PAY+WW EA
Sbjct: 22  DCANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEA 81

Query: 94  LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           LHGVA    G+ F  +G    ATSFPQ IL  A+FD  L   +   +  EARA  NA ++
Sbjct: 82  LHGVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRS 140

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+ FW PNIN F+DPRWGRGQETPGEDP     Y  + + G+QG     G      +  
Sbjct: 141 -GIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIV 194

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A CKHF AYDL+NW+G  RY FDA V++QDL++ Y   F +C +       MC+YN VNG
Sbjct: 195 ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNG 254

Query: 272 IPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           +PSCA+  LL    R  WG+     YITSDCDA+  IY+   Y  +  + V D L AG D
Sbjct: 255 VPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAGTD 314

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG +  ++  AA  Q    ES ++RAL   ++  ++LG F+    +QP+ +IG   V 
Sbjct: 315 LDCGEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYFD-PADIQPYRQIGWANVS 373

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +P  + LA  AA +GI LLKN  G LPL  S   ++ALIGP AN+   + GNY G +   
Sbjct: 374 TPEAEELAYTAAVEGITLLKND-GTLPLSPSIK-TIALIGPWANATTQMQGNYYGVAPYL 431

Query: 449 ITPLQALQNYVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           I+PL A +  +  TVYY   PG D    +++S   A   A+ AD ++   G+D T E E 
Sbjct: 432 ISPLMAAEE-LGFTVYYSAGPGVDD--PTTSSFPAAFAAAEAADAIIYAGGIDITVEAEA 488

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           +DR  L  PG Q + I +++    KP+I++   GG +D +    +  + +++W GYPG++
Sbjct: 489 MDRYTLDWPGVQPDFIDQLS-LLGKPLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQS 547

Query: 566 GAVALAEVIFGDHNP 580
           G  A+ ++I G+  P
Sbjct: 548 GGKAIMDIIVGNAAP 562


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 227/544 (41%), Positives = 307/544 (56%), Gaps = 25/544 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T     RA  L++  T++EKI+   ++AP +PRLG+PAY WW EALHGVA    G+ 
Sbjct: 39  CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVA-QSPGVN 97

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G  R ATSFPQ IL  A+FD  L   +   I  EARA  N  ++ G+ +W PNIN 
Sbjct: 98  FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARS-GLDYWTPNINP 156

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+D RWGRGQETPGEDP     Y  S + G+QGD    GK K   +  A CKHF AYDL+
Sbjct: 157 FKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQGD----GKYK---KVVATCKHFVAYDLE 209

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
            W G  RY+FD  V  Q+L + Y PPF++C +       MC+YN +NGIP+CAD  LL  
Sbjct: 210 TWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLLQT 269

Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     ++TSDCD++  +Y    Y  + E+AV   LKAG DVNCG++ Q+   
Sbjct: 270 ILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLP 329

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A+    + E +ID AL   +S  +RLG F+G  T   +  +    V +P  Q LAL+AA
Sbjct: 330 GALSLGLVTEKDIDMALIRQYSSLVRLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAA 387

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GI LLKN  G+LPL  +K   +A+IG  AN+ + +LGNY G      +PL A Q    
Sbjct: 388 VEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGA 446

Query: 461 NTVYY--PG--CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
           N  Y   PG   D    +   I  AVD    AD ++   G+D   E E +DRV +   G 
Sbjct: 447 NVTYSGNPGGQGDPTTNNWLHIWTAVD---EADVILFAGGIDNGVEAEGMDRVSIAWTGA 503

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q ++I ++A    KPVI+  +    VD T    ++NI ++LW GYPG+ G VAL ++I G
Sbjct: 504 QLDVIGQLASRG-KPVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQG 562

Query: 577 DHNP 580
              P
Sbjct: 563 KSAP 566


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 221/538 (41%), Positives = 301/538 (55%), Gaps = 12/538 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG-VGKGI 104
           C T+L  +QRA  L+   T+DE I   VN AP +PRLG+PAYEWWSE LHGVA   G   
Sbjct: 37  CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
              G    ATSFPQ IL +A+FD  L   +G  IG+E RA  N G A G+ FW PNIN F
Sbjct: 97  SITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHA-GLDFWTPNINPF 155

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGRGQETPGEDP    +Y  + ++G+QG    G   +   Q  A CKHF  YDL++
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG----GLDPEPYFQVVATCKHFAGYDLED 211

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W    RY ++A ++ QDL++ Y P F+SC +   A   MC+YN +NGIP+CAD  LL   
Sbjct: 212 WDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCADTYLLQDI 271

Query: 285 ARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
            R  WGF    ++T DCD+V  IYD   Y   P+ A  D LKAG D++CG F       A
Sbjct: 272 LRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAADALKAGSDIDCGIFYTTWLPLA 331

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
             +  + E ++  AL   ++  +RLG F+   + QP+ +     V +   Q LA  AA +
Sbjct: 332 YTESLITEQDLRAALTRQYASLVRLGYFD-PASEQPYRQYNWSNVDTSYAQELAYTAAVE 390

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           GI LLKN  G LP   S   ++ALIGP   +   + GNY G +   I+P Q  Q    N 
Sbjct: 391 GITLLKND-GTLPF-SSAIKNIALIGPWTFATTQMQGNYYGNAPYLISPYQGAQLAGYNI 448

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
            Y    +  + ++     A   A+GAD +V + G+D T E E +DR D+  P  Q  LI 
Sbjct: 449 SYVLETNVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITWPAFQLWLIG 508

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + +   KP+++V   GG VD T    + ++ ++LW GYPG++G  AL ++I G   P
Sbjct: 509 ELGKLG-KPLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDIISGKVAP 565


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 217/539 (40%), Positives = 308/539 (57%), Gaps = 21/539 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TT     RA  L++  TL EK++   +++P +PRLG+PAY WW EALHGVA    G+ 
Sbjct: 39  CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G  R ATSFPQ IL  A+FD  L   +   I  EARA  N  +A G+ FW PNIN 
Sbjct: 98  FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRA-GLDFWTPNINP 156

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+D RWGRGQETPGEDP     Y  + + G+QG   +  K     +  A CKHF AYD++
Sbjct: 157 FKDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSPDDKYK-----RVVATCKHFVAYDME 211

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  RY+FDA+V+ QDL + Y PPF+ C +       MC+YN +NG+P+CAD  LL  
Sbjct: 212 SWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLLQT 271

Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R +W +     ++TSDCDAV  ++    YA + E+A    LKAG D+NCG++ Q H  
Sbjct: 272 VLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAAALSLKAGTDINCGTYYQDHLP 331

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           AA  Q  +  +++D +L   +S  +RLG F+G     P+  +  + V +P  Q LA +AA
Sbjct: 332 AAYDQGLINTTDLDISLIRQYSSLVRLGYFDG--LAVPYRNLTWNDVSTPHAQQLAYKAA 389

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GI LLKN  G+LPL  S   S+ALIG  AN+   +LGNY G      +PL A Q    
Sbjct: 390 AEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTGA 448

Query: 461 NTVYYPGCDTVACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
              +  G         + D  + +   A  +D ++   G+D + E E +DRV L   G Q
Sbjct: 449 TVNFATGPG--GQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQ 506

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
            ++I ++A    KPVI++ + GG +D +    + N+ +++W GYPG+ G VAL ++I G
Sbjct: 507 LDMIGQLAMYG-KPVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRG 564


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 219/550 (39%), Positives = 315/550 (57%), Gaps = 42/550 (7%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           +  +QRA  LV+ +T+DEKI+QL ++ PAI RL +P Y WW+EALHG+A  GK       
Sbjct: 34  ISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNGK------- 86

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNIN 162
              AT FPQ I  AA+FD  L +++  AI  EARA Y       N GQ  G+TFW PN+N
Sbjct: 87  ---ATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFWTPNVN 143

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRGQET GEDP +T +   ++V+G+QGD          L+++   KHF  +  
Sbjct: 144 IFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDD------PKYLKSAGVAKHFAVH-- 195

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +   + R+ FD   + +DL +TY P FE+ V Q + +G+MCAYN VNG P+CA   LL 
Sbjct: 196 -SGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEPACASAQLLD 254

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
              ++QWGFHGYI SDC A++         KS  ++    L++G+++NCGS  +   KAA
Sbjct: 255 GILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESAALALQSGVNLNCGSTYEHFLKAA 314

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           ++Q  +P   ID+ L  L  +R +LG F+    + P+ ++  DV+ SP H  L+   A+ 
Sbjct: 315 LEQNLVPLELIDQRLTQLLMIRFQLGFFDP-AGLNPYNEVTPDVIHSPEHINLSRDVARK 373

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--- 459
            IVLLKN + +LPL K   V   + GP A S+  L+GNY G S   ++ L+ +   V   
Sbjct: 374 SIVLLKNDNHVLPLSKDIKVPY-VTGPFAASSDMLIGNYYGISDSLVSVLEGIAGKVSLG 432

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------DRVD 510
            +  Y  G      +   ++ A  +AK AD V+ ++G+    E EE+         DRV 
Sbjct: 433 SSLNYRSGSLPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDAIASADRGDRVA 492

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           + LP  Q + + ++A   K P+ILV+  G PVDI+    +    +ILW  YPGE G  A+
Sbjct: 493 ITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDIS--DLEPLADAILWIWYPGEQGGNAV 550

Query: 571 AEVIFGDHNP 580
           A+V+FGD NP
Sbjct: 551 ADVLFGDTNP 560


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/557 (40%), Positives = 316/557 (56%), Gaps = 61/557 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A +LVS++TL+EK SQL   APAIPRLG+P Y WW+EALHGVA  G           A
Sbjct: 9   EKAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------A 58

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           TSFPQ I  AA+FD  L   +G A+  E RA YN            G+TFW+PN+NIFRD
Sbjct: 59  TSFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRD 118

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASACCKHFTAYDLD 223
           PRWGRG ET GEDP +T +  V+YV G+QG   D F        ++ +AC KHF  +   
Sbjct: 119 PRWGRGHETYGEDPYLTSRLGVAYVEGLQGSQDDDF--------MKTAACAKHFAVH--- 167

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +   + R++FDA+ + +D+ +TY P FE+CVK+     +M AYNR NG P C    L+  
Sbjct: 168 SGPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQN 227

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAA 342
             R +W F G+  SDC A++  +      K+PE++    LK+G DVNCG ++L  H   A
Sbjct: 228 ILREEWDFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKSGCDVNCGVTYL--HLLKA 285

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
            +Q  + E EI +A   LF+ R  LG F+ N     +  I  +VV    H  LA + A++
Sbjct: 286 YQQGLVTEEEITQAAERLFTTRFLLGCFDKN----EYDDIPYEVVECKEHLELAQKMAKE 341

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
            +VLLKN  G+LPL K    ++ +IGPNA+S   L+GNY G S R IT L+ +Q++V   
Sbjct: 342 SMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGED 400

Query: 463 V---YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
           V   Y  GC       + +      I +A+ +A+ +D VVL +GLD+  E EE       
Sbjct: 401 VRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSY 460

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              D+ DL LP  Q+EL+  VA    KPV+L ++ G  +D+ FA    ++ +IL   YPG
Sbjct: 461 ASGDKKDLELPESQRELLEAVA-GCGKPVVLCMMSGSAIDMQFAA--EHVNAILQVWYPG 517

Query: 564 EAGAVALAEVIFGDHNP 580
             G  A AE++FG  +P
Sbjct: 518 ARGGKAAAEILFGACSP 534


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/548 (40%), Positives = 319/548 (58%), Gaps = 28/548 (5%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           + P C T+L I++R   LV  LTL+EKI  LV+++    RLG+P+YEWWSEA HGV G  
Sbjct: 76  SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 134

Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
            G+ F         ATSFP  ILTAASFD  L  +I + IG E RA  N G + G  FWA
Sbjct: 135 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 193

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNIN FRDPRWGRGQETPGEDPLV   Y  ++V G+QGD       K K Q  A CKH+ 
Sbjct: 194 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 247

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            YDL+    T RY  +   T QDL+D +  PF++CV+      IMC+YN V+GIP+CA+ 
Sbjct: 248 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 303

Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            LL +  R+ W F+    Y+ SDC AV+ I+    +  + E A    L AG+D+ CG S+
Sbjct: 304 YLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 363

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L+ +   A  Q  +    +DR+L  L+S    +G F+G      + K+    V +P  Q 
Sbjct: 364 LKLNESLAANQTSV--KVMDRSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 417

Query: 395 LALQAAQDGIVLLKNSHGLLPLP-KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
           LA +AA +G+ LLKN   LLPL    K  S+A+IGP AN+   + G+Y+G +   I+PL+
Sbjct: 418 LAYEAAVEGMTLLKNDD-LLPLDFPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 476

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           A  +      Y  G      +++  ++A+  A  +D ++ + G+D + E E LDR  L  
Sbjct: 477 AFGDSRWKVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAW 536

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q +LIT +++ + KP+++V   GG VD +    +++I +++WAGYP ++G  AL +V
Sbjct: 537 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDV 595

Query: 574 IFGDHNPG 581
           + G  +P 
Sbjct: 596 LVGKRSPA 603


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/548 (40%), Positives = 320/548 (58%), Gaps = 28/548 (5%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           + P C T+L I++R   LV  LTL+EKI  LV+++    RLG+P+YEWWSEA HGV G  
Sbjct: 76  SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 134

Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
            G+ F         ATSFP  ILTAASFD  L  +I + IG E RA  N G + G  FWA
Sbjct: 135 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 193

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNIN FRDPRWGRGQETPGEDPLV   Y  ++V G+QGD       K K Q  A CKH+ 
Sbjct: 194 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 247

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            YDL+    T RY  +   T QDL+D +  PF++CV+      IMC+YN V+GIP+CA+ 
Sbjct: 248 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 303

Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            LLS+  R+ W F+    Y+ SDC AV+ I+    +  + E A    L AG+D+ CG S+
Sbjct: 304 YLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 363

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L+ +   A  Q  +    +D++L  L+S    +G F+G      + K+    V +P  Q 
Sbjct: 364 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 417

Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
           LA +AA +G+ LLKN   LLPL    K  S+A+IGP AN+   + G+Y+G +   I+PL+
Sbjct: 418 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 476

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           A  +      Y  G      +++  ++A+  A  +D ++ + G+D + E E LDR  L  
Sbjct: 477 AFGDSRWKVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTW 536

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q +LIT +++ + KP+++V   GG VD +    +++I +++WAGYP ++G  AL +V
Sbjct: 537 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDV 595

Query: 574 IFGDHNPG 581
           + G  +P 
Sbjct: 596 LVGKRSPA 603


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 185/301 (61%), Positives = 224/301 (74%), Gaps = 4/301 (1%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PPFSC    PS    PFC T LP +QRA DLVSR+T  EK SQL + A  +PRLG+P+Y+
Sbjct: 84  PPFSCG-GGPSLG-LPFCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLGVPSYK 141

Query: 89  WWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
           WW+EALHGVA  GKGI  + G +R ATSFPQV+LTAASF+  LW+RIGQA G EARA YN
Sbjct: 142 WWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEARAFYN 201

Query: 148 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
            GQA G+T W+PN+NIFRDPRWGRGQETPGEDP V  +YA ++VRG+QG + N   +   
Sbjct: 202 IGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQGSSSNTKSVPPV 261

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L  SACCKH TAYDL++WKG TRY F A VT+QDLADT+ PPF SCV  G+AS +MCAY 
Sbjct: 262 LLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYT 321

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
            VNG+PSCA+ +LL+KT R  WG  G Y+ +DCDAVSI+ +++ Y  + ED V   LKAG
Sbjct: 322 SVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAG 381

Query: 327 M 327
           M
Sbjct: 382 M 382


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 221/546 (40%), Positives = 305/546 (55%), Gaps = 20/546 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C + L    RAR LV  LT+ EK +  VN++P +PRLG+P Y WWSE LHGVA    G+ 
Sbjct: 42  CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100

Query: 106 F---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F         ATSFP+ IL  A+FD  L Y I   I  EARA  N   + G+ FW PNIN
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHS-GLDFWTPNIN 159

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
             RDPRWGR  ETPGEDP     Y    V G+Q   F G   K + +  A CKH+  YDL
Sbjct: 160 PVRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ---FGGDDPKYQ-KLVATCKHYAGYDL 215

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW G  RY FDA ++ QDL + + PPF++C +    + +MC+YN VNGIPSCA+  LL 
Sbjct: 216 ENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYLLQ 275

Query: 283 KTARRQWGFH--------GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
              R  WG+          Y+TSDCDAVS IY    Y  +PE AV   LKAG D++CG+F
Sbjct: 276 SLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTITPEQAVAVSLKAGTDLDCGTF 335

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +   ++ +Q    +++IDRAL   ++    LG F+     Q + +     + +   Q 
Sbjct: 336 YAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFDP-AEGQIYRQYNWANINTDYAQQ 394

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LA  AA +GI LLKN   +LPLP S   ++ALIGP AN+   + GNY G +    +PL A
Sbjct: 395 LAYTAAWEGITLLKNIDDMLPLP-STMTNIALIGPWANATTQMQGNYQGIAPFLHSPLYA 453

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
           LQ    N  Y  G +  + S+A    A+  A+ AD  + + G+D T E E +DRV++  P
Sbjct: 454 LQQRGINVTYVLGTNITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNITWP 513

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q +LI ++A  +   +I+  + GG +D T    +  +  +LW GYPG+ G  A+ +++
Sbjct: 514 GNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMIDIL 572

Query: 575 FGDHNP 580
           +G   P
Sbjct: 573 YGSRAP 578


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 217/541 (40%), Positives = 301/541 (55%), Gaps = 15/541 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA---GVGK 102
           C TT     RA  L+    L + ++  VN +P + RLG+P Y+WW+EALHGV    GV  
Sbjct: 39  CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGSSPGVNW 98

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           G   +     ATSFP  IL  A+F+  L   I   I  EARA  N   A G+TF+ PNIN
Sbjct: 99  GSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYA-GLTFFTPNIN 157

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            FRDPRWGRGQETPGEDP    +Y   YV G+QG    G       +  A CKH  AYD+
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG----GLSPDPYYKVLANCKHVLAYDV 213

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW+G  R  F+A VT QDL++ Y P F+ C++  + +  MC+YN VNG+PSCA   +L 
Sbjct: 214 ENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCASSYILK 273

Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
              R  WG     G+IT DC AV  IY   GY  +  +A    + AG D++CG     + 
Sbjct: 274 DLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNATAVAMDAGTDLDCGDVYSPNL 333

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             AV +  +   +I  AL  L+   +RLG F+     QP+       V +P+ Q LA  A
Sbjct: 334 WTAVVEGLITAGQIQTALIRLYGSLIRLGYFD-PAEQQPYRSFDWSNVNTPSSQDLAYNA 392

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A  GIVLL+N  GLLPL  +   ++ALIGP AN+  +L GNYAG +   I+P QA +   
Sbjct: 393 AVQGIVLLEN-DGLLPL-STNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQAFETAG 450

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            N  +  G       ++   +A++ A+GAD VV + G+D + E E  DR  +  PG Q +
Sbjct: 451 YNVTFAFGTGISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSIEWPGSQLD 510

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI ++ E   KP+++V + GG  D +  K +  + ++LWAGYPG++G  AL ++I G  +
Sbjct: 511 LIGQLGELG-KPLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALVDIISGKQS 569

Query: 580 P 580
           P
Sbjct: 570 P 570


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 220/541 (40%), Positives = 313/541 (57%), Gaps = 17/541 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L+S  TL EK++   N++P +PR+G+P+YEWW+EALHG+A    G  
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA-RSPGTT 92

Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F  T      +TSFPQ IL  A+FD  L +++   +  EARA  N  +  G+ FW PNIN
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR-FGLNFWTPNIN 151

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            ++DPRWGRGQETPGEDP  T  Y  + + G+QG   +    KG     A CKHF  YDL
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGV----ATCKHFAGYDL 207

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++  G  RY FDA +  QDL D Y PPF+ C +      +MC+YN +NG+P+CAD  LL 
Sbjct: 208 ESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQ 267

Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
              R  WG+     ++TSDCDAV  I+D   Y  +PE +  D L AG D++CG+F   + 
Sbjct: 268 TLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYL 327

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
            +A  Q     S +DR+L   ++  +RLG F+  P++QP+ ++  D V +PA Q LALQA
Sbjct: 328 GSAYDQGLYDISTLDRSLARRYASLVRLGYFDP-PSVQPYRQLNWDNVSTPAAQQLALQA 386

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A+DGIVLLKN  G+LPL  S   ++ALIGP AN+ K + GNY G +    +PL A QN  
Sbjct: 387 AEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG 444

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
               Y  G D  + ++     A+  A+ AD V+ + G+D + E EE+DR  +  P  Q  
Sbjct: 445 FKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWPSSQLS 504

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI ++A  +   +I  + C   +D +    +  + ++LWAGYPG+ G  A+  ++ G   
Sbjct: 505 LINQLANLSTPLIISQMGCM--IDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTA 562

Query: 580 P 580
           P
Sbjct: 563 P 563


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 227/540 (42%), Positives = 311/540 (57%), Gaps = 15/540 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T     RA  L+   T +E  S  VN++P +PRLG+PAY WWSE LHGVA    G+ 
Sbjct: 41  CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVA-QSPGVT 99

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L   I   +  E RA  NAG+A G+ +W PNIN 
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRA-GLDYWTPNINP 158

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  + + G+QG    G   K   +  A CKHF AYDL+
Sbjct: 159 FKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG----GLDPKPYFKVVADCKHFAAYDLE 214

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW+G  R  FDA V+ QDL++ Y PPF++CV+  + + +MC+YN VNGIPSCA+  LL  
Sbjct: 215 NWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQD 274

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDAV  I     Y   P  A  D L AG D++CG+F   +   
Sbjct: 275 VLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAADALLAGTDIDCGTFSSTYLPE 334

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++  +  +++ RA    ++  +RLG F+ +P  QP+ ++G   V +P  Q LA  AA 
Sbjct: 335 ALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTPQAQQLAHTAAV 393

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKN  G+LP  K     LALIGP AN+   L G+Y G +   ++PLQ  Q     
Sbjct: 394 EGIVLLKND-GVLPFSKHVR-KLALIGPWANATSLLQGSYIGVAPYLVSPLQGAQEAGFE 451

Query: 462 TVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G +    +  S    AV   + AD VV   GLD+T E E  DR+++  PG Q +L
Sbjct: 452 VEYVLGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRLNVTWPGNQLDL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  + E   KP+I+    GG +D T  K+ + + +I+W GYPG++G  AL +++ G   P
Sbjct: 512 VAEL-ERVGKPLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTALFDILTGKAAP 570


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/547 (40%), Positives = 320/547 (58%), Gaps = 28/547 (5%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           + P C T+L I++R   LV  LTL+EKI  LV+++    RLG+P+YEWWSEA HGV G  
Sbjct: 136 SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 194

Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
            G+ F         ATSFP  ILTAASFD  L  +I + IG E RA  N G + G  FWA
Sbjct: 195 PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS-GFDFWA 253

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNIN FRDPRWGRGQETPGEDPLV   Y  ++V G+QGD       K K Q  A CKH+ 
Sbjct: 254 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 307

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            YDL+    T RY  +   T QDL+D +  PF++CV+      IMC+YN V+GIP+CA+ 
Sbjct: 308 VYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 363

Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            LLS+  R+ W F+    Y+ SDC AV+ I+    +  + E A    L AG+D+ CG S+
Sbjct: 364 YLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 423

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L+ +   A  Q  +    +D++L  L+S    +G F+G      + K+    V +P  Q 
Sbjct: 424 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 477

Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
           LA +AA +G+ LLKN   LLPL    K  S+A+IGP AN+   + G+Y+G +   I+PL+
Sbjct: 478 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 536

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           A  +      Y  G      +++  ++A+  A  +D ++ + G+D + E E LDR  L  
Sbjct: 537 AFGDSRWKVNYALGTAMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLTW 596

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q +LIT +++ + KP+++V   GG VD +    +++I +++WAGYP ++G  AL +V
Sbjct: 597 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTALLDV 655

Query: 574 IFGDHNP 580
           + G  +P
Sbjct: 656 LVGKRSP 662


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 226/540 (41%), Positives = 314/540 (58%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+L ++ RA+ LV+ +T +EK++    ++P +PRLG+PAY WWSEALHGVAG   G+ 
Sbjct: 43  CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVE 101

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ IL  A+FD  L  ++   +  E RA  NAG++ G+ FW PNIN 
Sbjct: 102 FADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRS-GLDFWTPNINP 160

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRD RWGRGQETPGEDPL   +Y    V G+Q      G      +  A CKHF AYDL+
Sbjct: 161 FRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN-----GIGPANPKVVATCKHFAAYDLE 215

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G  R+ F+A V+ QDL++ Y PPF+SC +  R   +MC+YN +NG+P+CAD  LL  
Sbjct: 216 DWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVMCSYNALNGVPACADSYLLQT 275

Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     +ITSDC A+  IY+   +  +P +A    L AG D++CG+   K+  
Sbjct: 276 ILREHWKWDEPGRWITSDCGAIDDIYNGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLG 335

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +       +DRAL  L+S  ++LG F+     QP+  IG   V +PA + LA +AA
Sbjct: 336 QAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAED-QPYRSIGWTDVDTPAAEALAHKAA 394

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN    LPL K+K  +LALIGP AN+ K + GNY GP+    T L A      
Sbjct: 395 GEGIVLLKNDK-TLPL-KAKG-TLALIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGY 451

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           +  Y  G      S+A  D A+  AK AD VV   G+D T E E  DR  +  PG Q  L
Sbjct: 452 DVKYAAGTAINTNSTAGFDAALSAAKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNL 511

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I ++++   KP+++V   GG VD +    +  + ++LWAGYP + G  A+ +++ G   P
Sbjct: 512 IDQLSKIG-KPLVVVQFGGGQVDDSSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAP 570


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 213/536 (39%), Positives = 313/536 (58%), Gaps = 15/536 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T   + RA+ L+   T +E +    N +P +PRLG+P+Y+WWSEALHGVAG   G+ 
Sbjct: 38  CDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVS 96

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ I+  ++FD  L   +   I  EARA  N  +A G+ ++ PNIN 
Sbjct: 97  FAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA-GLDYFTPNINP 155

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  S + G+QG    G   +   + +A CKH+ AYDLD
Sbjct: 156 FKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG----GIDPRPYFKVAADCKHYAAYDLD 211

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W+G  R+ FDA+V++QDL++ Y P F+SCV+  + + +MC+YN VNGIP+CA+  LL  
Sbjct: 212 SWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQD 271

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDA+  I+    +  +  +AV D LKAG DV+CG+    H   
Sbjct: 272 ILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPD 331

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+ Q  +   +++RAL   ++  MRLG F+  P  QP  ++    V  P  Q LA  AA 
Sbjct: 332 ALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAV 390

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ-ALQNYVE 460
           +G+VLLKN  G LP+  S   ++A+IGP AN+ K + GNY G +   +TP Q A+     
Sbjct: 391 EGLVLLKND-GFLPVSASGK-TIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFN 448

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             V   G      S A    A+ +A  +D ++   G++ + E E  DR+ +   G Q  L
Sbjct: 449 EVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSL 508

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           + ++A    KPV++V   GG +D +    +  + +++WAGYPG++G  A+ +VI G
Sbjct: 509 VKQLASLG-KPVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITG 563


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 211/532 (39%), Positives = 304/532 (57%), Gaps = 18/532 (3%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           RA+ LV+  TL+EKI+   NSAP + RLGIP Y+WW+E LHG+AG        G    +T
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIAGPFTSFAKQGDYSYST 173

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
           SFPQ IL  A+FD  L   +   I  EARA  N  +  G+ FW PNIN FRDPRWGRGQE
Sbjct: 174 SFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRT-GLDFWTPNINPFRDPRWGRGQE 232

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           TPGED      Y  + + G+QG+  +  +     +  A CKH+  YD++NW G  RY+ D
Sbjct: 233 TPGEDSYHLSSYVKALIHGLQGNETDPYR-----RVVATCKHYAGYDIENWNGNLRYQND 287

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG- 293
            +++ QDL + Y  PFE+CV Q      MC+YN VNG P CAD  +L    R  WG+   
Sbjct: 288 VQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWGWSSD 346

Query: 294 --YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
             ++TSDCD++  +Y    ++ + E A  D L AG D++CG++LQ H   AVKQ    E+
Sbjct: 347 EHWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQGLTNET 406

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            +D AL   +S  ++LG F+  P  QP+ ++G D V + A Q LAL+AA++GIVLLKN  
Sbjct: 407 TLDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVATSASQALALKAAEEGIVLLKND- 464

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY---YPGC 468
           G+LP+    S ++ + G  AN+   L GNY G +    +P  AL+    N  Y    PG 
Sbjct: 465 GVLPI-NFGSKNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAGNLPGG 523

Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
                ++ S  +   +   +D  + + G+D   E E+ DR  L L G Q ++I ++A+  
Sbjct: 524 QG-DPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQLADTG 582

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KPVI++++ GG +D +    +  I ++LWAGYPG+ G  A+  ++ G   P
Sbjct: 583 -KPVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAP 633


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 213/536 (39%), Positives = 312/536 (58%), Gaps = 15/536 (2%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T   + RA  L+   T +E +    N +P +PRLG+P+Y+WWSEALHGVAG   G+ 
Sbjct: 38  CDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVS 96

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFPQ I+  ++FD  L   +   I  EARA  N  +A G+ ++ PNIN 
Sbjct: 97  FAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRA-GLDYFTPNINP 155

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRGQETPGEDP    +Y  S + G+QG    G   +   + +A CKH+ AYDLD
Sbjct: 156 FKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG----GIDPRPYFKVAADCKHYAAYDLD 211

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W+G  R+ FDA+V++QDL++ Y P F+SCV+  + + +MC+YN VNGIP+CA+  LL  
Sbjct: 212 SWEGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQD 271

Query: 284 TARRQWGFHG--YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  WGF    ++TSDCDA+  I+    +  +  +AV D LKAG DV+CG+    H   
Sbjct: 272 ILRDFWGFDDDRWVTSDCDAIGNIFTTHNFTDTFAEAVADALKAGTDVDCGTSYSTHLPD 331

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+ Q  +   +++RAL   ++  MRLG F+  P  QP  ++    V  P  Q LA  AA 
Sbjct: 332 ALNQSLITRDDLERALTRQYTSLMRLGYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAV 390

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ-ALQNYVE 460
           +G+VLLKN  G LP+  S   ++A+IGP AN+ K + GNY G +   +TP Q A+     
Sbjct: 391 EGLVLLKND-GFLPVSASGK-TIAIIGPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFN 448

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
             V   G      S A    A+ +A  +D ++   G++ + E E  DR+ +   G Q  L
Sbjct: 449 EVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSL 508

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           + ++A    KPV++V   GG +D +    +  + +++WAGYPG++G  A+ +VI G
Sbjct: 509 VKQLASLG-KPVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVITG 563


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 222/560 (39%), Positives = 315/560 (56%), Gaps = 45/560 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF  T L   +R  DLVSRLTL+EK+ Q++N+ PAI RL IPAY WW+E LHG+     
Sbjct: 24  YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MT 155
                 T    T FPQ I  AA++D+ L   +  AI  E RA+YN   A G       +T
Sbjct: 81  ------TEYKVTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           +W PN+NIFRDPRWGRGQET GEDP +TG    S+V G+QGD          L+A+AC K
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDD------SQYLKAAACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           H   Y + +    TR+ F+  VT  DL DTY P F   V   + +G+MCAYN  +G P C
Sbjct: 189 H---YAVHSGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGEPCC 245

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
            +  L+ +  R +WGF GY+TSDC A+   Y         + A  D + +G D++CG+  
Sbjct: 246 GNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKYAAADAVYSGTDIDCGNEA 305

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
            K    AVK   + E +ID +L  LF +R RLG+F+    ++ F KI   V+ S  H+ L
Sbjct: 306 YKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVK-FSKIPLSVLESQPHKDL 364

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL+  ++ IVLLKN +  LPL K K   +A+IGPNA++  ++LGNY G   + ITP +A+
Sbjct: 365 ALKITRESIVLLKNENNFLPLSK-KLKKVAVIGPNADNEVSVLGNYNGFPTQIITPYKAI 423

Query: 456 QNYVENT--VYYPGCDTVACSSASIDKAVDIA---KGADHVVLMMGLDQTQEKEEL---- 506
           +N ++NT  +Y  G D V  S  S ++   +A   KG D V+   G+    E EE+    
Sbjct: 424 KNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMPVKI 483

Query: 507 ------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
                 DR  + LP  Q EL+ +  +A + P + V++ G  +   +    +N+ +IL A 
Sbjct: 484 EGFTGGDRTSIKLPKIQTELM-QALKAERIPTVFVMMTGSAIAAEWES--QNVPAILNAW 540

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           Y G+    A+A+V+FGD+NP
Sbjct: 541 YGGQDAGTAIADVLFGDYNP 560


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 213/547 (38%), Positives = 318/547 (58%), Gaps = 17/547 (3%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N +  T P C  +L    RA  LV   TL+E  +   N+AP +PRLG+PAYE W+EALHG
Sbjct: 51  NSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHG 110

Query: 97  VAGVGKGIF-FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
           ++    G F  NGT   ATSFP  IL+ AS +  L  +IG  I  + RA  NAG+  G+ 
Sbjct: 111 IS---HGHFATNGTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGR-YGLD 166

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
            +APNIN FR P WGRGQETPGED   ++  YA  Y+ G+QG     GK     +  A  
Sbjct: 167 SYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQG-----GKAPAVPKLVAVP 221

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHF  YD++NW   +R   D  +T QDLA  Y P F S ++  +A G+MC+YN VNG+PS
Sbjct: 222 KHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPS 281

Query: 275 CADRNLLSKTARRQWGF-HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
           C++   L   AR  WGF +G+++SDCDAV  +Y+  GYA +   AV D L+AG D++CG+
Sbjct: 282 CSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTTGAVADSLRAGTDIDCGT 341

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
               +   A     +  ++I+ AL   +S  +  G F+GN ++  +  +G + V +    
Sbjct: 342 SYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSL--YRNLGWNDVLTTDAW 399

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            ++ +AA +GI LLKN  G LPL KS   S+ALIGP AN+   L GNY   +   I+PLQ
Sbjct: 400 NISYEAAVEGITLLKND-GTLPLSKSTR-SVALIGPWANATLQLQGNYYAAAPYLISPLQ 457

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           A +       +  G    + +++   +A+ +A+ +D ++   G+D + E E LDR ++  
Sbjct: 458 AFRASGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITW 517

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q +LI ++++   KP++++ + GG VD +  K +  + +++W GYPG++G  AL ++
Sbjct: 518 PGNQLDLIYQLSQVG-KPLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDI 576

Query: 574 IFGDHNP 580
           I G+  P
Sbjct: 577 IMGNRAP 583


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 220/547 (40%), Positives = 318/547 (58%), Gaps = 28/547 (5%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           + P C T+L I++R   LV  LTL+EKI  LV+++    RLG+P+YEWWSEA HGV G  
Sbjct: 29  SHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGV-GSA 87

Query: 102 KGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWA 158
            G+ F         ATSFP  ILTAASFD  L  +I + IG E R   N G + G  FWA
Sbjct: 88  PGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFS-GFDFWA 146

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNIN FRDPRWGRGQETPGEDPLV   Y  ++V G+QGD       K K Q  A CKH+ 
Sbjct: 147 PNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGD-----DPKNK-QVIATCKHYA 200

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            YDL+    T RY  +   T QDL++ +  PF++CV+      IMC+YN V+GIP+CA+ 
Sbjct: 201 VYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPACANE 256

Query: 279 NLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            LL +  R+ W F+    Y+ SDC AV+ I+    +  + E A    L AG+D+ CG S+
Sbjct: 257 YLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNFTDTEEAAASVALNAGVDLECGSSY 316

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L+ +   A  Q  +    +D++L  L+S    +G F+G      + K+    V +P  Q 
Sbjct: 317 LKLNESLAANQTSV--KVMDQSLARLYSALFTVGFFDGG----KYDKLDFSDVSTPDAQA 370

Query: 395 LALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
           LA +AA +G+ LLKN   LLPL    K  S+A+IGP AN+   + G+Y+G +   I+PL+
Sbjct: 371 LAYEAAVEGMTLLKNDD-LLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISPLE 429

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           A  +      Y  G      +++  ++A+  A  +D ++ + G+D + E E LDR  L  
Sbjct: 430 AFGDSRWKVNYALGTAINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTSLAW 489

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q +LIT +++ + KP+++V   GG VD +    +++I +++WAGYP ++G  AL +V
Sbjct: 490 PGNQLDLITSLSKLS-KPLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTALLDV 548

Query: 574 IFGDHNP 580
           + G  +P
Sbjct: 549 LVGKRSP 555


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 226/554 (40%), Positives = 323/554 (58%), Gaps = 57/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A +LV ++TL EK SQL   APAIPRL IPAY WW+E+LHGVA  G           A
Sbjct: 12  KQAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------A 61

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I  AASFD  +   IG+AI LE RA YNA   +       G+TFWAPN+NIFRD
Sbjct: 62  TVFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRD 121

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP ++ +  VSY+RG+QGD          ++A+AC KHF  +   +  
Sbjct: 122 PRWGRGHETYGEDPYLSSRLGVSYIRGLQGDGET-------MKAAACAKHFAVH---SGP 171

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA V+ +DL +TY P F++CV++G    +M AYN VNG P C    LL K  R
Sbjct: 172 EALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKILR 231

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            +WGF G++ SDC A+   ++      +P  +    ++AG D+NCG ++L  H   A ++
Sbjct: 232 EEWGFDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAGCDLNCGVTYL--HLVHACQE 289

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E++I  A   LF+ R  LG+F+G+     +  +   VV    H+ L+ +AA++ IV
Sbjct: 290 GLVTEAQITEAAIRLFTTRFLLGMFDGS----EYDSVPYTVVECKEHRDLSERAARESIV 345

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
           LLKN+ G+LPL + K  ++ +IGPNA+S K L+GNY G S   IT L+ ++  V + V  
Sbjct: 346 LLKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRI 404

Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
            Y  GC       + +A     + +A  +A+ +D V+L +GLD+T E EE          
Sbjct: 405 LYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEEGDTGNSYASG 464

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+VDL LP  Q+ L+  VA   KKP +L L+ G  +D++FA+   +    LW  YPG  G
Sbjct: 465 DKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHFDAIVDLW--YPGAYG 521

Query: 567 AVALAEVIFGDHNP 580
             A A+++FG  +P
Sbjct: 522 GAAAADILFGKCSP 535


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 225/564 (39%), Positives = 325/564 (57%), Gaps = 57/564 (10%)

Query: 48  TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
           ++ P+  RA DL+ R++L EK+SQL N+AP IPRLG+PAY++W+EA HG+A  G      
Sbjct: 208 SSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI----- 262

Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMTFWA 158
                AT FPQ I  AA+++  L ++ G  IG+E RA +N         +    G+T+WA
Sbjct: 263 -----ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYWA 317

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNIN+FRDPRWGRGQET GEDP +T +  + +V+GVQGD          + A AC KH  
Sbjct: 318 PNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDD------PRYMLAMACAKH-- 369

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            Y + +    TR+ F+A +  +DL DTY P FE  V++G+ +G+M AYN VNG+P+ A+ 
Sbjct: 370 -YAVHSGPERTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNGVPASANS 428

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAE--GYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
            LL++  R++WGF GY+ SDCDA+  IY  +   Y K+ E+A    +KAG ++ CG    
Sbjct: 429 FLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNLCCGGDYN 488

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVL 395
              + AV+Q  + E ++D AL++    R RLGLF  +P  Q PF          PAH  +
Sbjct: 489 ALVR-AVQQGLVTEKDLDGALYHTLWTRFRLGLF--DPAEQVPFSGYTLKDNDLPAHSQV 545

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL+ A+  IVLLKN  G LPL ++K   +A+IGPNA S   L GNY G + RSI+ L  +
Sbjct: 546 ALELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRSISILDDI 604

Query: 456 QNYVENTVYY-----------PGCDTVACSSASIDKAV--------DIAKGADHVVLMMG 496
           +N V + +             PG    +    + D+ V         +A  AD ++ + G
Sbjct: 605 RNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEADAIIYVGG 664

Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
           +   QE E  DR  + LP  Q++LI R   A  KPV++V   G  + +T+   D N+ +I
Sbjct: 665 ITPAQEGESFDRESIELPSEQEDLI-RALHATGKPVVMVNCSGSAMALTWQ--DENLPAI 721

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           + A YPG+ G  A+AEV+FG+ NP
Sbjct: 722 VQAWYPGQEGGRAVAEVLFGETNP 745


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/554 (39%), Positives = 315/554 (56%), Gaps = 54/554 (9%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            +R RDLVS++TL+EK+SQL   APA+ RLGIP+Y WW+EALHGVA  G           
Sbjct: 13  EERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG----------A 62

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFR 165
           AT FPQ I  AA FD  L  +IG    LE RA Y+        G   G+TFW+PNINIFR
Sbjct: 63  ATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFWSPNINIFR 122

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+   +Y++G+QG   NG +LK    A+AC KHF A+     
Sbjct: 123 DPRWGRGHETYGEDPCLTGRMGTAYIKGMQG---NGKRLK----AAACVKHFAAHSGPE- 174

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
           KG  R+ F++ V+ +DL +TY P FE CVK+    G+M  YNR+NG  +C   +L+++  
Sbjct: 175 KG--RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEAACGSHHLITEIL 232

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R +WGF GY  SDC A+   +   G   +P+++    LK+G D+NCG+ +  H  +A  Q
Sbjct: 233 REKWGFDGYYVSDCGAIKDFHMHHGLTDTPQESAALALKSGCDLNCGA-VYLHVMSAYNQ 291

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             +   +IDRA+ +L   RMRLG+F+ +     F +I  ++     H  LAL+AA++ +V
Sbjct: 292 GLVSAEDIDRAVTHLMMTRMRLGMFDQH---TEFDEIPYEINDCAEHHGLALKAAEESMV 348

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--- 462
           LLKN  G+LPL K+   ++A+IGPN +S + L GNY G +    T L+ ++  +      
Sbjct: 349 LLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGIRAVLGKETRI 407

Query: 463 -------VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
                  +Y    + +A +   + +AV +A  +D V L +GL+ T E EE          
Sbjct: 408 FCSEGSHLYRDNVENLAEADDRLKEAVSMAVRSDVVFLCLGLNGTLEGEEGDANNSYAGA 467

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ DL LP  Q  L+  V      PVIL+L  G  + I +A    +  +IL   YPG+ G
Sbjct: 468 DKADLNLPESQMRLLKAVC-GTGTPVILLLAAGSAMAINYAA--EHCSAILHIWYPGQMG 524

Query: 567 AVALAEVIFGDHNP 580
            +A A ++ G+  P
Sbjct: 525 GLAAARLLTGEAVP 538


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 214/543 (39%), Positives = 307/543 (56%), Gaps = 17/543 (3%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P C T+L    RA+ LV  +T +EK+    N +P   RLG+PAY+WW+EALHGVAG   G
Sbjct: 163 PICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAG-SPG 221

Query: 104 IFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNI 161
           + F  +G    ATSFPQ IL +A+FD  L   +G  + +E RA  N G A G+ FW PNI
Sbjct: 222 VTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNA-GLDFWTPNI 280

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N FRDPRWGRGQETPGEDP    +Y  + V G+Q      G      +  A CKHF  YD
Sbjct: 281 NPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN-----GIAPANPRVVATCKHFAGYD 335

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           +++W+G +RY F+A ++ QDL++ Y PPF+SC +  +   IMC+YN VNGIP+CAD  LL
Sbjct: 336 IEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLL 395

Query: 282 SKTARRQWGFH---GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
               R  W ++    ++TSDCDAV  IY    Y  S   A  D L AG +++CG+ +  +
Sbjct: 396 DTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLAAAAADALNAGTNLDCGTTMSNN 455

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
             AA  Q     + ++ AL  L+S  +RLG F+   +   +  +G   V + A Q LA +
Sbjct: 456 LAAAAAQDLFKNATLNSALVYLYSSLVRLGWFDSEDSQ--YSSLGWSDVGTTASQQLANR 513

Query: 399 AAQDGIVLLKNSH-GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           AA +GIVLLKN H  +LPL +    ++ALIGP AN+   L GNY G      T +   + 
Sbjct: 514 AAVEGIVLLKNDHKKVLPLSQHGQ-TIALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQ 572

Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
                 Y  G    +  ++    AV  AK AD V+   G+D + E E +DR  +   G Q
Sbjct: 573 MGYTVQYEAGTGINSTDTSGFAAAVAAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQ 632

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            +LI ++++   KP++++   GG +D +    + N+ ++LW GYP + G  A+ +++ G 
Sbjct: 633 LQLIDQLSQVG-KPLVVLQFGGGQLDDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQ 691

Query: 578 HNP 580
             P
Sbjct: 692 SAP 694


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 217/565 (38%), Positives = 314/565 (55%), Gaps = 52/565 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PF    L    R  DLVSRLTL+EK+ Q++N APAIPRLGIPAY+WWSE LHGVA    
Sbjct: 26  LPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVAR--- 82

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
                 T    T +PQ I  AA++D+   Y +      E RA++N          + +G+
Sbjct: 83  ------TPYHTTVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGDRYVGL 136

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W PNINIFRDPRWGRGQET GEDP +T     ++VRG+QG+          L+A+AC 
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGED------PKYLKAAACA 190

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH   Y + +     R+ FD  V+  DL +TY P F+  V   + +G+MCAYN     P 
Sbjct: 191 KH---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVTHAKVAGVMCAYNAFRKKPC 247

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           C    L++   RRQWGF GY+TSDC A+   ++      + E A +D +  G DV CG+ 
Sbjct: 248 CGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNAEAAAIDAVTNGTDVECGNR 307

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
                  AVK  ++ E EIDR++  LF +RMRLG+F+   +M  + +    V+ S  H+ 
Sbjct: 308 AYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLGMFD-PVSMVSYAQTSPAVLESAPHKA 366

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL+ AQ+ IVLLKN + LLPL KS    +A++GPNA+++  +LGNY G   + +T L  
Sbjct: 367 QALKMAQESIVLLKNENHLLPLSKSIK-KIAVVGPNADNSIAVLGNYNGTPSKIVTALDG 425

Query: 455 LQNYV--ENTVYYPGCDTVACSSASI-DKAVDIA------KGADHVVLMMGLDQTQEKEE 505
           ++  +    +V Y     V  ++A + +   D A      K AD ++ + G+    E EE
Sbjct: 426 IKAKLGTNGSVVYEKA--VNFTNAMLPEGKTDFAALTSRVKDADAIIFVGGISPQLEGEE 483

Query: 506 L----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           +          DR  ++LP  Q E + +  +A  KPV+ V++ G  + I + +   NI +
Sbjct: 484 MKVNEPGFNSGDRTTILLPTVQTEAM-KALKATGKPVVFVMMTGSALAIPWEQ--ENIPA 540

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNP 580
           I+ A Y G+A   A+A+V+FGD+NP
Sbjct: 541 IVNAWYGGQAAGTAIADVLFGDYNP 565


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 225/541 (41%), Positives = 294/541 (54%), Gaps = 77/541 (14%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N + E  PFC  TL I  RA D VSR+T+ EKI  L      I  LG+PAY WWSEA  G
Sbjct: 31  NATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASSG 90

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
           V G           R  T F   + TA SF+  LW   G AIG EARAL NAG A   T+
Sbjct: 91  VMGS----------RPTTKFAYPVTTAMSFNRTLWRATGAAIGREARALMNAGAAYS-TY 139

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
           WAP +N+ R+PRWGR  E PGEDP +TG+YA  +V G Q    D ++       LQASAC
Sbjct: 140 WAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAAPEDPYH-------LQASAC 192

Query: 214 CKHFTAYDLDNWKGTT-----RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           CKH+ A +L+N +        R   D+ VT +DL D+Y  PF++CV++G+ S +MC+YN 
Sbjct: 193 CKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNA 252

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG+PSCA+  LL   AR  W F GYITSDCDA S +YDA  YA +PE+AV DVLKAG D
Sbjct: 253 VNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAATPEEAVADVLKAGTD 312

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG----LFNGNPTMQPFGKIGA 384
           V+C SF+ +H ++A+ +  + E+++D  L NLF VR+RLG     F+      P  +I A
Sbjct: 313 VDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEIDA 372

Query: 385 D-VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
           D VVCS AH   +++       LLKN  G LPL  S +   A++GPNA  +K   G Y G
Sbjct: 373 DAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSGTA--AVVGPNALLSKADAGYY-G 428

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
           P+                                          AD VVL +G D T   
Sbjct: 429 PT----------------------------------------DAADAVVLAVGTDLTWAA 448

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT--FAKYDRNIGSILWAGY 561
           E  D   +V    Q ELI  VA A+  PV++V+    P+D+T   A+ D  +G+++  G 
Sbjct: 449 EGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVGQ 508

Query: 562 P 562
           P
Sbjct: 509 P 509


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 213/562 (37%), Positives = 312/562 (55%), Gaps = 45/562 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF  T L I +R  DLVS+LTL+EK++Q++N+ PAI RL IPAY WW+E LHG+   
Sbjct: 23  QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGR- 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
                   T    T FPQ I  AA+++  L   +  AI  E RA+YN   +        G
Sbjct: 82  --------TDYKVTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +T+W PNINIFRDPRWGRGQET GEDP +TG    S+V G+QGD          L+A+AC
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDD------TKYLKAAAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH   Y + +    TR+ F+  VT  DL DTY P F + V + + +G+MCAYN  NG P
Sbjct: 188 AKH---YAVHSGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGVMCAYNAYNGEP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
            C +  L+ +  R +W F GY+TSDC A+   Y         + A  D +  G D++CG+
Sbjct: 245 CCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKYAAADAVYNGTDIDCGN 304

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
              K    AVK   + E +ID +L  LF++R RLG+F+    ++ + +I   V+ S  H+
Sbjct: 305 EAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVK-YSQISTSVLESQKHK 363

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LAL+  ++ IVLLKN +  LPL K K   +A++GPNAN+  ++LGNY G     +TP +
Sbjct: 364 DLALKITRESIVLLKNENNTLPLSK-KLKKVAVVGPNANNEVSVLGNYNGFPTEIVTPYE 422

Query: 454 ALQNYVE--NTVYYPGCDTVACSSASIDKA---VDIAKGADHVVLMMGLDQTQEKEEL-- 506
           A++  ++    +Y  G D V  S+ S ++    V   K  D V+ + G+    E EE+  
Sbjct: 423 AVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGISPELEGEEMPV 482

Query: 507 --------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
                   DR  + LP  Q + +  +  A K P + V++ G  +   +    +NI +I+ 
Sbjct: 483 KIEGFTGGDRTSIKLPKIQTDFMKALV-AEKIPTVFVMMTGSAIATEWE--SQNIPAIVN 539

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
           A Y G+    A+A+V+FGD+NP
Sbjct: 540 AWYGGQDAGTAIADVLFGDYNP 561


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 221/558 (39%), Positives = 322/558 (57%), Gaps = 50/558 (8%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
           T    +RA+ LV  +TL+EK+ Q + +APAI RLG+PAY +W+EALHGVA  G       
Sbjct: 10  TKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAGV------ 63

Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNI 161
               AT FPQ I  AASFD  L  ++   I  EARA +N  Q  G       +TFW+PN+
Sbjct: 64  ----ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFWSPNV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           NIFRDPRWGRG ET GEDP ++G+  VS++RG+QGD       +  ++ +AC KHF  + 
Sbjct: 120 NIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDD------ERYMKVAACAKHFAVH- 172

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             +     R+ F+A V+ QDL +TY P F +CV +     +M AYNR NG   C  + LL
Sbjct: 173 --SGPEDQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRTNGEACCGSKKLL 230

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
               R +WGF G++TSDC A+   ++     K+ E+ V   + +G D+NCG+ L  H   
Sbjct: 231 VDILRGEWGFRGHVTSDCWALKDFHEFHMVTKNQEETVALAMNSGCDLNCGN-LYVHLLQ 289

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AV+   + ES IDRA+  LF+ RM+LGLF+ +  + P+  IG D V + A++ L  +A++
Sbjct: 290 AVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEV-PYNGIGYDRVDTEANRKLNREASR 348

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
             + LLKN+ GLLPL  SK  ++ ++GPNA++ K L+GNY G +   +T L  ++    +
Sbjct: 349 RTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASEYVTVLDGIRELAGD 408

Query: 462 ---TVYYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------ 505
               VY  GC         +   +  I +A  +A+ +D V+ +MGLD   E EE      
Sbjct: 409 DVRVVYSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVIAVMGLDPGLEGEEGDQGNE 468

Query: 506 ---LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
               D+ +L LPG Q E++  + E+  KPV+LVLL G  + I +A  + ++ +IL A YP
Sbjct: 469 FASGDKPNLELPGLQGEVLKALVESG-KPVVLVLLGGSALAIPWA--EEHVPAILDAWYP 525

Query: 563 GEAGAVALAEVIFGDHNP 580
           G  G  A+A+V+FG   P
Sbjct: 526 GAQGGRAVADVLFGRACP 543


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 230/593 (38%), Positives = 317/593 (53%), Gaps = 48/593 (8%)

Query: 12   LLCLCFTSLLTRVDSTQPPFSCDPSNPST--ETFPFCKTTLPISQRARDLVSRLTLDEKI 69
            L  +C   L    D T        S P T  + +PF    L I QR  DLVSRLTL+EK+
Sbjct: 679  LRSICVDKLYHNPDGTIKLVKQTKSTPITVQKRYPFRNPQLSIEQRVDDLVSRLTLEEKV 738

Query: 70   SQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
             Q++N+APAI RLGIPAY WW+E LHGV           T    T FPQ I  AAS++  
Sbjct: 739  RQMLNNAPAIKRLGIPAYNWWNECLHGVGR---------TKYHVTVFPQAIGMAASWNDV 789

Query: 130  LWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRDPRWGRGQETPGEDPLV 182
            L   +  +I  E RA+YN  Q  G       +T+W PNINIFRDPRWGRGQET GEDP +
Sbjct: 790  LMKEVASSIADEGRAIYNDAQKRGDYSQYHALTYWTPNINIFRDPRWGRGQETYGEDPYL 849

Query: 183  TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            T K   ++V G+QGD          L+ASAC KH+  +         R+ F++ V+  DL
Sbjct: 850  TSKIGKAFVLGLQGDD------PRYLKASACAKHYAVHSGPE---KNRHSFNSDVSTYDL 900

Query: 243  ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
             DTY P F + V     SG+MCAYN   G P C +  L+    R +W F GY+TSDC A+
Sbjct: 901  WDTYLPAFRTLVVDANVSGVMCAYNAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAI 960

Query: 303  SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
              I++          A  D +  G D++CG         AVK   + E ++D ++  LF+
Sbjct: 961  DDIFNHHKAHPDAATAAADAVFHGTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFT 1020

Query: 363  VRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
            +R RLGLF  +P  Q  +  I   V+    HQ LA Q A++ +VLLKN   LLPL K+K 
Sbjct: 1021 IRFRLGLF--DPAEQVDYAHIPISVLECKKHQDLAKQLARESMVLLKNDR-LLPLQKNKL 1077

Query: 422  VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY---VENTVYYPGCDTV-ACSSAS 477
              + ++GPNA+    LLGNY G   R +TPLQA++     V   VY  G D +   S   
Sbjct: 1078 KKVVVMGPNADCKDALLGNYNGHPSRMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDE 1137

Query: 478  IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
            + + V+ AKGAD V+ + G+    E EE+          DR  + LP  Q +L+  +  A
Sbjct: 1138 LKRYVNQAKGADAVIFIGGISPRLEGEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-A 1196

Query: 528  AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             + P + V++ G  + I +    +++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 1197 GRIPTVFVMMTGSALAIPWEA--KHVPAILNAWYGGQYGGEAIADVLFGDYNP 1247


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 222/554 (40%), Positives = 311/554 (56%), Gaps = 58/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A+ LV+ +TL+E+ SQL   +PAI RLG+PAY WW+EALHGVA  G           A
Sbjct: 8   KKAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           TSFPQ I  AA+FD  L  R+ + I  E RA YNA           G+TFW+PN+NIFRD
Sbjct: 58  TSFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V++V+G+QG+          L+ +AC KHF  +   +  
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGEE--------GLKTAACAKHFAVH---SGP 166

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDARV+ +DL +TY P FE+ VK+     +M AYNR NG P C    L+    R
Sbjct: 167 EADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDILR 226

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            +WGF G+  SDC A+   ++      + +++    LK+G D+NCG ++L  H   A + 
Sbjct: 227 EKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESAALALKSGCDLNCGNTYL--HILMAYQN 284

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E EI  A   LF+ R  LGLF+G+     +  I  +VV S  H  +A +A    IV
Sbjct: 285 GLVTEEEITTAAERLFTTRYLLGLFDGS----TYDAIPYEVVESKPHLSVADEATAKSIV 340

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
           LLKN+ GLLPL K    ++ +IGPNANS K L+GNY G S + IT L+ LQ  V + V  
Sbjct: 341 LLKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRI 399

Query: 464 --------YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
                   Y    + +A     + +A  +AK +D V++ +GLD+T E EE          
Sbjct: 400 LYSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGDTGNAYASG 459

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ DL LP  QQEL+  +A+   KPVIL L  G  +D+ +A  D +  ++L A YPG  G
Sbjct: 460 DKRDLALPEPQQELVEAMAKMG-KPVILCLSAGSAIDLQYA--DAHYDAVLQAWYPGARG 516

Query: 567 AVALAEVIFGDHNP 580
              +A+ + G+  P
Sbjct: 517 GQVIAKALLGEIVP 530


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 230/548 (41%), Positives = 310/548 (56%), Gaps = 30/548 (5%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P C T+LPI+ R R LV +LTL EKI+ LV+ A    R+G+P YEWWSEALHGVA    G
Sbjct: 163 PACNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPG 221

Query: 104 IFF---NGT-IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
           + F   NGT    ATSFP  I  +A+FD  L  +I   +G E RA  N G + G  FW P
Sbjct: 222 VTFAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLS-GFDFWTP 280

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFT 218
           NIN FRDPRWGRG ETPGED     +Y    + G+QG D  +        Q  A CKH+ 
Sbjct: 281 NINPFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQGSDPLDK-------QIIATCKHYA 333

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            YD++    T RY++D      DLA+ Y  PF++CV+      +MC+YN V+GIP+CA  
Sbjct: 334 VYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASE 389

Query: 279 NLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-F 334
            LL    R  WGF   + Y+ SDCDAV  IY    +  SP  A    L AG D+ CGS +
Sbjct: 390 YLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTY 449

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           L  +   A       E+ +DRAL  L++    +G F+G+     +G +G D V +   QV
Sbjct: 450 LNLNQSLASNMTT--EAALDRALTRLYTALHTIGFFDGSAR---YGGLGWDAVGTGDAQV 504

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LA QAA DG VLLKN   LLPL   +   LA+IGP AN+   + GNY G +   ++PL A
Sbjct: 505 LAYQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAA 564

Query: 455 LQNY--VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
            Q+    +N ++  G      S+A    A+  AK AD VV + G+D + E E LDR  + 
Sbjct: 565 FQSAWGADNVLFANGTGIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAIS 624

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
            PG Q +LI ++A A  KP+++V   GG +D +    +  +G++LWAGYPG+AG  A+A+
Sbjct: 625 WPGNQLDLIAQLA-AVGKPLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIAD 683

Query: 573 VIFGDHNP 580
           ++ G   P
Sbjct: 684 LLTGKQAP 691


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 227/564 (40%), Positives = 317/564 (56%), Gaps = 56/564 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA DLVSR+TL+EK SQL   A  + RLGIP Y WW+EALHGVA  G   
Sbjct: 6   YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AA FD     +I   I  E RA YN     G       +TFW
Sbjct: 64  --------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PN+NIFRDPRWGRG ET GEDP +T +  V++V+G+QGD    GK    L+ +AC KHF
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKTAACAKHF 168

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA V+ +DL +TY P FE+ VK+ +   +M AYNR NG P    
Sbjct: 169 AVH---SGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGS 225

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           + LL    R  WGF G++ SDC A+   ++  G  K+P ++V   LK+G D+NCG+ +  
Sbjct: 226 KTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGN-MYL 284

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
               A+K+ ++ E +IDRA   L + RMRLG+F+ +     F KI  ++  S  H  L+L
Sbjct: 285 LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCE---FDKIPYELNDSVEHNKLSL 341

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA+  +VLLKN  GLLPL   K  ++A+IGPNA+S+  L  NY+G   ++IT L  ++ 
Sbjct: 342 EAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITILDGIRK 400

Query: 458 YV-ENT-VYYP--------GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
            V E+T V+Y           + +A     + +AV +A+ +D VVL +GLD + E E+  
Sbjct: 401 RVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAERSDVVVLCLGLDASVEGEQND 460

Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
                     D+ DL LP  Q+ L+  V  A  KP I+ LL G  + I  A       +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           +   YPG  G +A AE+IFGD++P
Sbjct: 518 VQCWYPGSRGGLAFAEMIFGDYSP 541


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/547 (40%), Positives = 312/547 (57%), Gaps = 47/547 (8%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
           ++R   L+S+++++EK   +++ A  IPRLGIP Y WW+EALHGVA  G+          
Sbjct: 4   NERMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE---------- 53

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFR 165
           AT FPQ I  AA+FD  L  R+ +AI  EARA +NA           G+TFW+PNINI+R
Sbjct: 54  ATVFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYR 113

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRGQET GEDP +T K  VS+V+G+QGD          ++ +AC KH+  +     
Sbjct: 114 DPRWGRGQETYGEDPFLTSKIGVSFVKGLQGDH------PYYMRVAACAKHYAVHSGPE- 166

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FDARV+ +DL +TY P FE+ VK G    +M AYNRVNG P+C  + LL +  
Sbjct: 167 --GLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEIL 223

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R++WGF G++ SDC A++  +      K P +++   L+AG D+NCG+  + H   AVK 
Sbjct: 224 RKRWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKA 282

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  +DR++  L S   RLGLF  +    P+ ++    +   AH+ LA +AA+  +V
Sbjct: 283 GVVSEELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSDIDWEAHRALAREAAEKSVV 339

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NT 462
           LLKN+ G+LP  + K   + + GPNA +   LLGNYAG S R +T L+ +  Y       
Sbjct: 340 LLKNN-GILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITV 398

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVL 513
            Y  GC         ID A  +A+ AD  V +MG D T E EE D +         DL L
Sbjct: 399 TYKIGCPLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDL 458

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           P  Q E + R+ E   KP+++VLL G PV     + +    +I++A YPGE G  A+A V
Sbjct: 459 PREQIEYLRRIKEIG-KPLVVVLLSGAPV--CSPELEELADAIVYAWYPGEEGGNAIARV 515

Query: 574 IFGDHNP 580
           +FG+ +P
Sbjct: 516 LFGEISP 522


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 208/543 (38%), Positives = 313/543 (57%), Gaps = 25/543 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L++  TL+E I+   N+AP +PRLG+P Y+ WSEALHG+        
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRA----- 118

Query: 106 FNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
            N    G     ATSFP  IL+ A+ +  L  +I   IG +ARA  NAG+  G+  +APN
Sbjct: 119 -NFATSGDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGR-YGLDAYAPN 176

Query: 161 INIFRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
           IN FR P WGRGQETPGED   ++  YA  Y+ G+QG     G     L+  A  KHF  
Sbjct: 177 INGFRSPLWGRGQETPGEDANFLSSSYAYEYITGLQG-----GVDPDHLKVVATAKHFAG 231

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YDL+NW G +R  FDA +T QDLA+ Y P F +  +  +A   MC+YN VNG+PSC+   
Sbjct: 232 YDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSF 291

Query: 280 LLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           LL    R  W F  +GY++SDCDAV  +++  GYA +   A  D L+AG D++CG     
Sbjct: 292 LLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAAADSLRAGTDIDCGQTYPW 351

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
           +   +  +  +   EI+R++  L+S  ++LG F+G+ +   + ++G + V +     ++ 
Sbjct: 352 NLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKS--EYRQLGWNDVVTTDAWNISY 409

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA +GIVLLKN  G+LPL K    S+ALIGP AN+ + L GNY G +   ITPLQ   +
Sbjct: 410 EAAVEGIVLLKND-GILPLSKHVK-SIALIGPWANATEQLQGNYYGTAPYLITPLQGASD 467

Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
                 Y  G + +  ++     A+  AK +D +V + G+D T E E  DR+++  PG Q
Sbjct: 468 AGYKVNYALGTNILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQ 527

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
            +LI ++++   KP++++ + GG VD +  K +  + +++W GYPG++G  A+ +++ G 
Sbjct: 528 LDLIQQLSQTG-KPLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGK 586

Query: 578 HNP 580
             P
Sbjct: 587 RVP 589


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 207/540 (38%), Positives = 309/540 (57%), Gaps = 19/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L++  TL+E I+   NS P +PRLG+P YE WSE LHG+    +  F
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGL---DRAHF 120

Query: 106 FNG--TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
                    ATSFP  IL+ A+ +  L  +I   I  +ARA  N G+  G+  +APNIN 
Sbjct: 121 VKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR-YGLDAYAPNING 179

Query: 164 FRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           FR P WGRGQETPGED   +T  YA  Y+ G+QG     G     L+ +A  KHF  YDL
Sbjct: 180 FRSPLWGRGQETPGEDANFLTSSYAYEYITGLQG-----GIDPDNLKIAATAKHFAGYDL 234

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW G +R  FDAR+T QDLA+ Y P F +  +  +A   MC+YN VN IPSC+   LL 
Sbjct: 235 ENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSSSFLLQ 294

Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R QW F  +GY++SDCDAV  +++  GYA +   A  + L+AG D++CG     H  
Sbjct: 295 TLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAESLRAGTDIDCGQTYSWHLN 354

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            +  +  +   EI+R++  L+S  ++LG F+G+     + ++G + V +     ++ +AA
Sbjct: 355 QSFIEGSVTRGEIERSILRLYSNLVKLGYFDGD--KNEYRQLGWNDVVTTDAWNISYEAA 412

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN  G+LPL K+   S+AL+GP AN+ K L GNY G +   ITPLQ   +   
Sbjct: 413 VEGIVLLKN-DGVLPLSKNVK-SVALVGPWANATKQLQGNYFGTAPYLITPLQGASDAGY 470

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
              Y  G +    ++     A+  AK +D +V + G+D T E E  DR+++  P  Q +L
Sbjct: 471 KVNYALGTNISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPRNQLDL 530

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I ++++   KP++++ + GG VD +  K +  + +++W GYPG++G  A+ +++ G   P
Sbjct: 531 IQQLSQTG-KPLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDILKGKRAP 589


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 222/569 (39%), Positives = 312/569 (54%), Gaps = 44/569 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           F+   S  + E   F    + + +RA  +V +LT++EKI+QL+N+APA+ RL IP Y+WW
Sbjct: 3   FTFSFSLKAQEDCSFRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWW 62

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +E LHGVA  G+          AT FPQ I  AA++D+ L YR+G AI  EARA YN   
Sbjct: 63  NECLHGVARAGR----------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFS 112

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
                GQ  G+TFW PN+NIFRDPRWGRGQET GEDP +T +  VS+V+G+QG+      
Sbjct: 113 KHGYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNH----- 167

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
               L+ +A  KH   Y + N     R++FDA+V+M+DL +TY P FE+ VK+    G+M
Sbjct: 168 -PKYLKVAALAKH---YAVHNGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGVM 223

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
            AYNR NG P CA   L+ +  R +WGF GY  SDC A+   Y       +PE+A    L
Sbjct: 224 GAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPEEAAAMAL 283

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
            AG ++NCG       K ++++    E EIDR++  LF  R+RLGLF     + P+  I 
Sbjct: 284 NAGCNLNCGDTYASLLK-SLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAV-PYDTIS 341

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
            DV+ S  HQ LAL+AA+  +VLLKN    LP+ +     + + GP A   + LL NY G
Sbjct: 342 TDVIRSKEHQKLALEAARKSVVLLKNEANTLPVARDVK-KVYVTGPTATHVQALLANYYG 400

Query: 444 PSCRSITPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
            S    T L+ +   V    +  Y  G      +  ++D     A  AD  V  +G+ Q 
Sbjct: 401 VSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFSGAAASADVTVACLGISQL 460

Query: 501 QEKEE---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
            E EE          DR    LP  Q + + R+  +AKK +++V+  G  + +    YD 
Sbjct: 461 IEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKK-LVVVITSGSAISLP-EIYDM 518

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              ++L+  YPGE G  A+A+V+FGD  P
Sbjct: 519 -ADALLYVWYPGEQGGKAVADVLFGDAVP 546


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 209/540 (38%), Positives = 314/540 (58%), Gaps = 17/540 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA-GVGKGI 104
           C  +L   QRA  LV+ LT+ EK+  LVN AP IPRL +P YEWWSE LHGVA   G   
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVARSPGTKF 229

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
              G    ATSFPQ IL  ++FD  L   +G+ +  EARA  NAG++ G+  ++PNIN F
Sbjct: 230 TSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRS-GLDLYSPNINAF 288

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGRGQETPGED     KY  + + G++GD  +   +       A CKH+ A D +N
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDKKLI-------ATCKHYAANDFEN 341

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +KG  R  F+A ++ QDL++ Y PPF++C  +      MC+YN +NG P CA+  L+   
Sbjct: 342 YKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDI 401

Query: 285 ARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL-QKHTK 340
            R+ WG++G   Y+++DCD V+++     YA     A    ++AG D+ C +F   +  +
Sbjct: 402 LRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQ 461

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           +A  Q  + E ++D+AL  +++  + +GLF+ +    P   +G D V +   Q LA +AA
Sbjct: 462 SAWNQSLISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAA 520

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +G VL+KN  G+LPL    S   ALIGP  ++   + GNY GP+   I+P +A ++   
Sbjct: 521 VEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKDLGL 579

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
           +  Y+ G  T   S +S  +A+  A+ AD V+ M G+D T E+E LDR  L  P  Q +L
Sbjct: 580 DFTYFLGSRTNK-SDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEPQLQL 638

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  ++E   KP++++   GG VD T    + ++ +ILW GYPG++G  A+ +++FG   P
Sbjct: 639 LRALSEVG-KPLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFGRAAP 697


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 215/532 (40%), Positives = 302/532 (56%), Gaps = 27/532 (5%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG---T 109
           +QR + L+  LTL+EKI  LV+++    RLG+P+YEWW+EA HGV G   G+ F      
Sbjct: 23  TQRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGV-GSAPGVQFTEKPVN 81

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
              ATSFP  ILTAASFD  L   I   IG E RA  N G + G  FWAPNIN FRDPRW
Sbjct: 82  FSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFS-GFDFWAPNINPFRDPRW 140

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GRGQETPGED  V   Y  +++ G+QGD           Q  A CKH+ AYDL+    T 
Sbjct: 141 GRGQETPGEDSFVVQSYIRNFIPGLQGDD------PEDKQVIATCKHYAAYDLE----TG 190

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           RY  D   T QDLAD +  PF++CV+      IMCAYN V+GIP+CA   LL +  R+ W
Sbjct: 191 RYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHW 250

Query: 290 GF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            F   + Y+ SDC AV+ I+    +  + E A    L AG+D+ CG S+L+ +   A  Q
Sbjct: 251 NFTADYNYVVSDCGAVTDIWQYHNFTDTEEAAASVSLNAGVDLECGSSYLKLNESLAANQ 310

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             +    +D+AL  L+S    +G F+G      +  +G   V +P  Q LA +AA +G+ 
Sbjct: 311 TTV--QALDQALTRLYSALFTVGFFDGG----KYTALGFADVSTPEAQSLAYEAAVEGMT 364

Query: 406 LLKNSHGLLPLPKS-KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
           LLKN   LLP+  S K  S+ALIGP AN+   + G+Y+G     I+PL+A + +     Y
Sbjct: 365 LLKNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNY 424

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G      ++     A+  A+ +D V+ + G+D + E E LDR  L  PG Q +L+T++
Sbjct: 425 AMGTGINNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQL 484

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           ++   KP+I+V   GG +D +    +  + +++WAGYP ++G  AL +V+ G
Sbjct: 485 SK-LHKPLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLG 535


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 214/547 (39%), Positives = 310/547 (56%), Gaps = 44/547 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           +S+ A+++++R+T+ EKISQL+N +PAI  LGI  Y+WWSE LHGV   G+         
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
            AT FPQ I   A+FD  L   IG A+  E RA +N  Q +       G+TFW+PN+NIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDPL++G    +YVRG+QGD          L+  AC KH   Y + +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
               TR++ D   + +DL +TY P F+  V+QGR   +M AYNRV G P    + LL+  
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R+ WGF+G+I SDCDA++  Y    Y K+PE+A    +KAG++V CG    K  + A+ 
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           Q  L E+++DRAL  L   R++LG+   +    P+       +CSPAH  LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
           VLLKN +G+LPL K+   +L + GP A+ A  L+GNY G S R  T LQ + + V +   
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426

Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
             + P    +      ++ AV+ A  A+  +++MG +   E EE          DRV + 
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LP  Q   + RV       +++VL  G P+D+   K  +   +++ A YPG+ G  AL +
Sbjct: 487 LPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--KISKLADAVVMAWYPGQEGGEALGD 544

Query: 573 VIFGDHN 579
           ++FGD N
Sbjct: 545 LLFGDKN 551


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 216/563 (38%), Positives = 312/563 (55%), Gaps = 44/563 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           + E FPF    LP+ QR  DLVSRLTL+EK+ Q++NS P + RLGIPAY WW+E LHG+ 
Sbjct: 21  AQEKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGIG 80

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQA 151
                     T    T FPQ I  AA+++  L   +  +I  E RA+YN         Q 
Sbjct: 81  R---------TKYHVTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQY 131

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
             +T+W PNINIFRDPRWGRGQET GEDP +T +   ++V+G+QGD          L+AS
Sbjct: 132 HALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGDN------PRYLKAS 185

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           AC KH+  +   +     R+ F++ V+  DL DTY P F + V   + SG+MCAYN   G
Sbjct: 186 ACAKHYAVH---SGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYNAFQG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C +  L+    R +W F GY+TSDC A+  I++          A  D +  G D++C
Sbjct: 243 QPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFHGTDLDC 302

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G         AVK   + E ++D ++  LF++R RLGLF+    +  + +I   ++    
Sbjct: 303 GHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVD-YARIPISILECRK 361

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           HQ LA Q A++ +VLLKN   LLPL K+K   + ++GPNA+S ++LLGNY G   R +TP
Sbjct: 362 HQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSRMLTP 420

Query: 452 LQALQNYV---ENTVYYPGCDTVACSSA-SIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
           LQA++  +       Y  G D V   SA  + + V+ AKGAD V+ + G+    E EE+ 
Sbjct: 421 LQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRLEGEEMP 480

Query: 507 ---------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
                    DR  + LP  Q +++ +   A   P + V++ G  + I +    +N+ +IL
Sbjct: 481 VSKDGFDGGDRTTIALPAVQTQMM-KAWVAEHIPTVFVMMTGSALAIPWEA--QNVPAIL 537

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
            A Y G+ G  A+A+V+FGD+NP
Sbjct: 538 NAWYGGQYGGEAIADVLFGDYNP 560


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 219/548 (39%), Positives = 312/548 (56%), Gaps = 47/548 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           + Q+AR L++ +TLDEKISQ++N  P IPRLGI  Y+WW+E LHGV   G+         
Sbjct: 9   MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR--------- 59

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIF 164
            AT FPQ I   A+F+  L  +IG AI  E RA YN  Q         G+TFW+PNINIF
Sbjct: 60  -ATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLD 223
           RDPRWGRG ET GEDP +TG   ++YV+G+QG D F        L+ +AC KH   Y + 
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGNDPF-------YLKVAACGKH---YAVH 168

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +    TR++ +   T +DL +TY P F+  V+QG    IM AYNRV G      + LL+ 
Sbjct: 169 SGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLTD 228

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
             R+QWGF G+I SDCDAV+ I+      K+  +A    +KAG+++ CG   +   K AV
Sbjct: 229 VLRKQWGFRGHIVSDCDAVADIHAGHKIVKTEAEACAIAIKAGLNIECGHTFEA-MKQAV 287

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
            QK L E EIDRAL  L   R++LG+   +    P+ ++    +CSP H  LA +AA + 
Sbjct: 288 AQKLLTEQEIDRALLPLMMTRLKLGILEYDAEC-PYNEVKETEICSPEHIALARKAATES 346

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
           +VLLKN +G+LPL K+   +L + GP A+ +  L+GNY G S R  T LQ + + V +  
Sbjct: 347 MVLLKN-NGILPLDKNLH-TLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSGT 404

Query: 464 ---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
              + P       +  +I+ A+D A  A+  +++MG +   E EE          DRV +
Sbjct: 405 AVNFRPAFGESTPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDRVSM 464

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LP  Q + + R  +A K  +++VL  G P+D+   +  R   +++ A YPG+ G  ALA
Sbjct: 465 RLPASQMKFL-RDLKARKNGIVVVLTGGSPIDVR--EISRLADAVVMAWYPGQEGGYALA 521

Query: 572 EVIFGDHN 579
           +++FGD N
Sbjct: 522 DLLFGDEN 529


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  370 bits (949), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 221/582 (37%), Positives = 317/582 (54%), Gaps = 33/582 (5%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           L  C ++L++ +D    PF   P   N    +   C  TL   QRA  LV+ +T +EK+ 
Sbjct: 10  LLSCTSALVSAIDL---PFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQ 66

Query: 71  QLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFD 127
            LV+ +   PR+G+PAY WWSEALHGVA    G  F   +G    +TSFP  +L AA+FD
Sbjct: 67  NLVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGDGPFNSSTSFPMPLLMAATFD 125

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             L  ++G+ IG E RA  NAG + G  +W PN+N F+DPRWGRG ETPGED L   +YA
Sbjct: 126 DELIEKVGEVIGTEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYA 184

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
            S +RG+QG          + +  A CKH+ A D ++W G+TR+ FDA+VT+QDLA+ Y 
Sbjct: 185 ASMIRGLQGPL-------PERRVVATCKHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYL 237

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSI 304
            PF+ C +  +   IMC+YN VNG+P+CA+  L+    R  W +     YITSDC+AV  
Sbjct: 238 SPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDCEAVLD 297

Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
           I+    YAK+  +      +AG D +C          A  Q  L +S +DRAL  L+   
Sbjct: 298 IFANHHYAKTNAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGL 357

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH----GLLPLPKSK 420
           +R+G F+GN     +  +G   V SP  Q +ALQ A +GIVLLKN      GL   PKSK
Sbjct: 358 VRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKTDPKSK 415

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASI 478
              LA+IG  AN  KTL G Y+G      +P+ A +   +   T   P       +    
Sbjct: 416 ---LAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDTWT 472

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
             A++ A+ A++++   GLD +   E  DR  +  P  Q +LI  + +   KP+++V + 
Sbjct: 473 QAALEAAQDANYILYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLG-KPLVVVQM- 530

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G  +D T     + + SILWA +PG+ G  A+ +++ G  +P
Sbjct: 531 GDQLDNTPLLATKTVNSILWANWPGQDGGTAVMQILTGLKSP 572


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  369 bits (947), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 221/569 (38%), Positives = 319/569 (56%), Gaps = 46/569 (8%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           C  +  +  TFPF    L   +R  DLVSRLTL+EKISQ++N APAI RLGIPAY+WW+E
Sbjct: 17  CAHTGKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNE 76

Query: 93  ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
            LHGV           T    T FPQ I  AA++D  L+ ++  +I  E RA+Y+   + 
Sbjct: 77  CLHGVG---------RTPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISK 127

Query: 153 -------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
                  G+T+W PNINIFRDPRWGRGQET GEDP +TG    ++V G+QGD        
Sbjct: 128 GVHEIYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDD------P 181

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             L+ASAC KH+  +   +    +R+ F+  V+M DL DTY P F   V   + S +MCA
Sbjct: 182 KYLKASACAKHYAVH---SGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCA 238

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YN + G P C +  L+    R+QW F GY+TSDC A+      + +A +   +   VL  
Sbjct: 239 YNALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLKHKTHADAAHASADAVLH- 297

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+ CG  +      AVKQ  + E++ID ++  LF  R RLGLF+    ++ +      
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVK-YADTPLS 356

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           V+    H+ LAL+ +++ +VLLKN + +LPL K+    +A+IGPNA+ +  +LGNY G  
Sbjct: 357 VLECDEHKALALKMSRESVVLLKNDN-VLPLRKNLK-KIAVIGPNADDSTVVLGNYNGFP 414

Query: 446 CRSITPLQALQNYV---ENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQ 501
            + ITPL+A+++ V      +Y    D V  S   +++  ++  KG D V+ + G+    
Sbjct: 415 SKVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRL 474

Query: 502 EKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           E EEL          DR  + LP  Q EL+ ++ EA   PVI V++ G  + I +    +
Sbjct: 475 EGEELPISVDGFRGGDRTTIALPEVQTELMKKMKEAG-LPVIFVMMTGSALGIEWES--Q 531

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           NI +IL A Y G+    A+A+V+FGD+NP
Sbjct: 532 NIPAILNAWYGGQFAGQAIADVLFGDYNP 560


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 225/564 (39%), Positives = 317/564 (56%), Gaps = 56/564 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA DLVSR+TL+EK SQL   A  + RLGIP Y WW+EALHGVA  G   
Sbjct: 6   YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AA FD     +I   I  E RA YN     G       +TFW
Sbjct: 64  --------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PN+NIFRDPRWGRG ET GEDP +T +  V++V+G+QGD    GK    L+++AC KHF
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKSAACAKHF 168

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ F+A  + +D+ +TY P FE+ VK+ +   +M AYNR NG P    
Sbjct: 169 AVH---SGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRTNGEPCNGS 225

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           + LL    R  WGF G++ SDC A+   ++  G  K+P ++V   LK G D+NCG+ +  
Sbjct: 226 KTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKNGCDLNCGN-MYL 284

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
               A+K+ K+ E +IDRA   L + RM+LG+F+ +     F KI  +V  S  H  L+L
Sbjct: 285 LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCE---FDKIPYEVNDSIEHNKLSL 341

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA+  +VLLKN +GLLPL   K  ++A+IGPNA+S+  L  NY+G    +IT L  +++
Sbjct: 342 EAARKSMVLLKN-NGLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITILDGVRS 400

Query: 458 YV-ENT-VYYP--------GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
            V E+T V+Y           + +A     + +AV +A+ +D VVL +GLD + E E+  
Sbjct: 401 RVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMAERSDVVVLCLGLDASVEGEQND 460

Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
                     D+ DL LP  Q+ L+  V  A  KP I+ LL G  + I  A       +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           +   YPG  G +A AE+IFGD++P
Sbjct: 518 VQCWYPGSKGGLAFAEMIFGDYSP 541


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 216/542 (39%), Positives = 308/542 (56%), Gaps = 19/542 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T   ++ RA+ LV+ LT +EK +   N++P +PRLG+ +Y+WW EALHGVA    G+ 
Sbjct: 35  CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS-SPGVN 93

Query: 106 FN--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+  G    ATSFPQ IL +A+FD  L   +   +  EARA  N  ++ G+ FW PNIN 
Sbjct: 94  FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRS-GLDFWTPNINP 152

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           ++DPRWGRGQETPGED      Y  + + G+QG     G      +  A CKHF AYDL+
Sbjct: 153 YKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG-----GLNPPIKKVIATCKHFVAYDLE 207

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W  T RY FDA V+ QDLA+ Y  PF++C +  R   IMC+YN +NG+P+CAD  +L  
Sbjct: 208 DWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTCADPYILQT 267

Query: 284 TARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +     Y+TSDCDA+  IY    Y  + E AV D L AG D+NCG++ Q H  
Sbjct: 268 VLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVADALTAGTDLNCGTYYQTHLP 327

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           AA  +    ++ ID+ +  L+S  ++LG F+  P+  P+  +    V +PA + LAL+AA
Sbjct: 328 AAFSEGLFNQTVIDQTITRLYSALIKLGYFDP-PSATPYRSLNWSDVSTPAAEALALKAA 386

Query: 401 QDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           ++GIVLLKN  GLLPL  P  K+ ++A+IG  AN+  T+ GNY G +    +PL ALQ  
Sbjct: 387 EEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHSPLYALQQL 445

Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
                 Y G   V  +    D+ +  A  AD +++  GL  + E E  DR  +       
Sbjct: 446 PNINAVYGGGFGVPTTDG-WDELLGAAGEADLIIIADGLTTSDESESNDRYTIGWQPAAI 504

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           ++I +++   K  V L +  G  +D T    + NI +++W GYPG AG  AL  ++ G  
Sbjct: 505 DIINQLSGMGKPTVFLQM--GDQLDNTPLLNNPNISALIWGGYPGMAGGDALINILTGKA 562

Query: 579 NP 580
            P
Sbjct: 563 AP 564


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  369 bits (946), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 211/541 (39%), Positives = 317/541 (58%), Gaps = 21/541 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV--AGVGKG 103
           C T++   +RA  L+S  TL+E I+   NSAP +PRLG+P Y+ WSE LHG+  A   K 
Sbjct: 64  CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWAKS 123

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
                  + ATSFP  IL+ A+ +  L  +I   I  +ARA  N G+  G+  +APNIN 
Sbjct: 124 ---GEEWKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGR-YGLDAYAPNING 179

Query: 164 FRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           FR P WGRGQETPGED   ++  YA  Y+ G+QG     G     L+  A  KHF  YDL
Sbjct: 180 FRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG-----GVDPEHLKIVATAKHFAGYDL 234

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW   +R  FDA +T QDLA+ Y P F +  +  +A   MC+YN VNG+PSC+   LL 
Sbjct: 235 ENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFLLQ 294

Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  W F  +GY++SDCDA   +++  GYA +   A  D L+AG D++CG     +  
Sbjct: 295 TLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISAAAADSLRAGTDIDCGQTYPWYLN 354

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG-ADVVCSPAHQVLALQA 399
            +  +  +   EI+R+L  L+S  ++LG F+GN     + ++G  DVV + A  + + +A
Sbjct: 355 QSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGN--QSEYRQLGWNDVVATDAWNI-SYEA 411

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A +GIVLLKN  G+LPL + K  S+A+IGP AN+ + L GNY GP+   ITPLQA ++  
Sbjct: 412 AVEGIVLLKND-GVLPLSE-KLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAG 469

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
               Y  G + +  ++     A+  AK +D ++ + G+D T E E  DR+++  PG Q +
Sbjct: 470 YKVNYAFGTNILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQLD 529

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI ++++   KP++++ + GG VD +  K + N+ +++W GYPG++G  A+ +++ G   
Sbjct: 530 LIQQLSQTG-KPLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGKRA 588

Query: 580 P 580
           P
Sbjct: 589 P 589


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  369 bits (946), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 218/555 (39%), Positives = 319/555 (57%), Gaps = 57/555 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA++LV+++T++EK SQL   APAI RLGIPAY WW+EALHGVA  G           A
Sbjct: 8   KRAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD  L   +G+ I  EARA YN            G+TFWAPN+NIFRD
Sbjct: 58  TMFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T + AV +V+ +QGD    G+    ++A+AC KHF  +   +  
Sbjct: 118 PRWGRGHETYGEDPFLTSRLAVPFVKAMQGD----GEY---MKAAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA+ + +DL +TY P FE+ VK+     +M AYNR NG P CA++ L+  T R
Sbjct: 168 EGERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDTLR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +WGF G+  SDC A+   ++      SPE++    L+ G D+NCG   Q      V+  
Sbjct: 228 GKWGFQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMGCDLNCGCTYQS-IMNGVRAG 286

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  I  +   LF+ R  LG+F+       F +I  + V    H  +A +AA++ +VL
Sbjct: 287 LIDEKLITESCERLFTTRFLLGMFD----KTEFDEIPYEKVECKEHLAVAKRAARESVVL 342

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN  GLLPL K    ++ ++GPNANS  +L+GNY G S R IT L+ +Q+ V + V   
Sbjct: 343 LKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVL 401

Query: 464 YYPGCDTVACSSAS---------IDKAVDIAKGADHVVLMMGLDQTQEKEE--------- 505
           Y  GCD    + ++         + +A  +A  +D VV+++GLD+  E EE         
Sbjct: 402 YSEGCDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFAS 461

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            D+++L LP  Q++L+  V +   KP I++ + G  +D++ A+ + N  ++L A YPG  
Sbjct: 462 GDKINLNLPLSQRQLLNAVLDCG-KPTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGAR 518

Query: 566 GAVALAEVIFGDHNP 580
           G   +A+++FGD +P
Sbjct: 519 GGADVADILFGDVSP 533


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 229/579 (39%), Positives = 317/579 (54%), Gaps = 31/579 (5%)

Query: 7   SLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
           SL  P LC          D T  P S           P C T+L   +RA+ LVS LTL+
Sbjct: 14  SLFIPALCQS-----NYPDCTTGPLS---------ELPICDTSLSPLERAKSLVSALTLE 59

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVILTAA 124
           EKI+   + A    RLG+PAY WW+EALHGVA    G+ F  +G    ATSFP  I+  A
Sbjct: 60  EKINNTGHEAAGSSRLGLPAYNWWNEALHGVA-EKHGVSFEESGDFSYATSFPAPIVLGA 118

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           +F+  L  R+ + I  EARA  N+  A G+ +W PN+N F+DPRWGRGQETPGEDPL   
Sbjct: 119 AFNDALIRRVAEIISTEARAFSNSDHA-GIDYWTPNVNPFKDPRWGRGQETPGEDPLHCS 177

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           +Y   +V G+QGD         K +  A CKH  AYDL+ W G +R++FDA+V+  DL +
Sbjct: 178 RYVKEFVGGLQGDD------PEKPKVVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDLLE 231

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDA 301
            Y PPF++C         MC+YN +NG+P+CADR LL    R  WG+ G   ++T DC A
Sbjct: 232 YYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREHWGWEGPGHWVTGDCGA 291

Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           V  I     Y +S  +A    L AG+D++CG++L  +   A +Q  +    +D AL  L+
Sbjct: 292 VERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQGLISNETLDAALTRLY 351

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
           +  ++LG F+     QP   +G D V +   + LA   A  G VLLKN    LPL  +  
Sbjct: 352 TSLVQLGYFD-PAEGQPLRSLGWDDVATSEAEELAKTVAIQGTVLLKNIDWTLPLKANG- 409

Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKA 481
            +LALIGP  N    L  NYAGP+    T ++A +    N +  PG +  + S+   D A
Sbjct: 410 -TLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTAPGTEVNSTSTDGFDDA 468

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           + IA  AD ++   G+D T E+E LDR  +  PG Q+ELI  +AE   +P+ +V   GG 
Sbjct: 469 LAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELAELG-RPLTVVQFGGGQ 527

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VD +       +G+I+WAGYP +AG   + +V+ G   P
Sbjct: 528 VDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAP 566


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  368 bits (944), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 224/590 (37%), Positives = 322/590 (54%), Gaps = 33/590 (5%)

Query: 5   KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSR 62
           K SLV   L  C + L++ +D    PF   P   N    +   C  TL   QRA  LV+ 
Sbjct: 2   KSSLVPYCLLSCSSVLVSAIDL---PFQTYPDCVNGPLASLKVCDVTLSPPQRAAALVAA 58

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF--NGTIRGATSFPQVI 120
           +T +EK+  LV+ +   PR+G+PAY WWSEALHGVA      F+  +G    +TSFP  +
Sbjct: 59  MTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVAYAPGTQFWSGDGPFNASTSFPMPL 118

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
           L AA+FD  L  ++G+ IG E RA  NAG + G  +W PN+N F+DPRWGRG ETPGED 
Sbjct: 119 LMAATFDDELIEKVGEVIGTEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDI 177

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           L   +YA S +RG+QG          + +  A CKH+ A D ++W G+TR+ F+A+VT+Q
Sbjct: 178 LRIKRYAASMIRGLQGPAR-------ERRVVATCKHYAANDFEDWNGSTRHDFNAKVTLQ 230

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITS 297
           DLA+ Y  PF+ C +  +   IMC+YN VNG+P+CA+  L+    R  W +     YITS
Sbjct: 231 DLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITS 290

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRAL 357
           DC+AV  I     YA++  +      +AG+D +C          A  Q  L +S +DRAL
Sbjct: 291 DCEAVLDISANHHYAETNAEGTALAFEAGIDSSCEYESSSDIPGAWTQGLLEQSTVDRAL 350

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL- 416
             ++   +R+G F+GN     +  +G   V SP  Q +ALQAA +GIVLLKN    LPL 
Sbjct: 351 KRIYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQAAVEGIVLLKNDK-TLPLD 407

Query: 417 ----PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDT 470
               PKSK   LA+IG  AN  KTL G Y+G      +P+ A Q   +   T   P    
Sbjct: 408 LRTDPKSK---LAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAQAMGFSVTTAGGPVLQN 464

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
              +      A++ AK A++++   G D +   E  DR  +  P  Q +LIT +++   K
Sbjct: 465 STSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITTLSKLG-K 523

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           P+++V + G  +D T     + + +ILWA + G+ G  A+ +++ G  NP
Sbjct: 524 PLVVVQM-GDQLDNTPLLAAKAVNAILWANWLGQDGGTAVMQILTGLKNP 572


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  368 bits (944), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 216/560 (38%), Positives = 311/560 (55%), Gaps = 50/560 (8%)

Query: 47  KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
           KT     QRA +LV ++TL+EK+ Q ++SAP+IPRL I AY +W+EALHGVA  G     
Sbjct: 5   KTQQEFQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV---- 60

Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAP 159
                 AT FPQ I  AA+FD  L   I   I  E R  +NA Q  G       +TFW+P
Sbjct: 61  ------ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSP 114

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
           N+NIFRDPRWGRG ET GEDP ++G     +V G+QG        +  L+A+AC KHF  
Sbjct: 115 NVNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQGHD------ETYLKAAACAKHFAV 168

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           +   +     R+ F+A V+ QDL +TY P F+  VK+ +   +M AYNR NG P C  + 
Sbjct: 169 H---SGPEDIRHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGSKT 225

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
           LL    R +W F G++TSDC A+   ++      +  ++V   +  G D+NCG+ L  + 
Sbjct: 226 LLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVESVALAMNRGCDLNCGN-LYVNL 284

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             AV+   + E  ID AL  LF+ RM+LGLF+   ++ PF  I  D V + + + L ++A
Sbjct: 285 LQAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESI-PFNTITYDQVDTKSSKELNIKA 343

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           ++  +VLLKN   +LPL   K  S+ +IGPNAN+   L+GNY G +   IT L+ ++  V
Sbjct: 344 SKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQVV 403

Query: 460 ENTV---YYPGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEELDRV 509
              V   +  GC       +++ +  D       + + +D V+  +GLD   E EE D+ 
Sbjct: 404 PEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEGDQG 463

Query: 510 D---------LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
           +         L LPG Q++++  + E   KPVIL+LL G  + + +A  D +I +IL   
Sbjct: 464 NQFASGDKKTLALPGIQEDVLKTIYECG-KPVILILLSGSALAVPWA--DEHIPAILQGW 520

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPG  G  A+AE+IFGD NP
Sbjct: 521 YPGAQGGRAIAELIFGDGNP 540


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  368 bits (944), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 215/553 (38%), Positives = 310/553 (56%), Gaps = 52/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA++LVS++T++EK SQ+++ A AI RLGIP Y WW+EALHGVA  G           A
Sbjct: 7   KRAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------A 56

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I   A+FD  L  ++      E RA YN            G+T+WAPN+NIFRD
Sbjct: 57  TVFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRD 116

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TG+  ++YVRG+QGD  +  K      ++AC KHF  +   +  
Sbjct: 117 PRWGRGHETYGEDPYLTGQLGMAYVRGLQGDDLDNPK------SAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA+V  QDL DTY   F+  VK  +   +M AYNRVNG P+C  + LL    R
Sbjct: 168 EAERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
             WGF G++ SDC A+   ++         ++    +  G D+NCG   +K    A K  
Sbjct: 228 GDWGFEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNGCDLNCGCVYEK-LLYAYKAN 286

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  I  ++  L  +R+RLG      +   +  I  +VV    H+ LA++AA+  +VL
Sbjct: 287 LVTEETITESVERLIELRLRLGTLPERRS--KYDDIPYEVVECKEHKELAIEAAKRSMVL 344

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
           LKN  GLLPL K +  ++ +IGPN+NS   L+GNY G S   IT L+ +Q YV + V   
Sbjct: 345 LKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRVF 403

Query: 467 GCD----------TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
             D           ++ +  +  +A+ +A+ +D VVL MGLD T E EE          D
Sbjct: 404 HSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSGD 463

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           +  L LPG QQEL+ ++  A  KPV+L++L G  +D+++A  + N+ +I+   YPG  G 
Sbjct: 464 KKGLKLPGLQQELLEKIT-AIGKPVVLLVLAGSAMDLSWA--NENVNAIMHCWYPGARGG 520

Query: 568 VALAEVIFGDHNP 580
            A+A+V+FG+ +P
Sbjct: 521 KAIAQVLFGEDSP 533


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  368 bits (944), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 212/547 (38%), Positives = 310/547 (56%), Gaps = 44/547 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           +S+ A+++++R+T+ EKISQL+N +PAI  LGI  Y+WWSE LHGV   G+         
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
            AT FPQ I   A+FD  L   IG A+  E RA +N  + +       G+TFW+PN+NIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDPL++G    +YVRG+QGD          L+  AC KH   Y + +
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
               TR++ D   + +DL +TY P F+  V+QGR   +M AYNRV G P    + LL+  
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R+ WGF+G+I SDCDA++  Y    Y K+PE+A    +KAG++V CG    K  + A+ 
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           Q  L E+++DRAL  L   R++LG+   +    P+       +CSPAH  LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
           VLLKN +G+LPL K+   +L + GP A+ A  L+GNY G S R  T LQ + + V +   
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426

Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
             + P    +      ++ AV+ A  A+  +++MG +   E EE          DRV + 
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LP  Q   + RV       +++VL  G P+D+   +  +   +++ A YPG+ G  AL +
Sbjct: 487 LPASQMNYLRRVKARKGGRIVVVLTGGSPIDLR--EISKLADAVVMAWYPGQEGGEALGD 544

Query: 573 VIFGDHN 579
           ++FGD N
Sbjct: 545 LLFGDKN 551


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  367 bits (942), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 221/559 (39%), Positives = 312/559 (55%), Gaps = 60/559 (10%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
               A++LV ++ L EK SQL   APAI RLGIP Y WW+EALHGVA  G          
Sbjct: 4   FKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV--------- 54

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
            AT FPQ I  AA FD      I   I +E RA YN            GMTFWAPNINIF
Sbjct: 55  -ATVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIF 113

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +T +  V++++G+QGD     + +  L+A+AC KHF  +   +
Sbjct: 114 RDPRWGRGHETYGEDPYLTARLGVAFIKGLQGD-----ENEDYLKAAACAKHFAVH---S 165

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FDA V+ +DL +TY P FE+ VK+    G+M AYNRVNG P+C  + LL   
Sbjct: 166 GPEEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDI 225

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS----FLQKHTK 340
            ++ WGF GYI SDC A+   +       +  ++    +  G ++NCG+     L+ H +
Sbjct: 226 LKKDWGFDGYIVSDCWAIRDFHTEHMVTHTAAESAALAINNGCELNCGNTYLHMLEAHQE 285

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
             VK++ + E     A   L  +RM+LGLF+ N     + +I   V     H+ +AL+A+
Sbjct: 286 GLVKEEIITE-----AAEKLMRIRMQLGLFDKNC---KYNEIPYAVNDCKVHREVALEAS 337

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
           +  +V+LKN  G+LPL K K  S+ +IGP AN+   L GNY G + R  T ++ +Q+YV 
Sbjct: 338 RRSMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVG 396

Query: 461 NTV---YYPGCDTVACSSASI-------DKAVDIAKGADHVVLMMGLDQTQEKEE----- 505
           + V   Y  GC   A   +++        +A+ +A+ +D VVL +GLD T E E+     
Sbjct: 397 DDVRVYYSEGCHLFANGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQGDTGN 456

Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
                D++ L L GRQQ+L+ +V  A  KPVILVL  G  + I +A  D +  +I    Y
Sbjct: 457 AFAGGDKLSLNLIGRQQQLLEKVV-AVGKPVILVLSTGSAMAINYA--DEHCNAIFQTWY 513

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PG  G  ALA+++FG+++P
Sbjct: 514 PGAQGGKALAQLLFGEYSP 532


>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
 gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
          Length = 246

 Score =  367 bits (942), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 174/245 (71%), Positives = 202/245 (82%), Gaps = 5/245 (2%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           M   K+SL+       F   +    + QPPFSCD SNPST+++ FC+T+LPISQRA+DLV
Sbjct: 1   MKLRKISLLTLFHLTSFLIFIFADSAPQPPFSCDYSNPSTKSYLFCQTSLPISQRAKDLV 60

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-----GKGIFFNGTIRGATS 115
           SRLTLDEKISQLV++AP+IPRLG+PAY WWSEALHGVA +      +GI FNGTIR ATS
Sbjct: 61  SRLTLDEKISQLVDTAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIRSATS 120

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
           FPQVI+TAASFD+ LWYRIGQ  G EARA+YNAGQAIGMTFWAPNINIFRDPRWGRGQET
Sbjct: 121 FPQVIITAASFDTKLWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGRGQET 180

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
           PGEDP+VTGKYAVS+VRGVQGD+F GG L  KLQASACCKHFTAYDL+ W GT R+ F+A
Sbjct: 181 PGEDPMVTGKYAVSFVRGVQGDSFEGGVLGEKLQASACCKHFTAYDLEKWNGTNRFIFNA 240

Query: 236 RVTMQ 240
           +V+ +
Sbjct: 241 QVSAR 245


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 210/546 (38%), Positives = 305/546 (55%), Gaps = 19/546 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  T     RA  L+S  TLDE I+   N+   + RLG+PAY+ WSEALHG+    +  
Sbjct: 68  ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124

Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F + G    ATSFPQ ILT A+ +  L ++I   I  + RA  NAG+  G+  +APNIN 
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FR P WGRGQETPGED  +   YA  Y+ G+QG        +  L+ +A  KH+  YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW   +R   D  +T QDL++ Y P F    +  +   +MCAYN VNG+P+CAD   L  
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  +GF  HGY++SDCDA   IY+  GYA S   A  + + AG D++CG+  Q H   
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
           ++    L   +I++ +  L++  ++ G F+ N T    P+  +    V       ++ QA
Sbjct: 359 SIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418

Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A  GIVLLKNS+ +LPL +      + ++ALIGP AN+   LLGNY G +   I+P  A 
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478

Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
           +       +  G    + S++    A+  A+ AD ++   G+D T E E LDR  +  PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538

Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
            Q +LI ++A AA KKP+I++ + GG VD +  K + N+ ++LW GYPG++G  AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598

Query: 575 FGDHNP 580
            G  NP
Sbjct: 599 TGKKNP 604


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 210/546 (38%), Positives = 305/546 (55%), Gaps = 19/546 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  T     RA  L+S  TLDE I+   N+   + RLG+PAY+ WSEALHG+    +  
Sbjct: 68  ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124

Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F + G    ATSFPQ ILT A+ +  L ++I   I  + RA  NAG+  G+  +APNIN 
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FR P WGRGQETPGED  +   YA  Y+ G+QG        +  L+ +A  KH+  YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW   +R   D  +T QDL++ Y P F    +  +   +MCAYN VNG+P+CAD   L  
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  +GF  HGY++SDCDA   IY+  GYA S   A  + + AG D++CG+  Q H   
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
           ++    L   +I++ +  L++  ++ G F+ N T    P+  +    V       ++ QA
Sbjct: 359 SIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418

Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A  GIVLLKNS+ +LPL +      + ++ALIGP AN+   LLGNY G +   I+P  A 
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478

Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
           +       +  G    + S++    A+  A+ AD ++   G+D T E E LDR  +  PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538

Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
            Q +LI ++A AA KKP+I++ + GG VD +  K + N+ ++LW GYPG++G  AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598

Query: 575 FGDHNP 580
            G  NP
Sbjct: 599 TGKKNP 604


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 219/547 (40%), Positives = 310/547 (56%), Gaps = 47/547 (8%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
           ++R   L+SR++++EK   +V+ A  +PRLGIP Y WW+EALHGVA  G+          
Sbjct: 4   NERMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE---------- 53

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFR 165
           AT FPQ I  AA+FD  L  R+  AI  EARA +NA           G+TFW+PNINI+R
Sbjct: 54  ATVFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYR 113

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRGQET GEDP +T K  V++V+G+QGD          L+ +AC KH+  +     
Sbjct: 114 DPRWGRGQETYGEDPFLTSKIGVAFVKGLQGDH------PYYLRVAACAKHYAVHSGPE- 166

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FDARV+ +DL +TY P FE+ VK G    +M AYNRVNG P+C  + LL +  
Sbjct: 167 --GLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEIL 223

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R++WGF G++ SDC A++  +      K P +++   L+AG D+NCG+  + H   AVK 
Sbjct: 224 RKKWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAGCDLNCGNTYE-HLLDAVKA 282

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  +DR++  L S   RLGLF  +    P+ ++    +   AH+ LA +AA+  +V
Sbjct: 283 GAVSEELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLADIDWEAHRALAREAAEKSVV 339

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NT 462
           LLKN +G+LPL + K   + + GPNA +   LLGNYAG S R +T L+ +  Y       
Sbjct: 340 LLKN-NGILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITV 398

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVL 513
            Y  GC         ID A  +A+ AD  V +MG D   E EE D +         DL L
Sbjct: 399 TYKIGCPLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNL 458

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
              Q + + R+ E   KP+++VLL G PV     + +    +I++A YPGE G  A+A V
Sbjct: 459 SREQIDYLRRIKEIG-KPLVVVLLSGAPV--CSPELEELADAIVYAWYPGEEGGNAIARV 515

Query: 574 IFGDHNP 580
           +FG+ +P
Sbjct: 516 LFGEVSP 522


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 215/541 (39%), Positives = 305/541 (56%), Gaps = 34/541 (6%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  L+S  TL EK++   N++P +PR+G+P+YEWW+EALHG+A    G  
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIA-RSPGTT 92

Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F  T      +TSFPQ IL  A+FD  L +++   +  EARA  N  +  G+ FW PNIN
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNR-FGLNFWTPNIN 151

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            ++DPRWGRGQETPGEDP  T  Y  + + G+QG   +    KG     A CKHF  YDL
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQGGLDDLPYKKGV----ATCKHFAGYDL 207

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +N  G  RY FDA +  QDL D Y PPF+ C +      +MC+YN +NG+P+CAD  LL 
Sbjct: 208 ENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADDWLLQ 267

Query: 283 KTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
              R  WG+     ++TSDCDAV  I+D   Y  +PE +  D L AG D++CG+F   + 
Sbjct: 268 TLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSAADALNAGTDLDCGTFWPTYL 327

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
            +A  Q     S +DR+L   ++  +RLG F+  P++QP+ ++  D V +PA Q LALQA
Sbjct: 328 GSAYDQGLYDISTLDRSLARRYASLVRLGYFDP-PSVQPYRQLNWDNVSTPAAQQLALQA 386

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A+DGIVLLKN  G+LPL  S   ++ALIGP AN+ K + GNY G +    +PL A QN  
Sbjct: 387 AEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAAQNAG 444

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
               Y  G D  + ++     A+  A+ AD V+ + G+D + E EE+   +L  P     
Sbjct: 445 FKVTYVQGADIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEI-LANLSTP----- 498

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
                        +++   G  +D +    +  + ++LWAGYPG+ G  A+  ++ G   
Sbjct: 499 -------------LIISQMGCMIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNILTGKTA 545

Query: 580 P 580
           P
Sbjct: 546 P 546


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  366 bits (939), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 224/561 (39%), Positives = 321/561 (57%), Gaps = 59/561 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA DL  RLT++E+ SQL   A  IPRLGIPAY WW+E LHGVA  G   
Sbjct: 9   YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA+FD+ L ++IG+    EARA + A +         G+T W
Sbjct: 67  --------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNIN+FRDPRWGRG ET GEDP +T +  V++V+G+QG+    GK+   L+A+AC KHF
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQGE----GKV---LKAAACAKHF 171

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA+V+ +DL ++Y P F + V + +  G+M AYNRVNG PSCA 
Sbjct: 172 AVH---SGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPSCAS 228

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQ 336
             L+ K    QWGF GY  SDC A+   +   G  K+  ++    L+ G D+NCG ++L 
Sbjct: 229 PMLMDKL--HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESAALALRTGCDLNCGNTYL- 285

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
            +  AA+++  +  ++I RA   +   R+RLGLF+  P    F     D + SPAH+ ++
Sbjct: 286 -YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH---FAACTYDTIASPAHKAVS 341

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           L  A+  +VLLKN  G+LPL  SK  ++A+IGPNA+S   L GNY G + R +T L+ +Q
Sbjct: 342 LSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQ 400

Query: 457 NYVENTVYYP-GCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE--- 505
           +     V+Y  GC      ++++  A D        A+ +D V+L +GLD T E EE   
Sbjct: 401 DAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDT 460

Query: 506 ------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
                  D+ DL LP  Q +L+ ++  A  KPVILVL  G  ++   +       ++L A
Sbjct: 461 GNEFSSGDKADLRLPPPQCKLLEKL-HAVGKPVILVLAAGSALNPEIS-----CNAVLQA 514

Query: 560 GYPGEAGAVALAEVIFGDHNP 580
            YPG+ G  ALA ++FG  +P
Sbjct: 515 WYPGQCGGQALAHILFGKVSP 535


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  365 bits (937), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 208/546 (38%), Positives = 305/546 (55%), Gaps = 19/546 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  +     RA  L+S  TLDE I+   N+   + RLG+PAY+ WSEALHG+    +  
Sbjct: 68  ICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124

Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F + G+   ATSFPQ ILT A+ +  L ++I   I  + RA  NAG+  G+  +APNIN 
Sbjct: 125 FSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FR P WGRGQETPGED  +   YA  Y+ G+QG           L+ +A  KH+  YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGP-----DPDSNLKLAATAKHYAGYDIE 238

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW   +R   D  +T QDL++ Y P F    +  +   +MCAYN VNG+P+CAD   L  
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQT 298

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  +GF  HGY++SDCDA   IY+  GYA S   A  + + AG D++CG+  Q H   
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
           ++    L   +I++ +  L++  ++ G F+ N T    P+  +    V       ++ QA
Sbjct: 359 SITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQA 418

Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A  GIVLLKNS+ +LPL +      + ++ALIGP AN+   LLGNY G +   I+P  A 
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478

Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
           +    N  +  G    + S++    A+  A+ AD ++   G+D T E E LDR  +  PG
Sbjct: 479 EEAGYNVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538

Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
            Q +LI ++A +A  KP+I++ + GG VD +  K + N+ ++LW GYPG++G  AL ++I
Sbjct: 539 NQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDII 598

Query: 575 FGDHNP 580
            G  NP
Sbjct: 599 TGKKNP 604


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  365 bits (937), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 214/567 (37%), Positives = 308/567 (54%), Gaps = 49/567 (8%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           T  +PF    LP+  R +D++SRLTL+EK+  + ++APA+PRLGIPAY WW+EALHGVA 
Sbjct: 23  TYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVAR 82

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQA---- 151
             + +         T FPQ I  AA+FD+    ++G     E RAL+N    AG+     
Sbjct: 83  TKEKV---------TVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGEIY 133

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+T+W PNINIFRDPRWGRGQET GEDP +T K   + V G++G+          L++ 
Sbjct: 134 RGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNN------PEYLKSV 187

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           AC KH+  +   +     R+ +DARV+M DL DTY P F   V + +  G+MCAYNR  G
Sbjct: 188 ACAKHYAVH---SGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMCAYNRFEG 244

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C    LL    R QW F GY+TSDC AVS          +  +AV D +  G D+ C
Sbjct: 245 TPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTEAVADAVLNGTDLEC 304

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G+  QK  +  V++  + E +I+ +L  LF ++ +LG+++    + P+  IG +V+   A
Sbjct: 305 GNLYQK-LQQGVEKGLISEKDINVSLARLFEIQFKLGMYDPADRV-PYASIGREVIECDA 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+  A + AQ  +VLLKN+  +LPL  SK   +ALIGPN ++  TLL NY G     ITP
Sbjct: 363 HKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGTPSEIITP 422

Query: 452 LQALQNYVENTVYYPGCDTVAC-----SSASIDKAVDIAKGADHVVLMMGLDQTQE---- 502
            ++LQ    N++       V        + S  +    AK AD ++ + G+    E    
Sbjct: 423 YKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISADYEGEAG 482

Query: 503 ---------KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
                        DR  + LP  Q EL+  + +   +P+ILV + G    ++F    RN 
Sbjct: 483 DAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTG-RPLILVNMSGSV--MSFDWESRNA 539

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP 580
            +IL A Y G+A   A+ +V+FGD+NP
Sbjct: 540 DAILQAWYGGQAAGDAITDVLFGDYNP 566


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 211/564 (37%), Positives = 318/564 (56%), Gaps = 23/564 (4%)

Query: 32  SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           + D S P  E  P      C T+     RA  LVS  T +E ++   N++P +PRLG+P 
Sbjct: 44  TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103

Query: 87  YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           Y+ WSEALHG+    +  F + G    ATSFP  ILT ++ +  L  +I   I  + RA 
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
            N G+  G+  +APNIN FR   WGRGQETPGED   +   YA  Y+ G+QG     G  
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L+  A  KH+  YDL+NW G +R   D  +T Q+L++ Y P F    +  +   +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           +YN VNG+PSCA+   L    R  +GF   GY++SDCD+   +++   +A +   A  D 
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           ++AG D++CG+  Q +   A  ++++  +EI+R +  L+S  +RLG F+GN ++      
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             DVV + A  + + +AA +GIVLLKN  G LPL KS   S+ALIGP  N    L GNY 
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP+   I+PL A QN   +  Y  G +  + S+    +A+  AK +D ++   G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E +DR+++  PG Q +LI ++++   KP+I++ + GG VD +  K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569

Query: 563 GEAGAVALAEVIFGDHNPGEFTVI 586
           G++G  AL ++I G   P    V+
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVV 593


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 211/564 (37%), Positives = 318/564 (56%), Gaps = 23/564 (4%)

Query: 32  SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           + D S P  E  P      C T+     RA  LVS  T +E ++   N++P +PRLG+P 
Sbjct: 44  TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103

Query: 87  YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           Y+ WSEALHG+    +  F + G    ATSFP  ILT ++ +  L  +I   I  + RA 
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
            N G+  G+  +APNIN FR   WGRGQETPGED   +   YA  Y+ G+QG     G  
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L+  A  KH+  YDL+NW G +R   D  +T Q+L++ Y P F    +  +   +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           +YN VNG+PSCA+   L    R  +GF   GY++SDCD+   +++   +A +   A  D 
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           ++AG D++CG+  Q +   A  ++++  +EI+R +  L+S  +RLG F+GN ++      
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             DVV + A  + + +AA +GIVLLKN  G LPL KS   S+ALIGP  N    L GNY 
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP+   I+PL A QN   +  Y  G +  + S+    +A+  AK +D ++   G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E +DR+++  PG Q +LI ++++   KP+I++ + GG VD +  K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569

Query: 563 GEAGAVALAEVIFGDHNPGEFTVI 586
           G++G  AL ++I G   P    V+
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVV 593


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  365 bits (936), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 211/564 (37%), Positives = 318/564 (56%), Gaps = 23/564 (4%)

Query: 32  SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           + D S P  E  P      C T+     RA  LVS  T +E ++   N++P +PRLG+P 
Sbjct: 44  TIDLSFPDCENGPLSKTLVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPP 103

Query: 87  YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           Y+ WSEALHG+    +  F + G    ATSFP  ILT ++ +  L  +I   I  + RA 
Sbjct: 104 YQVWSEALHGL---DRANFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAF 160

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
            N G+  G+  +APNIN FR   WGRGQETPGED   +   YA  Y+ G+QG     G  
Sbjct: 161 NNVGR-YGLDVYAPNINAFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG-----GVD 214

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L+  A  KH+  YDL+NW G +R   D  +T Q+L++ Y P F    +  +   +MC
Sbjct: 215 PEHLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMC 274

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           +YN VNG+PSCA+   L    R  +GF   GY++SDCD+   +++   +A +   A  D 
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAADS 334

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           ++AG D++CG+  Q +   A  ++++  +EI+R +  L+S  +RLG F+GN ++      
Sbjct: 335 IRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYR-DLT 393

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             DVV + A  + + +AA +GIVLLKN  G LPL KS   S+ALIGP  N    L GNY 
Sbjct: 394 WNDVVTTDAWNI-SYEAAVEGIVLLKND-GTLPLAKSVR-SVALIGPWMNVTTQLQGNYF 450

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           GP+   I+PL A QN   +  Y  G +  + S+    +A+  AK +D ++   G+D T E
Sbjct: 451 GPAPYLISPLNAFQNSDFDVNYAFGTNISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLE 510

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E +DR+++  PG Q +LI ++++   KP+I++ + GG VD +  K ++N+ S++W GYP
Sbjct: 511 AEAMDRMNITWPGNQLQLIDQLSQLG-KPLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYP 569

Query: 563 GEAGAVALAEVIFGDHNPGEFTVI 586
           G++G  AL ++I G   P    V+
Sbjct: 570 GQSGGQALLDIITGKRAPAGRLVV 593


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  365 bits (936), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 227/563 (40%), Positives = 317/563 (56%), Gaps = 54/563 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +    L   +RARDLVSR+TL+EKI+Q+ + AP+I RLG+PAY WW+EALHGVA  G   
Sbjct: 30  YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAGV-- 87

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   +T FPQ I  AA+FD+ L  +    I  E RA Y+  Q  G       +TFW
Sbjct: 88  --------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFW 139

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK-LQASACCKH 216
           +P INI RDPRWGRGQET GEDP +T + AVS++RG+QG        +G+ L+A+AC KH
Sbjct: 140 SPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------RGRYLKAAACAKH 191

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +       + R++F+A V+ +DL +TY P FE+ VK+ + +G+M AYNRVNG P C 
Sbjct: 192 FAVHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGVMGAYNRVNGEPCCG 248

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
              LL    R +W F GY+TSDC A+  I +  G  K+ E++    +K+G D+NCG    
Sbjct: 249 SGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIEESSALAVKSGCDLNCGCAYA 308

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
              K A +   + E EID A+H L   RMRLG+F+  P   P+  I  +      H+  A
Sbjct: 309 SLVK-AYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEKVPYSSIPYEKNDCAEHRAFA 366

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           L+ A+  +VLL+N  G LPL +S+  S+A+IGPNA+S   L GNY G +   +T L  ++
Sbjct: 367 LEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYNGTASEYVTVLDGIR 426

Query: 457 NYVEN--TVYY-PGCDTVACSSASIDK-------AVDIAKGADHVVLMMGLDQTQEKEE- 505
             V +   VYY  G      S   + +       A   A+ AD  V+ +GL++  E EE 
Sbjct: 427 EAVGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAVVCLGLNRDIEGEEG 486

Query: 506 --------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
                    D+ DL LPG Q+EL+  V +A   PV+LVLL G  + + +A  D N  +++
Sbjct: 487 DPSNEYPAGDKRDLRLPGLQEELLETV-KATGTPVVLVLLSGSALAVNWA--DENADAVV 543

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
            A YPG A A      +FG   P
Sbjct: 544 QAWYPG-AQAEGRRGALFGIIRP 565


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  365 bits (936), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 207/544 (38%), Positives = 304/544 (55%), Gaps = 18/544 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C +T     RA  LVS  TL+E I+   N++P +PRLG+P Y+ WSEALHG+A       
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLARAN--FT 117

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
            NG    ATSFP  IL+AA+F+  L  +I   I  + RA  NAG+  G+  ++PNIN FR
Sbjct: 118 DNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGR-FGLDVYSPNINTFR 176

Query: 166 DPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
            P WGRGQETPGED   +T  YA  Y+ G+QG     G     L+ +A  KHF  YD++N
Sbjct: 177 HPVWGRGQETPGEDAYTLTAAYAYEYITGIQG-----GVNPEHLKLAATAKHFAGYDIEN 231

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W   +R   D  +T QDLA+ Y P F    +       MC+YN VNG+PSC++   L   
Sbjct: 232 WDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQTL 291

Query: 285 ARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
            R  + F  HGY++ DC AV  +++  GYA +   A  D + AG D++CG+  Q H   +
Sbjct: 292 LRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFNES 351

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           +    +   +I+R    L++  + LG F+GN  +  P+  +G   V       ++ +AA 
Sbjct: 352 ITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEAAV 411

Query: 402 DGIVLLKNSHGLLPL---PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           +GIVLLKN  G LPL    + K+ S+ALIGP AN+   L GNY G +   I+P+ A    
Sbjct: 412 EGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFTAA 470

Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
                Y PG +    S+A+   A+  A+ AD +V + G+D T E E  DR  +  PG Q 
Sbjct: 471 GYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGNQL 530

Query: 519 ELITRVA--EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           ELI+++A  ++  +P+++  + GG VD +  K +  + ++LW GYPG++G +AL +++ G
Sbjct: 531 ELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQSGGLALRDILTG 590

Query: 577 DHNP 580
              P
Sbjct: 591 ARAP 594


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  364 bits (935), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 176/295 (59%), Positives = 231/295 (78%), Gaps = 5/295 (1%)

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
           + +  H     DCDAV++++  + YAK+PEDAV D LK+G+     S+L+ +TK+AV++K
Sbjct: 218 KHYTAHDLDNWDCDAVNVLHVEQKYAKTPEDAVADALKSGI-----SYLRNYTKSAVEKK 272

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           K+  SEIDRALHNLFS RMRLGLFNG+PT Q +  IG D VCS  HQ LAL+AA DGIVL
Sbjct: 273 KVTVSEIDRALHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVL 332

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
           LKN+  LLPL KS   SLA+IGPNA+++  LLGNY GP+C+++T L+ L+NYV +  Y  
Sbjct: 333 LKNADRLLPLSKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEK 392

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAE 526
           GC+ V+C+SA+  K V++A+  D V+L+MGLDQ+QEKE LDR+DLVLPG+Q  LIT VA+
Sbjct: 393 GCNNVSCTSAAKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAK 452

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           AAK+P++LVLL G P+D+TFAK +R IGSILWAGYPG+AGA ALA++IFG+HNPG
Sbjct: 453 AAKRPIVLVLLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPG 507



 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 151/208 (72%), Positives = 176/208 (84%), Gaps = 6/208 (2%)

Query: 23  RVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
           R  ++QPP+SCD S+PST+T+ FCKTTLPIS+RA DLVSRLT +EK +QLV+++PAIPRL
Sbjct: 22  RKQASQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRL 81

Query: 83  GIPAYEWWSEALHGVA---GVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
           GIPAYEWWSE LHG+     V +GI FFN TI+ ATSFPQVILTAASFD+++WYRIGQ +
Sbjct: 82  GIPAYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-V 140

Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
           G EARALYNAGQ  G+ FWAPN+NIFRDPRWGRGQETPGEDPLV GKY  S+VRGVQGD+
Sbjct: 141 GKEARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDS 200

Query: 199 FNG-GKLKGKLQASACCKHFTAYDLDNW 225
           F G   L   LQASACCKH+TA+DLDNW
Sbjct: 201 FEGESTLGDHLQASACCKHYTAHDLDNW 228


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  364 bits (935), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 209/546 (38%), Positives = 304/546 (55%), Gaps = 19/546 (3%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  T     RA  L+S  TLDE I+   N+   + RLG+PAY+ WSEALHG+    +  
Sbjct: 68  ICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRAN 124

Query: 105 FFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F + G    ATSFPQ ILT A+ +  L ++I   I  + RA  NAG+  G+  +APNIN 
Sbjct: 125 FSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINT 183

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FR P WGRGQETPGED  +   YA  Y+ G+QG        +  L+ +A  KH+  YD++
Sbjct: 184 FRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPESNLKLAATAKHYAGYDIE 238

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           NW   +R   D  +T QDL++ Y P F    +  +   +MCAYN VNG+P+CAD   L  
Sbjct: 239 NWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPACADSYFLQT 298

Query: 284 TARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R  +GF  HGY++SDCDA   IY+  GYA S   A  + + AG D++CG+  Q H   
Sbjct: 299 LLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNE 358

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQA 399
           ++    L   +I++ +  L++  ++ G F+ N T    P+  +    V       ++ QA
Sbjct: 359 SITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLETDAWNISYQA 418

Query: 400 AQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A  GIVLLKNS+ +LPL +      + ++ALIGP AN+   LLGNY G +   I+P  A 
Sbjct: 419 ATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAF 478

Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
           +       +  G    + S++    A+  A+ AD ++   G+D T E E LDR  +  PG
Sbjct: 479 EEAGYKVNFAEGTGISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPG 538

Query: 516 RQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
            Q +LI ++A AA KKP+I++ + GG VD +  K +  + ++LW GYPG++G  AL ++I
Sbjct: 539 NQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYPGQSGGFALRDII 598

Query: 575 FGDHNP 580
            G  NP
Sbjct: 599 TGKKNP 604


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  364 bits (935), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 210/542 (38%), Positives = 306/542 (56%), Gaps = 23/542 (4%)

Query: 47  KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
           K+  P++ RA+ LV+  T+ EK++   N++P +PRLG+P Y+WW EALHGVA    G+ F
Sbjct: 46  KSADPVA-RAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS-SPGVTF 103

Query: 107 N--GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
           N  G    ATSFPQ IL  A+FD  L   + + +  EARA  N G+A G+ FW PNIN +
Sbjct: 104 NATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRA-GLDFWTPNINPY 162

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGEDP     Y  S + G+QG     G+     + +A CKHF  YD+++
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQG-----GEDPEIRKITATCKHFAGYDIES 217

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W G  RY+ D ++  +DL + Y P F SC +       MC Y+ +NG+P+CAD  LL+  
Sbjct: 218 WNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTCADPWLLNDV 277

Query: 285 ARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            R  WG+     ++TSDCD++  I+    ++ + + A    L AG D++CG++ Q H   
Sbjct: 278 LREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAAAALNAGTDLDCGTYYQHHLPL 337

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A  Q  + ++ +D+AL  L++  +R G F+G P         +DV  + A Q LALQAA+
Sbjct: 338 AYSQGLINQTTVDQALVRLYTSLVRTGYFDG-PNAMYRNLTWSDVGTTHAQQ-LALQAAE 395

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +G+VLLKN  GLLPL  S    +ALIG  AN+   + GNY G      +PL A Q     
Sbjct: 396 EGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLYAAQQTGAQ 454

Query: 462 TVYYPGCDTVACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
             Y  G         + D  + +   A+ AD ++ + G+D + E E +DR D+   G Q 
Sbjct: 455 VFYAQGPG--GQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDINWTGAQL 512

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           ++I  +A    KP++L  + G  +D T    + NI +++W GYPG+ G VAL  +I G  
Sbjct: 513 DIIGELAMYG-KPMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFNIITGKT 570

Query: 579 NP 580
            P
Sbjct: 571 AP 572


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  364 bits (934), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 221/556 (39%), Positives = 315/556 (56%), Gaps = 60/556 (10%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           L   +RA  L   L+ +E+  QL   APAI + G+P+Y WW+E LHGVA  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
              AT FPQ I  AA+FD  + YR+G+ I  EARA+YN+           G+T WAPNIN
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRG ET GEDP +T +  VS+V+G+QG+       +  L+A+AC KHF  +  
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGE-------EEYLRAAACAKHFAVH-- 169

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +   + R++FDARV+ +D+ +TY P F++ VK+GR  G+M AYNRVNG PSCA   L+ 
Sbjct: 170 -SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
           K   R+WGF GY  SDC A+   +       +   +    LKAG DVNCG ++L  H  A
Sbjct: 229 KL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++  + + +I  A  +    R+RLG  + N     F  +  D++    ++ L+L+AA+
Sbjct: 285 ALEEGLITKQDIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
             +VLL N  G+LPL KS+  S+A+IGPNA+S   LLGNY G   RS+T L+ +Q+  + 
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDG 399

Query: 462 TVYY-PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------- 505
            VYY  GC         +A       +AV   + AD  V+ +GLD T E EE        
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEGDTGNEFA 459

Query: 506 -LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             D+ DL LP  Q+ L+  + +   KP+I+VL  G  V+           +++ A YPG+
Sbjct: 460 SGDKPDLRLPEVQRVLLQNLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQ 513

Query: 565 AGAVALAEVIFGDHNP 580
            G  ALAE++FG+ +P
Sbjct: 514 YGGKALAEILFGEVSP 529


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  364 bits (934), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 207/558 (37%), Positives = 317/558 (56%), Gaps = 22/558 (3%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CD + P ++T   C T      RA  L+S  TL+E ++   N++P +PRLG+P Y+ W+E
Sbjct: 52  CD-NGPLSKTI-VCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNE 109

Query: 93  ALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           ALHG+    +  F + G    +TSFP  ILT ++ +  L  ++   I  + RA  NAG+ 
Sbjct: 110 ALHGL---DRAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGR- 165

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
            G+  ++PNIN FR P WGRGQETPGED   ++  YA  Y+ G+QG     G     L+ 
Sbjct: 166 YGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG-----GVDPKSLKL 220

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            A  KH+  YD++NW G +R   D  +T QDL++ Y P F    +  +   +MC+YN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVN 280

Query: 271 GIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           G+PSCA+   L    R  +GF   GYI+SDCD+   +++   YA +   A  D ++AG D
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTD 340

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG+  Q +   AV Q  L  ++I+R +  L+S  MRLG F+GN +         DVV 
Sbjct: 341 IDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYR-NLTWNDVVT 399

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + +  +       +G VLLKN  G LPL +S   S+AL+GP  N +  L GNY GP+   
Sbjct: 400 TNSWNI---SYEVEGTVLLKND-GTLPLSESIR-SIALVGPWMNVSTQLQGNYFGPAPYL 454

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           I+PL A ++   +  Y  G +  + S+    KA+  AK +D ++   G+D + E E LDR
Sbjct: 455 ISPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDR 514

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           +++  PG+Q ELI ++++   KP+I++ + GG VD +  K ++N+ S++W GYPG++G  
Sbjct: 515 MNITWPGKQLELIDQLSQLG-KPLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQ 573

Query: 569 ALAEVIFGDHNPGEFTVI 586
           AL ++I G   P    V+
Sbjct: 574 ALLDIITGKRAPAGRLVV 591


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  363 bits (932), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 213/577 (36%), Positives = 314/577 (54%), Gaps = 43/577 (7%)

Query: 36  SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
           +NP  + +P C  T+P+S                       RA  L+S  TLDE I+   
Sbjct: 39  ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 96

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
           N+   + RLG+PAY+ WSEALHG+    +  F + G+   ATSFPQ ILT A+ +  L +
Sbjct: 97  NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 153

Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
           +I   I  + RA  NAG+  G+  +APNIN FR P WGRGQETPGED  +   YA  Y+ 
Sbjct: 154 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 212

Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
           G+QG           L+ +A  KH+  YD++NW   +R   D  +T QDL++ Y P F  
Sbjct: 213 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 267

Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
             +  +   +MCAYN VNG+P+CAD   L    R  +GF  HGY++SDCDA   IY+  G
Sbjct: 268 AARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 327

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           YA S   A  + + AG D++CG+  Q H   ++    L   +I++ +  L++  ++ G F
Sbjct: 328 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 387

Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
           + N T    P+  +    V       ++ QAA  GIVLLKNS+ +LPL +      + ++
Sbjct: 388 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 447

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
           ALIGP AN+   LLGNY G +   I+P  A +       +  G    + S++    A+  
Sbjct: 448 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 507

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
           A+ AD ++   G+D T E E LDR  +  PG Q +LI ++A +A  KP+I++ + GG VD
Sbjct: 508 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 567

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +  K + N+ ++LW GYPG++G  AL ++I G  NP
Sbjct: 568 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNP 604


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 220/556 (39%), Positives = 308/556 (55%), Gaps = 43/556 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F  T   I +RA  LVS++TL EKI+QL N+APAI RL +P Y+WW+EALHGVA  GK  
Sbjct: 21  FLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNGK-- 78

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I   A+FD  L  R+  AI  EARA Y   Q +G       +TFW
Sbjct: 79  --------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLTFW 130

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            PN+NIFRDPRWGRGQET GEDP +  +  V++V+G+QGD  N       L+++AC KH 
Sbjct: 131 TPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQGDDPN------YLKSAACAKH- 183

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             Y + +   + R +F+A  T QDL +TY P FE+ VK     G+M A+N V G P  A+
Sbjct: 184 --YAVHSGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFGAPMAAN 241

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           + LL+   R +WGF GY+ +DC A+  I     Y  S   A    LKAG ++NCG+   K
Sbjct: 242 KFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEVAAAAVALKAGTNLNCGA-TYK 300

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             K A+ Q  + E  +      LF  R RLG+F+ + +  P+ KIG +++ S  H  LA 
Sbjct: 301 ELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSKEHIELAR 360

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AAQ  IV+LKN + LLPLP    V   + GP ANS+  L+G+Y G S   +T L  + +
Sbjct: 361 EAAQKSIVMLKNKNNLLPLPTDIKVPY-VTGPFANSSDMLMGSYYGVSPGVVTILAGITD 419

Query: 458 YVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
            V    +  Y  G      +    + A ++A  +D  + ++GL   +E E +        
Sbjct: 420 AVSLGTSLNYRSGALPFQKNINPKNWAPNVAGMSDVTICVVGLTADREGEGVDAIASNHK 479

Query: 507 -DRVDLVLPGRQQELITRVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            DR+DL LP  Q   + ++A   K KP++LV+  G PV  +    + +  +IL   YPGE
Sbjct: 480 GDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPV--SLEGIEEHCDAILQIWYPGE 537

Query: 565 AGAVALAEVIFGDHNP 580
            G  A+A+V+FG  +P
Sbjct: 538 QGGNAVADVLFGKVSP 553


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 222/564 (39%), Positives = 315/564 (55%), Gaps = 56/564 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA DLVS++TL+EK SQL   A  + RLGIP Y WW+EALHGVA  G   
Sbjct: 6   YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AA FD     +I   I  E RA YN     G       +TFW
Sbjct: 64  --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PN+NIFRDPRWGRG ET GEDP +T +  V++V+G+QGD    GK    L+ +AC KH+
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKTAACAKHY 168

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA V+ +DL +TY P FE+ VK+ +   IM AYNR NG P    
Sbjct: 169 AVH---SGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPCNGS 225

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           + LL    R  WGF G++ SDC A+   ++  G  K+P ++V   LK+G D+NCG+ +  
Sbjct: 226 KTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPTESVALALKSGCDLNCGN-MYL 284

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
               A+K+  + E +IDRA   L + RM+LG+F+ +     F  I  ++  S  H  ++L
Sbjct: 285 LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCE---FDNIPYELNDSAEHNKISL 341

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA+  +VLLKN  GLLPL   K  ++A+IGPNA+S+  L  NY+G   +++T ++ ++ 
Sbjct: 342 EAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIRK 400

Query: 458 YV-ENT-VYYPGC--------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL- 506
            V ENT V+Y           + +A     + +AV  A+ +D VVL +GLD + E E+  
Sbjct: 401 RVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQND 460

Query: 507 ----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
                     D+ DL LP  Q+ L+  V  A  KP I+ LL G  + I  A       +I
Sbjct: 461 QGTVILDAGGDKADLNLPESQRNLLNAVL-ATGKPTIVALLSGSALSIGDAA--DKAAAI 517

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           +   YPG  G +A AE+IFGD++P
Sbjct: 518 VQCWYPGAIGGLAFAEMIFGDYSP 541


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 211/547 (38%), Positives = 309/547 (56%), Gaps = 44/547 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           +S+ A+++++R+T+ EKISQL+N +PAI  LGI  Y+WWSE LHGV   G+         
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR--------- 80

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIF 164
            AT FPQ I   A+FD  L   IG A+  E RA +N  + +       G+TFW+PN+NIF
Sbjct: 81  -ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RD RWGRG ET GEDPL++G    +YVRG+QGD          L+  AC KH   Y + +
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDD------AFYLKTGACAKH---YAVHS 190

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
               TR++ D   + +DL +TY P F+  V+QGR   +M AYNRV G P    + LL+  
Sbjct: 191 GPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLTDI 250

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R+ WGF+G+I SDCDA++  Y    Y K+PE+A    +KAG++V CG    K  + A+ 
Sbjct: 251 LRKSWGFNGHIVSDCDAINDFYGGHRYVKTPEEACAAAIKAGLNVECGHTF-KAMQGALD 309

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           Q  L E+++DRAL  L   R++LG+   +    P+       +CSPAH  LAL+AA + +
Sbjct: 310 QGLLAEADLDRALFPLVMTRLKLGILEPDSAC-PYNSYDESEICSPAHTALALRAADEAM 368

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
           VLLKN +G+LPL K+   +L + GP A+ A  L+GNY G S R  T LQ + + V +   
Sbjct: 369 VLLKN-NGILPLDKNIR-TLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSGTS 426

Query: 464 --YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLV 512
             + P    +      ++ AV+ A  A+  +++MG +   E EE          DRV + 
Sbjct: 427 VNFRPAFMQITEELNDMNWAVNEACAAEVAIVVMGNNGNMEGEEGEAIASASRGDRVGIG 486

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LP  Q   + RV       +++VL  G P+D+   +  +   +++ A YPG+ G  AL +
Sbjct: 487 LPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR--EISKLADAVVMAWYPGQEGGEALGD 544

Query: 573 VIFGDHN 579
           ++FGD N
Sbjct: 545 LLFGDKN 551


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 219/554 (39%), Positives = 310/554 (55%), Gaps = 57/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A  LVS++TL EK SQL   A  + RLG+PAY +W+EALHGVA  G           A
Sbjct: 13  KKAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------A 62

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I  AA FD     ++G  I  E RA YNA  A        G+TFW+PN+NIFRD
Sbjct: 63  TMFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRD 122

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V +V G+QGD    G +   ++A+AC KH+  +   +  
Sbjct: 123 PRWGRGHETYGEDPYLTSRLGVKFVEGIQGD----GPV---MKAAACAKHYAVH---SGP 172

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
            + R++FDA+ +M+D+ +TY P FE+ V +     +M AYNR NG P CA + L+    R
Sbjct: 173 ESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDVLR 232

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            +W F G+ TSDC A+   ++      +P  +    L AG D+NCG ++L  H   A + 
Sbjct: 233 GKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAGCDLNCGNTYL--HMMGAYQD 290

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E +I  +   L + R  LGLF+G+     + KI   VV    H   AL+ A+   V
Sbjct: 291 GLVTEEKITESAVRLLTTRYLLGLFDGS----EYDKIPYSVVECKEHIDEALKMARKSCV 346

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
           LLKN  G+LP+ K+K  ++ +IGPNA+S   L+GNY G S   IT L+ ++    + V  
Sbjct: 347 LLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRI 405

Query: 464 -YYPGCD-------TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
            Y  GCD        +A     I +AV  A+ +D V+L +GL++T E EE          
Sbjct: 406 LYSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTGNSDASG 465

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+VDL LP  Q+ELI +V  A  KP I+VL+ G  +D+ +A+   N   IL A YPG  G
Sbjct: 466 DKVDLHLPKVQEELIEKVT-AVGKPTIVVLMAGSAIDLNYAQ--DNCNGILLAWYPGARG 522

Query: 567 AVALAEVIFGDHNP 580
             A+A+++FG  +P
Sbjct: 523 GRAIADLLFGKESP 536


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 220/556 (39%), Positives = 315/556 (56%), Gaps = 60/556 (10%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           L   +RA  L   L+ +E+  QL   APAI + G+P+Y WW+E LHGVA  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
              AT FPQ I  AA+FD  + YR+G+ I  EARA+YN+           G+T WAPNIN
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRG ET GEDP +T +  V++V+G+QG+       +  L+A+AC KHF  +  
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE-------EEYLRAAACAKHFAVH-- 169

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +   + R++FDARV+ +D+ +TY P F++ VK+GR  G+M AYNRVNG PSCA   L+ 
Sbjct: 170 -SGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
           K   R+WGF GY  SDC A+   +       +   +    LKAG DVNCG ++L  H  A
Sbjct: 229 KL--REWGFDGYFVSDCWAIRDFHTTHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++  + +  I  A  +    R+RLG  + N     F  +  D++    ++ L+L+AA+
Sbjct: 285 ALEEGLITKQNIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
             +VLL N  G+LPL KS+  S+A+IGPNA+S   LLGNY G   RS+T L+ +Q+  + 
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQDAFDG 399

Query: 462 TVYY-PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------- 505
            VYY  GC         +A       +AV   + AD  V+ +GLD T E EE        
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEGDTGNEFA 459

Query: 506 -LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             D+ DL LP  Q+ L+ ++ +   KP+I+VL  G  V+           +++ A YPG+
Sbjct: 460 SGDKPDLRLPEVQRVLLQKLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQ 513

Query: 565 AGAVALAEVIFGDHNP 580
            G  ALAE++FG+ +P
Sbjct: 514 YGGKALAEILFGEVSP 529


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 207/546 (37%), Positives = 307/546 (56%), Gaps = 21/546 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C +T     RA  LVS  TL+E I+   N++P +PRLG+P Y+ WSEALHG+   G+  F
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGL---GRANF 116

Query: 106 F-NGTIR-GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
             NG +  G  SFP  IL+AA+F+  L  +I   I  + RA  NAG+  G+  ++PNIN 
Sbjct: 117 TDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGR-FGLDVYSPNINT 175

Query: 164 FRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           FR P WGRGQETPGED   +T  YA  Y+ G+QG     G     L+ +A  KHF  YD+
Sbjct: 176 FRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG-----GVNPEHLKLAATAKHFAGYDI 230

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           +NW   +R   D  +T QDLA+ Y P F    +       MC+YN VNG+PSC++   L 
Sbjct: 231 ENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPSCSNTFFLQ 290

Query: 283 KTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  + F  HGY++ DC AV  +++  GYA +   A  D + AG D++CG+  Q H  
Sbjct: 291 TLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAADAILAGTDIDCGTSYQYHFN 350

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADVVCSPAHQVLALQA 399
            ++    +   +I+R    L++  + LG F+GN  +  P+  +G   V       ++ +A
Sbjct: 351 ESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKTDAWNISYEA 410

Query: 400 AQDGIVLLKNSHGLLPL---PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           A +GIVLLKN  G LPL    + K+ S+ALIGP AN+   L GNY G +   I+P+ A  
Sbjct: 411 AVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPYLISPVDAFT 469

Query: 457 NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
                  Y PG +    S+A+   A+  A+ AD +V + G+D T E E  DR  +  PG 
Sbjct: 470 AAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQDRSSIAWPGN 529

Query: 517 QQELITRVA--EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           Q ELI+++A  ++  +P+++  + GG VD +  K++  + ++LW GYPG++G +AL +++
Sbjct: 530 QLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQSGGLALRDIL 589

Query: 575 FGDHNP 580
            G   P
Sbjct: 590 TGARAP 595


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 210/541 (38%), Positives = 313/541 (57%), Gaps = 22/541 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
           C ++    +RA+ L+S  TL+E I    NS P +PRLG+P Y+ W+EALHG+       K
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           G    G    ATSFP  ILT A+ +  L ++I   I  +ARA  N+G+  G+  +APN+N
Sbjct: 123 G----GQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNVN 177

Query: 163 IFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
            FR P WGRGQETPGED   ++  Y   Y+ G+QG     G     L+ +A  KHF  YD
Sbjct: 178 GFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG-----GVDPEHLKVAATVKHFAGYD 232

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+NW   +R  FDA +T QDL++ Y P F +  +  ++  +MCAYN VNG+PSCA+   L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCANSFFL 292

Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
               R  WGF   GY++SDCDAV  +++   YA +   A    L+AG D++CG     H 
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             +    ++   EI+R++  L++  +RLG F+     +  G    DVV + A  + + +A
Sbjct: 353 NESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLG--WKDVVKTDAWNI-SYEA 409

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A +GIVLLKN  G LPL K K  S+ALIGP AN+   + GNY GP+   I+PL+A +   
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKKAG 467

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            +  +  G +    S+    KA+  AK +D ++ + G+D T E+E  DR D+  PG Q +
Sbjct: 468 YHVNFELGTEIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPGNQLD 527

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI +++E   KP++++ + GG VD +  K ++ + S++W GYPG++G VAL +++ G   
Sbjct: 528 LIKQLSEVG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586

Query: 580 P 580
           P
Sbjct: 587 P 587


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  362 bits (929), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 219/554 (39%), Positives = 315/554 (56%), Gaps = 58/554 (10%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           L   +RA  L   L+ +E+  QL   APAI + G+P+Y WW+E LHGVA  G        
Sbjct: 9   LSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT------- 61

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNIN 162
              AT FPQ I  AA+FD  +  R+G+ +  EARA+YN+           G+T WAPNIN
Sbjct: 62  ---ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLWAPNIN 118

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRG ET GEDP +T +  V++V+G+QG+       +  L+A+AC KHF  +  
Sbjct: 119 IFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE-------EKYLRAAACAKHFAVH-- 169

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +   + R++FDARV+ +DL +TY P F++ VK+GR  G+M AYNRVNG PSCA   L+ 
Sbjct: 170 -SGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPSCASEKLMG 228

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKA 341
           K   R+WGF GY  SDC A+   +       +   +    LKAG DVNCG ++L  H  A
Sbjct: 229 KL--REWGFDGYFVSDCGAIRDFHTNHKITDTAPQSAAMALKAGCDVNCGNTYL--HILA 284

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+++  + + +I  A  +    R+RLG  + N     F  +  D++    ++ L+L+AA+
Sbjct: 285 ALEEGLITKQDIRTACIHALRTRIRLGQLDDN----EFDDLPFDIIACDGNKALSLEAAE 340

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
             +VLL N  G+LPL KS+  S+A+IGPNA+S   LLGNY G   RS+T L+ +Q+  + 
Sbjct: 341 KSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQDAFDG 399

Query: 462 TVYY-PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE-------L 506
            VYY  GC         +A       +AV   + AD  V+ +GLD T E EE        
Sbjct: 400 RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEGDTENKSG 459

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ DL LP  Q+ L+ ++ +   KP+I+VL  G  V+           +++ A YPG+ G
Sbjct: 460 DKPDLRLPEVQRVLLQKLKDTG-KPLIIVLAAGSSVNTECEG-----NALINAWYPGQYG 513

Query: 567 AVALAEVIFGDHNP 580
             ALAE++FG+ +P
Sbjct: 514 GKALAEILFGEVSP 527


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  362 bits (928), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 219/597 (36%), Positives = 309/597 (51%), Gaps = 123/597 (20%)

Query: 33  CDPSNPSTET-----FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 87
           CDP+  +        FP+C  +LP + R RDLV R+TL+EK++ L + A   PR+G+P Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEAR---- 143
                        G G       R  T+ P    T+A  D  +W R  +   L AR    
Sbjct: 107 ------------CGGG-------RRCTACP----TSARRD-VVWRRRARRHQLPARHQQR 142

Query: 144 --------------------ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
                                +YN G A  +T+W+PNIN+ RDPRWGR  ETPGEDP V 
Sbjct: 143 RVVQRDAVARHRRRGVDGDQGMYNLGHA-ELTYWSPNINVVRDPRWGRASETPGEDPFVV 201

Query: 184 GKYAVSYVRGVQG---------DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           G+YAV++VRG+Q                     ++ S+CCKH+ A               
Sbjct: 202 GRYAVNFVRGMQDIDGATTAASAAAATDAFSRPIKVSSCCKHYAA--------------- 246

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
                             CV        MC+YNR+NG+P+CAD  LL++T RR W  HGY
Sbjct: 247 ------------------CV--------MCSYNRINGVPACADARLLTETVRRDWQLHGY 280

Query: 295 ITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAVKQK 346
           I SDCD+V + + DA+    +  +A    +KAG+D++CG        F   +   AV+Q 
Sbjct: 281 IVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQG 340

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           KL ES +D AL NL+   MRLG F+G P ++    +GA  VC+  H+ LA  AA+ G+VL
Sbjct: 341 KLKESAVDNALTNLYLTLMRLGFFDGIPELE---SLGAADVCTEEHKELAADAARQGMVL 397

Query: 407 LKNSHGLLPLPKSKSVSLALIGP--NANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
           LKN   LLPL   K  S+AL G   + N+   +LG+Y G  CR +TP   ++  V +T  
Sbjct: 398 LKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSV 457

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
           +      AC   S D A   AK  D  +++ GL+ + E+E  DR DL+LP  Q   I  V
Sbjct: 458 H------ACDKGSCDTAAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAV 511

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           AEA+  P++LV++  G VD++FA+ +  IG+++WAGYPGE G  A+A+V+FG +NPG
Sbjct: 512 AEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPG 568


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  361 bits (927), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 216/554 (38%), Positives = 312/554 (56%), Gaps = 56/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A++LV ++TL+EK  QL   + A+ RL +P Y WW+E LHGVA  G           A
Sbjct: 14  EKAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------A 63

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I  AA FD  L   I + I  E RA YN            G+TFW+PN+NIFRD
Sbjct: 64  TVFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRD 123

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V++V+G+QG+    GK    L+A+AC KHF  +   +  
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGE----GKY---LKAAACAKHFAVH---SGP 173

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA V+ +DL +TY P FE+CVK+G    +M AYNR NG P C  + LL    R
Sbjct: 174 EGLRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDILR 233

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
            +W F G++ SDC A++  +       +  ++    +K G D+NCG+ +LQ     A K+
Sbjct: 234 GKWNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNGCDLNCGNVYLQ--LLLAYKE 291

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E +I  A   L + R+RLG+F+       + KI  ++     H  L+L+AA++ +V
Sbjct: 292 GLVTEEDITTAAERLMATRIRLGMFDEEC---EYNKIPYELNDCKEHNELSLKAARNSMV 348

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
           LLKN +G+LPL K+   S+A+IGPNA+S   L GNY+G + R IT L+ +   V   V  
Sbjct: 349 LLKN-NGILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRV 407

Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
            Y  GC       + +A  +  + +A+ IA+ +D  +L +GLD T E E+          
Sbjct: 408 YYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAG 467

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L LPGRQQEL+ ++ E    PVILV+  G    +TF   +    +IL A YPG  G
Sbjct: 468 DKASLNLPGRQQELLEKIIETG-TPVILVIGAGSA--LTFNNAEDKCSAILDAWYPGSRG 524

Query: 567 AVALAEVIFGDHNP 580
             A+A++IFG  +P
Sbjct: 525 GRAVADLIFGKCSP 538


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  361 bits (927), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 212/547 (38%), Positives = 308/547 (56%), Gaps = 23/547 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TTL   +RA  LV+ LT +EK+  +V+ +   PR+G+PAY WWSEALHGVA      F
Sbjct: 40  CNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAYAPGTQF 99

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           +  +G    +TSFP  +L AA+FD  L  +I + IG+E RA  NAG + G+ +W PN+N 
Sbjct: 100 WQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFS-GLDYWTPNVNP 158

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRG ETPGED L+  +YA + ++G++G        + + +  A CKH+ A D +
Sbjct: 159 FKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGPV-----PEKERRVVATCKHYAANDFE 213

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G TR+ F+A++++QD+A+ Y  PF+ CV+  R   IMCAYN VNG+PSCA   LL  
Sbjct: 214 DWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPSCASPYLLQT 273

Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +   + YITSDC+AV  +     YA +  +      +AGMD +C         
Sbjct: 274 ILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGTAISFEAGMDTSCEYEGSSDIP 333

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  Q  L ES +DRAL  L+   +R G F+G  ++  +  +G   V  P+ Q L+LQAA
Sbjct: 334 GAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSL--YSSLGWADVNKPSAQKLSLQAA 391

Query: 401 QDGIVLLKNSHGLLPLP----KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            DG VLLKN  G LPL     KS+   +A+IG  +++   L G Y+G +    TP  A  
Sbjct: 392 VDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGTAAYLHTPAYAAS 450

Query: 457 NY--VENTVYYPGCDT-VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
                 +T   P   + +A + +  D A+  AK AD+++   G+D +   E  DR DL  
Sbjct: 451 QLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAAGETKDRYDLDW 510

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PG Q  LI  +   +K   ++VL  G  +D T    +  I +ILWA +PG+ G  A+ E+
Sbjct: 511 PGAQLSLINLLTTLSKP--LIVLQMGDQLDNTPLLSNPKINAILWANWPGQDGGTAVMEL 568

Query: 574 IFGDHNP 580
           + G  +P
Sbjct: 569 VTGLKSP 575


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  361 bits (927), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 200/541 (36%), Positives = 307/541 (56%), Gaps = 19/541 (3%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P C  +L    RA  LVS  T DE ++   N+   + RLG+P Y+ W EALHGV   G+ 
Sbjct: 59  PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGV---GRA 115

Query: 104 IFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
            F  +G    ATSFP  I   A+ +  L ++IG  +  + RA  NAG   G+  ++PNIN
Sbjct: 116 NFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLG-GVDVYSPNIN 174

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            FR P WGRGQETPGED  +T  Y   Y+  +QG     G     L+  A  KH+  YD+
Sbjct: 175 TFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-----GVDPETLKIIATAKHYAGYDI 229

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++W   +R   D ++T Q+L++ Y PPF    +  +   +MC+YN VNG+PSCA++  L 
Sbjct: 230 ESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQ 289

Query: 283 KTARRQWGFH--GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  + F   GY++ DC AV  +++  GYA +   A  D + AG D++CG+  Q H++
Sbjct: 290 TLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSE 349

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +   +  S+I+R +  L+S  ++ G F+G     P+  I  D V S     +A +AA
Sbjct: 350 DAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAA 407

Query: 401 QDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
            +GIVLLKN      LP SK + S+A+IGP AN  + L GNY GP+   I+PL   ++  
Sbjct: 408 VEGIVLLKNDE---TLPLSKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSG 464

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            +  Y  G +  + S++  ++A+  AK AD ++   G+D T E E +DR ++  PG Q +
Sbjct: 465 LDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 524

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI++++E   KP++++ + GG VD +  K + N+ +++W GYPG++G  ALA++I G   
Sbjct: 525 LISKLSELG-KPLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 583

Query: 580 P 580
           P
Sbjct: 584 P 584


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 212/577 (36%), Positives = 314/577 (54%), Gaps = 43/577 (7%)

Query: 36  SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
           +NP  + +P C  T+P+S                       RA  L+S  TLDE I+   
Sbjct: 39  ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 96

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
           N+   + RLG+PAY+ WSEALHG+    +  F + G+   ATSFPQ ILT A+ +  L +
Sbjct: 97  NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 153

Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
           +I   I  + RA  NAG+  G+  +APNIN FR P WGRGQETPGED  +   YA  Y+ 
Sbjct: 154 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 212

Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
           G+QG           L+ +A  KH+  YD++NW   +R   D  +T QDL++ Y P F  
Sbjct: 213 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 267

Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
             +  +   +MCAYN V+G+P+CAD   L    R  +GF  HGY++SDCDA   IY+  G
Sbjct: 268 AARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 327

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           YA S   A  + + AG D++CG+  Q H   ++    L   +I++ +  L++  ++ G F
Sbjct: 328 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 387

Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
           + N T    P+  +    V       ++ QAA  GIVLLKNS+ +LPL +      + ++
Sbjct: 388 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 447

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
           ALIGP AN+   LLGNY G +   I+P  A +       +  G    + S++    A+  
Sbjct: 448 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 507

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
           A+ AD ++   G+D T E E LDR  +  PG Q +LI ++A +A  KP+I++ + GG VD
Sbjct: 508 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 567

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +  K + N+ ++LW GYPG++G  AL ++I G  NP
Sbjct: 568 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNP 604


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  361 bits (926), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 209/558 (37%), Positives = 316/558 (56%), Gaps = 23/558 (4%)

Query: 32  SCDPSNPSTETFPF-----CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           + D S P  E  P      C T+     RA  LVS  TL+E I+   N+AP +PRLG+P 
Sbjct: 45  TIDLSFPDCENGPLSTNLVCNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQ 104

Query: 87  YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           Y+ W+EALHG+    +  F + G    ATSFP  IL+ ASF+  L  +I   I  +ARA 
Sbjct: 105 YQVWNEALHGL---DRANFSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAF 161

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKL 204
            NAG+  G+  +APNIN FR P WGRGQETPGED   ++  YA  Y+ G+QG     G  
Sbjct: 162 NNAGR-YGLDSYAPNINGFRSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG-----GVD 215

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              ++  A  KHF  YDL+NW   +R   +A +T QDL++ Y P F +  +  +   +MC
Sbjct: 216 PEHVKIVATAKHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMC 275

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           +YN VNG+PSC++   L    R  + F   GY++SDCDAV  +++  GYA +   A  D 
Sbjct: 276 SYNAVNGVPSCSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAADS 335

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           L AG D++CG  +  H   +  ++ +   +I+++L  L++  +RLG F+GN ++  +  +
Sbjct: 336 LLAGTDIDCGQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSV--YRNL 393

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
             + V +     ++ +AA +GI LLKN  G LPL K K  S+ALIGP AN+   + GNY 
Sbjct: 394 NWNDVVTTDAWNISYEAAVEGITLLKND-GTLPLSK-KVRSIALIGPWANATVQMQGNYY 451

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           G     I+PL+A +       Y  G +    S+    +A+  AK +D ++   G+D T E
Sbjct: 452 GTPPYLISPLEAAKASGFTVNYAFGTNISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIE 511

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR DL  PG Q +LI ++++   KP++++ + GG VD +  K ++N+ +++W GYP
Sbjct: 512 AEGQDRTDLKWPGNQLDLIEQLSKVG-KPLVVLQMGGGQVDSSSLKANKNVNALVWGGYP 570

Query: 563 GEAGAVALAEVIFGDHNP 580
           G++G  AL +++ G   P
Sbjct: 571 GQSGGAALFDILTGKRAP 588


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  361 bits (926), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 212/577 (36%), Positives = 314/577 (54%), Gaps = 43/577 (7%)

Query: 36  SNPSTETFPFCKTTLPIS----------------------QRARDLVSRLTLDEKISQLV 73
           +NP  + +P C  T+P+S                       RA  L+S  TLDE I+   
Sbjct: 13  ANP--DLYPLCIETIPLSFPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTG 70

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWY 132
           N+   + RLG+PAY+ WSEALHG+    +  F + G+   ATSFPQ ILT A+ +  L +
Sbjct: 71  NTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIH 127

Query: 133 RIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVR 192
           +I   I  + RA  NAG+  G+  +APNIN FR P WGRGQETPGED  +   YA  Y+ 
Sbjct: 128 QIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYIT 186

Query: 193 GVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFES 252
           G+QG           L+ +A  KH+  YD++NW   +R   D  +T QDL++ Y P F  
Sbjct: 187 GIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHV 241

Query: 253 CVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEG 310
             +  +   +MCAYN V+G+P+CAD   L    R  +GF  HGY++SDCDA   IY+  G
Sbjct: 242 AARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHG 301

Query: 311 YAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           YA S   A  + + AG D++CG+  Q H   ++    L   +I++ +  L++  ++ G F
Sbjct: 302 YASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYF 361

Query: 371 NGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSL 424
           + N T    P+  +    V       ++ QAA  GIVLLKNS+ +LPL +      + ++
Sbjct: 362 DSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTV 421

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
           ALIGP AN+   LLGNY G +   I+P  A +       +  G    + S++    A+  
Sbjct: 422 ALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGTGISSTSTSGFAAALSA 481

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVD 543
           A+ AD ++   G+D T E E LDR  +  PG Q +LI ++A +A  KP+I++ + GG VD
Sbjct: 482 ARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVD 541

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +  K + N+ ++LW GYPG++G  AL ++I G  NP
Sbjct: 542 SSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNP 578


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  360 bits (925), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 220/563 (39%), Positives = 307/563 (54%), Gaps = 45/563 (7%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N   +T  F     P+  R  DL+ +LTL EKIS L   +  + RLGIPAY WW+EALHG
Sbjct: 9   NTQAQTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHG 68

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG------- 149
           VA  G           AT FPQ I  AA+F+  L       I  EARA YN         
Sbjct: 69  VARAGV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHL 118

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
           Q +G+TFW+PNINIFRDPRWGRGQET GEDP +T     ++V+G+QG+          L+
Sbjct: 119 QYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGND------PRYLK 172

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
           ASAC KHF  +   +     R+ F+A V  +DL +TY   F + V  G  S +MCAYNRV
Sbjct: 173 ASACAKHFAVH---SGPENGRHTFNAIVDEKDLRETYLYAFHALVDAGVES-VMCAYNRV 228

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           N  P C+   LL+   R +W F G++ +DC A+  I+       S  +     +KAG+++
Sbjct: 229 NDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAAAIKAGVNL 288

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +C + LQK  + AV+QK L E +ID +L +L   +++LG ++ +PT  PF K GAD V +
Sbjct: 289 DCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTANPFYKYGADSVAN 347

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            AH  LA   AQ  +VLLKNS+ LLPL K K  ++ ++G N+ S   LLGNY G S R++
Sbjct: 348 TAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNYHGVSNRAV 407

Query: 450 TPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE- 505
           + ++ + N V+      Y  G D    +  +    +  A  AD  V ++GL    E EE 
Sbjct: 408 SFVEGITNAVDAGTRVEYDQGSD---YNDTTHFGGIWAAGNADITVAVIGLTPVYEGEEG 464

Query: 506 --------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
                    D+ D+ LP      +  + +A KKP+I V+  G  VDI+    +    +IL
Sbjct: 465 DAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDIS--AIEPYADAIL 522

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
            A YPGE G  ALA+++FG  +P
Sbjct: 523 LAWYPGEQGGNALADILFGKVSP 545


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  360 bits (925), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 208/552 (37%), Positives = 310/552 (56%), Gaps = 22/552 (3%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S P ++T   C T+     RA  LVS LT +E ++   N+    PR+G+PAY+ W+EALH
Sbjct: 54  SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112

Query: 96  GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GVA      F + G    +TSFPQ I T A+ +  L ++I   I  + RA  NAG+  G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
             ++PNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+  A 
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  YD++NW   +R   D ++T QDLA+ Y P F    +  +   +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283

Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           SC++   L    R  + F   GY++ DC AV  +++  GYA +   A  D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G    +H + +   +++   +++R +  L++  +R G F+G  +  P+  I    V S  
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSIT 450
            Q L+ +AA   IVLLKN  G+LPL  S S  ++ALIGP AN+   +LGNY GP+   I+
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLGNYYGPAPYLIS 460

Query: 451 PLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           PLQA Q+  E  + Y           S S   A+  AK AD ++   G+D T E E  DR
Sbjct: 461 PLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQDR 519

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
            ++  P  Q  LIT++A+   KP+I++ + GG VD +  K ++N+ +++W GYPG++G  
Sbjct: 520 SNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGGQ 578

Query: 569 ALAEVIFGDHNP 580
           ALA++I G   P
Sbjct: 579 ALADIITGKRAP 590


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 211/541 (39%), Positives = 309/541 (57%), Gaps = 22/541 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
           C +T    +RA+ L+S  TL+E I    NS P +PRLG+P Y+ W+EALHG+       K
Sbjct: 64  CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 123

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           G    G     TSFP  IL+ A+ +  L ++I   I  +ARA  N G+  G+  +APNIN
Sbjct: 124 G----GEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGR-YGLDVYAPNIN 178

Query: 163 IFRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
            FR P WGRGQETPGED  ++T  Y   Y+ G+QG     G     L+ +A  KHF  YD
Sbjct: 179 GFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG-----GVDPENLKIAATAKHFAGYD 233

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+N+   +R  FDA +T QDL++ Y P F +  +  ++   MCAYN VNG+PSC++   L
Sbjct: 234 LENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFFL 293

Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
               R  WGF  +GY++SDCDA+  +++   YA S   A  D LKAG D++CG     H 
Sbjct: 294 QTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAAADSLKAGTDIDCGQTYPWHL 353

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             +     +   EI+R++  L++  +RLG F+     +  G    DVV + A  + + +A
Sbjct: 354 NESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLG--WKDVVKTDAWNI-SYEA 410

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A +GIVLLKN  G LPL K K  S+ALIGP  N+ + L GNY G +   I+PLQA +   
Sbjct: 411 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAG 468

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
               Y  G      ++A   KA+  AK +D ++ + G+D T E+E  DR D+  PG Q +
Sbjct: 469 YEVNYELGTGINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQLD 528

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI +++E   KP++++ + GG VD +  K ++ + S++W GYPG++G  AL +++ G   
Sbjct: 529 LIKQLSEVG-KPLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKRA 587

Query: 580 P 580
           P
Sbjct: 588 P 588


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 205/540 (37%), Positives = 305/540 (56%), Gaps = 20/540 (3%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GT 109
           P++++   L+S  TLDE I+   N+   + RLG+PAY+ WSEALHG+    +  F + G+
Sbjct: 59  PMTEQ-HSLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGL---DRANFSDSGS 114

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
              ATSFPQ ILT A+ +  L ++I   I  + RA  NAG+  G+  +APNIN FR P W
Sbjct: 115 YNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGR-YGLDVYAPNINTFRHPVW 173

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GRGQETPGED  +   YA  Y+ G+QG           L+ +A  KH+  YD++NW   +
Sbjct: 174 GRGQETPGEDVSLAAVYAYEYITGIQGP-----DPDSNLKLAATAKHYAGYDIENWHNHS 228

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R   D  +T QDL++ Y P F    +  +   +MCAYN VNG+P+CAD   L    R  +
Sbjct: 229 RLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTF 288

Query: 290 GF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           GF  HGY++SDCDA   IY+  GYA S   A  + + AG D++CG+  Q H   ++    
Sbjct: 289 GFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGD 348

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGKIGADVVCSPAHQVLALQAAQDGIV 405
           L   +I++ +  L++  ++ G F+ N T    P+  +    V       ++ QAA  GIV
Sbjct: 349 LSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIV 408

Query: 406 LLKNSHGLLPLPKS----KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           LLKNS+ +LPL +      + ++ALIGP AN+   LLGNY G +   I+P  A +     
Sbjct: 409 LLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYK 468

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
             +  G    + S++    A+  A+ AD ++   G+D T E E LDR  +  PG Q +LI
Sbjct: 469 VNFAEGTGISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLI 528

Query: 522 TRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            ++A +A  KP+I++ + GG VD +  K + N+ ++LW GYPG++G  AL ++I G  NP
Sbjct: 529 QKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNP 588


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 237/598 (39%), Positives = 336/598 (56%), Gaps = 53/598 (8%)

Query: 4   HKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
           HKL     +L L F +LL +  + Q        N +T+ F +  T     +RA+ LV+ L
Sbjct: 7   HKLLPTKVVLMLIFLTLLPKHLAAQ-------KNENTKNFEWFDTNKSFEKRAKALVASL 59

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           TL+EKIS +V+ +  I RL IP Y WW+E LHGVA  G+          AT FPQ I  A
Sbjct: 60  TLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNGR----------ATVFPQAIGLA 109

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRDPRWGRGQETP 176
           A+FD  L +++  AI  EARA +NA  AIG       +TFW PNINIFRDPRWGRGQET 
Sbjct: 110 ATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGLTFWTPNINIFRDPRWGRGQETY 169

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDP +T +  V++V+G+QG+          L+++AC KH   Y + +     R++FDA 
Sbjct: 170 GEDPYLTSQIGVNFVKGLQGNH------PKYLKSAACAKH---YAVHSGPEELRHEFDAI 220

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
            + +D+A+TY P FE+ VK+ +  G+M AYNRVNG  +CA   LL K  +  WGF GYI 
Sbjct: 221 ASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIV 280

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
           SDC A+S ++      ++ E++    L  G++VNCG+ +      A+KQ    E ++D  
Sbjct: 281 SDCWALSDLHKFHKVTQTAEESAAAALNVGLNVNCGN-VYPALDGAIKQGLTSEKQLDNV 339

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           L +    R +LG F+ +    P+ KI  DVV S AH+ +AL+AAQ  IVLLKN++ LLPL
Sbjct: 340 LQHQLLTRFKLGFFDPS-NNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNNNLLPL 398

Query: 417 PKS-KSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVA 472
            K  KSV +A  GPNA     LLGNY G + ++ T L  + + V    +  Y  G     
Sbjct: 399 KKDLKSVYVA--GPNAAREDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQ 456

Query: 473 CSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELITR 523
            +   ID +      AD  +++MGL    E EE          DRVD+ LP  Q + I +
Sbjct: 457 KNVNPIDWSTGEISRADVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKK 516

Query: 524 V-AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + A+    P++LVL  G P+ +    YD  + +I++A YPGE G  A+A+++FGD  P
Sbjct: 517 IKAKNTGNPLVLVLTGGSPIAMP-EVYDL-VDAIVFAWYPGEEGGQAVADILFGDVVP 572


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 218/550 (39%), Positives = 303/550 (55%), Gaps = 44/550 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SR+TL+EKISQ+ NS  AIPRLGIPAY++WSEALHGVA  GK        
Sbjct: 14  PLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK-------- 65

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-----GQA---IGMTFWAPNIN 162
             AT FPQ I  AA++D+ L  R+  AI  EARA ++      G+     G+T W+PNIN
Sbjct: 66  --ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTMWSPNIN 123

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           IFRDPRWGRGQET GEDP +TG+   ++VRG+Q      GK    L+ +AC KH+T +  
Sbjct: 124 IFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQ------GKDPHYLKTAACAKHYTVH-- 175

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            +     R+ F+A VT ++L DTY P F+  V + +   +M AYNR  G P C    LL 
Sbjct: 176 -SGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGEPCCGSPYLLK 234

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKA 341
           +  R QWGF G++ SDC A++  +      K   ++    +K G D+ C  ++  ++   
Sbjct: 235 EILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESAALGIKNGCDMACICTYSYENLTE 294

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A+ +  + E +ID AL N    R +LGLF+    + P+  I   VV   AH+ LA + A 
Sbjct: 295 ALNRGLITEEDIDHALRNTLRTRFKLGLFDPQEKV-PYAHISMSVVGCEAHRKLAYETAV 353

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
              VLLKN + +LP+ K    S+ ++GPNA +   LLGNY G S    T ++ L   +  
Sbjct: 354 KSAVLLKNHNHILPV-KPDVKSILIVGPNAGNVHVLLGNYYGLSDSMTTFMEGLVGRLPE 412

Query: 462 TV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE--------LDRVD 510
            V   + PG           D +V  A   D V+  MGL    E EE         DR D
Sbjct: 413 GVRMEFMPGSLLTDSKKIKNDWSVASAASFDLVIAFMGLSPLLEGEEGEAILSDNGDRED 472

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           + LP  QQE I  +A    K  I+++L GG   I     +  + +ILW GYPG+ G  A+
Sbjct: 473 IALPKAQQEYIRDLAATGAK--IVLVLTGGSA-IALNGIEDLVEAILWVGYPGQEGGRAI 529

Query: 571 AEVIFGDHNP 580
           A++IFGDH+P
Sbjct: 530 ADLIFGDHSP 539


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 176/348 (50%), Positives = 230/348 (66%), Gaps = 26/348 (7%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           M  +  SL F  + L F   ++  +S +  + C P  P   ++ FC  +L +  RA  L+
Sbjct: 1   MTLYLRSLFFSCIVL-FELSVSNSESHKLQYPCQP--PLHNSYTFCNQSLSVPTRAHSLI 57

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQV 119
           S LTL+EKI QL ++A  IPR GIP YEWWSE+LHG+A  G G+ F  G +  AT FPQV
Sbjct: 58  SLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQV 117

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           I++AA+F+  LW+ IG AI +EARA++N GQ+ G+TFWAPN+NIFRDPRWGRGQETPGED
Sbjct: 118 IISAAAFNRTLWFLIGSAIAIEARAMHNVGQS-GLTFWAPNVNIFRDPRWGRGQETPGED 176

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA--------------------SACCKHFTA 219
           P++T  YA+ +V+G QG  +  G                            SACCKH TA
Sbjct: 177 PMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLRDDDGDDGLMLSACCKHLTA 236

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YDL+ W   +RY F+A VT QDL DTYQPPF SC+++G+AS +MC+YN VNG+P+CA  +
Sbjct: 237 YDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACARED 296

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           LL K AR +WGF GYI SDCDAV+ I++ + Y+KS EDAV   LKAGM
Sbjct: 297 LLQK-AREEWGFEGYIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 225/617 (36%), Positives = 324/617 (52%), Gaps = 61/617 (9%)

Query: 13  LCLCFTSLLT--RVDSTQPPF---SCDPS------NPSTETFP-----------FCKTTL 50
           L  C ++L+     DS Q P    +C P+      N +  +FP            C  TL
Sbjct: 10  LLFCGSALVAGFDADSHQKPLVSRTCLPAVEHAANNNAVNSFPDCSRDPLCSNDVCDETL 69

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV-GKGIFFNGT 109
              +RA  LV+ LT+ EK+  LVN AP +PRL IP YEWWSE LHGVA   G     +G 
Sbjct: 70  SPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVASSPGTKFAKSGN 129

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFW------------ 157
              ATSFPQ I+  ++FD  L   IG+ +  EARA  N G++ G+  +            
Sbjct: 130 FSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRS-GLDLYVSSISRHIEPEV 188

Query: 158 ---------APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
                    +PNIN F+DPRWGRGQETPGEDP     Y  + + G++G     G    KL
Sbjct: 189 RDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGLEG-----GDPSKKL 243

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
            A+  CKH+ A D +N+KG  R  FDA +T QDL++ Y PPF++C    +    MC+YN 
Sbjct: 244 IAT--CKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNA 301

Query: 269 VNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           +NG P CA+  LL    R+ WG++G   Y+++DCD V+++     YA     A    +KA
Sbjct: 302 INGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKA 361

Query: 326 GMDVNCGSFL-QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA 384
           G D+ C +F   +  + A  Q  + E E+D++L  +++  + +G F+ +   QP   +  
Sbjct: 362 GTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFD-SARGQPLRSLSW 420

Query: 385 DVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS-KSVSLALIGPNANSAKTLLGNYAG 443
           D V +   Q LA QA  +G VLLKN  G+LPL  + +    ALIGP  N+   + GNY G
Sbjct: 421 DDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINATTQMQGNYFG 479

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
           P+   I+  QA + +  +  Y  G   +  +  S  +A+D A  A  +V   G+D T E 
Sbjct: 480 PAPYLISLYQAAKEFGLDFTYSLGS-RINSTDDSFKQALDSAHAAALIVFAGGVDNTLEA 538

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E  DR  L  P  Q +L+ R   A  KPVI++   GG VD T    + +I ++LW GYPG
Sbjct: 539 ETRDRKTLAWPESQLDLL-RAVSALGKPVIVLQFGGGQVDDTELLANHSINALLWGGYPG 597

Query: 564 EAGAVALAEVIFGDHNP 580
           ++G  A+ +++FG   P
Sbjct: 598 QSGGKAVIDLLFGRAAP 614


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  358 bits (919), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 200/541 (36%), Positives = 308/541 (56%), Gaps = 19/541 (3%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P C  +L    RA  LVS  T DE ++   N+   + RLG+P Y+ W EALHGV   G+ 
Sbjct: 58  PVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGV---GRA 114

Query: 104 IFF-NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
            F  +G    ATSFP  I   A+ +  L ++IG  +  + RA  NAG   G+  ++PNIN
Sbjct: 115 NFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLG-GVDVYSPNIN 173

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            FR P WGRGQETPGED  +T  Y   Y+  +QG          K+ A+A  KH+  YD+
Sbjct: 174 TFRHPVWGRGQETPGEDAFLTSVYGYEYITALQGAV---DPETSKIIATA--KHYAGYDI 228

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++W   +R   D ++T Q+L++ Y PPF    +  +   +MC+YN VNG+PSCA++  L 
Sbjct: 229 ESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQ 288

Query: 283 KTARRQWGFH--GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
              R  + F   GY++ DC AV  +++  GYA +   A  D + AG D++CG+  Q H++
Sbjct: 289 TLLRDTFEFSEDGYVSGDCGAVYNVWNPHGYASNEAAASADSILAGTDIDCGTSYQWHSE 348

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +   +  S+I+R +  L+S  ++ G F+G     P+  I  D V S     +A +AA
Sbjct: 349 DAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDA--PYRDITWDDVLSTDAWNIAYEAA 406

Query: 401 QDGIVLLKNSHGLLPLPKSKSV-SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
            +GIVLLKN      LP SK + S+A+IGP AN  + L GNY GP+   I+PL   ++  
Sbjct: 407 VEGIVLLKNDE---TLPLSKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSG 463

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            +  Y  G +  + S++  ++A+  AK AD ++   G+D T E E +DR ++  PG Q +
Sbjct: 464 LDVHYALGTNLTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 523

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI++++E   KP++++ + GG VD +  K + N+ +++W GYPG++G  ALA++I G   
Sbjct: 524 LISKLSELG-KPLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 582

Query: 580 P 580
           P
Sbjct: 583 P 583


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  358 bits (918), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 210/551 (38%), Positives = 313/551 (56%), Gaps = 54/551 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A++LV+++TL EK  QL  ++PA+ RL IP Y WW+E LHGVA  G           AT 
Sbjct: 29  AKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATV 78

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPR 168
           FPQ I  AA FD     ++   I  E RA YN            G+T+W+PN+NIFRDPR
Sbjct: 79  FPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPR 138

Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
           WGRG ET GEDP +T +  V++V+G+QGD    GK    L+ SAC KHF  +   +   +
Sbjct: 139 WGRGHETYGEDPYLTSRLGVAFVKGLQGD----GKY---LKLSACAKHFAVH---SGPES 188

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
            R++F+A V+ +DL +TY P FE+CVK+     +M AYNR NG P C  + LL    R +
Sbjct: 189 LRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILRGK 248

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
           WGF G++ SDC A++  +       +  ++V   ++ G D+NCG+ +  +   A K+  +
Sbjct: 249 WGFKGHVVSDCWALADFHMHHKVTSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 307

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
            E +I  A   L + R +LG+F+ +     + +I  +V     H  ++L+A++  +VLLK
Sbjct: 308 TEEQITTAAERLMTTRFKLGMFDEDC---EYNQIPYEVNDCKEHNQVSLEASRKSMVLLK 364

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
           N+ G+LPL KSK  ++A+IGPNANS   L GNY+G + +  T L  + + +++ V   Y 
Sbjct: 365 NN-GILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYS 423

Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
            GC       + +A     + +AV +A+ AD V+L +GLD T E E+          D++
Sbjct: 424 EGCHLYKEKVEDLARRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDKL 483

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           DL LPG QQEL+ +V E   KPV++VL  G  + +  A  +    +IL A YPG  G  A
Sbjct: 484 DLNLPGIQQELLEKVLETG-KPVVVVLGTGSGLTLNGA--EERCAAILNAWYPGSHGGTA 540

Query: 570 LAEVIFGDHNP 580
            A+++FG  +P
Sbjct: 541 AADILFGKCSP 551


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  358 bits (918), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 219/554 (39%), Positives = 309/554 (55%), Gaps = 58/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A  LV+R+TL+EK SQL   APAI RLGIPAY WW+E LHGVA  G+          A
Sbjct: 8   KKAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD      +   +  E RA YNA           G+TFW+PN+NIFRD
Sbjct: 58  TVFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  VS+V+ +QG   NG  +K    A+AC KHF  +   +  
Sbjct: 118 PRWGRGHETYGEDPYLTKELGVSFVKALQG---NGDTMK----AAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++FDA  + +D+ +TY P FE  VK+ +   +M AYNR NG P C     L K  R
Sbjct: 168 EALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSPT-LQKKLR 226

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            +W F G+  SDC A+   ++      +  ++    +  G D+NCG ++L  H   A ++
Sbjct: 227 GEWKFQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGCDLNCGNTYL--HIMKAYEK 284

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  I RA   LF+ R  LGLF+G+     +  +    V SP H   A +AA+   V
Sbjct: 285 GLVTEETITRAAVRLFTTRYLLGLFDGS----EYDNLSYMEVESPRHLDAAEKAAEKSFV 340

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
           LLKN +G+LPL K K  ++ +IGPNA+S + L+GNY G + R IT  + +Q+YV + V  
Sbjct: 341 LLKN-NGILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRI 399

Query: 466 ---PGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
               GCD        +A +   I +A  +A+ +D V+L MGLD+T E EE          
Sbjct: 400 LTSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGDTGNSYVSG 459

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ D+ LPG Q+EL+  +A+   KPV+  LL G  +D+ +A    +   +LW  YPG  G
Sbjct: 460 DKEDIELPGVQRELMEAIADTG-KPVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQG 516

Query: 567 AVALAEVIFGDHNP 580
             A A+V+FG+ +P
Sbjct: 517 GKAAAKVLFGEISP 530


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  357 bits (917), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 222/550 (40%), Positives = 307/550 (55%), Gaps = 37/550 (6%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T+     RA  LVS +  +EK++ LVN++P + RLG+ AY+WW+EALHGVA   +GI 
Sbjct: 39  CDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH-NRGIT 97

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
           + G    AT FPQ I T+A+FD  L  +IG  I  EARA  N G+A  + FW PN+N FR
Sbjct: 98  WGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRA-HLDFWTPNVNPFR 156

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ETPGED     K+A ++V+G+QG            +  A CKH+ AYDL+N 
Sbjct: 157 DPRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-------GPTHRVIATCKHYAAYDLENS 209

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
             TTR+ FDA+V+ QDLA+ Y PPF+ C +  +   IMC+YN VN IP+CA+  L+    
Sbjct: 210 GSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACANPYLMDTIL 269

Query: 286 RRQWGF---HGYITSDCDAVSIIYDAEG---YAKSPEDAVVDVLKAGMDVNCGSF--LQK 337
           R+ W +   H YI SDCDAV  + +A G   Y  S   A+   L+AG D  C +      
Sbjct: 270 RKHWNWTDEHQYIVSDCDAVYYLGNANGGHRYKPSYAAAIGASLEAGCDNMCWATGGTAP 329

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
              +A    +  ++ +D A+       +  G F+G   M  +  +    V +   Q  AL
Sbjct: 330 DPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGM--YRNLSVADVNTQTAQDTAL 387

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKS-VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +AA+ GIVLLKN  G+LPL  + S   +A+IG  AN+A  +LG Y+G    +  P+ A +
Sbjct: 388 KAAEGGIVLLKND-GILPLSVNGSNFQVAMIGFWANAADKMLGGYSGSPPFNHDPVTAAR 446

Query: 457 ------NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
                 NYV   +  P  DT A        A++ A+ ++ VV   G+D T EKE  DR  
Sbjct: 447 SMGITVNYVNGPLTQPNGDTSA--------ALNAAQKSNAVVFFGGIDNTVEKESQDRTS 498

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           +  P  Q  LI R+AE   KPVI+V L G  VD T      N+ +ILWAGYPG+ G  A+
Sbjct: 499 IEWPSGQLALIRRLAETG-KPVIVVRL-GTHVDDTPLLSIPNVRAILWAGYPGQDGGTAV 556

Query: 571 AEVIFGDHNP 580
            ++I G  +P
Sbjct: 557 VKIITGLASP 566


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  357 bits (916), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 219/550 (39%), Positives = 302/550 (54%), Gaps = 34/550 (6%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  TL   +RA  LV  LT++EK+  LV+ A   PR+G+PAY WWSEALHGVA      F
Sbjct: 43  CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAYAPGTYF 102

Query: 106 FNGTIR--GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
             G +    +TS+P  +L AA+FD  L  +IG AIG+EARA  NAG A G+ +W PN+N 
Sbjct: 103 PEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWA-GLDYWTPNVNP 161

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRG ETPGED L   +YA    RG+ G      +     +  + CKH+   D +
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQR-----RVISTCKHYAGNDFE 216

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W GT+R+ FDA++T QDLA+ Y  PF+ C +  +   IMCAYN VNG+PSCA+  LL  
Sbjct: 217 DWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEYLLQN 276

Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +   + Y+TSDC+AV  +     YA +         +AGMD +C         
Sbjct: 277 ILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGTAICFEAGMDTSCEYTGSSDIP 336

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  Q  L E  +DRAL  L+   +R G F+G+  +  + K+G   V S   Q LALQAA
Sbjct: 337 GAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAI--YAKLGWKDVNSAEAQSLALQAA 394

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
            +GIVLLKN +G LPL    S  +A+IG  A++   L G Y+G +    TP  A +    
Sbjct: 395 VEGIVLLKN-NGTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQL-- 451

Query: 461 NTVYYPGCDTVACSSASIDK--AVD--------IAKGADHVVLMMGLDQTQEKEELDRVD 510
                 G D    S   + +  A D         A+GAD+++   GLD +   E LDR D
Sbjct: 452 ------GLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTD 505

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           L  P  Q  LI +++ A  KP+++ LL     D    + D  + SILWA +PG+ G VA+
Sbjct: 506 LEWPEAQLMLIKKLS-ALGKPLVVNLLGDQLDDTPLLQLDE-VSSILWANWPGQDGGVAI 563

Query: 571 AEVIFGDHNP 580
            ++I G+ +P
Sbjct: 564 MKLITGEKSP 573


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  357 bits (916), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 211/541 (39%), Positives = 315/541 (58%), Gaps = 22/541 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
           C ++    +RA+ L+S  TL+E I    NS P +PRLG+P Y+ W+EALHG+       K
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           G    G    ATSFP  ILT A+ +  L ++I   I  +ARA  N+G+  G+  +APN+N
Sbjct: 123 G----GQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNVN 177

Query: 163 IFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
            FR P WGRGQETPGED   ++  Y   Y+ G+QG     G    +L+ +A  KHF  YD
Sbjct: 178 GFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG-----GVDPEQLKVAATVKHFAGYD 232

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+NW   +R  FDA +T QDL++ Y P F +  +  ++  +MC+YN VNG+PSCA+   L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCANSFFL 292

Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
               R  WGF   GY++SDCDAV  +++   YA +   A    L+AG D++CG     H 
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             +    ++   EI+R++  L++  +RLG F+     +  G    DVV + A  + + +A
Sbjct: 353 NESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLG--WKDVVKTDAWNI-SYEA 409

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A +GIVLLKN  G LPL K K  S+ALIGP AN+   + GNY GP+   I+PL+A +   
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKKAG 467

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
            +  +  G +    S+A   KA+  AK +D +V + G+D T E+E  DR D+  PG Q +
Sbjct: 468 YHVNFELGTEIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPGNQLD 527

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI +++E   KP++++ + GG VD +  K ++ + S++W GYPG++G VAL +++ G   
Sbjct: 528 LIKQLSEVG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586

Query: 580 P 580
           P
Sbjct: 587 P 587


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 218/599 (36%), Positives = 327/599 (54%), Gaps = 61/599 (10%)

Query: 5   KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
           K   +F ++ + F + +     T+     D S      F F  T L + +R  DL+SRLT
Sbjct: 3   KYRQLFAIMLMLFLTAIMHAQQTK----IDKSE-----FDFYDTDLSMDERIDDLISRLT 53

Query: 65  LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L+EK  Q++N++PAI RLGIPAY+WW+EALHG+   G           AT FPQ I   A
Sbjct: 54  LEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSGV----------ATVFPQAIGMGA 103

Query: 125 SFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRGQETPG 177
           +FD  L  ++  AI  EARA +N         +  G+TFW PN+NIFRDPRWGRGQET G
Sbjct: 104 TFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETYG 163

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           EDP +T K   ++V+G+QGD          L+ +A  KH+  +   +     R++F+A V
Sbjct: 164 EDPYLTSKLGEAFVKGLQGDN------DKYLKTAAAAKHYAVH---SGPEKLRHEFNADV 214

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +DL +TY P F++ V       IMCAYN  NG P CA+  L++   R +WGF+G++ S
Sbjct: 215 SEKDLWETYLPAFKTLV-DANVETIMCAYNSTNGEPCCANNRLINDILRDKWGFNGHVVS 273

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG---SFLQKHTKAAVKQKKLPESEID 354
           DC A+          +SPE A    ++ G+++NCG   +FL K    AV+   + E  +D
Sbjct: 274 DCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTYNFLAK----AVEDGLVSEELVD 329

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
           + LH L   R +LGLF+   +  P+ KIG +V+ S  H+ LA + A+  IVLLKN  G+L
Sbjct: 330 KRLHKLLETRFKLGLFDPEES-NPYNKIGVEVMNSDEHRALARETARKSIVLLKND-GVL 387

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTV 471
           PL K+      + GPNA + + LLGNY G +   +T L+ +   ++      Y  G    
Sbjct: 388 PL-KNNLSKYFITGPNATNIEVLLGNYHGVNPDMVTVLEGIAKAIKPESQLQYRMGTRLN 446

Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELIT 522
             +    D A   A  +D   ++MG+    E EE          DR+D  LP  Q + + 
Sbjct: 447 LPNENPQDWASPNAGNSDATFVVMGISGLLEGEEGESIASPTFGDRMDYNLPQNQIDYLQ 506

Query: 523 RVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V+EAA+ +PV+ ++  G P+++T  +  +   ++L   YPGE G  A+A++IFG ++P
Sbjct: 507 KVSEAAEDRPVVAIVTGGSPMNLT--EVHKLADAVLLVWYPGEEGGNAVADIIFGKNSP 563


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 212/541 (39%), Positives = 315/541 (58%), Gaps = 22/541 (4%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG---K 102
           C ++   ++RA+ L+S  TL+E I    NS P +PRLG+P Y+ W+EALHG+       K
Sbjct: 63  CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFATK 122

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           G    G  + ATSFP  IL+ A+ +  L ++I   I  +ARA  N+G+  G+  +APNIN
Sbjct: 123 G----GQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGR-YGLDVYAPNIN 177

Query: 163 IFRDPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
            FR P WGRGQETPGED  ++T  Y   Y+ G+QG     G     L+ +A  KHF  YD
Sbjct: 178 GFRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG-----GVDPENLKIAATAKHFAGYD 232

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L+NW   +R  FDA +T QDL++ Y P F +  +  ++   MCAYN VNG+PSCA+   L
Sbjct: 233 LENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFFL 292

Query: 282 SKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
               R  WGF   GY++SDCDAV  +++   YA +   A    L+AG D++CG     H 
Sbjct: 293 QTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAAASSLRAGTDIDCGQTYPWHL 352

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
             +    ++   EI+R++  L++  +RLG F+     +  G    DVV + A  + + +A
Sbjct: 353 NESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLG--WKDVVKTDAWNI-SYEA 409

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A +GIVLLKN  G LPL K K  S+ALIGP AN+   + GNY G +   I+PL+A +   
Sbjct: 410 AVEGIVLLKND-GTLPLSK-KVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAG 467

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
               +  G +T + S+A   KA+  AK +D ++   G+D T E+E  DR D+  PG Q +
Sbjct: 468 YQVNFELGTETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQLD 527

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI +++E   KP++++ + GG VD +  K ++ + S++W GYPG++G VAL +++ G   
Sbjct: 528 LIKQLSELG-KPLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKRA 586

Query: 580 P 580
           P
Sbjct: 587 P 587


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  356 bits (913), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 221/557 (39%), Positives = 319/557 (57%), Gaps = 34/557 (6%)

Query: 38  PSTETFPFCK-----TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           P+ +T P C      T+L I++R   LV  LTL+EKI  LV++A    RLG+P YEWW+E
Sbjct: 150 PACQTQPLCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNE 209

Query: 93  ALHGVAGVGKGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           A HGV G   G+ F         ATSFP  IL AASFD+ L  +I + IG E RA  N G
Sbjct: 210 ATHGV-GSAPGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNG 268

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            + G  FWAPNIN FRDPRWGRGQETPGED  V   Y  +++ G+QGD       K K Q
Sbjct: 269 FS-GFDFWAPNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGD-----DPKNK-Q 321

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A CKH+  YDL+    T RY  +   T QDL+D +  PF++CV+      IMC+YN V
Sbjct: 322 VIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSV 377

Query: 270 NGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           +GIP+CA+  LL +  R+ WGF+    Y+ SDC+AV+ I+    +  + E A    L AG
Sbjct: 378 SGIPACANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQYHNFTDTEEAAAAVALNAG 437

Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           +D+ CG S+L+ +   A  Q  +    +D++L  L+S    +G F+G      +  +   
Sbjct: 438 VDLECGSSYLKLNESLAANQTSV--KAMDQSLARLYSALFTIGFFDGG----KYDHLDFS 491

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGP 444
            V  PA Q LA +AA +G+ LLKN  GLLPL  + K  S+A+IGP AN+   + G Y+G 
Sbjct: 492 DVSIPAAQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGN 550

Query: 445 SCRSITPLQALQNYVENTVYYP-GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
           +   I+PL A ++     V Y  G      ++   + ++  AK +D +V + G+D + E 
Sbjct: 551 APYLISPLVAFESDHRWKVNYAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIES 610

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E +DR  L  PG Q +LI  ++  + KP+++V   GG VD +    +++I +++WAGYP 
Sbjct: 611 ETIDRTSLAWPGNQLDLIKSLSNLS-KPMVVVQFGGGQVDDSALLENKDIQALIWAGYPS 669

Query: 564 EAGAVALAEVIFGDHNP 580
           ++G  AL +++ G  +P
Sbjct: 670 QSGGTALLDILVGKRSP 686


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 218/566 (38%), Positives = 311/566 (54%), Gaps = 40/566 (7%)

Query: 35  PSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           P+ P T +FP            C TT  I++RA  +V  +TL+EK++ + +SA    RLG
Sbjct: 53  PNPPLTPSFPDCTRDPLCSNDVCDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLG 112

Query: 84  IPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
           +PAY+W +EALHGVAG   G+ F          ATSFP  IL +A+FD  L   +  AI 
Sbjct: 113 LPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAIS 171

Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
            EARA  N G A G+ FW PNIN FRDPRWGRG ETPGED      Y ++ V G+QG   
Sbjct: 172 TEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQG--- 227

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
             G      +  + CKHF AYD++N     R   +   T QD+AD Y P FE+CV+  + 
Sbjct: 228 --GIDPDFYRTLSTCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKV 281

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPE 316
           + IMCAYN V+G+P+CAD  LL    R  +GF     Y+ SDCDAV  ++D   YA +  
Sbjct: 282 ASIMCAYNAVDGVPACADSYLLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLT 341

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
            A    + AG D++CGS       A+V+     E+ +D++L  L+S  +++G F+     
Sbjct: 342 QAAAMSINAGTDLDCGSSYNV-LNASVQAGLTTEATLDKSLIRLYSALVKVGYFD----- 395

Query: 377 QP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           QP  +  +G   V +   Q LA  AA +G+ LLKN  G LPL ++ S ++A+IGP AN  
Sbjct: 396 QPAEYNSLGWGNVNTTQSQALAHDAATEGMTLLKN-DGTLPLSRTLS-NVAVIGPWANVT 453

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
             + GNYAG +   + PL   Q    N  Y  G    +  ++  + A+  A  +D +V +
Sbjct: 454 TQMQGNYAGTAPLLVNPLSVFQQKWRNVKYAQGTAINSQDTSGFNAALSAASSSDVIVYL 513

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
            G+D + E E  DR  +  PG Q  LI+++A    KP+++V   GG +D +    +  + 
Sbjct: 514 GGIDISVENEGFDRSSITWPGNQLNLISQLANLG-KPLVIVQFGGGQIDDSALLSNSKVN 572

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           SILWAGYPG+ G  A+ +V+ G + P
Sbjct: 573 SILWAGYPGQDGGNAIFDVLTGANPP 598


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 213/551 (38%), Positives = 307/551 (55%), Gaps = 54/551 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A++LVS++TL EK  QL   +PAI  L +P Y WW+E LHGVA  G           AT 
Sbjct: 16  AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATV 65

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPR 168
           FPQ I  AA FD     ++   I  E RA YN        G   G+T+W+PNINIFRDPR
Sbjct: 66  FPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPR 125

Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
           WGRG ET GEDP +T +  V++++G+QG+    GK    L+ +AC KHF  +   +    
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----GKY---LKLAACAKHFAVH---SGPEG 175

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
            R++F+A V  +DL +TY P FE+CVK+     +M AYNR NG P C  + LL    R +
Sbjct: 176 LRHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGK 235

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
           WGF G++ SDC A++  +       +  ++V   ++ G D+NCG+ +  +   A K+  +
Sbjct: 236 WGFKGHVVSDCWALADFHLHHMVTSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 294

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
            E +I  A   L + R +LG+F+       + KI  +V  S  H  +AL A++  +VLLK
Sbjct: 295 TEEQITTAAERLMTTRFKLGMFDEEC---EYNKIPYEVNDSREHNEVALIASRKSMVLLK 351

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
           N+ G LPL KS   S+A+IGPNANS   L GNY+G + +  T L+ + + V N V   Y 
Sbjct: 352 NN-GTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYS 410

Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
            GC       + +A     + +A+ +A+ +D VVL +GLD T E E+          D+ 
Sbjct: 411 EGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDKE 470

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           +L LPGRQQ L+ +V E   KPVI+VL  G  + +  A  +    +IL A YPG  G  A
Sbjct: 471 NLNLPGRQQNLLEKVLEVG-KPVIVVLGAGSALTLNGA--EEKCAAILNAWYPGSHGGTA 527

Query: 570 LAEVIFGDHNP 580
           +A+++FG  +P
Sbjct: 528 VADILFGKCSP 538


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  355 bits (911), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 223/604 (36%), Positives = 324/604 (53%), Gaps = 70/604 (11%)

Query: 5   KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
           K+SL F    LCF S                +    E  PF    LP+  R +DLV+RLT
Sbjct: 3   KISLYF---VLCFWSF--------------GAQAQVENLPFRNADLPLEVRVKDLVARLT 45

Query: 65  LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L+EK+  + + +PAIPRLGIPAY+WW+EALHGVA          T+   T FPQ I  AA
Sbjct: 46  LEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVTVFPQAIGMAA 96

Query: 125 SFDSYLWYRIGQAIGLEARALYN----AGQA----IGMTFWAPNINIFRDPRWGRGQETP 176
           +FD+    ++G     E RAL+N    AG+      G+T+W PNINIFRDPRWGRGQET 
Sbjct: 97  TFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRDPRWGRGQETY 156

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDP +T K   + VRG++G+  +       L++ AC KH+  +    +    R+ FDAR
Sbjct: 157 GEDPYLTAKMGAAIVRGLEGEDPH------YLKSVACAKHYAVHSGPEY---NRHSFDAR 207

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
            ++ DL DTY P F   V + +  G+MCAYNR+NG P C +  LL    R QW F GY+T
Sbjct: 208 PSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVT 267

Query: 297 SDCDAVSIIYDAEGYAKSPED--AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
           SDC A+     AE +   PE   A+ D L AG D+ CG+ L       VK+    E +I+
Sbjct: 268 SDCWALKDF--AEFHKTHPEHTIAMSDALLAGTDLECGN-LYHLLAEGVKKGLHSERDIN 324

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
            +L  LF++  ++G+F+    + P+  IG +V+   AH+  A + A++ IVLL+N + +L
Sbjct: 325 VSLSRLFTILFKIGMFDPAERV-PYSSIGREVLECEAHKQHAERMAKESIVLLENKNHIL 383

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTV 471
           PL  SK  S+ALIGPNA++ +T L NY G     +TP  +L+  + + +   Y PG   V
Sbjct: 384 PLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIV 443

Query: 472 --ACSSASIDKAVDIAKGADHVVLMMGLDQTQE-------------KEELDRVDLVLPGR 516
                + S  +    A  +D +V + G+    E                 DR  + LP  
Sbjct: 444 DKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLV 503

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q EL+ ++ +   +P+I+V + G    ++F    +N  ++L A Y G+A   A+ +V+FG
Sbjct: 504 QIELLKKLKKTG-RPLIIVNMSGSV--MSFEWESQNADALLQAWYGGQAAGDAIVDVLFG 560

Query: 577 DHNP 580
             NP
Sbjct: 561 HCNP 564


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  355 bits (911), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 207/553 (37%), Positives = 310/553 (56%), Gaps = 23/553 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S P ++T   C T+     RA  LVS LT +E ++   N+    PR+G+PAY+ W+EALH
Sbjct: 54  SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112

Query: 96  GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GVA      F + G    +TSFPQ I T A+ +  L ++I   I  + RA  NAG+  G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
             ++PNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+  A 
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  YD++NW   +R   D ++T QDLA+ Y P F    +  +   +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283

Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           SC++   L    R  + F   GY++ DC AV  +++  GYA +   A  D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G    +H + +   +++   +++R +  L++  +R G F+G  +  P+  I    V S  
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            Q L+ +AA   IVLLKN  G+LPL    S + ++ALIGP AN+   +LGNY GP+   I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460

Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           +PLQA Q+  E  + Y           S S   A+  AK AD ++   G+D T E E  D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R ++  P  Q  LIT++A+   KP+I++ + GG VD +  K ++N+ +++W GYPG++G 
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578

Query: 568 VALAEVIFGDHNP 580
            ALA++I G   P
Sbjct: 579 QALADIITGKRAP 591


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 207/553 (37%), Positives = 310/553 (56%), Gaps = 23/553 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S P ++T   C T+     RA  LVS LT +E ++   N+    PR+G+PAY+ W+EALH
Sbjct: 54  SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112

Query: 96  GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GVA      F + G    +TSFPQ I T A+ +  L ++I   I  + RA  NAG+  G+
Sbjct: 113 GVA---HADFSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
             ++PNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+  A 
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  YD++NW   +R   D ++T QDLA+ Y P F    +  +   +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283

Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           SC++   L    R  + F   GY++ DC AV  +++  GYA +   A  D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G    +H + +   +++   +++R +  L++  +R G F+G  +  P+  I    V S  
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            Q L+ +AA   IVLLKN  G+LPL    S + ++ALIGP AN+   +LGNY GP+   I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460

Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           +PLQA Q+  E  + Y           S S   A+  AK AD ++   G+D T E E  D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R ++  P  Q  LIT++A+   KP+I++ + GG VD +  K ++N+ +++W GYPG++G 
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578

Query: 568 VALAEVIFGDHNP 580
            ALA++I G   P
Sbjct: 579 QALADIITGKRAP 591


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 236/597 (39%), Positives = 320/597 (53%), Gaps = 45/597 (7%)

Query: 8   LVFPLL-CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLD 66
           +V PLL  +   SL+T   +T P      +N   +  PFC T+L  + R  DL+SRL L 
Sbjct: 2   IVLPLLLAIASASLVTPTAATIPRACVGKAN---QELPFCNTSLSTADRVEDLLSRLPLQ 58

Query: 67  EKISQLVNSAPAIPR-----LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           EK + L  +A A PR     +G+P Y W +  +HGV          GT    TSFP  + 
Sbjct: 59  EKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQSTC------GT-NCPTSFPNPVN 109

Query: 122 TAASFDSYLWYRIGQAIGLEARAL--------YNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
             A FD  + + + Q IG E RAL        Y  G  +G+  W+PNINI RDPRWGR  
Sbjct: 110 LGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGRNT 169

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ETP EDPLV  KY V+Y RG+Q       +    LQA    KH+ AY  +N+ G  R +F
Sbjct: 170 ETPSEDPLVNSKYGVAYTRGLQEGKRQDPRF---LQAVVTLKHYAAYSYENYGGVNRMEF 226

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           DA V+  D ADTY P F S V  G A G+MC+YN VNGIP CA++ L+    R   GF G
Sbjct: 227 DAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDG 286

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           Y+TSD  AV  I D   YA S  +A    + AG D+N G   +   K  V   +L E  +
Sbjct: 287 YVTSDSGAVEAISDMHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKAL 346

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D AL +   +R  LGLF+     QP+  +    V + A + L+L A +  +V+L+N+  +
Sbjct: 347 DDALRHTLKLRFELGLFDPIDD-QPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASV 405

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--------ITPLQALQ--NYVENTV 463
           LPL   K V LA++GP+A S + LLGNY G  C           TPL A++  N   NT 
Sbjct: 406 LPL--QKGVKLAVLGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANGASNTT 463

Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
           +  GC     S+A  +KAV  AK AD VVL +G+D++ E E  DR ++ LP  Q +L+ R
Sbjct: 464 FAEGCGISGNSTAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQR 523

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V  A  +P ++VL+ GG   I   +      +++ A YPG  GA A+A+V+FGD NP
Sbjct: 524 V-HAVGRPTVVVLINGGV--IGAEEIIERTDALVEAFYPGFFGARAMADVLFGDTNP 577


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 222/572 (38%), Positives = 314/572 (54%), Gaps = 32/572 (5%)

Query: 28  QPPFSCDPSNPSTETFPFC-----KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRL 82
            PP      +P   T PFC      T+L    R   L+S++T++EK + LV+ A  +PR+
Sbjct: 123 HPPPKGVKRSPVCLTDPFCANKACDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRI 182

Query: 83  GIPAYEWWSEALHGVAGVGKGIFF---NGT-IRGATSFPQVILTAASFDSYLWYRIGQAI 138
           G+P YEWW+EALHGVAG  +G+ F   NG+    ATSFP  IL  A+FD  L Y +   I
Sbjct: 183 GLPPYEWWNEALHGVAG-SRGVSFDSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASII 241

Query: 139 GLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
           G EARA  N   + G  FW PN+N F DPRWGRG E P ED     +Y  S V G+QG  
Sbjct: 242 GKEARAFANYAHS-GYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQG-- 298

Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
             G +     Q  A CKHF  YD++    T R+  +   T QDL + Y P F++CV+   
Sbjct: 299 --GKEKTDHKQIIATCKHFAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRDVN 352

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSP 315
              IMC+YN V G+P+CA    L    R QW F   + Y+TSDC+AV  I+    +  + 
Sbjct: 353 VGSIMCSYNAVYGVPACASEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTE 412

Query: 316 EDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
             A    L AG D NCG S+LQ +T  +V      E+++D +L  L++    +G F+G P
Sbjct: 413 PAAAAVALNAGTDTNCGTSYLQLNT--SVANNWTTEAQMDISLTRLYNALFTVGYFDGQP 470

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
                G   AD V +P  Q  A +AA +GI LLKN  GLLPL KS + S+ALIGP AN+ 
Sbjct: 471 EYD--GLSFAD-VSTPFAQATAYRAASEGITLLKND-GLLPLKKSYN-SVALIGPWANAT 525

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
             + G Y G +   ++PL A Q    +  +  G    + ++     A+  A+ AD ++  
Sbjct: 526 TQMQGIYQGIAPYLVSPLAAAQAQWGHISFTNGTAINSTNTTGFASALSAARDADVIIYA 585

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
            G+D + EKE  DR  +  PG Q +L+ +++E   KP+++V   GG VD +    ++N+ 
Sbjct: 586 GGIDSSIEKESRDRTSISWPGNQLDLVQQLSELG-KPLVVVQFGGGQVDDSALLRNKNVN 644

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP-GEFTV 585
           S++WAGYPG+ G  AL +V+ G  +P G  T+
Sbjct: 645 SLVWAGYPGQDGGSALIDVLVGKQSPAGRLTI 676


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 207/553 (37%), Positives = 310/553 (56%), Gaps = 23/553 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S P ++T   C T+     RA  LVS LT +E ++   N+    PR+G+PAY+ W+EALH
Sbjct: 54  SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112

Query: 96  GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GVA      F + G    +TSFPQ I T A+ +  L ++I   I  + RA  NAG+  G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
             ++PNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+  A 
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  YD++NW   +R   D ++T QDLA+ Y P F    +  +   +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283

Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           SC++   L    R  + F   GY++ DC AV  +++  GYA +   A  D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G    +H + +   +++   +++R +  L++  +R G F+G  +  P+  I    V S  
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            Q L+ +AA   IVLLKN  G+LPL    S + ++ALIGP AN+   +LGNY GP+   I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460

Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           +PLQA Q+  E  + Y           S S   A+  AK AD ++   G+D T E E  D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R ++  P  Q  LIT++A+   KP+I++ + GG VD +  K ++N+ +++W GYPG++G 
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578

Query: 568 VALAEVIFGDHNP 580
            ALA++I G   P
Sbjct: 579 QALADIITGKRAP 591


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 216/551 (39%), Positives = 302/551 (54%), Gaps = 50/551 (9%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
            +QRAR LV+++TL+EK+SQ+   APAI RLGIPAY WW+E LHGV   G          
Sbjct: 4   FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIF 164
            AT FPQ I  AASFD  L   + QAI  EARA YN        G   G+TFW+PNIN+F
Sbjct: 55  -ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDPL+TG+   +++RG+Q      G+     +  A  KHF A+   +
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQE-----GEDSQYRKLDATVKHFAAH---S 165

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ F+A V+ +D+AD+Y   F  C++  + + +M AYNR+NG P+CA    L   
Sbjct: 166 GPEAGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYLKGV 225

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
              +W F GY+ SDC A+  I +     K+ +++    +  G  +NCG       KAAV+
Sbjct: 226 LYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESAALAVNNGCQLNCGKAYH-WVKAAVE 284

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
              + E  +  A+  LF  R RLG+F+ +     +  I  +V+    H+ L  + AQ+ I
Sbjct: 285 DGLISEDTVTCAVERLFEARFRLGMFDSDCV---YDSIPMNVIECRKHRELNRKMAQESI 341

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY 464
           VLLKN+ G+LPL   K++  A+IGPNA+    LLGNY G      T L+ +Q+     VY
Sbjct: 342 VLLKNN-GILPLNPEKTI--AVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEVY 398

Query: 465 YPGCDTVACSSA------SIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
           Y     +    A       + +A+  AK AD VVL +GL    E EE          DR 
Sbjct: 399 YARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDRK 458

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
           D+ LP  QQ+L+  + +  +KPV+LV + GG VD+  A  D    +IL   YPG  G  A
Sbjct: 459 DISLPDIQQQLLCAILD-TEKPVVLVNVSGGCVDLRQA--DERCAAILQCFYPGAEGGNA 515

Query: 570 LAEVIFGDHNP 580
           LA+++FG  +P
Sbjct: 516 LADILFGRVSP 526


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 207/553 (37%), Positives = 310/553 (56%), Gaps = 23/553 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S P ++T   C T+     RA  LVS LT +E ++   N+    PR+G+PAY+ W+EALH
Sbjct: 54  SGPLSKTL-VCDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALH 112

Query: 96  GVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GVA      F + G    +TSFPQ I T A+ +  L ++I   I  + RA  NAG+  G+
Sbjct: 113 GVA---HADFSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGR-YGL 168

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
             ++PNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+  A 
Sbjct: 169 DVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITGIQG-----GVDANPLKLIAT 223

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  YD++NW   +R   D ++T QDLA+ Y P F    +  +   +MC+YN VNG+P
Sbjct: 224 AKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSVMCSYNAVNGVP 283

Query: 274 SCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           SC++   L    R  + F   GY++ DC AV  +++  GYA +   A  D ++AG D++C
Sbjct: 284 SCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAADSIRAGTDIDC 343

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G    +H + +   +++   +++R +  L++  +R G F+G  +  P+  I    V S  
Sbjct: 344 GVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDGKTS--PYRNITWSDVVSTN 401

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPL--PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            Q L+ +AA   IVLLKN  G+LPL    S + ++ALIGP AN+   +LGNY GP+   I
Sbjct: 402 AQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQMLGNYYGPAPYLI 460

Query: 450 TPLQALQNYVENTVYYPGCDTVAC--SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           +PLQA Q+  E  + Y           S S   A+  AK AD ++   G+D T E E  D
Sbjct: 461 SPLQAFQD-SEYKITYTIGTNTTTDPDSTSQSTALTTAKEADLIIFAGGIDNTLETEAQD 519

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R ++  P  Q  LIT++A+   KP+I++ + GG VD +  K ++N+ +++W GYPG++G 
Sbjct: 520 RSNITWPSNQLSLITKLADLG-KPLIVLQMGGGQVDSSALKNNKNVNALIWGGYPGQSGG 578

Query: 568 VALAEVIFGDHNP 580
            ALA++I G   P
Sbjct: 579 QALADIITGKRAP 591


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  354 bits (908), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 229/574 (39%), Positives = 311/574 (54%), Gaps = 45/574 (7%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
           P +C  S+   + FPFC  +L  S+R  DL+ RL LDEK++ L  +A A P+     +G+
Sbjct: 23  PRAC--SDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 78

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           P Y W +  +HGV          GT   ATSFP  +   A FD    + + Q IG E RA
Sbjct: 79  PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPQAVFDMAQVIGWELRA 131

Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
           L        Y AG  +G+  W+PNINI RDPRWGR  ETP EDPLV  KY V+Y RG+Q 
Sbjct: 132 LWLEGARENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQE 191

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
                GK K  LQA    KH+ AY  +++ G  R  F+A+V+  D ADTY P F + V +
Sbjct: 192 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVE 246

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G+A G+MC+YN VNG+P CA+  L +K  R   GF GYITSD  A+  IY    Y KS  
Sbjct: 247 GKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHYTKSLC 306

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           +A    + +G DVN GS  +K     V   +LPE  +D A+     +R  LGLF+     
Sbjct: 307 EAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 365

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
           QP+  +    V     + L+L+  +  IVLL+N   +LPL K K   LA+IGP+A + + 
Sbjct: 366 QPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKGK--KLAVIGPHAKAKRA 423

Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
           LLGNY G  C           TPL+A+   N   NTVY  G      S+A  D A   A+
Sbjct: 424 LLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSGINDTSTADFDAAEAAAR 483

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
           GAD VVL +G+D + E+E  DR ++ +P  Q +L+ RV  A  KP ++VL  GG V    
Sbjct: 484 GADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 540

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   +   +  A YPG  GA A+++++FGD  P
Sbjct: 541 EELILHTDGVAEAFYPGFFGAQAVSDILFGDAIP 574


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  354 bits (908), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 216/553 (39%), Positives = 308/553 (55%), Gaps = 54/553 (9%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           +A  LV+++ L EK S L   +PAI RLG+P Y WWSEALHGVA  G           AT
Sbjct: 8   KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDP 167
            FPQ I  AA FD    Y I   I  EARA YN            GMT WAPNINIFRDP
Sbjct: 58  VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRG ET GEDP +T +  V+++ G+QGD  +        +A+AC KHF  +   +   
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHH-----YWKAAACAKHFAVH---SGPE 169

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+ FDA V+ +DL +TY P FE+ V +G+ +G+M AYNRVNG P+C  + LL    + 
Sbjct: 170 EERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDILKE 229

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQK 346
           +WGF GY+ SDC A+   +       +  ++    +  G  +NCG ++L  H   A K+ 
Sbjct: 230 EWGFDGYVVSDCWAIRDFHTEHMVTHTATESAALAINNGCQLNCGNTYL--HMLQAYKEG 287

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E  I ++   L ++RM+LGLF+ N     + KI  +V     H+ +AL  A+  +VL
Sbjct: 288 LVTEETITKSAQKLMAIRMKLGLFDKNC---EYNKIPYEVNDCKVHRDIALDVARRSMVL 344

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VY 464
           LKN+ G+LPL   ++ ++ +IGP ANS   L GNY G + R  T L+ +Q+YV +   VY
Sbjct: 345 LKNN-GILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVY 403

Query: 465 Y-PGCDTVACS-------SASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LD 507
           Y  GC     S       +  + +A+ +A+ +D V+L +GLD + E E+          D
Sbjct: 404 YAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTGNAFAAGD 463

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL L GRQQ L+  V +   KP IL+L  G  + I  A+      +IL   YPG++G 
Sbjct: 464 KSDLNLIGRQQLLLEEVLKIG-KPTILILSSGSAMAIHTAQ--EYCEAILETWYPGQSGG 520

Query: 568 VALAEVIFGDHNP 580
            ALA+++FG+++P
Sbjct: 521 KALAQLLFGEYSP 533


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  354 bits (908), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 222/574 (38%), Positives = 313/574 (54%), Gaps = 45/574 (7%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
           P +C  S+P  + F FC  +L  ++R  DL+ RL LDEK++ L  +A A P+     +G+
Sbjct: 24  PRAC--SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 79

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           P Y W +  +HGV          GT   ATSFP  +   A FD    + + Q +G E RA
Sbjct: 80  PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPRAVFDMAQVVGWELRA 132

Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
           L        Y  G  +G+  W+PNINI RDPRWGR  ETP EDPLV  KY V+Y +G+Q 
Sbjct: 133 LWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE 192

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
                GK K  LQA    KH+ AY  +++ G  R  F+A V+  D ADTY P FE+ V  
Sbjct: 193 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVH 247

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G+A G+MC+YN VNG+P CA+  L SK  R   GF GYITSD  A++ IY    Y K+  
Sbjct: 248 GKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLC 307

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           +A    + +G DVN GS  ++     V   +LPE  +D A+     +R  LGLF+     
Sbjct: 308 EAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 366

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
           QP+  +  + V +   + L+L  ++  IVLL+N   +LPL K K   LA+IGP+A + + 
Sbjct: 367 QPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKGK--KLAVIGPHAAAKRA 424

Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
           LLGNY G  C           TPL+A+   N   NT+Y  G      S+A  D+A   A+
Sbjct: 425 LLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAGFDEAEAAAR 484

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            A+ VVL +G+D + E+E  DR ++ +P  Q +L+ RV  A  KP ++VL  GG V    
Sbjct: 485 KAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 541

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   +   ++ A YPG  GA A+++++FGD  P
Sbjct: 542 EELILHTDGVVEAFYPGFFGAQAVSDILFGDAIP 575


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  354 bits (908), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 220/566 (38%), Positives = 306/566 (54%), Gaps = 40/566 (7%)

Query: 35  PSNPSTETFP-----------FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLG 83
           P+ P T +FP            C TT  I+ RA  +V  +TL+EK++ + +SA    RLG
Sbjct: 117 PNPPLTPSFPDCTRDPLCSNDVCDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLG 176

Query: 84  IPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
           +PAY+W +EALHGVAG   G+ F          ATSFP  IL +A+FD  L   +  AI 
Sbjct: 177 LPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAIS 235

Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
            EARA  N G A G+ FW PNIN FRDPRWGRG ETPGED      Y +S + G+QG   
Sbjct: 236 TEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQG--- 291

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
             G      +  + CKHF AYD++N     R   +   T QD+AD Y P FE+CV+  + 
Sbjct: 292 --GIDPDFFRTISTCKHFAAYDIEN----GRTANNLSPTQQDMADYYLPMFETCVRDAKV 345

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPE 316
             IMCAYN VNG+P+CAD  LL    R  +GF     Y+ SDCDAV  +YD   YA +  
Sbjct: 346 GSIMCAYNSVNGVPACADSYLLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLT 405

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
            A    L AG D++CGS       A+V+     E+ +D++L  L+S  +++G F+     
Sbjct: 406 QAAAMSLNAGTDLDCGSSYNV-LNASVQAGMTTEATLDKSLIRLYSALIKVGWFD----- 459

Query: 377 QP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           QP  +  +G   V +   + LA  AA  G+ LLKN  G LPL  +   ++A+IGP  N+ 
Sbjct: 460 QPAKYSSLGWGNVNTTQTRALAHDAATGGMTLLKND-GTLPLSPTLQ-NVAVIGPWVNAT 517

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLM 494
             L GNYAG +   + PL   Q    N  Y  G    +  ++  + A+  A  +D +V +
Sbjct: 518 TQLQGNYAGTAPVLVNPLTVFQQKWRNVKYAQGTAINSQDTSGFNAAISAASSSDVIVYL 577

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
            G+D + E E  DR  +  PG Q  LI+++A    KP+++V   GG +D +    +  + 
Sbjct: 578 GGIDISVENEGFDRTAITWPGNQLSLISQLANLG-KPLVIVQFGGGQIDDSSLLSNSKVN 636

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNP 580
           SILWAGYPG+ G  AL +V+ G + P
Sbjct: 637 SILWAGYPGQEGGNALFDVLTGANPP 662


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 221/569 (38%), Positives = 313/569 (55%), Gaps = 36/569 (6%)

Query: 26  STQPPFSCDPSNPSTETFPFCK-----TTLPISQRARDLVSRLTLDEKISQLVNSAPAIP 80
           S  PP +  PS P     P C      TTL +++RA  +V  +TLDEK++ + +SA    
Sbjct: 121 SANPPLT--PSFPDCTRDPLCSNAICDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSA 178

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFN----GTIRGATSFPQVILTAASFDSYLWYRIGQ 136
           RLG+PAY+W +EALHGVAG   G+ F          ATSFP  IL +A+FD  L   +  
Sbjct: 179 RLGLPAYQWQNEALHGVAG-STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQNVAT 237

Query: 137 AIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
           AI  EARA  N G A G+ FW PNIN FRDPRWGRG ETPGED      Y ++ + G+QG
Sbjct: 238 AISTEARAFANYGFA-GLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQG 296

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
                G      +  A CKHF AYD++N     R   +   T QD+AD Y P FE+CV+ 
Sbjct: 297 -----GINPDFFRIIATCKHFAAYDIEN----GRTGNNLNPTQQDMADYYLPMFETCVRD 347

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAK 313
            +   +MCAYN V+GIP+CA   LL    R  +GF     Y+ SDCDAV  ++D   YA 
Sbjct: 348 AKVGSVMCAYNAVDGIPACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYAS 407

Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +  +A    L AG D++CGS       A+V+     E+ ++++L  L+S  +++G F+  
Sbjct: 408 NLTEAAALSLNAGTDLDCGSSYNV-LNASVEAALTSEAALNQSLVRLYSALIKVGYFD-- 464

Query: 374 PTMQP--FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
              QP  +  +    V +  +Q LA  AA  G+ LLKN  G LPL ++ S ++A+IGP  
Sbjct: 465 ---QPSEYKSLSWANVNTTQNQALAHDAATGGMTLLKND-GTLPLSRTLS-NVAIIGPWV 519

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
           N+   + GNYAG +   + PL   Q    N  Y  G    +  ++    A+  A  +D +
Sbjct: 520 NATTQMQGNYAGTAPFLVNPLDVFQQKWGNVKYAQGTAINSQDTSGFSAALSAASSSDVI 579

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           V + G+D T E E  DR  +V PG Q +LI+++A    KP+++V   GG +D +    + 
Sbjct: 580 VYLGGIDITVENEGFDRGSIVWPGNQLDLISQLANLG-KPLVIVQFGGGQIDDSSLLSNP 638

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           N+ SILWAGYPG+ G  A+ +V+ G + P
Sbjct: 639 NVRSILWAGYPGQDGGNAVFDVLTGANPP 667


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 209/564 (37%), Positives = 308/564 (54%), Gaps = 19/564 (3%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           T P  S D  N    +   C  +     RA  L+S  TLDE I+   N+   + RLG+P 
Sbjct: 50  TIPLSSPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPV 109

Query: 87  YEWWSEALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           Y+ WSEALHG+    +  F + G+   ATSFPQ ILT A+ +  L ++I   I  + RA 
Sbjct: 110 YQVWSEALHGL---DRANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAF 166

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
            NAG+  G+  +APNIN FR P  GRGQETPGED  +   YA  Y+ G+QG         
Sbjct: 167 NNAGR-YGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGP-----DPD 220

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             L+ +A  KH+  YD++NW   +R   D  +T QDL++ Y P F    +  +   +MCA
Sbjct: 221 SNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCA 280

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           YN VNG+P+CAD   L    R  +GF  HGY++SDCDA   IY+  GYA S   A  + +
Sbjct: 281 YNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAI 340

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM--QPFGK 381
            AG D++CG+  Q H   ++    L   +I++ +  L++  ++ G F+ N T    P+  
Sbjct: 341 LAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRD 400

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS----KSVSLALIGPNANSAKTL 437
           +    V       ++ QAA  GIVLLKNS+ +LPL +      + ++ALIGP AN+   L
Sbjct: 401 LTWSDVLETDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQL 460

Query: 438 LGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGL 497
           LGNY G +   I+P  A +    N  +       + +++    A+  A+ AD ++   G+
Sbjct: 461 LGNYYGNAPYMISPRVAFEEAGYNVNFAERTGISSTNTSGFAAALSAAQSADVIIYAGGI 520

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVDITFAKYDRNIGSI 556
           D T E E LDR  +  PG Q +LI ++A +A  KP+I++ + GG VD +  K + N+ ++
Sbjct: 521 DNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSAL 580

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           LW GYPG++G  AL ++I G  NP
Sbjct: 581 LWGGYPGQSGGFALRDIITGKKNP 604


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 214/555 (38%), Positives = 303/555 (54%), Gaps = 42/555 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+L   +R   LV  +TL EKI+QL + +P + RL IP Y WW+EALHGVA  GK  
Sbjct: 26  WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGK-- 83

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   +T FPQ I  AA+FD  L  ++  AI  EARA +N  Q+IG       +TFW
Sbjct: 84  --------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            PN+NIFRDPRWGRGQET GEDP +T +  V++V+G+QG+          L+++AC KHF
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNH------PKYLKSAACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ F+A  + +DL +TY P FE+ VKQ    G+M AYN V G+P+ + 
Sbjct: 190 AVH---SGPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAYNAVYGVPAGSS 246

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             LL +T R+ WGF GYI SDC A+  I+      K+  +A    LKAG+++NCG     
Sbjct: 247 EFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPEAAAVALKAGVNLNCGYVYNG 306

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             + AV+Q  + E  ID  L  L   R +LG F+      P+  I   V+ S  H  LA 
Sbjct: 307 ALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEA-NPYNAIPTSVIHSDDHIALAR 365

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           + AQ  IVLLKN +  LPL K+  V   + GP A+S+  LL NY G +   ++ L+ + +
Sbjct: 366 KTAQKSIVLLKNKNHTLPLDKNIKVPY-VTGPFASSSDVLLANYYGMTTNLVSVLEGIAD 424

Query: 458 YVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV----- 509
            V    +  Y  G      +    + A ++AK AD V+ ++GL    E EE+D +     
Sbjct: 425 KVSLGTSLNYRMGALPFNKNLNPKNWAPNVAKTADAVIAVVGLSADFEGEEVDAIASPNK 484

Query: 510 ----DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
               DL LP  Q + +  +A   K P+ILV+  G  V +    YD    +I+   YPGE 
Sbjct: 485 GDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSAVALG-ELYDL-ADAIVLMWYPGEQ 542

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A+V+FGD +P
Sbjct: 543 GGNAVADVLFGDVSP 557


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 214/555 (38%), Positives = 307/555 (55%), Gaps = 57/555 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A++LVS++TL E+  QL   APAI  L I  Y WW+E LHGVA  G           A
Sbjct: 14  EKAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------A 63

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I  AA FD  L  +I   I  E RA YN            G+TFW+PN+NIFRD
Sbjct: 64  TVFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRD 123

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V++V+G+QGD       +  L+ +AC KHF  +   +  
Sbjct: 124 PRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------EKYLKIAACAKHFAVH---SGP 173

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++F+A V+ +DL +TY P FE+CVK+     +M AYNR N  P C    LL    R
Sbjct: 174 EGLRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDILR 233

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQ 345
            +W F G++ SDC A++  +   G   +  ++    +K G D+NCG+ +LQ     A K+
Sbjct: 234 GKWQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNGCDLNCGNVYLQ--MLLAYKE 291

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E +I RA   L + R+RLG+F+       F KI   +     H  ++L A++  IV
Sbjct: 292 GLVTEEDITRAAERLMATRIRLGMFDEEC---EFNKIPYTMNDCKEHHEVSLMASRKSIV 348

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--- 462
           +L+N +GLLPL KSK  S+ +IGPNA+S   L GNY G + + IT L+ +   V++    
Sbjct: 349 MLRN-NGLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIR 407

Query: 463 -VYYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE--------- 505
             Y  GC         +A     + +AV +A+ +D V+L +GLD + E E+         
Sbjct: 408 IFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGA 467

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            D+++L LPG+QQEL+ +V  A  KPVI+VL  G    +T    + N  +IL A YPG  
Sbjct: 468 GDKLNLNLPGKQQELLEKVI-ATGKPVIVVLGAGSA--LTLQGQEENCAAILNAWYPGSF 524

Query: 566 GAVALAEVIFGDHNP 580
           G  A+A++IFG  +P
Sbjct: 525 GGRAIADLIFGKCSP 539


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 215/554 (38%), Positives = 310/554 (55%), Gaps = 58/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA  LV+++T++EK SQL   APAI RLGIPAY WW+EALHGVA  G+          A
Sbjct: 8   KRAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRD 166
           T FPQ I   A+FD  L   I   I  E RA YNA  A        G+TFW+PN+NIFRD
Sbjct: 58  TVFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +  V++V+G+QGD          ++A+AC KHF  +   +  
Sbjct: 118 PRWGRGHETYGEDPCLTSRLGVAFVKGLQGDGET-------MKAAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++F+A  + +D+ +TY P FE+ VK+     +M AYNR NG   CA   +L K  R
Sbjct: 168 EAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCAS-PVLQKILR 226

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
             WGF G+  SDC A+   ++      + +++    + +G D+NCG ++L  H   A + 
Sbjct: 227 EDWGFEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGCDLNCGNTYL--HILHAYRD 284

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  I  A   LF+ R  LGLF+G+     +  I   VV S  H  LA +AA +  V
Sbjct: 285 GLVSEETITEAAVRLFTTRFLLGLFDGS----EYDDIPYTVVESKEHLALAEKAALESAV 340

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
           LLKN+ G+LPL K +  ++ +IGPNA+S   L GNY G + R  T  Q LQ+Y+   V  
Sbjct: 341 LLKNN-GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRV 399

Query: 466 ---PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
               GC       + +A +   + +A  +A+ +D V+L +GLD+T E EE          
Sbjct: 400 LTSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGLDETLEGEEGDTGNSYASG 459

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L+LP  Q++L+  VA A  KPV+L ++ G  +D+++A    +  +IL   YPG  G
Sbjct: 460 DKETLLLPEAQRDLMEAVA-ATGKPVVLCMMSGSDLDMSYAA--EHFDAILQLWYPGSQG 516

Query: 567 AVALAEVIFGDHNP 580
             A A+++FG+ +P
Sbjct: 517 GSAAAKLLFGEVSP 530


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 204/456 (44%), Positives = 275/456 (60%), Gaps = 37/456 (8%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           +T   PF    L   +RA DLV+R+TLDEK+ Q+ NSAPAIPRLGIPAY+WW+EALHGVA
Sbjct: 20  TTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVA 79

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI------ 152
             G           AT FPQ I  AA++D+ L +RI + I  EARA YN  +AI      
Sbjct: 80  RAGL----------ATVFPQAIGLAATWDATLMHRIAETISTEARAKYN--EAIRNDDHS 127

Query: 153 ---GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
              G+TFW+PNINIFRDPRWGRGQET GEDP +T + AV++++G+QG+  +  K+     
Sbjct: 128 RYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGEDPHYYKVI---- 183

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A  KH   Y + +   ++R++FD + + +DLADTY P F + + + RA  +MCAYNRV
Sbjct: 184 --ATAKH---YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIVEARADSLMCAYNRV 238

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           +GIP+CA  +LL K  R +WGF G++ SDC AVS I+    Y      A    +KAG D+
Sbjct: 239 DGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDAASASAVAVKAGTDL 298

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CG+  +     AVK   + E EI+R+L  LF  R +LG+F+  P   PF  I    V S
Sbjct: 299 TCGNEYRALVD-AVKTGLITEPEINRSLERLFVARFKLGMFD-PPERVPFSNIPYSEVDS 356

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             H+ +AL+AA+  IVLLKN  G LPL KS    +A+IGP A+ A+ LLGNY G S   +
Sbjct: 357 AGHRKIALEAARKSIVLLKND-GTLPL-KSSIKKIAVIGPAADDAEALLGNYNGFSSLQV 414

Query: 450 TPLQALQNYVENTV---YYPGCDTVACSSASIDKAV 482
           TPL  +++         Y  G +  A S A +  +V
Sbjct: 415 TPLAGIEHQWAGKAEVRYALGANYTAQSQAPLPASV 450



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 57/110 (51%), Gaps = 13/110 (11%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
           A++    AD  +  +GL+ + E EE+          DR +L LP  Q++LI   A A  K
Sbjct: 597 AIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLI-EAAIATGK 655

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           PV++VL  G  V + FA   ++  ++L   Y GE    A+A+ + G +NP
Sbjct: 656 PVVVVLASGSAVAMNFAA--QHASALLETWYNGEETGTAIADTLAGINNP 703


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 221/574 (38%), Positives = 312/574 (54%), Gaps = 45/574 (7%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGI 84
           P +C  S+P  + F FC  +L  ++R  DL+ RL LDEK++ L  +A A P+     +G+
Sbjct: 24  PRAC--SSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLL--TARASPKGNMSSIGL 79

Query: 85  PAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA 144
           P Y W +  +HGV          GT   ATSFP  +   A FD    + + Q +G E RA
Sbjct: 80  PEYNWGANCVHGVQSTC------GT-NCATSFPNPVNLGAIFDPRAVFDMAQVVGWELRA 132

Query: 145 L--------YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG 196
           L        Y  G  +G+  W+PNINI RDPRWGR  ETP EDPLV  KY V+Y +G+Q 
Sbjct: 133 LWLEGARENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQE 192

Query: 197 DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
                GK K  LQA    KH+ AY  +++ G  R  F+A V+  D ADTY P FE+ V  
Sbjct: 193 -----GKDKRFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVH 247

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G+A G+MC+YN VNG+P CA+  L SK  R   GF GYITSD  A++ IY    Y K+  
Sbjct: 248 GKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLC 307

Query: 317 DAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           +A    + +G DVN GS  ++     V   +LPE  +D A+     +R  LGLF+     
Sbjct: 308 EAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDD- 366

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
           QP+  +  + V +   + L+L  ++  IVLL+N   +LPL K K   LA+IGP+A + + 
Sbjct: 367 QPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKGK--KLAVIGPHAAAKRA 424

Query: 437 LLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAK 486
           LLGNY G  C           TPL+A+   N   NT+Y  G      S+   D+A   A+
Sbjct: 425 LLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTGGFDEAEAAAR 484

Query: 487 GADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITF 546
            A+ VVL +G+D + E+E  DR ++ +P  Q +L+ RV  A  KP ++VL  GG V    
Sbjct: 485 KAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG-KPTVVVLFNGGVVGA-- 541

Query: 547 AKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   +   ++ A YPG  GA A+++++FGD  P
Sbjct: 542 EELILHTDGVVEAFYPGFFGAQAVSDILFGDAIP 575


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 209/551 (37%), Positives = 304/551 (55%), Gaps = 54/551 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A++LVS++TL E+  QL   +PAI  L +P Y WW+E LHGVA  G           AT 
Sbjct: 16  AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATV 65

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPR 168
           FPQ I  AA FD      I   I  E RA YN        G   G+T+W+PN+NIFRDPR
Sbjct: 66  FPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPR 125

Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
           WGRG ET GEDP +T +  V++++G+QG+    GK    L+ +AC KHF  +   +    
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----GKY---LKLAACAKHFAVH---SGPEG 175

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
            R++F+A V  +DL +TY P FE+CVK+     +M AYNR NG P C  + LL    R +
Sbjct: 176 LRHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILRGK 235

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKL 348
           WGF G++ SDC A++  +       +  ++V   ++ G D+NCG+ +  +   A K+  +
Sbjct: 236 WGFKGHVVSDCWALADFHLHHMITSTATESVALAIENGCDLNCGN-MYLNLLLAYKEGLV 294

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
            E +I  A   L + R +LG+F+ +     + +I  +V     H  +AL A++  +VLLK
Sbjct: 295 TEEQITTAAERLMTTRFKLGMFDEDC---EYNRIPYEVNDCKEHNEIALIASRKSMVLLK 351

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YY 465
           N  G LPL KS   S+A+IGPNANS   L GNY+G + +  T L+ + N V + +   Y 
Sbjct: 352 ND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYS 410

Query: 466 PGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRV 509
            GC       + +A     + +A+ +A+ +D V+L +GLD T E E+          D+ 
Sbjct: 411 EGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDKE 470

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
            L LPGRQQ L+ +V E   KPVI+VL  G    +TF   +    +IL A YPG  G  A
Sbjct: 471 SLNLPGRQQNLLEKVLEVG-KPVIVVLGAGSA--LTFNGAEEKCAAILNAWYPGSHGGTA 527

Query: 570 LAEVIFGDHNP 580
           +A+++FG  +P
Sbjct: 528 VADILFGKCSP 538


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 211/567 (37%), Positives = 313/567 (55%), Gaps = 35/567 (6%)

Query: 36  SNPSTE-TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPA 86
           SNP+ +   PFC  +L    RA DL  RLTLDE ++Q +N+        AP +PRLG+  
Sbjct: 34  SNPNCQLDLPFCDLSLDFRDRAWDLAQRLTLDE-LAQQLNTYSFTPQAYAPGVPRLGLRN 92

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           Y + +E LHG+         N     AT +PQV   AA+ ++ L + +   +G E RA+ 
Sbjct: 93  YSYHAEGLHGIRDA------NVVNYPATLYPQVTAMAATANASLIHEMSTIMGTELRAVN 146

Query: 147 NAGQAIGMTF--------WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
           N  Q +G  F        + P +NI RD RWGR QE+  EDP + G YAV++V G++   
Sbjct: 147 NRAQELGEIFGRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQR- 205

Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGT-TRYKFDARVTMQDLADTYQPPFESCVKQG 257
            N  K    LQA+  CKH  AY  + +  T TR+ F+A +   D+ DTY P F +CV+ G
Sbjct: 206 -NSSKY---LQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELG 261

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
               IMC+YN VNGIP+CA  ++ +   R+ WGF G I SDCDAV+ IY+   Y ++PED
Sbjct: 262 HVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNYTRTPED 321

Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
           AV   L+ G D++CG F  +H  +AV+Q     + + +++  +  +R  LG F+ + T  
Sbjct: 322 AVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPD-TSV 380

Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
           P+ ++G + + +P  +  +L+A+++ +VLL+N   LLP+  S  + +ALIGP  N    +
Sbjct: 381 PYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVALIGPYVNLTTIM 440

Query: 438 LGNYAGPSCRSITP-LQALQNY-VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
           +G     +   IT   Q  Q   + +    PGC+  A    ++DKAV IA  AD VVL +
Sbjct: 441 MGGKLDYTPSFITTYFQGFQAIGITHLTSSPGCNITAPLPGALDKAVQIATQADLVVLTL 500

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEA-AKKPVILVLLCGGPVDITFAKYD-RNI 553
           GL    E E  DR  L LP  QQ+L   ++ A     +++VL+ GGPV +   KY     
Sbjct: 501 GLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIART 560

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP 580
            +I+ A Y G++   ALAE IFG +NP
Sbjct: 561 PTIIEAFYGGQSAGTALAETIFGQNNP 587


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 212/573 (36%), Positives = 316/573 (55%), Gaps = 20/573 (3%)

Query: 16  CFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS 75
            ++  +  VD + P    D SN        C TT     RA  L S  TL+E ++   N 
Sbjct: 36  LYSETMAHVDFSFP----DCSNGPLSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNV 91

Query: 76  APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIG 135
            PA+PRLG+P Y+ WSEALHG+      +  +G    ATSFP  IL  A+ +  L  +IG
Sbjct: 92  IPAVPRLGLPPYQVWSEALHGLDRAN--LTESGDYSWATSFPSPILIMAALNRTLINQIG 149

Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
           + I  + RA  N G+  G+  +APNIN FR P WGRGQETPGED  +   Y V Y+ G+Q
Sbjct: 150 EIISTQGRAFNNGGR-YGLDVYAPNINSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQ 208

Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
           G     G     L+ +A  KHF  YDL+NW   +R   +  ++  DLA  Y P F + V+
Sbjct: 209 G-----GLNPRDLKLAATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVR 263

Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAK 313
             R   +M +YN VNG+PS A+  LL    R  W F   GY++SDCDAV  +++  GYA 
Sbjct: 264 DARVHSVMSSYNAVNGVPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYAS 323

Query: 314 SPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           S   A    ++AG D++CG+  Q +   ++   ++  SEI+RA+   +S  + LG F+G+
Sbjct: 324 SASLAAAKSIQAGTDIDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGD 383

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
            +         DVV + A  + + +AA +GIVLLKN  G LPL  + + S+ALIGP AN 
Sbjct: 384 NSKYRHLHW-PDVVATDAWNI-SYEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANV 439

Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
             TL GNY G +     PL ALQ    +  Y  G +  + S++  + A+  A  ++ ++ 
Sbjct: 440 TTTLQGNYYGAAPYLTGPLAALQASNLDVNYAFGTNISSDSTSGFEAALSAAGKSEVIIF 499

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
             G+D T E E +DR  +  PG Q +LI ++++   KP++++ + GG VD +  K ++N+
Sbjct: 500 AGGIDNTVEAEGVDRESITWPGNQLQLIEQLSKLG-KPLVVLQMGGGQVDSSSLKANKNV 558

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP-GEFTV 585
            S++W GYPG++G  A+ +++ G   P G  TV
Sbjct: 559 NSLVWGGYPGQSGGPAILDILTGKRAPAGRLTV 591


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 201/517 (38%), Positives = 284/517 (54%), Gaps = 19/517 (3%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           EK+  LV+ +  + RLG+PAY WW EALHGVAG   GI F G+ R ATSFP  +L +A+F
Sbjct: 56  EKLENLVSKSKGVARLGLPAYNWWGEALHGVAGA-PGINFTGSYRTATSFPMPLLMSAAF 114

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           D  L ++I   IG EARA  N G A  + FW P+IN FRDPRWGRG ETPGED L    Y
Sbjct: 115 DDDLIHQIAIVIGNEARAFGNGGIA-PVDFWTPDINPFRDPRWGRGSETPGEDILRIKGY 173

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
             S + G++GD       K + +  A CKH+  YD++NW GT R+ FDA++T QDLA+ +
Sbjct: 174 TKSLLSGLEGD-------KAQRKIIATCKHYVGYDVENWNGTDRHHFDAKITTQDLAEYF 226

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVS 303
            PPF+ C +  +    MC+YN VNG+P+CAD  +L    R+ W +   + YITSDC+AV 
Sbjct: 227 MPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVK 286

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
            I     Y  + ++A       GMD++C          A  Q  L  S IDRAL   +  
Sbjct: 287 DISLRHKYVATLQEATAIAFNNGMDLSCEYSGTSDIPGAFSQGLLNVSVIDRALTRQYEG 346

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            +  G F+G      +  +G   + +P  Q L LQ A +G+ LLKN    LPL       
Sbjct: 347 LVHAGYFDG--AAATYAHLGVQDINTPEAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSK 403

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG--CDTVACSSASIDKA 481
           +A++G  AN+   L G Y+GP+    TP+ A      +     G    T   +      A
Sbjct: 404 VAMVGFWANTTSKLSGIYSGPAPYLHTPVYAGNKLGLDMAVATGPILQTSGAADNWTTTA 463

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           ++ AK +D ++   GLD +   E  DR D+  P  Q +LIT++A A  KP++++ L G  
Sbjct: 464 LNAAKKSDFILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLA-ALGKPLVVIAL-GDM 521

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           VD T     + + S++WA +PG+ G  A+ +VI G+H
Sbjct: 522 VDHTPILKMKGVNSLIWANWPGQDGGTAVMQVITGEH 558


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  350 bits (899), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 218/570 (38%), Positives = 311/570 (54%), Gaps = 28/570 (4%)

Query: 16  CFTSLLTRVDSTQPPFSCDPS--NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
           C  +L+  +D    PF   P   N    +   C  TL   QRA  LV+ +T +EK+  LV
Sbjct: 20  CSAALVYAIDL---PFQTYPDCVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLV 76

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYL 130
           + +   PR+G+PAY WWSEALHGVA    G  F   NGT   +TSFP  +L AA+FD  L
Sbjct: 77  SKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFPMPLLMAATFDDEL 135

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
             R+G+ IG+E RA  NAG + G  +W PN+N F+DPRWGRG ETPGED L   +YA S 
Sbjct: 136 IERVGEVIGIEGRAFGNAGFS-GFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASM 194

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           +RG++G        + + +  A CKH+ A D ++W G+TR+ F+A+VT+QDLA+ Y  PF
Sbjct: 195 IRGLEGPV-----RERERRIVATCKHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPF 249

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYD 307
           + C +  +   IMC+YN VNG+P+CA+  L+    R  W +     YITSDC+AV  I  
Sbjct: 250 QQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPGNYITSDCEAVLDISA 309

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRL 367
              YAK+  +      +AG+D +C          A  Q  L +S +DRAL  L+   +++
Sbjct: 310 NHHYAKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQSTVDRALRRLYEGLVQV 369

Query: 368 GLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS---VSL 424
           G F+GN     +  +G + V  P  Q +ALQAA +GIVLLKN    LPL   K+   + L
Sbjct: 370 GYFDGN--RSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-TLPLGVKKNGPKLKL 426

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAV 482
           A+IG  AN  KTL G Y+G      +P+ A Q   +   T   P              A+
Sbjct: 427 AMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPVLQNSTSKDTWTQAAL 486

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
             AK A++++   G D +   E  DR  +  P  Q +LIT +++   KP+++V + G  +
Sbjct: 487 AAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKLG-KPLVVVQM-GDQL 544

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           D T     + I SILWA +P  AG + + +
Sbjct: 545 DNTPLLASKAINSILWANWPVPAGRLPVTQ 574


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  350 bits (898), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 226/565 (40%), Positives = 306/565 (54%), Gaps = 41/565 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR-----LGIPAYEWWSEA 93
           + +  PFC T+L  + R  DL+SRL L EK + L  +A A PR     +G+P Y W +  
Sbjct: 30  TNQELPFCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANC 87

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL-------- 145
           +HGV          GT    TSFP  +   A FD  + + + Q IG E RAL        
Sbjct: 88  VHGVQSTC------GT-NCPTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATEN 140

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
           Y  G  +G+  W+PNINI RDPRWGR  ETP EDPLV  KY V+Y RG+Q       +  
Sbjct: 141 YKGGPHLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQEGKRQDPRF- 199

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             LQA    KH+ AY  +N+ G  R +FDA V+  D ADTY P F S V  G A G+MC+
Sbjct: 200 --LQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCS 257

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           YN VNGIP CA++ L+    R   GF GY+TSD  AV  I D   YA S  +A    + A
Sbjct: 258 YNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILA 317

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+N G   +   K  V   +L E  +D AL +   +R  LGLF+     QP+  +   
Sbjct: 318 GTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDD-QPYWNVTPS 376

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            V + A + L+L A +  +V+L+N+  +LPL   K V LA++GP+A S + LLGNY G  
Sbjct: 377 EVNTAAAKALSLNATRKSLVMLQNNASVLPL--QKGVKLAVLGPHAKSKRGLLGNYLGQM 434

Query: 446 CRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
           C           TPL A++  N   NT +  GC     S+A  +KAV  AK AD VVL +
Sbjct: 435 CHGDYDEVGCVQTPLDAIRAANGASNTTFAEGCGISGNSTAGFEKAVAAAKEADAVVLFL 494

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           G+D++ E E  DR ++ LP  Q +L+ RV  A  +P ++VL+ GG   I   +      +
Sbjct: 495 GIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVVVLINGGV--IGAEEIIERTDA 551

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNP 580
           ++ A YPG  GA A+A+V+FGD NP
Sbjct: 552 LVEAFYPGFFGARAMADVLFGDTNP 576


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  349 bits (896), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 218/579 (37%), Positives = 304/579 (52%), Gaps = 49/579 (8%)

Query: 11  PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           P  C+ FTS+L  + ST               FPF  T+LP S+R  DLV RLTL+E + 
Sbjct: 4   PGKCVVFTSVLFSLIST-----------ILGDFPFRNTSLPWSERVDDLVGRLTLEEIVL 52

Query: 71  QLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           Q+          AP I RLGI  Y W +E LHG    G           ATSFPQ    A
Sbjct: 53  QMSRGGTGSNGPAPPIDRLGIGPYSWNTECLHGDVAAGP----------ATSFPQAFGLA 102

Query: 124 ASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETP 176
           A+FD+ L  +I  A   E RA YN        G   G++ ++P INI R P WGR QET 
Sbjct: 103 ATFDAVLIEQIANATAYEVRAKYNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETY 162

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDP ++G  A SYV G+QG+          + A+A CKHF AY       ++R  FDA+
Sbjct: 163 GEDPYLSGTLAASYVNGLQGNH------PRYVTANAGCKHFDAYAGPEDIPSSRSTFDAK 216

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
           V+ +DL  T+ P F  C++ G  S +MC+YN +NG+P+CA++ LL+   R +W F GY+ 
Sbjct: 217 VSDRDLRMTFLPAFHECIQAGTHS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVI 275

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESE 352
           SD  AV  +YDA  Y K   D  +  + +G+++   S L+ +    T  AVKQ  +    
Sbjct: 276 SDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKT 335

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +   +  LF  RMRLG F+  P M P+ K+   ++ S  HQ L+L+AA    VLLKN + 
Sbjct: 336 VKARVSPLFYTRMRLGEFDP-PEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENR 394

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNY-AGPSCRSITPLQALQNYVENTVYYPGCDTV 471
            LPL K K   LA++GP A++   L G+Y A P+  ++TP   L     NT Y  GCD  
Sbjct: 395 FLPL-KEKIDKLAVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNP 453

Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
            C      +      GAD VV+ +G     E E  DR +L LPG+Q  L+    +   KP
Sbjct: 454 KCRKYDSGQVKSAVSGADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKP 513

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           VIL+L   GP+D+++A  +  + +I+   +P +A   AL
Sbjct: 514 VILLLFNAGPLDVSWAVENPAVQTIVACFFPAQATGDAL 552


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  348 bits (893), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 202/521 (38%), Positives = 288/521 (55%), Gaps = 26/521 (4%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF---NGTIRGATSFPQV 119
           ++++EK+  LV+++  +  LG+P + WW+E LHGV G   G+ F   +     ATSFP  
Sbjct: 1   MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGV-GFSPGVLFAQDSEPFGYATSFPLP 59

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 179
           ILTAASFD  L+  IGQ IG E RA  N G A G  FW PN+N FRDPRWGRGQETPGED
Sbjct: 60  ILTAASFDDDLFNAIGQVIGREGRAFSNYGYA-GFNFWTPNMNAFRDPRWGRGQETPGED 118

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
            LV   Y  SYV G+QG       +       A CKHF AYD++    T R   +   T 
Sbjct: 119 VLVVSNYVQSYVTGLQGSDPTDKVII------AACKHFAAYDIE----TARRANNYNPTQ 168

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYIT 296
           QDL D Y P F  CV+      +MC+YN V+GIP+C+   LL +  R  WGF   + ++ 
Sbjct: 169 QDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQFVV 228

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRA 356
           SDC AV+ ++    +  + +DA    + AG D+ CGS    H   ++  K++ +  +D A
Sbjct: 229 SDCGAVTDVWLLHNFTNTEQDAASVSMAAGTDLECGSSYL-HLNGSLADKQVTQERVDEA 287

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           L  L+     +G F+G+        +G   V +   Q +A +AA+ G+ LLKN  G+LPL
Sbjct: 288 LTRLYKALFTVGYFDGS----SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND-GVLPL 342

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVYYPGCDTVACSS 475
              K  S+ALIGP AN+   + GNY G +    +PL A  Q       Y  G D  + S 
Sbjct: 343 ADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGTDINSTSD 402

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +    A+  AK +D V+   G+D T E E LDRV +  PG Q +LI++++    KP+++ 
Sbjct: 403 SGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLG-KPLVVA 461

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
              GG VD T    + N+ ++ WAG PG+AG +A+ +++ G
Sbjct: 462 QFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVG 502


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  348 bits (893), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 208/553 (37%), Positives = 311/553 (56%), Gaps = 21/553 (3%)

Query: 36  SNPSTETFPFCKT-----TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           S P  E  P  KT     +     RA  LVS  TL+E ++   N+   +PRLG+P Y+ W
Sbjct: 48  SFPDCENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVW 107

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SE+LHGV          G    ATSFPQ ILT A+ +  L ++IG  +  +ARA  N G+
Sbjct: 108 SESLHGVYRANWAS--EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGR 165

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
             G+  +APNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+
Sbjct: 166 -YGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG-----GVDPETLK 219

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A  KH+  YD++NW G +R   D ++T QDL++ Y P F    +  +   +MC+YN V
Sbjct: 220 LVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAV 279

Query: 270 NGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           NG+PSC++   L    R  +GF   GY++ DC AV   ++   YA +   A  D ++AG 
Sbjct: 280 NGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGT 339

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D++CG+  Q H   A  + ++   +I+R +  L++  +RLG F+GN + Q      +DV 
Sbjct: 340 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSS-QYRDLTWSDVQ 398

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            + A  + + +AA +G VLLKN  G LPL  S   S+ALIGP AN+   + GNY GP+  
Sbjct: 399 TTDAWNI-SHEAAVEGTVLLKND-GTLPLADSIR-SVALIGPWANATTQMQGNYYGPAPY 455

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
             +PL AL+    +  Y  G +  + ++A    A+  A+ AD ++   G+D T E E LD
Sbjct: 456 LTSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALD 515

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R+++  PG Q +LI +++ A  KP++++ + GG VD +  K++ N+ ++LW GYPG++G 
Sbjct: 516 RMNITWPGNQLDLINQLS-ALGKPLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGG 574

Query: 568 VALAEVIFGDHNP 580
            AL ++I G   P
Sbjct: 575 TALLDIIRGVRAP 587


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  348 bits (892), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 208/553 (37%), Positives = 311/553 (56%), Gaps = 21/553 (3%)

Query: 36  SNPSTETFPFCKT-----TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           S P  E  P  KT     +     RA  LVS  TL+E ++   N+   +PRLG+P Y+ W
Sbjct: 66  SFPDCENGPLSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVW 125

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SE+LHGV          G    ATSFPQ ILT A+ +  L ++IG  +  +ARA  N G+
Sbjct: 126 SESLHGVYRANWAS--EGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGR 183

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
             G+  +APNIN FR P WGRGQETPGED   +   YA  Y+ G+QG     G     L+
Sbjct: 184 -YGLDTYAPNINSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG-----GVDPETLK 237

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A  KH+  YD++NW G +R   D ++T QDL++ Y P F    +  +   +MC+YN V
Sbjct: 238 LVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAV 297

Query: 270 NGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           NG+PSC++   L    R  +GF   GY++ DC AV   ++   YA +   A  D ++AG 
Sbjct: 298 NGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASADSIRAGT 357

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D++CG+  Q H   A  + ++   +I+R +  L++  +RLG F+GN + Q      +DV 
Sbjct: 358 DIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN-SSQYRDLTWSDVQ 416

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            + A  + + +AA +G VLLKN  G LPL  S   S+ALIGP AN+   + GNY GP+  
Sbjct: 417 TTDAWNI-SHEAAVEGTVLLKND-GTLPLADSIR-SVALIGPWANATTQMQGNYYGPAPY 473

Query: 448 SITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
             +PL AL+    +  Y  G +  + ++A    A+  A+ AD ++   G+D T E E LD
Sbjct: 474 LTSPLAALEASDLDVHYAFGTNISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALD 533

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R+++  PG Q +LI +++ A  KP++++ + GG VD +  K++ N+ ++LW GYPG++G 
Sbjct: 534 RMNITWPGNQLDLINQLS-ALGKPLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGG 592

Query: 568 VALAEVIFGDHNP 580
            AL ++I G   P
Sbjct: 593 TALLDIIRGVRAP 605


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 200/517 (38%), Positives = 282/517 (54%), Gaps = 19/517 (3%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           EK+  LV+ +  + RLG+PAY WW EALHGVAG   GI F G  R ATSFP  +L +A+F
Sbjct: 56  EKLDNLVSKSKGVARLGLPAYNWWGEALHGVAGA-PGINFTGPYRTATSFPMPLLMSAAF 114

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           D  L ++I   IG EARA  N G A  + FW P+IN FRDPRWGRG ETPGED L    Y
Sbjct: 115 DDDLIHQIAIVIGNEARAFGNGGIA-PVDFWTPDINPFRDPRWGRGSETPGEDILRIKGY 173

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
             S + G++GD       K + +  A CKH+  YD+++W GT R+ FDA++T QDLA+ +
Sbjct: 174 TKSLLSGLEGD-------KAQRKIIATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYF 226

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVS 303
            PPF+ C +  +    MC+YN VNG+P+CAD  +L    R+ W +   + YITSDC+AV 
Sbjct: 227 MPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVK 286

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
            I     Y  + ++A       GMD++C          A  Q  L  S IDRAL   +  
Sbjct: 287 DISLRHKYVATLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEG 346

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            +  G F+G      +  +G   + +P  Q L LQ A +G+ LLKN    LPL       
Sbjct: 347 LVHAGYFDG--AAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDD-TLPLSLKSGSK 403

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKA 481
           +A++G  AN +  L G Y+GP+     P+ A      +     G       +A     KA
Sbjct: 404 VAMVGFWANDSSKLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKA 463

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           +D AK +D ++   GLD +   E  DR D+  P  Q +LIT++A A  KP++++ L G  
Sbjct: 464 LDAAKKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLA-ALGKPLVVIAL-GDM 521

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           VD       + + S++WA +PG+ G  A+ +VI G+H
Sbjct: 522 VDHMPILNMKGVNSLIWANWPGQDGGTAVMQVITGEH 558


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 210/557 (37%), Positives = 314/557 (56%), Gaps = 59/557 (10%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG 108
           +L   +RA DL +RLTL+E+ SQL   APA+ RL IPAY WWSE LHGVA  G       
Sbjct: 8   SLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT------ 61

Query: 109 TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNI 161
               AT FPQ I  AA FD     ++G  IG EARA YN   A        G+  W+PN+
Sbjct: 62  ----ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLWSPNV 117

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           NIFRDPRWGRGQET GEDP +T +  V++ +G+QG+    G++   L+ +AC KH   + 
Sbjct: 118 NIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQGE----GEV---LKTAACAKHLAVH- 169

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             +     R++FDA  + +D+ +TY P FE+ VK+ +  G+M AYNRVNG P+CA + L+
Sbjct: 170 --SGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPACASKFLM 227

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K    +WGF GY  SDC A+   +      K+  ++    LK G D+NCG ++L  H  
Sbjct: 228 GKL--DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESAAMALKLGCDLNCGNTYL--HLL 283

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A  +  + + +I +A  +L   R+RLG+F+       + K+   +V +  ++  A + +
Sbjct: 284 HAYNEGLINDEDIKKACTHLMRTRVRLGMFDDE---TEYDKLDYSIVANEENKAYARKCS 340

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN--- 457
           +  +V+LKN +G+LPL  SK  ++ +IGPNA+S   L GNY G + R IT L+ +Q+   
Sbjct: 341 ERSMVMLKN-NGILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQDAFG 399

Query: 458 ----YVENT-VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE------- 505
               Y E + +Y   C  +A +   + +A  + + +D VVL +GLD T E EE       
Sbjct: 400 GRVLYSEGSHLYKDRCMGLAVADDRLSEAEIVTEHSDVVVLCVGLDATIEGEEGDTGNEF 459

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              D+ DL LP  Q++L+  V     KPVI+V   G  +++     + +  +++ A YPG
Sbjct: 460 SSGDKNDLRLPEAQRKLVETVMRKG-KPVIIVTAAGSAINV-----EADCDALIHAWYPG 513

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  ALA+++FG  +P
Sbjct: 514 QFGGTALADILFGKISP 530


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  346 bits (888), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 203/538 (37%), Positives = 297/538 (55%), Gaps = 18/538 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T     RA  L S +TL+E ++   N  PAIPRLG+P Y+ W+EALHG+        
Sbjct: 62  CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGL--YLANFT 119

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
            +G    +TSFP  ILT A+ +  L ++I Q I  + RA  NAG+  G+  ++PNIN FR
Sbjct: 120 ESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGR-YGLNAFSPNINAFR 178

Query: 166 DPRWGRGQETPGEDP-LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
            P WGRGQETPGED   +   YA  Y+ G+QG+  N   +       A  KH+  YD++N
Sbjct: 179 HPVWGRGQETPGEDANCLCSAYAYEYITGLQGNATNPKII-------ATAKHYAGYDIEN 231

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W+  +R+  D  +T QDLA+ + P F   V+  +   +M +YN VNG+PS A+  LL   
Sbjct: 232 WRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANTFLLQTL 291

Query: 285 ARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
            R  WGF   GY+ SDCDAV  +++  GYA +   A    L+AG D++CG         +
Sbjct: 292 VRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSSASAMSLRAGTDIDCGISYLTTLNES 351

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           + Q ++  SEI+RA+   +S  +  G F+G     P+  +    V       +A +AA  
Sbjct: 352 LTQGQISRSEIERAVTRFYSNLVSAGYFDGPDA--PYRDLSWSDVVRTNRWNVAYEAAVA 409

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           G+VLLKN  G+LPL KS    +ALIGP AN+ + + GNY G +    +PL A+Q      
Sbjct: 410 GVVLLKND-GVLPLSKSVQ-RVALIGPWANATEQMQGNYHGVAPYLTSPLAAVQASGLEV 467

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
            Y  G +  +  +     A+  A+ +D ++   G+D T E EELDR ++  PG Q ELI 
Sbjct: 468 NYAFGTNITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGNQLELIH 527

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           R+ E   KP++++ + GG VD +  K    +G++LW GYPG+AG  AL +++ G   P
Sbjct: 528 RLGELG-KPLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTGQRAP 584


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 202/544 (37%), Positives = 307/544 (56%), Gaps = 18/544 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TT     RA  L++  T +E ++   N  PAIPRLG+P Y+ W+EALHG+       F
Sbjct: 61  CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANLTEF 120

Query: 106 FNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR 165
             G    ATSFP  ILT A+ +  L  +IG  +  + RA  N G+  G+  ++PNIN FR
Sbjct: 121 --GDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGR-YGLDVYSPNINSFR 177

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
            P WGRGQETPGED  +   Y + Y+ G+QG     G    +L+ +A  KHF  YD++NW
Sbjct: 178 HPVWGRGQETPGEDIQLCSVYGLEYITGLQG-----GLDPKELKLAATAKHFAGYDIENW 232

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
              +R   D  ++  D A  Y P F + V+  R   +M +YN VNG+P+ A+  LL    
Sbjct: 233 GNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLL 292

Query: 286 RRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
           R  W F   GY++SDCD+V  +++  GYA S   A    ++AG D++CG+  Q +   + 
Sbjct: 293 RDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSIQAGTDIDCGATYQLYLNQSF 352

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG-ADVVCSPAHQVLALQAAQD 402
            Q ++  SEI+RA    +S  + LG F+G+ +   +  +  +DVV + A  + + +AA +
Sbjct: 353 TQGEISRSEIERAATRFYSNLVSLGYFDGDNSK--YRDLDWSDVVATDAWNI-SYEAAVE 409

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           GIVLLKN  G LPL K    S+ALIGP AN   T+ GNY G +     PL ALQ    + 
Sbjct: 410 GIVLLKND-GTLPLSKDTH-SVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDV 467

Query: 463 VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT 522
            Y  G +  + +++  + A+  A+ +D V+   G+D + E E +DR  +  PG Q +LI 
Sbjct: 468 NYAFGTNISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIE 527

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP-G 581
           +++E   KP++++ + GG VD +  K ++N+ S++W GYPG++G  A+ +++ G   P G
Sbjct: 528 QLSELG-KPLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAG 586

Query: 582 EFTV 585
             TV
Sbjct: 587 RLTV 590


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 211/550 (38%), Positives = 298/550 (54%), Gaps = 49/550 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           +S R +DL+SRLTL EK+S L   + A+PRL IPAY WW+E LHGVA  G+         
Sbjct: 37  LSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAGE--------- 87

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIF 164
            AT FPQ I  AA+FD  L  ++   +  EARA YN         Q +G+TFW+PNINIF
Sbjct: 88  -ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNINIF 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLD 223
           RDPRWGRGQET GEDP +T K   +YV G+QG D  +       L+ SA  KHF A+   
Sbjct: 147 RDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPLH-------LKTSATAKHFVAHS-- 197

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
             +G   Y FDA V  +DL DTY   F+S V  G  S IM AYNRVNG+P+  ++ L++ 
Sbjct: 198 GPEGERDY-FDALVDEKDLRDTYLYAFKSLVDGGVES-IMTAYNRVNGVPNSINKTLVND 255

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
              ++WGF G++ +DC A+  +Y       +  +     +KAG+D++C S  Q     A+
Sbjct: 256 IVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSIFQTDIINAI 315

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
             K L E ++D AL  + S + +LG F+  P+  PF   GAD + + +H +LA Q AQ  
Sbjct: 316 NNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVMLARQMAQKS 374

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
           +VLLKN   +LPL      S+ ++GPNA S   L+ +Y G S +++  ++ +   V+   
Sbjct: 375 MVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEGITAAVDKGT 434

Query: 464 ---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
              Y  G D    +       +  A  AD  V ++GL    E E           D+ DL
Sbjct: 435 RVEYDLGADYRDTTHFG---GIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQTGGDKKDL 491

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDI-TFAKYDRNIGSILWAGYPGEAGAVAL 570
            LP      +  + ++ KKP+I V+  G  VDI   A Y     +++ A YPGE G  AL
Sbjct: 492 SLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIAPY---ADAVILAWYPGEQGGNAL 548

Query: 571 AEVIFGDHNP 580
           A+++FG  +P
Sbjct: 549 ADILFGKISP 558


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  345 bits (885), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 216/542 (39%), Positives = 305/542 (56%), Gaps = 49/542 (9%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           L+S ++L+EKI  +++ A  IPRLGIP Y WW+EALHGVA  G+          AT FPQ
Sbjct: 15  LISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------ATVFPQ 64

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNA-GQAI------GMTFWAPNINIFRDPRWGR 171
            I   A+FD  L +R+ +AI +EARA +NA G+        G+TFWAPNINIFRDPRWGR
Sbjct: 65  AIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDPRWGR 124

Query: 172 GQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           GQET GEDP++T +   +YVRG+QG D +        L+A+AC KHF  +   +     R
Sbjct: 125 GQETYGEDPVLTSRLGTAYVRGLQGSDPY-------YLRAAACAKHFAVH---SGPEGLR 174

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + F+A V+ +DL +TY P F++ VK G  S +M AYNRVNG P+C    LL +  R +W 
Sbjct: 175 HTFNAEVSQKDLEETYLPAFKALVKSGVES-VMGAYNRVNGEPACGSTYLLKQKLREEWQ 233

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F G++ SDC A+   +          +++   L++G D+NCG     +   AV +  + E
Sbjct: 234 FQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCDLNCGDAYN-YLAEAVLKGYVTE 292

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
            +I+RA+  L     +LGL + +    P+  I    +    H  LAL+AA+  IVLLKN+
Sbjct: 293 DDINRAVVRLLITLDKLGLIHDD---GPYQGITIHQIDWKKHDSLALEAAEKSIVLLKNN 349

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYY-PG 467
            G+LPL K K   + + GPNA ++  LLGNYAG S R +T L+A+      E TV Y  G
Sbjct: 350 -GVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEITVTYKKG 408

Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV---------DLVLPGRQQ 518
           C          D A  + K AD  + +MG D + E EE D +         DL L   Q 
Sbjct: 409 CPLAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDLNLNDEQL 468

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
             + ++ E+  KP+I+VL+ G P  I   +      +IL A YPG+AG  A++ ++FG  
Sbjct: 469 SYLHKLKESG-KPLIVVLMGGAP--ICSPELHEIADAILVAWYPGQAGGTAVSNIVFGKT 525

Query: 579 NP 580
           NP
Sbjct: 526 NP 527


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  345 bits (884), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 210/554 (37%), Positives = 308/554 (55%), Gaps = 53/554 (9%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           R +DLV+RLTL+EK+  + + +PAIPRLGIPAY+WW+EALHGVA          T+   T
Sbjct: 2   RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVT 52

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQA----IGMTFWAPNINIFRD 166
            FPQ I  AA+FD+    ++G     E RAL+N    AG+      G+T+W PNINIFRD
Sbjct: 53  VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T K   + VRG++G+  +       L++ AC KH+  +    + 
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPH------YLKSVACAKHYAVHSGPEY- 165

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDAR ++ DL DTY P F   V + +  G+MCAYNR+NG P C +  LL    R
Sbjct: 166 --NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILR 223

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPED--AVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            QW F GY+TSDC A+     AE +   PE   A+ D L AG D+ CG+ L       VK
Sbjct: 224 NQWHFDGYVTSDCWALKDF--AEFHKTHPEHTIAMSDALLAGTDLECGN-LYHLLAEGVK 280

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +    E +I+ +L  LF++  ++G+F+    + P+  IG +V+   AH+  A + A++ I
Sbjct: 281 KGLHSERDINVSLSRLFTILFKIGMFDPAERV-PYSSIGREVLECEAHKQHAERMAKESI 339

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV- 463
           VLL+N + +LPL  SK  S+ALIGPNA++ +T L NY G     +TP  +L+  + + + 
Sbjct: 340 VLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIK 399

Query: 464 --YYPGCDTV--ACSSASIDKAVDIAKGADHVVLMMGLDQTQE-------------KEEL 506
             Y PG   V     + S  +    A  +D +V + G+    E                 
Sbjct: 400 INYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFASG 459

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  + LP  Q EL+ ++ +   +P+I+V + G    ++F    +N  ++L A Y G+A 
Sbjct: 460 DRTTMQLPLVQIELLKKLKKTG-RPLIIVNMSGSV--MSFEWESQNADALLQAWYGGQAA 516

Query: 567 AVALAEVIFGDHNP 580
             A+ +V+FG  NP
Sbjct: 517 GDAIVDVLFGHCNP 530


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  345 bits (884), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 220/560 (39%), Positives = 303/560 (54%), Gaps = 43/560 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAP---AIPRLGIPAYEWWSEALHGVAG 99
           FPFC ++L +  R  DL+ RL LDEK+  L   A    +IPRLG+P Y W +  +HGV  
Sbjct: 34  FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGVQS 93

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL--------YNAGQA 151
                   GT   ATSFP  +   A FD    Y++ Q IG E RAL        Y  G  
Sbjct: 94  TC------GT-HCATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGARENYARGPH 146

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
           IG+  W+PNINI RDPRWGR  ETP EDP V  KY V+Y +G+Q      G+    LQA 
Sbjct: 147 IGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQE-----GQDSRFLQAV 201

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
              KH+ AY  +N+ GT R +FDA V+  D ADTY P FE+ V  G+A GIMC+YN +NG
Sbjct: 202 VTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNG 261

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
           IP+CA++  L++  R    F GYITSD  A+  I+D   Y K+  +A    +++G+D+  
Sbjct: 262 IPTCANK-WLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICS 320

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G+      K          S ID A+     +R +LGLF+       FG    DV  + +
Sbjct: 321 GNAYWNCLKQLANSTNFSAS-IDEAIRRTLKLRFQLGLFDAIGDQPHFGP--EDVRTAKS 377

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC----- 446
            Q L+L  A+  IVLL+N    LPL     + +A+IGP++ + + ++GNY G  C     
Sbjct: 378 LQ-LSLDLARKSIVLLQNHGNTLPL--RLGLRIAVIGPHSMTRRGIMGNYYGQLCHGDYD 434

Query: 447 --RSI-TPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ 501
             R I +PL+A+Q  N   NT +  GC     S+A  D A+   + AD  VL +G+D + 
Sbjct: 435 EVRCIQSPLEAIQSVNGRNNTHHVNGCGINDTSTAEFDDALQAVRTADVAVLFLGIDISI 494

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E+E  DR ++ +P  Q EL+ +    A KP ++VL  GG + I   K      S+L A Y
Sbjct: 495 ERESKDRDNIDVPHIQLELL-KAIRVAGKPTVVVLFNGGILGI--EKLILYADSVLEAFY 551

Query: 562 PGEAGAVALAEVIFGDHNPG 581
           PG  GA A+AE++FG  NP 
Sbjct: 552 PGFFGAQAIAEILFGSINPS 571


>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 883

 Score =  344 bits (883), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 187/419 (44%), Positives = 254/419 (60%), Gaps = 31/419 (7%)

Query: 32  SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
           S  P+   T   P+  TTLP  QRA DLV RLTLDEK +QLV SAP IPRLG+PAY++WS
Sbjct: 24  SPSPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWS 83

Query: 92  EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           E LHG+A  G           AT FPQ +  AA+FD  L ++IG+ I  EARA YN   A
Sbjct: 84  EGLHGIARSGY----------ATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVA 133

Query: 152 -------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
                   G+T W+PNINIFRDPRWGRGQET GEDP +T +   ++V G+QGD  N    
Sbjct: 134 HDLRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQGDDPN---- 189

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
               +A    KHF  +   +   + R++F+A  +  DL DTY P F + + +G+A  IMC
Sbjct: 190 --YYRAIGTPKHFAVH---SGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIMC 244

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG--YAKSPEDAVVDV 322
           AYN + G P+CA   LL +  R+ W F G++TSDC A+   ++ +G  Y+K  E A VD 
Sbjct: 245 AYNAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDAEQASVDG 304

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           ++AG D NCG    ++  +AV++  + ESE+D  L  LF  R +LGLF+  P+   +  +
Sbjct: 305 IRAGTDTNCGG-TYRNLASAVRKGMIQESELDVPLRRLFLARFKLGLFD-PPSQVKYASM 362

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
                 S +H  LALQAA++ +VLLKN H  LPL  ++  ++A+IGPNA+S  +L GNY
Sbjct: 363 PITENMSSSHTELALQAAREAVVLLKNEHHTLPL-DARVKTIAVIGPNASSLISLEGNY 420



 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/111 (40%), Positives = 67/111 (60%), Gaps = 13/111 (11%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEAAK 529
           +A++  K AD VV  +GL    E EE+D          R DLVLP  QQ+L+   A+A+ 
Sbjct: 605 QAMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASG 663

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KP+++VLL G  + + +A+   +  +IL A YPG+AGA A+AE + G +NP
Sbjct: 664 KPLVVVLLNGSALAVNWAQ--EHADAILEAWYPGQAGAQAIAETLSGKNNP 712


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  344 bits (883), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 217/558 (38%), Positives = 313/558 (56%), Gaps = 35/558 (6%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  +L  S+RA  LV+ L + EK++ LV++A    R+G+P Y WWSEALHGVA    G  
Sbjct: 44  CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVA-YAPGTQ 102

Query: 106 FN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           F    G    +TSFP  +L AASFD  L  +IG  IG E+RA  N G+  G+ +W PN+N
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGN-GRWSGLDYWTPNVN 161

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            F+DPRWGRG ETPGED L   +YA S ++G++G        + + +  + CKH+ A D 
Sbjct: 162 PFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGP-----HPEKERRVVSTCKHYAANDF 216

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
           ++W GT+R+ FDAR++ QDLA+ Y  PF+ C +  R   IMCAYN VNG+PSCA+  LL 
Sbjct: 217 EDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAVNGVPSCANSYLLD 276

Query: 283 KTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
              R+ WG+ G   Y+TSDC+AV  +     YA++  +      +AG D +C        
Sbjct: 277 TVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGTAMCFEAGTDTSCEYTPSSDI 336

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           + A  Q  L E  +DRAL  L+   +R+G F+GN +   F  I    V +PA Q L+LQ+
Sbjct: 337 RGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSA--FSDISWADVNAPAAQDLSLQS 394

Query: 400 AQDGIVLLKNSHGLLPLPKSKSV-------------SLALIGPNANSAKTLLGNYAGPSC 446
           A +GIV+LKN  G LPLP                   LA+IG  A++ + L G Y+G + 
Sbjct: 395 AVEGIVMLKND-GTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPEKLRGGYSGTAA 453

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSASIDK----AVDIAKGADHVVLMMGLDQTQE 502
              TP  A +    + V   G      ++A+ D     A+  A+GAD++V   GLD+T  
Sbjct: 454 YLRTPAYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYIVYFGGLDETAA 513

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR D+  PG Q  L+ R+A A  KP+++V + G  +D T    +  +G++LWA +P
Sbjct: 514 GENKDRWDVEWPGAQLALVKRLA-ALGKPLVVVQM-GDQLDGTPLLANAGVGAVLWASWP 571

Query: 563 GEAGAVALAEVIFGDHNP 580
           G+ G  A+  ++ G  +P
Sbjct: 572 GQDGGPAVMRLLSGAASP 589


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  344 bits (882), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 206/545 (37%), Positives = 306/545 (56%), Gaps = 44/545 (8%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           Q+A+ ++S+LTLDEKISQL   A  I RLGI  Y W +EALHGV   G+          A
Sbjct: 33  QKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------A 82

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTFWAPNINIFRD 166
           T FPQ I   A+FD  + ++IG AI  E RA +       N     G+TFWAPN+NIFRD
Sbjct: 83  TVFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRD 142

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TG    ++V+G+QGD          L+A+AC KHF  +   +  
Sbjct: 143 PRWGRGMETYGEDPFLTGTLGTAFVKGMQGDD------PFYLKAAACGKHFAVH---SGP 193

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
             TR+  +   T +DL +TY P F+  V++G+   IM AY R+ G      + LL+   R
Sbjct: 194 ERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYGESCSGSKYLLTDILR 253

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
           + WGF G++ SDC AV+ +Y+     KS  +AV   +KAG+++ CG+ + +  K A++QK
Sbjct: 254 KDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLECGNSM-RTMKDAIQQK 312

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E ++D+AL  L   R++LG+   +    P+ +    V+ S A++ +A QAA++ +VL
Sbjct: 313 LITEKDLDKALLPLMMTRLKLGILQPDAAC-PYNEFPESVIGSEANRKIAEQAAEESMVL 371

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN +G+LP+ K    +L + GP A  A  L+GNY G S R  T L+ +   V N     
Sbjct: 372 LKN-NGVLPIAKDIR-TLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSNGTSVN 429

Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLP 514
           Y  G   V  +   ++ +V  ++GA+  +L+MG     E EE          DRV+L LP
Sbjct: 430 YKQGFMQVFKNLNDVNWSVSESRGAEVSILIMGNSGNTEGEEGDAIASAERGDRVNLRLP 489

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
             Q E +  V++     +++VL  G P+D+   +      +++ A YPG+ G VALA ++
Sbjct: 490 DSQMEYLREVSKDRTNKLVVVLTGGSPIDV--KEITELADAVVMAWYPGQEGGVALANLL 547

Query: 575 FGDHN 579
           FGD N
Sbjct: 548 FGDAN 552


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 165/341 (48%), Positives = 239/341 (70%), Gaps = 2/341 (0%)

Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
           L DTY  PF++CV +G+ + +MC+YN+VNG P+CAD +LL  T R  WG  GYI SDCD+
Sbjct: 1   LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60

Query: 302 VSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           V ++YD++ +  +PE+A    +KAG+D++CG FL  HT  AV +  L E +++ AL NL 
Sbjct: 61  VGVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLL 120

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
           SV+MRLG+F+G P  QP+G +G   VC+PAH+ LAL+AA+ GIVLL+N  G LPL  ++ 
Sbjct: 121 SVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRH 180

Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDK 480
            ++A+IGPN+++  T++GNYAG +C   TP+Q +  YV+ T++  GC  VAC     I +
Sbjct: 181 RTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVK-TIHAKGCANVACVGDQLIGE 239

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           A   A+ AD  V+++GLDQ+ E E  DR  ++LPG+Q+EL+ R+  A K P ++VL+ GG
Sbjct: 240 AEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGG 299

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           P+D++FAK D  I  ILW GYPG+AG  A+A+V+FG  NPG
Sbjct: 300 PIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPG 340


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 155/255 (60%), Positives = 194/255 (76%), Gaps = 7/255 (2%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           PF+CDP + ++  FPFC+  LPI  R  DL+ R+TL EK+  LVN A A+PRLGI  YEW
Sbjct: 26  PFACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEW 85

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGV+ VG G  F G   GATSFPQVI TAASF++ LW  IG+ +  EARA++N G
Sbjct: 86  WSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGG 145

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+T+W+PN+NIFRDPRWGRGQETPGEDP+V GKYA SYVRG+QG+  +G +LK    
Sbjct: 146 VA-GLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGN--DGDRLK---- 198

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
            +ACCKHFTAYDLDNW G  R+ F+A+V+ QD+ DT+  PF  CVK+G+ + +MC+YN+V
Sbjct: 199 VAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVMCSYNQV 258

Query: 270 NGIPSCADRNLLSKT 284
           NGIP+CAD  LL KT
Sbjct: 259 NGIPTCADPKLLKKT 273


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 217/564 (38%), Positives = 303/564 (53%), Gaps = 52/564 (9%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           PF    L +  R RDLVSRLTL EK+SQ+ ++A AIPRLGIPAY +W+E LHGVA  G+ 
Sbjct: 22  PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNGR- 80

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG----------QAIG 153
                    AT FPQ+I  AA++D+ L YR+  AI  EARA ++A           Q  G
Sbjct: 81  ---------ATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNIN+FRDPRWGRGQET GEDP +T + A ++VRG+QGDT +       L+ +AC
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDT-----HLKLAAC 186

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH   Y + +     R+ F+ARVT  DL D+Y P FE  V+  R   +M AYNR    P
Sbjct: 187 AKH---YAVHSGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLDEP 243

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
            CA + LL    R +WGF G++ SDC A+  I++       P ++    L  G D+ CG+
Sbjct: 244 CCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESAALALTKGCDLACGT 303

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF-----NGNPTMQPFGKIGADVVC 388
             +     AV++  + E++IDRAL      R +LG+F     N NP   P     A V C
Sbjct: 304 TFEL-LGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAP-EAIVTC 361

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           + AH  LA +AA    VLL+N + +LPL +    S+ + GP A +   LLGNY G   R+
Sbjct: 362 A-AHTALACEAAVASCVLLQNHNHILPL-RPDVRSIYITGPLAATQDALLGNYYGLPPRA 419

Query: 449 ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           IT L  L   +   +   Y PG         +++ A       D  +  +GL    E EE
Sbjct: 420 ITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCASCDVTIACLGLTALLEGEE 479

Query: 506 ---------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
                     DR D+ LP  Q+  +  + +   +  ++V+L GG   ++       + +I
Sbjct: 480 GEAIASSLHGDRDDISLPPPQRLFLESLIQRGAR--VIVILFGGSA-LSLGPLADKVEAI 536

Query: 557 LWAGYPGEAGAVALAEVIFGDHNP 580
           LWAGYPG+ G  ALA+++ G  +P
Sbjct: 537 LWAGYPGQEGGRALADILLGRASP 560


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  342 bits (876), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 211/557 (37%), Positives = 297/557 (53%), Gaps = 59/557 (10%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           R ++++S++TL+EK+SQL   APA+   GIP Y WW+E LHGVA  G           AT
Sbjct: 6   RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDP 167
            FPQ I  AA+FD      +  AI  E RA YN         Q  G+TFW PN+NIFRDP
Sbjct: 56  VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +TG+  +++++G+QGD          L+ +AC KH+  +   +   
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGDDTE------HLKVAACAKHYAVH---SGPE 166

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+ FDA V+ +DL +TY P F+  V+ G    +M AYNR  G P      LL +  R 
Sbjct: 167 KLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEILRG 225

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++TSDC A+   ++     KSPE++    L AG D+NCG     +   + K+  
Sbjct: 226 RWGFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNAGCDLNCGC-TYPYLTVSHKKGL 284

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + +  ID AL  L   R +LGLF+  P   P+  +G D+V    H+ LAL+AAQ  IVLL
Sbjct: 285 VTDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGNDIVGCEKHRNLALEAAQKSIVLL 343

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE------- 460
           KN   +LPL  S +  + L+GP A +  TLL NY G S R +T L+ L   ++       
Sbjct: 344 KNDSNILPLDDS-ARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTAISF 402

Query: 461 -----NTVYYPG-CDTVACSSASIDKAVDI--AKGADHVVLMMGLDQTQEKEE------- 505
                + +Y P     V   S  +D    I      D V+ + GLD + E EE       
Sbjct: 403 EYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSIASD 462

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR  + LP  Q   + R+ +A KK V+L+L  G P+       D    ++L+A YPG
Sbjct: 463 ANGDRDTIELPSWQLNFLRRIRKAGKK-VVLILTGGSPIAFPEDLAD----AVLFAWYPG 517

Query: 564 EAGAVALAEVIFGDHNP 580
           E G  A+A+++FGD +P
Sbjct: 518 EQGGNAVADILFGDVSP 534


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  342 bits (876), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 192/413 (46%), Positives = 252/413 (61%), Gaps = 38/413 (9%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           RA  LV+++TL EK++Q+ N APAIPRLG+PAY+WWSE LHG+A  G           AT
Sbjct: 32  RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNGY----------AT 81

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDP 167
            FPQ I  AAS+D+ L + +G  I  EARA +NA       G   G+T W+PNINIFRDP
Sbjct: 82  VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +TG+ AV++VRG+QGD           +A A  KHF A+   +   
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQ------HPRAIATPKHFVAH---SGPE 192

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R  FD  V+  DL DTY P F + V  G A  +MCAYN ++G P+CA+  LL    R+
Sbjct: 193 AGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTRLRK 252

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA----AV 343
            WGF GY+ SDCDAV  I     +      A V  ++AG D++CG     HT A    AV
Sbjct: 253 DWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQAGTDLDCG-----HTYASLAQAV 307

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
           +Q  + ES +D +L  LF+ R RLG   G+    P+ +IGAD + SPAH+ LALQAA + 
Sbjct: 308 RQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGADQIDSPAHRKLALQAALES 366

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +VLLKN+H  LPL     + LA+IGP+A++ +TL  NY G +   +TPLQ L+
Sbjct: 367 LVLLKNAHSTLPL--HAGMRLAVIGPDADALETLEANYHGTARHPVTPLQGLR 417



 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/104 (42%), Positives = 63/104 (60%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           AD VV  +GL    E E+L          DR D+ LP  Q+ L+ R A A+ KP+I+VLL
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G  V + +A+  ++  +IL A YPG+AG  A+A+V+ GD+NPG
Sbjct: 657 SGSAVALDWAQ--QHADAILAAWYPGQAGGTAIAQVLAGDYNPG 698


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  341 bits (875), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 193/470 (41%), Positives = 270/470 (57%), Gaps = 41/470 (8%)

Query: 21  LTRVDSTQPPFSCDPSN--PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPA 78
           LT    +  P +  P +   ST T+PF   +L +  R  DLVSRLT  EKI+Q+ N  PA
Sbjct: 19  LTACSKSAAPNANQPQDTAASTATYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPA 78

Query: 79  IPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAI 138
           I RLGIPAY WW+E+LHGVA  GK          AT +PQ I  A++FD  L  R+  +I
Sbjct: 79  IERLGIPAYNWWNESLHGVARAGK----------ATVYPQAIGLASTFDEDLMLRVATSI 128

Query: 139 GLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
             E RA Y+   +        G+TFW+PNINIFRDPRWGRGQET GEDP +TG+ A+++V
Sbjct: 129 SDEGRAKYHDFLSKDVRTIYGGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFV 188

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFE 251
           +G+QG+  N   LK    A A  KH+  +   +    TR+  D   T +DL +TY P F 
Sbjct: 189 KGIQGENDNSDYLK----AVATIKHYAVH---SGPEKTRHSDDYHPTRKDLFETYLPAFR 241

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
             + +     +MCAYNRV+G P+C +  L+ +  R   GF+GY+ SDC A++  Y++  +
Sbjct: 242 MAIAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSH 301

Query: 312 --AKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
               SP +A    +K+G D+NCG     ++   H   A++Q  + E  ID A+  LF  R
Sbjct: 302 HVVDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHY--ALQQGLITEDYIDIAVKRLFKAR 359

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
           ++LG+F+    + P+ +IG DVV SP H  L  +AA+  IVLLKN+ G+LPL     V +
Sbjct: 360 IKLGMFDEQDRV-PYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNN-GVLPL--KAGVKV 415

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVA 472
           A+IGPNA     L+GNY G   + + PL+ + N V   N  Y PG   +A
Sbjct: 416 AVIGPNAVDEDVLVGNYHGVPVKPVLPLEGIVNRVGEANVFYAPGSAQIA 465



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 13/112 (11%)

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
           ++A+  A+ AD ++ M G+D   E EE+          DR  + LP  Q  L+ ++ +A 
Sbjct: 618 EEALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQL-KAT 676

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KPV++V   G  + + +      + +IL A YPGEA   ALA +++GD +P
Sbjct: 677 GKPVVMVNFSGSAMALNWES--EKLDAILQAFYPGEATGTALANILWGDVSP 726


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  341 bits (874), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 171/329 (51%), Positives = 225/329 (68%), Gaps = 2/329 (0%)

Query: 252 SCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY 311
           S V  G  + +MC+YN+VNG P+CAD +LLS   R +W  +GYI SDCD+V ++Y  + Y
Sbjct: 1   SYVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY 60

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
            K+P +A    + AG+D+NCGSFL +HT+ AVK   + E+ ID+A+ N F   MRLG F+
Sbjct: 61  TKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFD 120

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNA 431
           GNP  Q +G +G   VC+ A+Q LA  AA+ GIVLLKN+ G LPL      +LA+IGPNA
Sbjct: 121 GNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKNT-GCLPLSPKSIKTLAVIGPNA 179

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHV 491
           N  KT++GNY G  C+  TPLQ L   V +T Y PGC  VAC+ A +  A  +A  AD  
Sbjct: 180 NVTKTMIGNYEGTPCKYTTPLQGLAGTV-STTYLPGCSNVACAVADVAGATKLAATADVS 238

Query: 492 VLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
           VL++G DQ+ E E  DRVDL LPG+QQEL+ +VA+AAK PV+LV++ GG  DITFAK D 
Sbjct: 239 VLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDP 298

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            I  ILW GYPGEAG +A+A++IFG +NP
Sbjct: 299 KIAGILWVGYPGEAGGIAIADIIFGRYNP 327


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 216/560 (38%), Positives = 307/560 (54%), Gaps = 57/560 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   TL   +RA  L   +T +E+ SQL   APA+ RLGIPAY WW+E +HG+A  G   
Sbjct: 4   YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA FD  L  +  +    EARA YNA           G+T W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGRG ET GEDP +T K  ++ VRG+QGD    GK+   ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQGD----GKV---IKAAACAKHF 166

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA+   +D+ +TY P FE+ VK+ +   +M AYNRVNG P+CA 
Sbjct: 167 AVH---SGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             L+ K   ++W F GY  SDC A+   ++      +  ++    LKAG DVNCG   Q 
Sbjct: 224 NFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIESTAMALKAGCDVNCGCTYQ- 280

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
           +   A+++  + + +I  A  +L   R+RLG+F+       +  I  D V    H+ ++L
Sbjct: 281 NLLVALEKGAVTKEDIRTACVHLMRTRIRLGMFDKK---TEYDDIPYDKVACKEHKAISL 337

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           + A+  +V+L+N +G+LP+  SK  ++A+IGPNA+S   L GNY G S R  T L  +Q+
Sbjct: 338 ECAEKSLVMLEN-NGILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQD 396

Query: 458 YVENTVYY-PGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE---- 505
             +  V +  GC       +++ +A D        AK AD  +L +GLD T E EE    
Sbjct: 397 RFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDTG 456

Query: 506 -----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
                 D+  L LP  Q+EL+ ++  A  KPV+ V +C G    T +K D    +++ A 
Sbjct: 457 NEFSSGDKNGLTLPPPQRELVKKIM-AVGKPVVTV-VCAGSAINTESKPD----ALIHAF 510

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPG  G  ALAEV+FGD +P
Sbjct: 511 YPGAEGGKALAEVLFGDVSP 530


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 192/440 (43%), Positives = 257/440 (58%), Gaps = 28/440 (6%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+ AY+WW+EALHGVA  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                   GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TF
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
           W+PNINIFRDPRWGRGQET GEDP +T +  V++VRG+QG   D     + +   +  A 
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G  
Sbjct: 204 AKHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
           + A + LL    R+QWGF GY+ SDC A+  I+       + E A    +K G ++ CG 
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
                  AAV Q  + E++ID +L  L + RMRLG+F+  P   P+ KI A V  SPAH 
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LLGNY G     +T LQ
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 454 ALQNYVEN--TVYYPGCDTV 471
            ++    N   +Y  G D V
Sbjct: 438 GIRAAAPNAQVLYARGADLV 457



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ A+ VV + GL    E 
Sbjct: 592 LQAGRTYDLRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 651

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +A  KPV+ VL  G  + I +A+  +++
Sbjct: 652 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 708

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 709 PAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 219/555 (39%), Positives = 305/555 (54%), Gaps = 36/555 (6%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLV------NSAPAIPRLGIPAYEWWSEALHGVAG 99
           C TT    +RA  LV  + + EK++ LV      +S+   PRLG+P YEWWSEALHGVA 
Sbjct: 11  CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69

Query: 100 VGKGIFFN---GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
              G+ FN   G    ATSF   I  +A+FD  L  ++   I  EARA  NAG A G+ F
Sbjct: 70  ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSA-GLDF 128

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNIN +RDPRWGRG ETPGEDP+    Y  S +RG++G+       +   +  A CKH
Sbjct: 129 WTPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGE-------ESIKKVIATCKH 181

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           + AYDL+ W   TRY+FDA V++QDL++ Y PPF+ C +  +   IMC+YN +NG P+CA
Sbjct: 182 YAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPACA 241

Query: 277 DRNLLSKTARRQWGF---HGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC- 331
           +  L+    R+ W +   + YITSDC+A+   + D   + ++  +A      AG D  C 
Sbjct: 242 NTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTVCE 301

Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
             GS        A  QK L E  IDRAL  L+   +R G F+   +  P+  IG   V +
Sbjct: 302 VAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGYFDPA-SASPYRDIGWSDVNT 360

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
              Q LALQ+A DG+VLLKN  G LP+ K +  ++ALIG  A+  +++LG Y+G      
Sbjct: 361 AEAQALALQSASDGLVLLKND-GTLPI-KLEGKTVALIGHWASGTRSMLGGYSGIPPYYH 418

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASID----KAVDIAKGADHVVLMMGLDQTQEKEE 505
           +P+ A      N  Y      VA +SA+ D     A+  A  +D ++   GLDQ+   E+
Sbjct: 419 SPVYAAGQL--NLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVASED 476

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR  +  P  Q  LI  +A    KP++++ L G  VD T    + N+ +ILWAGYPG++
Sbjct: 477 KDRDSIAWPPAQLTLIQTLA-GLGKPLVVIQL-GDQVDDTPLLTNPNVSAILWAGYPGQS 534

Query: 566 GAVALAEVIFGDHNP 580
           G  A+   I G   P
Sbjct: 535 GGTAVLNAITGVSPP 549


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 212/571 (37%), Positives = 307/571 (53%), Gaps = 37/571 (6%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D  N    T   C      ++RA  LV  + LDEK+  LVN +P  PR+G+PAYEWWSEA
Sbjct: 29  DCENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEA 88

Query: 94  LHGVAGVGKGIFFNGT----IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           LHGVA    G+ FN +       ATSF   I+ +A+FD  L   +   I  EARA  NAG
Sbjct: 89  LHGVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAG 147

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+ +W PNIN ++DPRWGRG ETPGED L   KY  + +RG++G      K+     
Sbjct: 148 LA-GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKM----- 201

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A CKH+ A DL+ W G TRY FDA VT+QDL++ Y P F+ C +       MCAYN +
Sbjct: 202 -VANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAM 260

Query: 270 ---------NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPED 317
                    NG P CA + L++   R  WG+   + +ITSDC+AV  +++   ++ + E+
Sbjct: 261 SIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREE 320

Query: 318 AVVDVLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
           A      AG D  C   ++ +   K A  +  L E  +DRAL  L+   +R G F+G   
Sbjct: 321 AAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA 380

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP----KSKSVSLALIGPNA 431
             P+  I    V +P  + LA ++A +G+VL KN +G+LP+     + K  ++ALIG   
Sbjct: 381 --PYRNITWADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWV 437

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++ + +LG Y+G +    TPL A +  N    T   P   +     +    A++ A  AD
Sbjct: 438 DNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQAD 497

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
            V+   G+D + E E+ DR  L  P  Q +L++ ++ A  KP ++V L G  +D T    
Sbjct: 498 VVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDIS-ALGKPTVVVQL-GTMLDDTALLD 555

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++NI +I+WAGYPG+ G  A  ++I G   P
Sbjct: 556 NKNISAIIWAGYPGQDGGTAAFDIITGKTAP 586


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 192/440 (43%), Positives = 257/440 (58%), Gaps = 28/440 (6%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+ AY+WW+EALHGVA  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                   GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TF
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
           W+PNINIFRDPRWGRGQET GEDP +T +  V++VRG+QG   D     + +   +  A 
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRKLDAT 203

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G  
Sbjct: 204 AKHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
           + A + LL    R+QWGF GY+ SDC A+  I+       + E A    +K G ++ CG 
Sbjct: 261 ASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE 320

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
                  AAV Q  + E++ID +L  L + RMRLG+F+  P   P+ KI A V  SPAH 
Sbjct: 321 EYST-LPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPAHD 378

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LLGNY G     +T LQ
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 454 ALQNYVEN--TVYYPGCDTV 471
            ++    N   +Y  G D V
Sbjct: 438 GIRAAAPNAQVLYARGADLV 457



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ A+ VV + GL    E 
Sbjct: 592 LQAGRTYDLRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 651

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +A  KPV+ VL  G  + I +A+  +++
Sbjct: 652 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 708

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 709 PAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 193/442 (43%), Positives = 261/442 (59%), Gaps = 32/442 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+ +T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                   GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
             + A + LL    R+QWGF GY+ SDC A+  ++       + E A    +  G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G        AAV Q  + E++ID AL  L + RMRLG+F+  P   P+ KI A V  SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  LA + A++ +VLLKN  GLLPL ++    +A+IGP A+    LLGNY G     +T 
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437

Query: 452 LQALQNYVEN--TVYYPGCDTV 471
           LQ ++    N   +Y  G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459



 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 79/148 (53%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA ++Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +A  KPV+ VL  G  + + +A+  +++
Sbjct: 654 EEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAVDWAQ--QHV 710

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 193/442 (43%), Positives = 261/442 (59%), Gaps = 32/442 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+ +T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                   GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDAT 205

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
             + A + LL    R+QWGF GY+ SDC A+  ++       + E A    +  G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G        AAV Q  + E++ID AL  L + RMRLG+F+  P   P+ KI A V  SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  LA + A++ +VLLKN  GLLPL ++    +A+IGP A+    LLGNY G     +T 
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437

Query: 452 LQALQNYVEN--TVYYPGCDTV 471
           LQ ++    N   +Y  G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459



 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 53/148 (35%), Positives = 79/148 (53%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA ++Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +A  KPV+ VL  G  + I +A+  +++
Sbjct: 654 EEMKVSYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 710

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 210/554 (37%), Positives = 307/554 (55%), Gaps = 57/554 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A  LV ++TL+E+ SQ+   APA+PRLGIPAY WW E LHGVA  G           A
Sbjct: 8   RKAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-----AGQAI--GMTFWAPNINIFRD 166
           T FPQ I  AA FD  L   I   +  E RA YN       + I  G+TFW+PN+NIFRD
Sbjct: 58  TMFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T +   ++VRG+QGD  +       L+ +AC KHF  +   +  
Sbjct: 118 PRWGRGHETYGEDPYLTSRLGTAFVRGLQGDGEH-------LKIAACAKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R++F A  + +DL +TY P FE+CVK+     +M AYN  +G P CA+  L+ +  R
Sbjct: 168 EALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            QWGF G+  SDC A+   +       +  ++    +K G D+NCG ++LQ     A ++
Sbjct: 228 GQWGFEGHFVSDCWAIRDFHMNYMVTDTAMESAALAVKKGCDLNCGNTYLQ--VLKACEE 285

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             L ++ +  A+  LF+ R  LG+  G  T   +  I  +VV    H+ LA++AA+  +V
Sbjct: 286 GLLDDACVTEAVVRLFTTRYLLGM--GEET--EYDDIPYEVVECKEHRELAVEAARRSMV 341

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-- 463
           LLKN  GLLPL   K  ++A+IGPNA++   L+GNY G S    T L+ +Q+ V   V  
Sbjct: 342 LLKND-GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRV 400

Query: 464 -YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
            Y  GC       + +A +   + +A  +AK +D VVL +GLD+T E EE          
Sbjct: 401 LYAEGCHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVGLDETLEGEEGDTGNSHASG 460

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ DL+LP  Q+ L+  +     KPV++  + G  +D++ A+     G+++   YPG  G
Sbjct: 461 DKKDLLLPESQRRLMEEILNLG-KPVVVCNMSGSAIDLSLAQ--EKAGAVIQVWYPGAEG 517

Query: 567 AVALAEVIFGDHNP 580
             ALA+++FG  +P
Sbjct: 518 GRALADLLFGKASP 531


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 210/560 (37%), Positives = 308/560 (55%), Gaps = 46/560 (8%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           T TF   K    + Q+A+ ++S+LTLDEKISQL   A  I RLGI  Y W +EALHGV  
Sbjct: 19  TFTFAQSKKEKEMIQKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGR 78

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAI 152
            G+          AT FPQ I   A+FD  +  +IG AI  E RA +       N     
Sbjct: 79  DGR----------ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYA 128

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQAS 211
           G+TFWAPN+NIFRDPRWGRG ET GEDP +TG    ++V+G+QG D F        L+A+
Sbjct: 129 GLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQGNDPF-------YLKAA 181

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           AC KHF  +   +    TR+  +   T  DL +TY P F+  V+QG+   IM AY R+ G
Sbjct: 182 ACGKHFAVH---SGPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAYQRLYG 238

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
                 + LL+   R+ WGF G++ SDC AV+ +Y+     KS  +AV   +KAG+++ C
Sbjct: 239 ESCSGSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFAIKAGLNLEC 298

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G+ + +  K A+KQK + E ++D+AL  L   R++LG+   +    P+ +    V+ S  
Sbjct: 299 GNSM-RTMKDALKQKLITEKDLDKALLPLMMTRLKLGILQPDVAC-PYNEFPESVIGSID 356

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           ++ +A +AA++ +VLLKN  G+LP+ K    +L + GP A  A  L+GNY G S R  T 
Sbjct: 357 NRNIAQRAAEESMVLLKND-GVLPIAKDIR-TLFVTGPGATDAYYLMGNYFGLSDRYSTY 414

Query: 452 LQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE--- 505
           L+ +   V N     Y  G   V  +   ++ +V  ++GA+  +++MG     E EE   
Sbjct: 415 LEGIVGKVSNGTSVNYKQGFMQVFKNLNDVNWSVSESRGAEVSIIIMGNSGNTEGEEGDA 474

Query: 506 ------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
                  DRVDL LP  Q + +  V++     +++VL  G P+D+   +      +++ A
Sbjct: 475 IASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDV--KEITELADAVVMA 532

Query: 560 GYPGEAGAVALAEVIFGDHN 579
            YPG+ G VALA ++FGD N
Sbjct: 533 WYPGQEGGVALANLLFGDAN 552


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  339 bits (869), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 212/571 (37%), Positives = 307/571 (53%), Gaps = 37/571 (6%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D  N    T   C      ++RA  LV  + LDEK+  LVN +P  PR+G+PAYEWWSEA
Sbjct: 84  DCENGPLSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEA 143

Query: 94  LHGVAGVGKGIFFNGT----IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           LHGVA    G+ FN +       ATSF   I+ +A+FD  L   +   I  EARA  NAG
Sbjct: 144 LHGVAK-SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAG 202

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            A G+ +W PNIN ++DPRWGRG ETPGED L   KY  + +RG++G      K+     
Sbjct: 203 LA-GLDWWTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLEGSDPTTRKM----- 256

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A CKH+ A DL+ W G TRY FDA VT+QDL++ Y P F+ C +       MCAYN +
Sbjct: 257 -VANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAM 315

Query: 270 ---------NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPED 317
                    NG P CA + L++   R  WG+   + +ITSDC+AV  +++   ++ + E+
Sbjct: 316 SIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHWSDTREE 375

Query: 318 AVVDVLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
           A      AG D  C   ++ +   K A  +  L E  +DRAL  L+   +R G F+G   
Sbjct: 376 AAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDA 435

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP----KSKSVSLALIGPNA 431
             P+  I    V +P  + LA ++A +G+VL KN +G+LP+     + K  ++ALIG   
Sbjct: 436 --PYRNITWADVNTPEARKLAHRSAVEGMVLTKN-NGVLPIKLEELQKKGKTVALIGNWV 492

Query: 432 NSAKTLLGNYAGPSCRSITPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++ + +LG Y+G +    TPL A +  N    T   P   +     +    A++ A  AD
Sbjct: 493 DNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQAD 552

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
            V+   G+D + E E+ DR  L  P  Q +L++ ++ A  KP ++V L G  +D T    
Sbjct: 553 VVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDIS-ALGKPTVVVQL-GTMLDDTALLD 610

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++NI +I+WAGYPG+ G  A  ++I G   P
Sbjct: 611 NKNISAIIWAGYPGQDGGTAAFDIITGKTAP 641


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 210/549 (38%), Positives = 306/549 (55%), Gaps = 53/549 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA++LV R+ L + +SQL + APAI  LGIPAY WW+E LHG A  G           A
Sbjct: 5   QRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------A 54

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  A+ FD    Y +   +  E RA YN            G+T W+PN+NIFRD
Sbjct: 55  TVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRD 114

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T + AV+++RG+QG+    G +   L+ ++C KHF A+   +  
Sbjct: 115 PRWGRGQETFGEDPYLTARLAVAFIRGLQGE----GPV---LKTASCVKHFAAH---SGP 164

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ F+A V  +DL +TY P F S VK+ +A  +M AY+ +N  P CA   L+ +T R
Sbjct: 165 EPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLMEETLR 224

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +WGF G   SDC A+   +      K+ E++    LK G D+ CG   Q   K A ++ 
Sbjct: 225 LRWGFEGMYISDCWAIRDFHLNHKVTKNEEESAALALKRGCDLACGCEYQSLEK-AFQKG 283

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            +   +I +A   + + R +LG F+       +  +G + + S  H  LA +A+   +VL
Sbjct: 284 LITREQIKKAAIRVMTTRFKLGQFDQGTA---YDTLGLESLDSDEHAALAFEASCRSLVL 340

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT---V 463
           LKN   LLPL K     LA+IGPNA+S + L GNY G S R +T L+ L++YV ++   +
Sbjct: 341 LKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTRIL 399

Query: 464 YYPG-------CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
           Y  G        + +A     + +AV +AK +D VVL +GL++T E E           D
Sbjct: 400 YSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWAGD 459

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL LP  Q++L+  VAE   KP+I+VLL GG +D    +Y  N+ +++ A YPG+ G 
Sbjct: 460 KDDLRLPLCQRKLLKAVAETG-KPIIVVLLSGGSLDPEIEQY-ANVKALIQAWYPGQEGG 517

Query: 568 VALAEVIFG 576
            A+A +++G
Sbjct: 518 KAIAHLLYG 526


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  338 bits (867), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 207/553 (37%), Positives = 300/553 (54%), Gaps = 53/553 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A+ LV+ ++L E  SQL++ APAIPRLG+P Y WW+EALHG A  G           A
Sbjct: 8   EQAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------A 57

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINIFRD 166
           T FPQ I  AA FD      I   I  E RA YN   A+G       +T W+PN+NIFRD
Sbjct: 58  TVFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRD 117

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +  +  VS+++G+QGD          L+ +AC KHF  +   +  
Sbjct: 118 PRWGRGQETYGEDPYLASQLGVSFIQGLQGD-------GPYLKTAACVKHFAVH---SGP 167

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ F+A V+ +DL +TY P FE+CVK+G  + +M AY+ VNG P C    L++   R
Sbjct: 168 EPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDILR 227

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
             WGF G   SDC A+   +      K+  D+V   L AG D+NCG       K A +Q 
Sbjct: 228 NDWGFEGMYISDCWAIRDFHLNHAVTKNQVDSVALALNAGCDLNCGCEYLSLEK-AYQQG 286

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            +    I +A   + + R  LGLF+ + T   +  IG +   +  H+ +A +A+ + +VL
Sbjct: 287 LIDRKTITQACIRVMTTRFALGLFSEDCT---YSNIGYEQNDTEEHRKVAFKASCNSLVL 343

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN  G+LPL      ++A+IGPNA+S + L GNY G S    T L+  +  +  +V   
Sbjct: 344 LKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVK 402

Query: 464 YYPGC-------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL---------D 507
           Y  G        + +A  +  I +A+ +A  +D ++L +G D+T E E           D
Sbjct: 403 YSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGD 462

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           + DL LP  Q+ L+  VA    KP++LVLL GG +D    ++  N+ ++L   YPG+ G 
Sbjct: 463 KQDLRLPPCQRALLKAVASTG-KPIVLVLLSGGAIDPEIERFP-NVKALLQGWYPGQEGG 520

Query: 568 VALAEVIFGDHNP 580
           +A+A  I G +NP
Sbjct: 521 LAIAHTILGLNNP 533


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  338 bits (867), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 196/455 (43%), Positives = 262/455 (57%), Gaps = 32/455 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
             C      T T P+  T      RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20  LGCATGAAQTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +EALHGVA  G          GAT FPQ I  AA+FD  L + +  AI  EARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
                 +  G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +   
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPK 189

Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
              G+   KL A+A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGERYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGK 244

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
              +M AYNRV G  + A + LL    R QWGF GY+ SDC A+  I+       + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +K G ++ CG        AAV+Q  + E++ID AL  L + RMRLG+F+  P   P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +  I A V  SPAH  LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421

Query: 439 GNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV 471
           GNY G     +T LQ ++    N   +Y  G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456



 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q++L+  + +A  KPV+ VL  G  + I +A+  +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHL 707

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  338 bits (867), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 196/455 (43%), Positives = 262/455 (57%), Gaps = 32/455 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
             C        T P+  T     QRA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20  LGCATGAAQAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +EALHGVA  G          GAT FPQ I  AA+FD  L + +  AI  EARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
                 +  G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+      
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
              G+   KL A+A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK+G+
Sbjct: 190 NAQGEPYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGK 244

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
              +M AYNRV G  + A + LL    R QWGF GY+ SDC A+  I+       + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +K G ++ CG        AAV+Q  + E++ID AL  L + RMRLG+F+  P   P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +  I A V  SPAH  LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421

Query: 439 GNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV 471
           GNY G     +T LQ ++    N   +Y  G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 591 LQAGRAYDVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q++L+  + +A  +PV+ VL  G  + I +A+  +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  338 bits (866), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 194/441 (43%), Positives = 260/441 (58%), Gaps = 32/441 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TFW
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATA 204

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G 
Sbjct: 205 --KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R+QWGF GY+ SDC A+  I+       + E A    +K G ++ CG
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECG 319

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                   AAV Q  + E++ID AL  L + RMRLG+F+  P   P+  I A V  SPAH
Sbjct: 320 EEYST-LPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAH 377

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LLGNY G     +T L
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436

Query: 453 QALQNYVEN--TVYYPGCDTV 471
           Q ++    N   +Y  G D V
Sbjct: 437 QGIRAAAPNAQVLYARGADLV 457



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 68/114 (59%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + +A+D+A+ AD VV + GL    E EE+          DR DL LP  Q++L+  + +A
Sbjct: 626 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 684

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 685 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 216/562 (38%), Positives = 306/562 (54%), Gaps = 61/562 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA  L   +T +E+ SQL   APAI RLGIPAY WW+E +HG+A  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA FD  L  R  +    EARA YNA           G+T W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGRG ET GEDP +T +   + VRG+QGD    GK+   ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQGD----GKV---MKAAACAKHF 166

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA+   +D+ +TY P FE+ VK+ +   +M AYNRVNG P+CA 
Sbjct: 167 AVH---SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             L+ K   ++W F GY  SDC A+   ++      +  ++    LKAG DVNCG   Q 
Sbjct: 224 DYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ- 280

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ--PFGKIGADVVCSPAHQVL 395
           +  AA+ +  + + +I  A  +L   R+RLG+F+ +      P+ K    V C+  H+ +
Sbjct: 281 NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSK----VACAE-HKAV 335

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           +L+ A+  +VLLKN +G+LPL   K  ++A+IGPNA+S   L GNY G S R  T L  +
Sbjct: 336 SLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGI 394

Query: 456 QNYVENTVYY-PGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE-- 505
           Q+  E  V +  GC     S + + +A D        AK AD V++ +GLD T E EE  
Sbjct: 395 QDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGD 454

Query: 506 -------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
                   D+  L LP  Q+ L+ ++     KPV+ V+  G  ++      +    +++ 
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVG-KPVVTVVCAGSAINT-----ESQPDALIH 508

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
           A YPG  G+ ALAEV+FGD +P
Sbjct: 509 AFYPGAEGSKALAEVLFGDVSP 530


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 193/441 (43%), Positives = 261/441 (59%), Gaps = 32/441 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TFW
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G++G+  +      G+   KL A+A
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRKLDATA 204

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G 
Sbjct: 205 --KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R+QWGF GY+ SDC A+  I+       + E A    +K G ++ CG
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECG 319

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                   AAV+Q  + E++ID AL  L + RMRLG+F+  P   P+  I A V  SPAH
Sbjct: 320 EEYST-LPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAH 377

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LLGNY G     +T L
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436

Query: 453 QALQNYVEN--TVYYPGCDTV 471
           Q ++    N   +Y  G D V
Sbjct: 437 QGIRAAAPNAQVLYARGADLV 457



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 67/114 (58%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + +A+D+A  AD VV + GL    E EE+          DR DL LP  Q++L+  + +A
Sbjct: 626 LQEALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 684

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 685 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPG 736


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 168/320 (52%), Positives = 225/320 (70%), Gaps = 4/320 (1%)

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YN+VNG P+CAD++LLS   R  W  +GYI+SDCD+V ++Y+ + Y K+PEDA    
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60

Query: 323 LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           +KAG+D+NCG+FL +HT AAV+  KL ES++DRA+ N     MRLG F+G+P   PFG +
Sbjct: 61  IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           G   VC+P++Q LA +AA+ GIVLLKN+ G LPL  +   S+A+IGPNAN++ T++GNY 
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIGNYE 179

Query: 443 GPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKAVDIAKGADHVVLMMGLDQT 500
           G  C+  TPLQ L   V  TVY PGC  V CS  S  +D A   A  AD  VL++G DQ+
Sbjct: 180 GTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
            E+E LDR  L+LPG+Q +L++ VA A+  P ILV++ GGP DI+FAK    I +ILW G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPGEAG  A+A+V+FG HNP
Sbjct: 299 YPGEAGGAAIADVLFGYHNP 318


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 203/522 (38%), Positives = 295/522 (56%), Gaps = 37/522 (7%)

Query: 74  NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYR 133
           +++P + RLG+ AY+WW+EALHGVA   +GI + G    AT FPQ I ++A+FD +L  R
Sbjct: 43  SNSPGVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSSAAFDDHLIER 101

Query: 134 IGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
           IG  I  EARA  N G+A  + FW PN+N FRDPRWGRG ETPGED     K+A ++V+G
Sbjct: 102 IGVIISTEARAFANNGRA-HLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQG 160

Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
           +QG        +   +  A CKH+ AYDL+N   TTR+ FDA+V+ QDLA+ Y PPF+ C
Sbjct: 161 MQG-------TESTHRVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQC 213

Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEG 310
            +  +   IMC+YN VNG+P+CA   L+    R+ W +   + Y+ SDCDAV  + +A G
Sbjct: 214 ARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANG 273

Query: 311 ---YAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
              Y  S   A+   L+AG D  C +         +A   ++  ++ +D+A+       +
Sbjct: 274 GHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGLV 333

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS-VSL 424
           + G F+G  ++  +  + A  V +   +  AL+AA++GIVLLKN + +LPL    S   +
Sbjct: 334 KAGYFDGPNSL--YRNLTAADVNTQVARDTALKAAEEGIVLLKNDN-ILPLTLGGSNTQV 390

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQ------NYVENTVYYPGCDTVACSSASI 478
           A+IG  AN+A  +LG Y+G    S  P+ A +      NYV   +     DT A      
Sbjct: 391 AMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGPLTQTNADTSA------ 444

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
             AV+ A+ +  V+   G+D T EKE  DR  +  P  Q  +I R+A+   KPVI+V + 
Sbjct: 445 --AVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTG-KPVIVVRM- 500

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G  VD T      N+ +ILWAGYPG+ G  A+  +I G  +P
Sbjct: 501 GTHVDDTPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASP 542


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 203/549 (36%), Positives = 291/549 (53%), Gaps = 45/549 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P   R RDL+S+LTL+EK+ Q+++ +P++PRL +P Y WW+EALHGVA  G         
Sbjct: 30  PTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-------- 81

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFWAPNINI 163
             AT FPQ I   A+FD  L  R   AI  EARA+YNA    G       +TFW PNINI
Sbjct: 82  --ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFWTPNINI 139

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRDPRWGRGQET GEDP +T +  V++++G+QGD          L+ +AC KHF  +   
Sbjct: 140 FRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDD------PEHLKVAACAKHFAVH--- 190

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +     R+ F+A  + +DL +TY P F++ V   R   +MCAYNR N    C    LL +
Sbjct: 191 SGPERLRHSFNAIASPKDLRETYLPAFKALV-NARVEAVMCAYNRTNSEVCCGSNLLLDQ 249

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
             R +W F G++ SDC A+   Y          +AV   +K G+D+NCG       + AV
Sbjct: 250 ILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPEAVALAVKHGVDLNCGDEYPALIE-AV 308

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
           K+  + E EID+AL  L   R +LGLF+      P+  I   V+ S  H+ LA + A   
Sbjct: 309 KRGLITEKEIDKALATLLKTRFKLGLFDPKQN-SPYNNIPVSVINSTDHRALAKEVALKS 367

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--- 460
           IVLLKN    LPL K+      + GPNA S   L+GNY G +    T L+ +   ++   
Sbjct: 368 IVLLKNEK-CLPL-KNNLSKYYITGPNAASVDALMGNYYGVNPHMSTILEGIAGAIQPGS 425

Query: 461 NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDRVDL 511
              Y PG      ++  ID     AK +D   ++MG+    E EE          DR+D 
Sbjct: 426 QMQYKPGILLDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDY 485

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LP  Q + + ++ +  K  V+ ++  G P++++  +      ++L A YPGE G  A+A
Sbjct: 486 NLPKNQIDFLRKIRKGNKNKVVAIITGGSPMNLS--EVHELADAVLLAWYPGEEGGNAVA 543

Query: 572 EVIFGDHNP 580
           +++FG  +P
Sbjct: 544 DILFGKVSP 552


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 192/436 (44%), Positives = 252/436 (57%), Gaps = 32/436 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     +RA DLVSR+TL EK++Q+ NSAPAIPRLG+PAY+WW+EALHGVA  G+  
Sbjct: 43  YLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAGE-- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
                   AT FPQ I  AA+FD  L +    AI  EARA YN        G+  G+TFW
Sbjct: 101 --------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTFW 152

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PN NIFRDPRWGRGQET GEDP +T +  V++VRG++GD     KL       A  KHF
Sbjct: 153 SPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQKL------DATAKHF 206

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +   + R++FD   + +DL +TY P F++ V+QG    +M AYNRV+G+P+ A 
Sbjct: 207 AVH---SGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVPATAS 263

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             LL    RR WGF GY+ SDCDAV+ IY       + E A    +  G D+NCG+    
Sbjct: 264 HRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAEQAAALAVNNGDDLNCGTTYAT 323

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             K AV    + E  ID A+  L   R RLG+F+  P   P+  +   VV SP H  LAL
Sbjct: 324 LVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGRVPWSTLPMSVVQSPQHDALAL 381

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           + AQ+ +VLLKN  GLLPL  +    +A+IGP A++   LLGNY G     +T LQ ++ 
Sbjct: 382 RTAQESMVLLKND-GLLPLSHNVR-RIAVIGPTADNVTALLGNYHGTPKAPVTILQGIRE 439

Query: 458 YVEN--TVYYPGCDTV 471
            V N    Y  G + V
Sbjct: 440 AVPNAQVTYVQGTELV 455



 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 65/117 (55%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           + +  + A+D A+ AD V+   GL    E EE+          DR  L LP  Q++L+ +
Sbjct: 619 AKSPFEAALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLL-Q 677

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +   KPV+LVL  G  + I +AK  +++ +IL A YPG+ G  A+A+ +FG+ +P
Sbjct: 678 ALQVTGKPVVLVLTTGSALAIDWAK--QHLPAILLAWYPGQDGGHAVADALFGNVDP 732


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 216/546 (39%), Positives = 290/546 (53%), Gaps = 38/546 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-----APAIPRLGIPAYEWWSEALHGV 97
           FPF   +L    RA DLV+RLTL+E   Q   S      PAI RLGI  Y W +E L G 
Sbjct: 20  FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAGQ 79

Query: 98  AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG---- 153
                          AT++PQ I  AASF   L + + + I  E RA +NA +A+G    
Sbjct: 80  VNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
              ++ ++P INI R P WGR QET GEDPL++G  A S+VRG+QGD          L+A
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDD------PRYLRA 182

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           +A CKHF  +        +R+ FDA+V M+D   T+ P F+ CV  G  S +MC+YNR+N
Sbjct: 183 NAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAGSYS-LMCSYNRIN 241

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           GIP+CA++ LL+   R +WGFHGYI SD  A+S I +   Y  S    VV  +KAG ++ 
Sbjct: 242 GIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHYTNSTVATVVAAIKAGTNLE 301

Query: 331 CG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
            G   +        A+KQ  L E EI   +  L   R+RLG F+    M  + KIG DV+
Sbjct: 302 LGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPE-AMVDYNKIGVDVI 360

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H+  A++AA  G VLLKN + LLP+ K  S  LA++GP  N+   L G Y+     
Sbjct: 361 QSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYS-KLAIVGPFTNATSELFGTYSSEVNL 419

Query: 448 SITP--LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
             T    + L     +T    GC   ACS    D       GAD V++ +G  Q  E E 
Sbjct: 420 KFTSTIFEGLSPLGGSTRSANGCTNSACSGYVRDDVETAVAGADLVIVALGSGQRFESEG 479

Query: 506 LDRVDLVLPGRQQELITR-VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
            DR  L L G Q +++   V  +   PVILVL+  GP+DIT+AK D  + +IL  GYP +
Sbjct: 480 NDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKLDPGVTAILSCGYPAQ 539

Query: 565 AGAVAL 570
           +   AL
Sbjct: 540 STGEAL 545


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  335 bits (859), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 204/542 (37%), Positives = 292/542 (53%), Gaps = 17/542 (3%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C T     QRA  +++ + + EK++ L++ +    RLG+P YEWWSEALHGVAG   G+ 
Sbjct: 43  CDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVAG-SPGVN 101

Query: 106 F--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F  +G    ATSFP  I  +++FD      I   I  EARA  NA +  G+ ++ PNIN 
Sbjct: 102 FTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARG-GLDYFTPNINP 160

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKLKGKLQASACCKHFTAYD 221
           F+DPRWGRG ETPGEDPL    Y  + + G++G  D +      G  +  A CKHF  YD
Sbjct: 161 FKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYKKMIATCKHFAGYD 220

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
           L++W G  RY +DA +T QDLA+ Y PPF++C +    + IMC+YN VN +P+CA+  L 
Sbjct: 221 LEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVPACANSYLQ 280

Query: 282 SKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFL 335
               R  WG+   + YITSDC+A+S IY    Y+ +   A    L  GMD  C    + +
Sbjct: 281 ETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAAAAGLSLSNGMDTACIVANTGV 340

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
                 +     + E+ I  AL   +   +  G F+   +  P+  IG   V +PA Q L
Sbjct: 341 MTDVNGSYYGGYVTEATITTALIRQYEALVIAGYFDP-ASSNPYRSIGWSSVNTPAAQTL 399

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A QAA +G  LLKN+ GLLP   +    +A+IG  AN    + G Y+GP+    +PL A 
Sbjct: 400 ARQAATEGTTLLKNT-GLLPYKFTSQTKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAA 458

Query: 456 QNYVENTVYYPG-CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
                +  Y  G  +    +S     A   A+ AD ++   G+D + E E +DR  +  P
Sbjct: 459 SQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWP 518

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G QQ LI ++A A  KP+I VL  G  +D T    + NI +++W GYPG+ G VA  +++
Sbjct: 519 GAQQALIAQLA-ALGKPMI-VLQMGSMLDATPILSNNNISALVWVGYPGQDGGVAAFDIL 576

Query: 575 FG 576
            G
Sbjct: 577 TG 578


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 192/442 (43%), Positives = 258/442 (58%), Gaps = 32/442 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T     QRA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW+EALHGVA  G  
Sbjct: 36  PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                   GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQAS 211
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+         G+   KL A+
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRKLDAT 205

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+   +M AYNRV G
Sbjct: 206 A--KHFAVH---SGPEAERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYG 260

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
             + A + LL    R+QWGF GY+ SDC A+  ++       + E A    +  G ++ C
Sbjct: 261 ESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATREQAAALAVTHGTELEC 320

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G        AAV Q  + E++ID AL  L + RMRLG+F+  P   P+ KI A V  SPA
Sbjct: 321 GEEYST-LPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPGQLPWSKIPASVNQSPA 378

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  LA + A++ +VLLKN  GLLPL ++    +A+IGP A+    LLGNY G     +T 
Sbjct: 379 HDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437

Query: 452 LQALQNYVEN--TVYYPGCDTV 471
           LQ ++    N   +Y  G D V
Sbjct: 438 LQGIRAAAPNAQVLYARGADLV 459



 Score = 75.5 bits (184), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 53/148 (35%), Positives = 79/148 (53%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA ++Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 594 LQAGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 653

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +A  KPV+ VL  G  + I +A+  +++
Sbjct: 654 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QATGKPVVAVLTAGSALAIDWAQ--QHV 710

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 711 PAILLAWYPGQRGGTAVADTLFGDANPG 738


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 203/581 (34%), Positives = 302/581 (51%), Gaps = 35/581 (6%)

Query: 14  CLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
            +C T + T   S        PS+ + ++FPF   +LPI  R  DL+SRLT+++ I+Q V
Sbjct: 8   VVCLTLISTASFSQDNVLRFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTV 67

Query: 74  ----NSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSY 129
                  P I RLGI   E+ +E L GV               AT FPQ +  AASF   
Sbjct: 68  ARYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGLAASFSRD 116

Query: 130 LWYRIGQAIGLEARALYNAG------QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           L  R+  A+ +E RA YN         A G+T ++P INI R P WGR QET GEDP ++
Sbjct: 117 LMQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLS 176

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           G+ A  YV G+QGD          L+ SA CKHF A+   +     ++ FDA++  +DL 
Sbjct: 177 GELASQYVSGLQGDD------PRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQ 230

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
            T+ P F+ C+   +   +MC++N +NG+PSCA++ LL+   R QWG+ G++ SD  AV 
Sbjct: 231 MTFLPAFKKCIA-AKPYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVE 289

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTK--AAVKQKKLPESEIDRALHNL 360
            I+    Y  S E A V+ +K+G ++   G F   + +   A+ +  + + E+   +  +
Sbjct: 290 YIFTEHHYNSSFETAAVEAIKSGCNMELVGKFDPSYWQLTKALNEHLITKDELMENVRPV 349

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
           F  R  LG F+  P + PF +I  DVV S  HQ LAL+AA    VLLKN    LPL K+ 
Sbjct: 350 FLTRFLLGEFDP-PALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNS 408

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTVYYPGCDTVACSSASI 478
             ++A++GP +N    L+G+Y+  +  S  +TPL  ++    N  +  GC    C+    
Sbjct: 409 LKTVAVVGPMSNYTDGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRA 468

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR-VAEAAKKPVILVLL 537
                   GA  V + +G     E E  DR D+VLPG Q +L+   V  A  +PV+L+L 
Sbjct: 469 TDVAAAVDGAQVVFVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLF 528

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
            GGP+D+TFA+    I SI+   +P      A+  ++  + 
Sbjct: 529 NGGPLDVTFAQLTSGIVSIVECFFPAMMTGEAIYRMLINNE 569


>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 517

 Score =  334 bits (856), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 200/499 (40%), Positives = 272/499 (54%), Gaps = 18/499 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D  N    T   C  TL   +RA  LV  L+++EK+  LV+ +   PR+G+PAY WWSEA
Sbjct: 29  DCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEA 88

Query: 94  LHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           LHGVA    G +F   N     +TS+P  +L AA FD  L  +IG AIG+EARA  N+G 
Sbjct: 89  LHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
           A G  +W PN+N F+DPRWGRG ETPGED L   +YA    RG+ G   N  +     + 
Sbjct: 148 A-GFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR-----RI 201

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            + CKH+   D ++W GTTR+ F+A++TMQDLA+ Y  PF+ C +  +   IMCAYN VN
Sbjct: 202 ISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVN 261

Query: 271 GIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           G+PSCA++ LL    R  W +   + Y+TSDC+AV  +     YA +         +AGM
Sbjct: 262 GVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGM 321

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D +C          A  Q  L E  +DRAL  L+   +R G F+G   M  +  +    V
Sbjct: 322 DTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHV 379

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S   Q LALQAA +G+VLLKN +G LPL    S  +A+IG  A++ + L G Y+G +  
Sbjct: 380 NSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHH 438

Query: 448 SITP-LQALQNYVENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
             +P   A Q  ++ TV   P       S      A++ A GAD+++   GLD +   E 
Sbjct: 439 LYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAAGET 498

Query: 506 LDRVDLVLPGRQQELITRV 524
           LDR DL  P  Q  L+  V
Sbjct: 499 LDRTDLDWPEAQLTLVKVV 517


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  334 bits (856), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 215/562 (38%), Positives = 304/562 (54%), Gaps = 61/562 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA  L   +T +E+ SQL   APAI RLGIPAY WW+E +HG+A  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA FD  L  R  +    EARA YNA           G+T W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGR  ET GEDP +T +   + VRG+QGD    GK+   ++A+AC KHF
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQGD----GKV---MKAAACAKHF 166

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDA+   +D+ +TY P FE+ VK+ +   +M AYNRVNG P+CA 
Sbjct: 167 AVH---SGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPACAS 223

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             L+ K   ++W F GY  SDC A+   ++      +  ++    LKAG DVNCG   Q 
Sbjct: 224 DYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESAAMALKAGCDVNCGCTYQ- 280

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ--PFGKIGADVVCSPAHQVL 395
           +  AA+ +  + + +I  A  +L   R+RLG+F+ +      P+ K    V C+  H+ +
Sbjct: 281 NLLAALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSK----VACAE-HKAV 335

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           +L+ A+  +VLLKN +G+LPL   K  ++A+IGPNA+S   L GNY G S R  T L  +
Sbjct: 336 SLECAEKSLVLLKN-NGILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGI 394

Query: 456 QNYVENTVYY-PGCDTVACSSASIDKAVD-------IAKGADHVVLMMGLDQTQEKEE-- 505
           Q+  E  V +  GC     S + + +A D        AK AD V++ +GLD T E EE  
Sbjct: 395 QDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEEGD 454

Query: 506 -------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
                   D+  L LP  Q+ L+ ++     KPV+ V+  G  ++      +    +++ 
Sbjct: 455 TGNEFSSGDKNGLTLPPPQKILVEKIMSVG-KPVVTVVCAGSAINT-----ESQPDALIH 508

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
           A YPG  G  ALAEV+FGD +P
Sbjct: 509 AFYPGAEGGKALAEVLFGDVSP 530


>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
          Length = 515

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 199/496 (40%), Positives = 271/496 (54%), Gaps = 18/496 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D  N    T   C  TL   +RA  LV  L+++EK+  LV+ +   PR+G+PAY WWSEA
Sbjct: 29  DCVNGPLSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEA 88

Query: 94  LHGVAGVGKGIFF---NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           LHGVA    G +F   N     +TS+P  +L AA FD  L  +IG AIG+EARA  N+G 
Sbjct: 89  LHGVA-YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGW 147

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
           A G  +W PN+N F+DPRWGRG ETPGED L   +YA    RG+ G   N  +     + 
Sbjct: 148 A-GFDYWTPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR-----RI 201

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            + CKH+   D ++W GTTR+ F+A++TMQDLA+ Y  PF+ C +  +   IMCAYN VN
Sbjct: 202 ISTCKHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVN 261

Query: 271 GIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           G+PSCA++ LL    R  W +   + Y+TSDC+AV  +     YA +         +AGM
Sbjct: 262 GVPSCANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHYAPTNAAGTAICFEAGM 321

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D +C          A  Q  L E  +DRAL  L+   +R G F+G   M  +  +    V
Sbjct: 322 DTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAM--YADLDWQHV 379

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S   Q LALQAA +G+VLLKN +G LPL    S  +A+IG  A++ + L G Y+G +  
Sbjct: 380 NSAEAQSLALQAAVEGMVLLKN-NGTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHH 438

Query: 448 SITP-LQALQNYVENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
             +P   A Q  ++ TV   P       S      A++ A GAD+++   GLD +   E 
Sbjct: 439 LYSPAFAARQLGLDITVASGPVLQDNNASDNWTTNALEAASGADYILYFGGLDTSAAGET 498

Query: 506 LDRVDLVLPGRQQELI 521
           LDR DL  P  Q  L+
Sbjct: 499 LDRTDLDWPEAQLTLV 514


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 193/455 (42%), Positives = 261/455 (57%), Gaps = 32/455 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
             C        T P+  T      RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20  LGCATGAAQAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +EALHGVA  G          GAT FPQ I  AA+FD  L + +  AI  EARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
                 +  G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +   
Sbjct: 130 RQNQHERYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPK 189

Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
              G+   KL A+A  KHF  +   +     R+ FDA  + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGERYRKLDATA--KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKDGK 244

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
              +M AYNRV G  + A + LL    R QWGF GY+ SDC A+  I+       + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +K G ++ CG        AAV+Q  + E++ID AL  L + RMRLG+F+  P   P
Sbjct: 305 AALAVKHGTELECGEEYST-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           + +I A V  SPAH  LA + A++ +VLLKN  GLLPL +++   +A+IGP A+    LL
Sbjct: 363 WSQIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALL 421

Query: 439 GNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV 471
           GNY G     +T LQ ++    N   +Y  G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 77/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ A+ VV + GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 650

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q++L+  +  A  KPV+ VL  G  + I +A+  +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-HATGKPVVAVLTTGSALAIDWAQ--QHL 707

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 194/455 (42%), Positives = 260/455 (57%), Gaps = 32/455 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
             C        T P+  T      RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20  LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +EALHGVA  G          GAT FPQ I  AA+FD  L + +  AI  EARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
                 +  G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+      
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
              G+   KL A+A  KH   +   +     R+ FDAR + +DL +TY P FE+ VK+G+
Sbjct: 190 NAQGEPYRKLDATA--KHLAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKEGK 244

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
              +M AYNRV G  + A + LL    R QWGF GY+ SDC A+  I+       + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +K G ++ CG        AAV+Q  + E++ID AL  L + RMRLG+F+  P   P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +  I A V  SPAH  LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALL 421

Query: 439 GNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV 471
           GNY G     +T LQ ++    N   +Y  G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLV 456



 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q++L+  + +A  +PV+ VL  G  + I +A+  +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 916

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 196/455 (43%), Positives = 259/455 (56%), Gaps = 37/455 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T+L   +RA  LVSR+TL+EK +Q+ N +PAI RLG+PAY+WW+EALHGVA  G  
Sbjct: 49  PWLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG-- 106

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
                   GAT FPQ I  AASFD  L  ++  AI  EARA ++        G+  G+TF
Sbjct: 107 --------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTF 158

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQA-SACC 214
           W+PNINIFRDPRWGRGQET GEDP +T +  VS+VRG+QG D   G  L  K +   A  
Sbjct: 159 WSPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMDPQTGQPLDPKYRKLDATA 218

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHF  +   +     R+ FD   + QDL DTY P FES VK+     +M AYNRV G  +
Sbjct: 219 KHFAVH---SGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
              + LL  T RR WGF GY+ SDC A+  I+      ++PE+A    +K G ++NCGS 
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIVETPEEAAALAVKNGTELNCGST 335

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
              H   AVK+  + E+E+D AL  LF  RM LG+F+  P    + ++   V  S  H  
Sbjct: 336 YADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD-PPEQVRWAQVPYSVNQSAEHDA 394

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LA + AQ+ +VLLKN  G+LPL K     LA++GP A+    LLGNY G     +T L+ 
Sbjct: 395 LARKMAQESLVLLKND-GVLPLSKDIR-RLAVVGPTADDTMALLGNYYGTPADPVTILRG 452

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++         PG D V        + VD+ +G D
Sbjct: 453 IREAA------PGVDVVYA------RGVDLVEGRD 475



 Score = 75.9 bits (185), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 46/114 (40%), Positives = 64/114 (56%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
            + A++ A  AD VV + GL    E EE+          DR D+ LP  QQ+L+  V  A
Sbjct: 640 FETALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAV-HA 698

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV++VL  G  + I +A+  RN+  IL A YPG+ G  A+ E +FGD+NPG
Sbjct: 699 TGKPVVMVLTTGSALGIDWAR--RNVPGILVAWYPGQRGGTAVGEALFGDYNPG 750


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  332 bits (852), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 207/557 (37%), Positives = 300/557 (53%), Gaps = 44/557 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T LP ++R   L+  +TL EK SQLVN   AI RLG+P Y++W+EALHGVA  G+ 
Sbjct: 25  PWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
                    AT FPQ I  AA+FD  L  +    I  EARA +N    IG       +TF
Sbjct: 84  ---------ATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGNRSKYSGLTF 134

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINIFRDPRWGRGQET GEDP +T +   + V G+QGD          L+ +A  KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R++FDA  + +D+ +TY P FE+ V +     +M AYNRVNG P+  
Sbjct: 189 FAVH---SGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPAGG 245

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
              LL+   R +WGF G+I SDC  ++  ++      +  ++    +  G D+NCGS + 
Sbjct: 246 SDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESAALAINTGTDLNCGS-VY 304

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
                AV+   + E  ID  LH + + + +LG F+      P+  I ADVV S AH  +A
Sbjct: 305 TALPDAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDD-NPYNSISADVVNSDAHADVA 363

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A   IVLL+N + +LPL K+   ++ + GP A+S++ LLGNY G S ++   L  + 
Sbjct: 364 YEMAVKSIVLLQNENQVLPLDKNIR-NVYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422

Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
             V    T+ Y  G      +   ID     AK   D ++ +MGL    E EE       
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR+ L LP  Q E + ++ +   KPVI+VL  G PV++T  +  +   +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIEFLRKLRKDNDKPVIVVLTAGTPVNVT--EIAQLADAIVFAWYPG 540

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  A+A+++FG+ +P
Sbjct: 541 QEGGKAVADILFGERSP 557


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  332 bits (851), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 188/447 (42%), Positives = 266/447 (59%), Gaps = 36/447 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  +    +QRA DLV+R+TL+EK++Q+ N+APAIPRLG+PAY+WW+E LHGVA  G   
Sbjct: 34  YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++        G+  G+TFW
Sbjct: 91  -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+A
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRKLDATA 203

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +   + R+ FDAR + +DL +TY P FE+ VK+G+   +M AYNR+ G 
Sbjct: 204 --KHFAVH---SGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGE 258

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R +WGFHGY+ SDC A+  I+       + E A    +K G  + CG
Sbjct: 259 SASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVATREQAAALAVKNGTQLECG 318

Query: 333 SFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
              Q++    AAV+Q  + E++ID AL  L + RMRLG+F+  P    + ++   V  SP
Sbjct: 319 ---QEYATLPAAVQQGLIGETDIDAALRTLMTARMRLGMFD-PPGQLRWAQLPISVNQSP 374

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H  LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LLGNY G     +T
Sbjct: 375 EHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVT 433

Query: 451 PLQALQNYV--ENTVYYPGCDTVACSS 475
            LQ ++      + +Y  G D V   S
Sbjct: 434 ILQGIRAAAPDADVLYARGADLVEGRS 460



 Score = 72.0 bits (175), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 69/121 (57%), Gaps = 13/121 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           V  +   + +A+D A+ AD VV + GL    E EE+          DR DL LP  Q+EL
Sbjct: 618 VPGAKPPLQEALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQREL 677

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  ++ A  KPV+ VL  G  + I +A+   ++ +IL A YPG+ G  A+A+V+FGD NP
Sbjct: 678 LQALS-ATGKPVVAVLTTGSALAIDWAQ--EHVPAILLAWYPGQRGGSAVADVLFGDTNP 734

Query: 581 G 581
           G
Sbjct: 735 G 735


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  332 bits (851), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 194/455 (42%), Positives = 259/455 (56%), Gaps = 32/455 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
             C        T P+  T      RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW
Sbjct: 20  LGCATGAAHAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWW 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +EALHGVA  G          GAT FPQ I  AA+FD  L + +  AI  EARA ++   
Sbjct: 80  NEALHGVARAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFL 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-- 201
                 +  G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+      
Sbjct: 130 RQNQHARYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPK 189

Query: 202 ---GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
              G+   KL A+A  KHF  +   +     R+ FDAR + +DL +TY P FE+ VK G+
Sbjct: 190 NAQGEPYRKLDATA--KHFAVH---SGPEADRHHFDARPSQRDLYETYLPAFEALVKDGK 244

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
              +M AYNRV G  + A + LL    R QWGF GY+ SDC A+  I+       + E A
Sbjct: 245 VDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATREQA 304

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
               +K G ++ CG        AAV+Q  + E++ID AL  L + RMRLG+F+  P   P
Sbjct: 305 AALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQLP 362

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
           +  I A V  SPAH  LA + A++ +VLLKN  GLLPL ++K   +A+IGP A+    LL
Sbjct: 363 WSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALL 421

Query: 439 GNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTV 471
           GNY G     +T LQ ++        +Y  G D V
Sbjct: 422 GNYYGTPAAPVTVLQGIRAAAPKAQVLYARGADLV 456



 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           LQA + Y     Y+ G    A   A         + +A+D+A+ AD VV + GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q++L+  + +A  +PV+ VL  G  + I +A+  +++
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QATGRPVVAVLTTGSALAIDWAQ--QHL 707

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPG 735


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  332 bits (851), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 195/552 (35%), Positives = 287/552 (51%), Gaps = 19/552 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N    T   C    P  +RA  LV+ +   EK+  LV+ +  + RLG+PAY WW EA
Sbjct: 25  DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEA 84

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
           LHGVAG   GI F    + ATSFP  IL +A+FD  L ++I   IG EARA  N G A  
Sbjct: 85  LHGVAGA-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVA-P 142

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           M +W P+IN  RD RWGR  E+PGED      Y  + + G++GD       + + +  A 
Sbjct: 143 MDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGD-------QAQRKIIAT 195

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           CKH+  YD++ W G  R+ F A++TMQDLA+ Y PPF+ C +  +    MC+YN VNG+P
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVP 255

Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           +CAD  +L    R  W +   + YITSDC+AV+ I +   Y ++           GMD++
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           C          A  Q  L  S ID+AL   +   +  G F+G      +  +  + + +P
Sbjct: 316 CEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG--AKATYANLSYNDINTP 373

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
             + L+LQ   +G+V+LKN H  LPLP +K   +A+IG  AN +  L G Y+GP     +
Sbjct: 374 EARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432

Query: 451 PLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           P+ A +    +    + P     +        A+D A+ +D+++   G D T  +E  DR
Sbjct: 433 PVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDR 492

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             +  P  Q +L+ ++A+  K  V++ L  G   D +       I SI+WA +PG+ G  
Sbjct: 493 TTISFPQVQIDLLAKLAKLGKPLVVITL--GDMTDHSPLLSMEGINSIIWANWPGQDGGP 550

Query: 569 ALAEVIFGDHNP 580
           A+  VI G H P
Sbjct: 551 AILNVISGVHAP 562


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 215/582 (36%), Positives = 302/582 (51%), Gaps = 37/582 (6%)

Query: 22  TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPR 81
           T  D T+PP S             C  TLP ++RA  LV+ LT +EK+  LV+ AP  PR
Sbjct: 14  TYPDCTKPPLS---------DIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPR 64

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFFNG--TIRGATSFPQVILTAASFDSYLWYRIGQAIG 139
           +G+PAY WWSEALHGVA      F +G      +TSFP  +L AA+FD  L   +G  IG
Sbjct: 65  IGLPAYNWWSEALHGVAHAPGTQFRDGPGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIG 124

Query: 140 LEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
            EARA  NAG + G+ +W PN+N FRDPRWGRG ETPGED +   +YA S +RG++G + 
Sbjct: 125 TEARAFGNAGWS-GLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSS 183

Query: 200 NGGKLKGKLQAS-----ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           +                + CKH+   D ++W GTTR+ FDA ++ QDLA+ Y  PF+ C 
Sbjct: 184 SSSSCSFGSGGEPPRVISTCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCA 243

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGY 311
           +  R   +MCAYN VNG+PSCA+  L++   R  W +     Y+TSDC+AV  +     Y
Sbjct: 244 RDSRVGSVMCAYNAVNGVPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAVLDVSAHHHY 303

Query: 312 AKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN 371
           A +  +      +AGMD +C          A     L    +DRAL  L+   +R+G F+
Sbjct: 304 ADTNAEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFD 363

Query: 372 GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV--------- 422
           G     P   +G   V  P  Q LAL+AA +GIVLLKN +  LPLP    V         
Sbjct: 364 G--PESPHASLGWADVNRPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRR 421

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPG----CDTVACSSASI 478
            +A+IG  A++   L G Y+G    + +P  A +    N     G     D+        
Sbjct: 422 RVAMIGFWADAPDKLFGGYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWT 481

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
             AV+ A  AD++V   GLD +   E  DR+ +  P  Q  LI+ +A    KPV++V + 
Sbjct: 482 APAVEAAADADYIVYFGGLDTSAAGETKDRMTIGWPAAQLALISELARLG-KPVVVVQMG 540

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
               D    + D  +G++LWA +PG+ G  A+  ++ G  +P
Sbjct: 541 DQLDDTPLFELD-GVGAVLWANWPGQDGGTAVVRLLSGAESP 581


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  331 bits (848), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 194/552 (35%), Positives = 287/552 (51%), Gaps = 19/552 (3%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N    T   C    P  +RA  LV+ +   EK+  LV+ +  + RLG+PAY WW EA
Sbjct: 25  DCTNGPLSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEA 84

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
           LHGVAG   GI F    + ATSFP  IL +A+FD  L ++I   IG EARA  N G A  
Sbjct: 85  LHGVAGA-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVA-P 142

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           + +W P+IN  RD RWGR  E+PGED      Y  + + G++GD       + + +  A 
Sbjct: 143 VDYWTPDINPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGD-------QAQRKIIAT 195

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           CKH+  YD++ W G  R+ F A++TMQDLA+ Y PPF+ C +  +    MC+YN VNGIP
Sbjct: 196 CKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIP 255

Query: 274 SCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           +CAD  +L    R  W +   + YITSDC+AV+ I +   Y ++           GMD++
Sbjct: 256 TCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLS 315

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           C          A  Q  L  S ID+AL   +   +  G F+G      +  +    + +P
Sbjct: 316 CEYTGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDG--AKATYANLSYKDINTP 373

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
             + L+LQ   +G+V+LKN H  LPLP +K   +A+IG  AN +  L G Y+GP     +
Sbjct: 374 EARQLSLQVTSEGLVMLKNDH-TLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHS 432

Query: 451 PLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           P+ A +    +    + P     +        A+D A+ +D+++   G D T  +E  DR
Sbjct: 433 PVFAGEQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDR 492

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             +  P  Q +L+T++A+  K  V++ L  G   D +       + SI+WA +PG+ G  
Sbjct: 493 TTISFPQVQIDLLTKLAKLGKPLVVITL--GDMTDHSPLLSMEGVNSIIWANWPGQDGGP 550

Query: 569 ALAEVIFGDHNP 580
           A+  V+ G H P
Sbjct: 551 AILNVVSGAHAP 562


>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 878

 Score =  331 bits (848), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 30/427 (7%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           +E +PF  T LP  +R  DL++RLT+DEKI+QL+  +PAI RLGIPAY WW+E+LHGVA 
Sbjct: 21  SEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVAR 80

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA----GQA---I 152
            G           AT FPQ I  AAS+D  L   +   I  EARA ++     GQ     
Sbjct: 81  AGY----------ATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDIYQ 130

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+TFW+PNINIFRDPRWGRG ET GEDP +TG     YV+G+QG   N  K    L+  A
Sbjct: 131 GLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQG---NNAKY---LKVVA 184

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +         R++FD   + +DL +TY P F + VK G    IM AYNR+ G 
Sbjct: 185 TAKHFAVHSGPE---PLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAYNRIYGE 241

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A  +L S   R +WGF+GY+ SDC A++ ++     AK   +A    +K G D+NCG
Sbjct: 242 AASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAEASAMAVKEGCDLNCG 300

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +  +K T  A++   + E+++D ALH L   R +LG+F+ +  + P+ KI   V  +P H
Sbjct: 301 NSYEKLTD-ALQDGLITEADLDVALHRLMRARFKLGMFDSDEKV-PYAKIPFSVNNNPKH 358

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           +VLAL+AAQ  IVLLKN + +LPL K+   ++A+IGPNA++ ++L GNY G     +T L
Sbjct: 359 KVLALKAAQKSIVLLKNENAILPLSKNLK-NIAVIGPNADNIQSLWGNYNGMPKNPVTVL 417

Query: 453 QALQNYV 459
           + ++N V
Sbjct: 418 EGIKNKV 424



 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 50/113 (44%), Positives = 68/113 (60%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++KAV  A  +D VVL +GL++  E EE+          DR  L LP +Q EL+  V  A
Sbjct: 589 LEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVELMKEVV-A 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             KPV+LVLL G  + I +A    NI +I+ AGYPG+ G  A+A V+FGD+NP
Sbjct: 648 TGKPVVLVLLNGSALSINWAS--ENIPAIISAGYPGQEGGNAIANVLFGDYNP 698


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  331 bits (848), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 161/257 (62%), Positives = 198/257 (77%), Gaps = 2/257 (0%)

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           MDVNCGS++Q H  +A++Q K+ E +I+RALHNLF+VRMRLGLFNG+P    +G IG D 
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 387 VCSPAHQVLALQAAQDGIVLLKNS--HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
           VC+  HQ LAL+AAQDGIVLLKN    G LPL K    SLA+IG NAN A  L GNY GP
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 445 SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            C ++TPLQ LQ YV++T +  GC++ AC+  +I +AV  A  AD VVL MGLDQ QE+E
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTTIPEAVQAASSADSVVLFMGLDQDQERE 180

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
           E+DR+DL LPG+QQ LI  VA AAKKPVILVLLCGGPVD++FAK +  IG+ILWAGYPGE
Sbjct: 181 EVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPGE 240

Query: 565 AGAVALAEVIFGDHNPG 581
           AG +A+A+V+FG+HNPG
Sbjct: 241 AGGIAIAQVLFGEHNPG 257


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  331 bits (848), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 198/539 (36%), Positives = 285/539 (52%), Gaps = 24/539 (4%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
            C  T   ++RA  LV  +  +EK+  L+     + RLG+P Y WW EALHGVAG   GI
Sbjct: 33  ICDVTAAPAERAAALVEAMQTNEKLDNLMR---GVTRLGLPKYNWWGEALHGVAGA-PGI 88

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            F G  + ATSFP  +L +A+FD  L ++I   IG EARA  N G A  + FW P+IN F
Sbjct: 89  NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVA-PVDFWTPDINPF 147

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ETPGED +    Y    + G++GD       K + +  A CKH+  YD++ 
Sbjct: 148 RDPRWGRGSETPGEDIVRIKGYTKHLLAGLEGD-------KPQRKIIATCKHYVGYDMEA 200

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           W G  R+ F+A++ MQDLA+ Y PPF+ C +  +    MC+YN VNG+P+CAD  +L   
Sbjct: 201 WGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLQTI 260

Query: 285 ARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            R  W +   + YITSDC+AV  I     YAK+  +       AGMD +C          
Sbjct: 261 LRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGLAFTAGMDNSCEYTGSSDIPG 320

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           A  Q  L    IDRAL   +   +R G F+G      +  +G   + +P  Q L+LQ A 
Sbjct: 321 AFNQSYLSIPTIDRALKRQYEGLVRAGYFDG--AAATYANLGVKDINTPEAQQLSLQVAS 378

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +G+VLLKN    LPL  +    +A++G  AN    L G Y+GP+    +P+ A Q    +
Sbjct: 379 EGLVLLKNDD-TLPLSLTNGSKVAMLGFWANDTSKLSGIYSGPAPYLRSPVWAGQKLGLD 437

Query: 462 TVYYPGCDTVACSSASID----KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
                G      +S++ D     A+  A+ +D+++   GLD +   E  DR  +  P  Q
Sbjct: 438 MAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGFDRNSIAWPTAQ 497

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
            +LI ++A   K  V+LVL  G  +D +       + S++WA +PG+ G  A+ +V+ G
Sbjct: 498 VDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVIWANWPGQDGGSAVMQVVTG 554


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 200/504 (39%), Positives = 269/504 (53%), Gaps = 45/504 (8%)

Query: 1   MIFHKLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLV 60
           M  HKL  V   L LC   LL    +     +  PS P+     +   +LP   RARDLV
Sbjct: 1   MRQHKLH-VAAGLALCAGMLLMLPSAFAQSQTQSPSTPA-----YLNPSLPPVVRARDLV 54

Query: 61  SRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVI 120
           SR+TL EK SQLVN+A AIPRL +PAY WWSEALHGVA           + G T FP+ I
Sbjct: 55  SRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVA-----------VNGTTEFPEPI 103

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQ 173
              A+FD    + +   IG E R +Y   +         G+ FWAPN+NIFRDPRWGRGQ
Sbjct: 104 GLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFWAPNLNIFRDPRWGRGQ 163

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ET GEDP +TGK  V++V G+QGD           +  A  KHF   D+ +    TR+  
Sbjct: 164 ETYGEDPFLTGKMGVAFVSGMQGDN------PKYYRVIATPKHF---DVHSGPEPTRHFA 214

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  V++ D  DTY+P F + + QG A  +MC+YN +NG P+CA++  L    R  WGF G
Sbjct: 215 DVDVSLHDQLDTYEPAFRAAIMQGHADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKG 274

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-------KHTKAAVKQK 346
           Y+ SDCDAV  IY    Y  +   A    ++ GMD +C  F Q       K    AV+Q 
Sbjct: 275 YVVSDCDAVHDIYSGHKYRPTLAQAAAISMERGMDNDCADFAQPKGDDDYKAYIDAVQQG 334

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            L +  +D AL  LF+ R++LGLF+    M P+       + SPAH+  A + A + +VL
Sbjct: 335 YLSQQAMDTALVRLFTARIKLGLFDPK-GMDPYADTPHSELNSPAHRAYARKLADESMVL 393

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VY 464
           LKN  G LPL      S+A++GP A+    LLGNY G    +++ L+ L+    NT   Y
Sbjct: 394 LKND-GTLPLKPGSVHSIAVVGPLADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNTKITY 452

Query: 465 YPGCDTVACSSASI-DKAVDIAKG 487
            PG   ++ ++  + D A+    G
Sbjct: 453 VPGTQFLSDTANPVPDSALTTPDG 476



 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 52/99 (52%), Gaps = 13/99 (13%)

Query: 492 VLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           + ++G+    E EE+          DR +L +P  ++ L+  VA+  K PV++VL+ G  
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + + +     N  ++L A Y GE G  A+A+ + G ++P
Sbjct: 686 LAVNWISQHAN--AVLEAWYSGEEGGAAIADTLSGKNDP 722


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 209/571 (36%), Positives = 301/571 (52%), Gaps = 66/571 (11%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+   +L  ++RA DLVSRLTL+EKI+ + N++ A+ RLGI  YEWW+EALHGVA  G  
Sbjct: 24  PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTF 156
                    AT +PQ I   ASF+  L Y++  +I  EAR  Y       N  +  G+TF
Sbjct: 83  ---------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTF 133

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---DTFNGGKLKGKLQASAC 213
           W PNINIFRDPRWGRGQET GEDP +T +  +S V G+QG     +N        +  AC
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYN--------KTHAC 185

Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KH+  +    W    R+ F+A  +  +DL +TY P F+  V QG    +MCAYNR  G 
Sbjct: 186 AKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGD 242

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVN 330
           P C    LL    R +W + G + SDC A+   Y    +   K+  DA    + +G D+ 
Sbjct: 243 PCCGSDRLLINILRNEWNYKGLVVSDCGAIDNFYFKGRHETHKNKADASAAAVLSGTDLE 302

Query: 331 CGSFLQKHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           CG   + +T   +AVK+  + ES ID++L  L   R  LG  +      P+ ++   ++ 
Sbjct: 303 CG---RSYTGLISAVKEGLINESAIDQSLCRLMKARFELGEMD---DTTPWDQLPDSLLS 356

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
             AHQ LALQ A++ + LL+N   +LPL   K +++ALIGPNAN +     NY G    +
Sbjct: 357 CHAHQQLALQMARESMTLLQNHKNILPL--DKEMTVALIGPNANDSVMQWANYNGFPVHT 414

Query: 449 ITPLQALQNYV--ENTVYYPGCDT-------VACSSASIDKAVDIAKGADHVVLMMGLDQ 499
           IT L+ L  Y+  E  +Y P  +        V      I   ++ A  AD ++   G+  
Sbjct: 415 ITLLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAKADVIIYAGGISA 474

Query: 500 TQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKY 549
           + E EE+          DR  + LP  Q++L+ +  +A  KP++ V   G  + +     
Sbjct: 475 SLEGEEMDVDAEGFRGGDRTTIELPNVQRKLV-KALKATGKPIVFVNFSGCAMGL--QPE 531

Query: 550 DRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   +IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 532 SQICDAILQAWYPGQAGGTAIAEVLFGDYNP 562


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 185/435 (42%), Positives = 256/435 (58%), Gaps = 33/435 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +    LP  +RA DL+ RL+LDEKIS +V+ +PAI RLGIP Y WW+EALHGVA  G+  
Sbjct: 23  YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNGR-- 80

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                   AT FP  I  AA++D  L YRI   I  EARA YN+       G   G++ W
Sbjct: 81  --------ATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGRG ET GEDP +TG+ AVS+++G+QG        K  L+  A  KH 
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQD------KKYLKTIATPKHL 186

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ F+A V+  DL +TY P F+  + +G+A  +MCAYNR+ G   C  
Sbjct: 187 AVH---SGPEPERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKACCGH 243

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQ 336
             LL+   R +WGF G + SDC AV  I+++     SPE A    + +G D+ CG +FL 
Sbjct: 244 DTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPEKAAALAVSSGTDLECGNTFLS 303

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
              K A +   + E EID AL  +   R +LG+F+  P +  + +I    + +  ++ +A
Sbjct: 304 --LKNAYRDGLITEKEIDSALRRVLLARFKLGMFDP-PEIVSYSQIDESYLDNSYNREIA 360

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           L+AA+  IVLLKN + LLPL  S +  +A+IGPNA++ ++LLGNY G     ITPLQA++
Sbjct: 361 LEAARKSIVLLKNDNKLLPLDSSIN-KIAVIGPNADNLESLLGNYHGFPSEYITPLQAIR 419

Query: 457 NYVEN--TVYYPGCD 469
             ++N    Y  GCD
Sbjct: 420 RVLKNGEVFYEKGCD 434



 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 63/112 (56%), Gaps = 13/112 (11%)

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
           D+A   A  +D V++ MGL    E E L          DR+ L LP  Q +LI ++  + 
Sbjct: 589 DRAYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKI-HST 647

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KPVILVLL GGP+   +     NI +IL A YPG+AG  A+ +VI+G +NP
Sbjct: 648 GKPVILVLLNGGPISTVWES--ENIPAILEAWYPGQAGGRAITDVIWGKYNP 697


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 199/554 (35%), Positives = 305/554 (55%), Gaps = 53/554 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           ++A+ +V ++T+DEKI Q+   APAI RL IP Y +W+EALHGVA  G           A
Sbjct: 12  RQAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------A 61

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD  L   I   IG E RA YN            G+TFW+PN+NIFRD
Sbjct: 62  TVFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRD 121

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +T K+ V++++G+QG           L+ +A  KHF  +   +  
Sbjct: 122 PRWGRGHETYGEDPFLTSKFGVAFIKGLQGQA-------KYLKLAATAKHFAVH---SGP 171

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA V+ +DL +TY P F++ V++     IM AYN V+G+P+     LL     
Sbjct: 172 EGLRHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDILH 231

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +W F G++ SD  A   +++   Y K   + +   +KAG+++  G  +++    A+ + 
Sbjct: 232 DKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAGH-IEQSLHEALNRG 290

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E EI  A+ +L++ R+RLG+F    T   +  I  +   + AH  L+  AA+   VL
Sbjct: 291 LVTEEEITNAVISLYATRVRLGMF---ATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVL 347

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VY 464
           LKN  G+LPL K    ++A++GPNA+S   LLGNY G   RS T L+ +Q  + +   V+
Sbjct: 348 LKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVH 406

Query: 465 YP---------GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
           Y            + +A +     +A+  A+ +D +V ++GLD T E EE          
Sbjct: 407 YSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAG 466

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+ +L LPGRQ++L+ R+  A  KPV+++L  G  + +   +   N+ +I+   YPG  G
Sbjct: 467 DKPNLSLPGRQRQLLERLL-AVGKPVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARG 525

Query: 567 AVALAEVIFGDHNP 580
            +A+A+V+FG  +P
Sbjct: 526 GLAVADVLFGTVSP 539


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 169/289 (58%), Positives = 212/289 (73%), Gaps = 2/289 (0%)

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           Y+ SDCDAV+ I DA  Y  SPED V   +KAGMDVNCG++ Q H  AAV++  L E +I
Sbjct: 16  YVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKDI 75

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           DRAL NLF+VRMRLG F+G+P     +G +GA  VCSPAH+ LAL+AAQDGIVLLKN  G
Sbjct: 76  DRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAG 135

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV-ENTVYYPGCDTV 471
            LPL  S   SLA+IGPNA++   L GNY GP C + TPLQ ++ Y+ +   +  GCD+ 
Sbjct: 136 ALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSP 195

Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
           AC+ A+ ++A  +A  +DHVVL MGL Q QE+E LDR  L+LPG QQ LIT VA AA++P
Sbjct: 196 ACAVAATNEAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAARRP 255

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VILVLL GGPVD+TFAK +  IG+IL AGYPG+AG +A+A+V+FGDHNP
Sbjct: 256 VILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNP 304


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 197/542 (36%), Positives = 283/542 (52%), Gaps = 55/542 (10%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C TT P + RA  LV  +   EK+  +++ +  + RLG+P Y WWSEALHGVAG   GI 
Sbjct: 65  CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAGA-PGIQ 123

Query: 106 FNGT--IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
           F+ +     ATS P  IL AA+FD  L  ++G  IG EARA  N G   G+ FW PNIN 
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGN-GNHSGIDFWTPNINP 182

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           F+DPRWGRG ETPGED L    Y  + +RG++G+       K + +  A CKH+ A DL+
Sbjct: 183 FKDPRWGRGSETPGEDTLRLKGYVAALLRGLEGN-------KAQRRIIATCKHYAANDLE 235

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +W G TR+ FDA+++MQDLA+ Y  PF+ C +  +    MC+YN VNG+P+CA++ LL  
Sbjct: 236 SWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPACANKYLLQT 295

Query: 284 TARRQWGF---HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
             R  W +   + Y+TSDC+AV  I     YA +          AG D +C         
Sbjct: 296 ILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTALAFNAGTDSSC--------- 346

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
                                      G F+G+  +  +  +G   V +P  Q LALQA 
Sbjct: 347 -------------------------EAGYFDGSKAL--YSSLGWSDVNTPQAQQLALQAT 379

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN--Y 458
            DGIV+LKN  G LPL       +A+IG  A+ +  L G Y+G +    TP+ A Q   +
Sbjct: 380 VDGIVMLKND-GTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVYAAQQLGF 438

Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
             N    P   + + +      A+  A  +D+++   GLD +   E +DR  L  P  Q 
Sbjct: 439 TPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSLEWPSAQL 498

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
            LI +++ A  KP+I++   G  +D T    ++ + SILWA +PG+ G  A+ ++I G  
Sbjct: 499 ALIKKLS-ALGKPLIIIQE-GDQMDNTPLLTNKGVSSILWASWPGQDGGPAVMQIISGAK 556

Query: 579 NP 580
           +P
Sbjct: 557 SP 558


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 181/429 (42%), Positives = 244/429 (56%), Gaps = 26/429 (6%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T  F    LPI +R  DLV RLTL+EKISQ++N+APAI RLGIPAY WW+E LHGVA   
Sbjct: 23  TCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR-- 80

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGM 154
                  +    TSFPQ I  AA++D+   +++      E RA+Y+        G   G+
Sbjct: 81  -------SPYPVTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGL 133

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W+PNINIFRDPRWGRGQET GEDP +T    VS+V+G+QGD          L++SAC 
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQGDD------PVYLKSSACA 187

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH+  +    W    R+ +DA+V   DL DTY P F+  V +G+ +G+MCAYN   G P 
Sbjct: 188 KHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVEGKVTGVMCAYNSFFGQPC 244

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           C +  L+    R  W F GY+TSDC AV   Y+     +    A  D +  G D  CG+ 
Sbjct: 245 CGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDAAAASADAVLHGTDCECGNG 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    AV +  + E +ID +L  LF +R RLG+F+ +  + P+  I   V+   AH+ 
Sbjct: 305 AYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDDRV-PYSNIPLSVLECDAHKA 363

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL+ A+  IVLLKN   LLPL K+K   +A++GPNA+    LL NY G      T L+ 
Sbjct: 364 HALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKSVLLANYYGYPSHITTALEG 423

Query: 455 LQNYVENTV 463
           +Q  V N V
Sbjct: 424 IQKKVGNQV 432



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 57/105 (54%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD ++ + GL    E EE+          DR  + +P  QQ L+  +  A  KPV+ V
Sbjct: 597 KDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELY-ATGKPVVFV 655

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++ G  + + +     ++ +IL A Y G+AG  A+A+V+FGD+NP
Sbjct: 656 MMTGSALGLEWES--AHLPAILNAWYGGQAGGQAIADVLFGDYNP 698


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  328 bits (840), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 194/477 (40%), Positives = 271/477 (56%), Gaps = 44/477 (9%)

Query: 11  PLLCLCFTSLL---TRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
           P+L L   SLL      D+   P      + + E   +  TTL I  R  DLVSR+ L E
Sbjct: 10  PILGLTLASLLFTGCSPDNNPVPKPVSERSTANEQPAYMDTTLDIDTRVDDLVSRMDLAE 69

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KISQ+ N +PAI  LGI  Y+WW+EALHGVA  GK          AT FPQ I  AA +D
Sbjct: 70  KISQMYNESPAIEHLGIAEYDWWNEALHGVARAGK----------ATVFPQAIGMAAMWD 119

Query: 128 SYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
               + I +A+  EARA ++         +  G+TFW+PNINIFRDPRWGRGQET GEDP
Sbjct: 120 RETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQETYGEDP 179

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
            +TG+ A+ Y+ G+QG+          L+ +A  KHF  +   +    +R+  +   + +
Sbjct: 180 YLTGELALPYISGLQGEN------PKYLKTAAMAKHFAVH---SGPEKSRHSDNYIASPK 230

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           DL +TY P FE  V +G    +MCAYNRVN  P+C +  LL +T R +WGF G++ SDC 
Sbjct: 231 DLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCG 290

Query: 301 AVSIIYDAEGY--AKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEI 353
           A++  Y  E +    +P  A    +++G D+NCG     +F   H   A++++ + + EI
Sbjct: 291 AIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHF--ALQREMITQDEI 348

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D+++  L   R +LG+F+ +  + P+ KI  DVV S AH  L  +AA+   VLLKNS G+
Sbjct: 349 DQSVKRLMKTRFKLGMFDPDDQV-PYSKIPMDVVGSQAHLALTQKAAEKSFVLLKNS-GI 406

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGC 468
           LPL KS  V  A+IGPNA +   L+GNY G   + +TPL  +Q Y+  EN  Y PG 
Sbjct: 407 LPLKKSSKV--AIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFYAPGS 461



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           ++   G+    E EE+          DR D+ LP  Q++L+  + +   KP++LV   G 
Sbjct: 624 IIFTGGISADLEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKL-NKPIVLVNFSGS 682

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + + +A  + N+ +IL   YPGEA   ALA +++G+ +P
Sbjct: 683 AIALNWA--NNNVDAILQGFYPGEATGTALARILWGEVSP 720


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  327 bits (839), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 205/552 (37%), Positives = 297/552 (53%), Gaps = 59/552 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           +RA D+  RL+ +EK  Q    APA  RLG  AY WWSE LHGVA  G           A
Sbjct: 15  ERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT----------A 64

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA FD    +R G+    EARA YN   A        G+T W+PN+NIFRD
Sbjct: 65  TMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLWSPNVNIFRD 124

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T    V+Y +G+QGD    GK+   L+ +AC KHF  +   +  
Sbjct: 125 PRWGRGQETYGEDPYLTSCLGVAYAKGLQGD----GKV---LRTAACAKHFAVH---SGP 174

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
             TR++FDA+  M+D+ +TY   FE+ VK  +   +M AYNRVNG P+CA   +++K   
Sbjct: 175 EATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPACASDFVMNKL-- 232

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQ 345
            +WGF G+  SDC A+   +   G  K+  ++    LK G D+NCG ++L  H  AA  +
Sbjct: 233 EEWGFDGHFVSDCWAIRDFHTNHGVTKTAPESAALALKKGCDLNCGNTYL--HLLAAFNE 290

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E ++ R+   L   R+RLG+F+ +     +  +  D+V    H+  +L+ ++  +V
Sbjct: 291 GLINEEDLRRSCIKLMRTRVRLGMFDKST---EYDGLDYDIVACDEHKEFSLRCSERSMV 347

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN-------Y 458
           LLKN +G+LPL  SK  ++ +IGPNA+S   L GNY G +   IT L  ++        Y
Sbjct: 348 LLKN-NGILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIREAHDGRVLY 406

Query: 459 VENT-VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE---------LDR 508
            E + +Y   C  +A     + +A  I +       +  LD T E EE          D+
Sbjct: 407 TEGSHLYKDRCMGLALPDDRLSEAEIITRTLRCSGSLCWLDATIEGEEGDTGNEFSSGDK 466

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
            DL LP  Q++L+  V  A  KPVI+V   G  +++     + +  +++ A YPG+ G  
Sbjct: 467 NDLRLPESQRKLVKTVM-AKGKPVIIVTAAGSAINV-----EADCDALIQAWYPGQLGGR 520

Query: 569 ALAEVIFGDHNP 580
           ALA ++FG  +P
Sbjct: 521 ALANILFGKVSP 532


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 210/559 (37%), Positives = 302/559 (54%), Gaps = 53/559 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F    L I  RA+ L+S LTL EKIS L  + P + RL IPAY WW+EALHGVA  G+  
Sbjct: 29  FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAGE-- 86

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
                   AT FPQ I  AA+FD+ L YRI  AI  EAR+ YN         Q +G+TFW
Sbjct: 87  --------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            PNINIFRDPRWGRGQET GEDP +T     ++V+G+QG      + + +L+ +A  KHF
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQG-----SEPERRLKTAAGAKHF 193

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ F+A V  +DL +TY P F++ V+ G  + IMCAYNRVNG P C  
Sbjct: 194 AVH---SGPEADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYNRVNGEPCCTG 249

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
           + LL    R +WGF G + +DC A+  I+       +  +     +KAG++++C + LQ+
Sbjct: 250 KTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRVEVAAAAVKAGVNLDCANILQE 309

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             + A++++ L   ++D AL      +++LG ++ +P+  P+   G D V +  H  LA 
Sbjct: 310 DVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSHSPYRHYGIDSVNNSYHISLAK 368

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA+  +VLLKN  G+LPL K    S+ ++G NA S   L GNY G S   +T       
Sbjct: 369 EAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGNMVT------- 420

Query: 458 YVENTVYY--PGCDT---VACSSASIDKAVDI--AKGADHVVLMMGLDQTQEKEE----- 505
           +VE  V    PG        CS A       I  A   D  + ++GL    E E      
Sbjct: 421 FVEGLVKAGGPGMSVQYDYGCSFADTSHFGGIWAAGFTDVTIAVIGLSPLLEGEHGDAFL 480

Query: 506 ----LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
                D+ DL +P   +  + ++ E+   PVI V+  G  +DI+    +    +I++A Y
Sbjct: 481 SNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDIS--AIEPYADAIIYAWY 538

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PGE G  ALA++IFG+ +P
Sbjct: 539 PGEQGGTALADLIFGEVSP 557


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 203/557 (36%), Positives = 297/557 (53%), Gaps = 44/557 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T LP  +R   L+  +TL EK SQLVN   AI RLG+P Y++W+EALHGVA  G+ 
Sbjct: 25  PWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
                    AT FPQ I  AA+FD +L  +    I  EARA +N    IG       +TF
Sbjct: 84  ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINIFRDPRWGRGQET GEDP +T +   + V G+QGD          L+ +A  KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R++FDA  + +D+ +TY P FE+ V +     +M AYNRVNG P+  
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
              LL+   R +WGF G++ SDC  ++  +       +  ++    +  G D+NCG+ + 
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
                AV+   + E  ID+ L  + + + +LG F+      P+  I ADVV S AH  +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A   IVLL+N + +LPL ++   +L + GP A+S++ LLGNY G S ++   L  + 
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422

Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
             V    T+ Y  G      +   ID     AK   D ++ +MGL    E EE       
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR+ L LP  Q   + ++ +   KPVI+VL  G PV++T  +      +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  A+A+++FG+ +P
Sbjct: 541 QEGGKAVADILFGERSP 557


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 202/557 (36%), Positives = 297/557 (53%), Gaps = 44/557 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T LP  +R   L+  +TL EK SQLVN   AI RLG+P Y++W+EALHGVA  G+ 
Sbjct: 25  PWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
                    AT FPQ I  AA+FD +L  +    I  EARA +N    IG       +TF
Sbjct: 84  ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINIFRDPRWGRGQET GEDP +T +   + V G+QGD          L+ +A  KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R++FDA  + +D+ +TY P FE+ + +     +M AYNRVNG P+  
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYNRVNGHPAGG 245

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
              LL+   R +WGF G++ SDC  ++  +       +  ++    +  G D+NCG+ + 
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
                AV+   + E  ID+ L  + + + +LG F+      P+  I ADVV S AH  +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A   IVLL+N + +LPL ++   +L + GP A+S++ LLGNY G S ++   L  + 
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422

Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
             V    T+ Y  G      +   ID     AK   D ++ +MGL    E EE       
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR+ L LP  Q   + ++ +   KPVI+VL  G PV++T  +      +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  A+A+++FG+ +P
Sbjct: 541 QEGGKAVADILFGERSP 557


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 203/557 (36%), Positives = 297/557 (53%), Gaps = 44/557 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T LP  +R   L+  +TL EK SQLVN   AI RLG+P Y++W+EALHGVA  G+ 
Sbjct: 25  PWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNGR- 83

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTF 156
                    AT FPQ I  AA+FD +L  +    I  EARA +N    IG       +TF
Sbjct: 84  ---------ATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTF 134

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINIFRDPRWGRGQET GEDP +T +   + V G+QGD          L+ +A  KH
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDH------PKYLKTAAAAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R++FDA  + +D+ +TY P FE+ V +     +M AYNRVNG P+  
Sbjct: 189 FAVH---SGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
              LL+   R +WGF G++ SDC  ++  +       +  ++    +  G D+NCG+ + 
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESAALAINTGTDLNCGA-VY 304

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
                AV+   + E  ID+ L  + + + +LG F+      P+  I ADVV S AH  +A
Sbjct: 305 NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDD-NPYNNISADVVNSEAHAQVA 363

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A   IVLL+N + +LPL ++   +L + GP A+S++ LLGNY G S ++   L  + 
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDRNIR-NLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422

Query: 457 NYVE--NTVYYP-GCDTVACSSASIDKAVDIAKG-ADHVVLMMGLDQTQEKEE------- 505
             V    T+ Y  G      +   ID     AK   D ++ +MGL    E EE       
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482

Query: 506 --LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
              DR+ L LP  Q   + ++ +   KPVI+VL  G PV++T  +      +I++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTAGTPVNLT--EIAELADAIVFAWYPG 540

Query: 564 EAGAVALAEVIFGDHNP 580
           + G  A+A+++FG+ +P
Sbjct: 541 QEGGKAVADILFGERSP 557


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 205/550 (37%), Positives = 296/550 (53%), Gaps = 21/550 (3%)

Query: 35  PSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEA 93
           P   ++ ++PFC T L +  R +DLVSR++  +  +QL    +  I  +G+PAY W + A
Sbjct: 99  PCMGASSSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNA 158

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG 153
           +HG+         +G  +  TSFP     +A+F+  L   +G+ IG E RA YN     G
Sbjct: 159 IHGMQNTA--CLADG--QCPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNG 214

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +  W+P IN  RDPRWGR  E+PGE P V G+Y  +Y  G+Q      G  K   QA   
Sbjct: 215 LDTWSPTINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ-----NGDDKDYTQAVVT 269

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+ AY ++++   TRY+++A V+  DL DTY P +E  VK  +  G+MC+YN +NG+P
Sbjct: 270 LKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVP 329

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
           +C +   L+   R  WGF GYITSD D++  I+    Y  +   A  D L  G D++ G 
Sbjct: 330 TCGNP-ALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATRDGLLGGCDIDSGD 388

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
               + +AAV Q  +  S +D AL N + +R  LGLF+ N T   + +I AD V   + Q
Sbjct: 389 TYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVT-NAYDRISADEVGMSSSQ 447

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITP 451
             +L AA+  + LLKN    LP    K V  A+IG ++NSA+ +LGNY GP C S     
Sbjct: 448 ETSLLAARKSMTLLKNDGQTLPFATGKKV--AVIGKSSNSAEDILGNYVGPICPSGAFDC 505

Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           +Q L   V       G  T++   A I+ A+ +A  AD VVL +  +  Q  E  DR  +
Sbjct: 506 VQTLYQGVA-AANQGGATTLSDDVADINTAIQLAMDADQVVLTIS-NYGQAGEGKDRTYI 563

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            L   QQEL+  V +   KP  +V+L GG + + + K +    +IL A  PG  G  A+A
Sbjct: 564 GLDTDQQELVAAVLKVG-KPTAIVMLNGGLISLDWIKDEAQ--AILVAFAPGVHGGQAVA 620

Query: 572 EVIFGDHNPG 581
           E IFG +NPG
Sbjct: 621 ETIFGANNPG 630


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 187/462 (40%), Positives = 262/462 (56%), Gaps = 40/462 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T LP  +RA DLV+R+TL+EK  Q+ ++APAIPRLG+PAY WW+E LHGVA  G   
Sbjct: 38  YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY--------NAGQAIGMTF 156
                   AT FPQ I  AA++D          IG E RA Y        +     G+T 
Sbjct: 96  --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PNINIFRDPRWGRGQET GEDP +TG+  V+++RG+QG   N        +  A  KH
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPN------FFKTIATAKH 201

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           +  +       + R++ D   +  DL DTY P F + V +G+   +MCAYN V+G+P+CA
Sbjct: 202 YAVHSGPE---SNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVPACA 258

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
             +L+ +  RR WGF G++ SDC A + IY  D+  Y K+PE+ +   L AGMD+ CG +
Sbjct: 259 SEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVCGDY 318

Query: 335 L------QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
                   + T +AV++  L E+ +D AL  LF+ R+RLGLF+  P   PF KI A    
Sbjct: 319 RADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAAQND 377

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           +P H+ ++L+ A+  + LLKN  G+LPL K +   +A++GPNA+S   L+GNY G     
Sbjct: 378 TPEHRAMSLEMAKASMTLLKND-GVLPL-KGEPRRIAVVGPNADSVDALIGNYYGTPSNP 435

Query: 449 ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGA 488
           +T L  ++        VY  G   V  +S  +  AV  A  A
Sbjct: 436 VTVLAGIRARFPKAEVVYAEGTGLVGPASLPVPDAVLCADAA 477



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 58/102 (56%), Gaps = 13/102 (12%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D VV + GL    E EE+          DR  L LP  QQ+L+ R+  A  KPV+LVL+ 
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRL-HATGKPVVLVLMN 671

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G  + + +A  D N+ +I+ A YPG  G  A+A+++ GD++P
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSP 711


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 180/437 (41%), Positives = 250/437 (57%), Gaps = 36/437 (8%)

Query: 33  CDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
           C     ST+T P +   +  I +R  DLV+RLTL+EK++QL + +PAI RL +P Y WW+
Sbjct: 21  CAKDEQSTQTPPLYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWN 80

Query: 92  EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--- 148
           EALHGVA  GK          AT FPQ I  AA+FD  L  R+G AI  E RA ++A   
Sbjct: 81  EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLE 130

Query: 149 ----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
                   G+T+W+PNINIFRDPRWGRGQET GEDP +T + AV+++ G+QGD       
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------ 184

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L++ A  KH+  +   +    +R+  D   + +DLA+TY P F+  + Q + + +MC
Sbjct: 185 AEYLKSVATLKHYAVH---SGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMC 241

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY--AKSPEDAVVDV 322
           AYN VNG P+C +  L+    R ++ F GYI SDC A++  YD + +    +   A    
Sbjct: 242 AYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMA 301

Query: 323 LKAGMDVNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
           LK G D+NCG        +   AVK+  + E ++D+AL  L   R +LG+F+ NP   P+
Sbjct: 302 LKTGTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPY 360

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
                DVV S  H  L  +AAQ  +VLLKN   +LPL  ++ +  ALIGPNA++   LLG
Sbjct: 361 SDTSIDVVGSNKHLALTQEAAQKSLVLLKNEQ-VLPLKGNEKI--ALIGPNADNEAILLG 417

Query: 440 NYAGPSCRSITPLQALQ 456
           NY G     ITP  AL+
Sbjct: 418 NYNGMPIVPITPKLALE 434


>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
           tundricola MP5ACTX9]
          Length = 892

 Score =  325 bits (832), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 193/445 (43%), Positives = 257/445 (57%), Gaps = 40/445 (8%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S+   +  P+    L   QR  DLVSR+TL+EK+SQ +NSAPAI RL +P Y++WSE LH
Sbjct: 25  SHQDAKPLPYMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLH 84

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--- 152
           G+A  G           AT FPQ I  AA++D+ L  +IG  I +EARA +N  +AI   
Sbjct: 85  GIARSGY----------ATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFN--EAIRHN 132

Query: 153 ------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
                 G+T W+PNINIFRDPRWGRGQET GEDP +TG+  V++V+G+QG   N      
Sbjct: 133 IHSIYYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQGPDPN------ 186

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
             +A A  KHF  +   +   +TR+  +   T  DL DTY P F + + +  A  IMCAY
Sbjct: 187 YFRAIATPKHFAVH---SGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAY 243

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSP--EDAVVDV 322
           N V G P+CA + LL  T RR WGF G++TSDC A+   Y  D   +  SP  E A    
Sbjct: 244 NAVEGSPACASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPDKEAAAAAG 303

Query: 323 LKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
           +KAG D NCG ++L     +AVK+  + E+EID AL +LF+ R +LGLF+    +  F  
Sbjct: 304 IKAGTDSNCGQTYLT--LGSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVA-FNA 360

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           I    V SPAHQ LAL+AA++ IVLLKN    LP  K    ++A+IGP+A +   L GNY
Sbjct: 361 IPFSEVNSPAHQALALKAAEESIVLLKNDAHTLPF-KPSVRTIAVIGPSAATLNNLEGNY 419

Query: 442 AGPSCRSITPLQA-LQNYVENTVYY 465
                  + PL   L  +  + V Y
Sbjct: 420 NAIPLHPVLPLDGILTQFKSSKVLY 444



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 55/98 (56%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EE+          DR D+ LP  QQ+++  VA A  KP+++VLL G  + 
Sbjct: 629 FLGLSPELEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVA-ATGKPLVVVLLNGSALA 687

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +A  + +  +IL A YPG+AG  A+AE + G +NP 
Sbjct: 688 VNWA--NDHAAAILEAWYPGQAGGTAIAETLAGKNNPA 723


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  325 bits (832), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 230/604 (38%), Positives = 314/604 (51%), Gaps = 71/604 (11%)

Query: 29  PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYE 88
           PP   D  +PS   +P+C   LPI  R  DL +R T++E ISQ+   A A+PRLG+PA  
Sbjct: 327 PPKLGDVESPSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALN 386

Query: 89  WWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           +  EALHGV             R  T FP      ASFD  LW  +G A GLEARAL+  
Sbjct: 387 YGGEALHGVWST------CAAGRCPTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRW 440

Query: 149 GQ-------------AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
            Q              +G+TF+APN+N+ RDPRWGR +E P EDPL+ G Y   +VRG Q
Sbjct: 441 NQRHNASDCARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQ 500

Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLD---------NWKGTT-------RYKFDARVTM 239
           GD    G  +    A+A  KHF  Y+L+         +W G+        R+ FDARV+ 
Sbjct: 501 GD----GAYR---VANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSP 553

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           +D  +TY  PF + V    A+  MC+YN VNG P+C D  LL    R    F G + +DC
Sbjct: 554 RDFEETYVGPFVAPVAA-GAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDC 612

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHN 359
            A+           +  +A    + AG+D NCG  L      A+    +    +   L  
Sbjct: 613 GALEDAVARHKRYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLER 672

Query: 360 LFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKS 419
           L   R+RLGL +      P  +   D V SPAH+ LAL+AA++G+VLL+N + +LPL   
Sbjct: 673 LLEARLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDGR 732

Query: 420 KSVSLALIGPNANSAKTLLGNYAG--PSCRSITPLQALQNYVE--NTVYYPGCDTVACSS 475
              +LA+IGPNAN++  LL  Y G  P     +PLQ L+        VY  GC+    ++
Sbjct: 733 G--TLAVIGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGCNASGAAT 790

Query: 476 ASIDKAVDIAKGADHVVLMMGL------------DQT----QEKEELDRVDLVLPGRQQE 519
           A++D+AVD+AK AD VVL +GL            D T     E E +DR  L LPG Q+ 
Sbjct: 791 AALDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEA 850

Query: 520 LITRVAEAAKKPVILV--LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           L +++  A  KPV +   L+  G VD +FAK   +  ++L AGY GE G VA+A+ + G 
Sbjct: 851 LFSKI-WALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGGEFGGVAVADALLGA 906

Query: 578 HNPG 581
           +NPG
Sbjct: 907 YNPG 910


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  325 bits (832), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 189/558 (33%), Positives = 291/558 (52%), Gaps = 71/558 (12%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CD + P ++T   C T      RA  L+S  TL+E ++   N++P +PRLG+P Y+ W+E
Sbjct: 52  CD-NGPLSKTI-VCDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNE 109

Query: 93  ALHGVAGVGKGIFFN-GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           ALHG+    +  F + G    +TSFP  ILT ++ +  L  ++   I  + RA  NAG+ 
Sbjct: 110 ALHGL---DRAYFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGR- 165

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPL-VTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
            G+  ++PNIN FR P WGRGQETPGED   ++  YA  Y+ G+QG     G     L+ 
Sbjct: 166 YGLDVYSPNINSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG-----GVDPKSLKL 220

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            A  KH+  YD++NW G +R   D  +T QDL++ Y P F    +  +   +MC+YN VN
Sbjct: 221 VATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVN 280

Query: 271 GIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           G+PSCA+   L    R  +GF   GYI+SDCD+   +++   YA +   A  D ++AG D
Sbjct: 281 GVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADSIRAGTD 340

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG+  Q +   AV Q  L  ++I+R +  L+S  MRLG F+  P M            
Sbjct: 341 IDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDVGPWM------------ 388

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
                                           +VS  L G           NY GP+   
Sbjct: 389 --------------------------------NVSTQLQG-----------NYFGPAPYL 405

Query: 449 ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           I+PL A ++   +  Y  G +  + S+    KA+  AK +D ++   G+D + E E LDR
Sbjct: 406 ISPLDAFRDSHLDVNYAFGTNISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDR 465

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
           +++  PG+Q ELI ++++   KP+I++ + GG VD +  K ++N+ S++W GYPG++G  
Sbjct: 466 MNITWPGKQLELIDQLSQLG-KPLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQ 524

Query: 569 ALAEVIFGDHNPGEFTVI 586
           AL ++I G   P    V+
Sbjct: 525 ALLDIITGKRAPAGRLVV 542


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  324 bits (831), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 155/287 (54%), Positives = 207/287 (72%)

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           YI SDCD++ I+Y ++ Y K+PE+A    + AG+D+NCGSFL  HT+ AVK+  + E+ I
Sbjct: 45  YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           ++A+ N F+  MRLG F+GNP  QP+G +G   VC+  ++ LA++ A+ GIVLLKNS G 
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVAC 473
           LPL  S   +LA+IGPNAN  KT++GNY G +C+  TPLQ L+  V  T Y+ GC  V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
           + A +D A  +A  AD  VL+MG DQT EKE LDR+DL LPG+QQEL+T+VA+AA+ PV+
Sbjct: 225 TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVV 284

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           LV++ GG  DITFAK D  I SI+W GYPGEAG +A+A+VIFG HNP
Sbjct: 285 LVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNP 331


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 180/437 (41%), Positives = 250/437 (57%), Gaps = 36/437 (8%)

Query: 33  CDPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
           CD    S +  P +   +  I +R  DLV+RLTL+EK++QL + +PAI RL IP Y WW+
Sbjct: 21  CDNDEQSMQPPPLYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWN 80

Query: 92  EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           EALHGVA  GK          AT FPQ I  AA+FD  L  R+G AI  E RA ++A  A
Sbjct: 81  EALHGVARAGK----------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLA 130

Query: 152 -------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
                   G+T+W+PNINIFRDPRWGRGQET GEDP +T + AV+++ G+QGD       
Sbjct: 131 ENNRSMYTGLTYWSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------ 184

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L++ A  KH+  +   +    +R+  D   + +DLA+TY P F+  + Q + + +MC
Sbjct: 185 TEYLKSVATLKHYAVH---SGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMC 241

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY--AKSPEDAVVDV 322
           AYN VNG P+C +  L+    R ++ F GYI SDC A++  YD + +    +   A    
Sbjct: 242 AYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTEAKAAAMA 301

Query: 323 LKAGMDVNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
           LK G D+NCG        +   AVK+  + E ++D+AL  L   R +LG+F+ NP   P+
Sbjct: 302 LKTGTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPY 360

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
                D+V S  H  L  +AA+  +VLLKN   +LPL  ++ V  ALIGPNA++   LLG
Sbjct: 361 SDTSIDIVGSNKHLALTQEAAKKSLVLLKNEQ-VLPLKGNEKV--ALIGPNADNEAILLG 417

Query: 440 NYAGPSCRSITPLQALQ 456
           NY G     ITP  AL+
Sbjct: 418 NYNGMPIVPITPKLALE 434



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/175 (25%), Positives = 80/175 (45%), Gaps = 19/175 (10%)

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSS-- 475
           KS +V   L G + N A TL  N        +       ++  + V  P        +  
Sbjct: 538 KSNNVDFTLNGQSVNGAITLEKN----QSYQLNAESKFNHFWHSNVINPTASLTWLKNPQ 593

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
           +   +A++ A  AD +V + G+    E EE+          DR ++ LP  Q  L+ ++ 
Sbjct: 594 SLTQQALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLK 653

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  K P++LV + G  + + +   + NI +I+   YPGEA   AL  +++G+++P
Sbjct: 654 QTGK-PIVLVNMSGSAMALNWE--NENIDAIIQGFYPGEAAGSALVSLLYGEYSP 705


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  324 bits (830), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 201/554 (36%), Positives = 302/554 (54%), Gaps = 53/554 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           + A  +VS++TL EKISQ+   A AI RL IP Y +W+E LHGVA  G           A
Sbjct: 10  EEAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------A 59

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFRD 166
           T FPQ I  AA+FD+ L   I + I +E RA YNA           G+TFW+PNIN+FRD
Sbjct: 60  TVFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRD 119

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T +  V++++G+QG+    GK    L+ +AC KHF  +   +  
Sbjct: 120 PRWGRGQETYGEDPFLTAQIGVAFIKGLQGE----GKY---LRLAACTKHFAVH---SGP 169

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
              R+ FDA V  +DL + Y P F++ +++      M AYN +NG P+C +  L++KT  
Sbjct: 170 EADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLL 229

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
            +WGF G++ SD  A+  +++   Y ++  + +   +K G ++ C   +      AV + 
Sbjct: 230 GKWGFEGHVVSDYAALEDVHENHHYTQTAAETMALAMKIGTNL-CAGKISDALFEAVGKG 288

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + E+EI  ++  L++  +RLG+F  +     +  I  +V  S  H++L+L+AA+  +VL
Sbjct: 289 LVTETEITASVVKLYTTHVRLGMFAED---NDYDTIPYEVNASAEHEMLSLKAAEKSMVL 345

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TV 463
           LKN +  LPL +S+  S+A+IGP A +   L GNYAG +    T +  +Q  + N     
Sbjct: 346 LKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVT 404

Query: 464 YYPGCDTVACSSASI--------DKAVDIAKGADHVVLMMGLDQTQEKEE---------L 506
           Y  GC   A  + S          +A+  A+ AD  VL +GLD T E E+          
Sbjct: 405 YALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSG 464

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L LPG+Q+ LI +V E   K VILVL  G  + +   +    + +I+ A YPG  G
Sbjct: 465 DKPSLSLPGQQKRLIEKVLETG-KTVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHG 523

Query: 567 AVALAEVIFGDHNP 580
             ALA ++ G  +P
Sbjct: 524 GTALANILLGKVSP 537


>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
           4393]
          Length = 883

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 192/439 (43%), Positives = 263/439 (59%), Gaps = 36/439 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T+     RA  LV+++TL+EK +Q+ N+APAI RLG+PAY+WW+EALHGVA  G+ 
Sbjct: 23  PWQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ- 81

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                    AT FPQ I  AA+FD  L  ++   I  EARA ++        G+  G+TF
Sbjct: 82  ---------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGRYQGLTF 132

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QGD      +  KL A+A  KH
Sbjct: 133 WSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQGDD----PVYRKLDATA--KH 186

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R+ FDAR + +DL DTY P FE+ VK+G+   +M AYNRV G  + A
Sbjct: 187 FAVH---SGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYGESASA 243

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
            + LL    RR WGF GY+ SDC A+  I+     A S E A    +K G ++ CG   Q
Sbjct: 244 SQFLLRDVLRRDWGFTGYVVSDCWAIVDIWKHHHLAPSREAAAALAVKNGTELECG---Q 300

Query: 337 KHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           ++    AAV+Q  + E+EID A+  LF+ RMRLG+F+  P    + +I A V   PAH  
Sbjct: 301 EYATLPAAVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVPAHDA 359

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LALQAAQ+ +VLLKN  G+LPL ++    +A++GP A+    LLGNY G     +T LQ 
Sbjct: 360 LALQAAQESLVLLKND-GVLPLSRTLK-RIAVVGPTADDTMALLGNYFGTPAAPVTILQG 417

Query: 455 LQNYVE--NTVYYPGCDTV 471
           +++  +     Y  G D V
Sbjct: 418 IRDAAKGIEVRYARGVDLV 436



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
            D+A+D A+ AD VV + GL    E EE+          DR DL LP  Q+ L+  +  A
Sbjct: 605 FDEALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEAL-HA 663

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             KPV++VL  G  + + +A+   ++ +IL + YPG+ G  A+ + +FG+ NP
Sbjct: 664 TGKPVVMVLTGGSALAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGEVNP 714


>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
           SP1PR4]
          Length = 885

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 199/467 (42%), Positives = 268/467 (57%), Gaps = 51/467 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   TL    RARDLV R+TL+EK +Q++N+APAI RLG+PAY++WSE LHGVA  G   
Sbjct: 30  YLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSEGLHGVARSGY-- 87

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA++D  L + IG  +  EARA YN            G+T W
Sbjct: 88  --------ATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQHGVHSIYFGLTIW 139

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PNINIFRDPRWGRGQET GEDP +T +   ++VRG+QGD  N        +  A  KHF
Sbjct: 140 SPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQGDDPN------YFRTIATPKHF 193

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +   +TR+ F+  V+  DL DTY P F S + +G+A  IMCAYNR++G P+CA 
Sbjct: 194 AVH---SGPESTRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSIMCAYNRIDGQPACAS 250

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG--YAKSPEDAVVDVLKAGMDVNCGSFL 335
             LL +  R  WGF G++TSDC A+   Y   G  ++K  EDA    +KAG D  CG   
Sbjct: 251 DLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHFSKEKEDASAAGVKAGTDTACGKTY 310

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
              T +AVK   + E E+D +L  LF  R+RLGLF+ +P   P+ ++    V SPAH+ L
Sbjct: 311 LGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFD-DPARMPYARLTMAEVNSPAHRAL 368

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           AL+AA++ IVLLKN++ LLPL   K++  A+IGPNA S   L GNY   +     P+  +
Sbjct: 369 ALRAARESIVLLKNANNLLPLHGVKNI--AVIGPNAASLDALEGNYNAIARDPAMPVDGI 426

Query: 456 QNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVV-LMMGLDQTQ 501
                    +PG   V             A+GA +V  LM+ + +TQ
Sbjct: 427 ------AAAFPGAKVV------------YAQGAPYVEGLMLPIPRTQ 455



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 55/102 (53%), Gaps = 15/102 (14%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D VV  +GL    E EE+          DR D+ LP  Q EL+ R  +A  KP+I+VL+ 
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELL-RAVKATGKPLIVVLMN 678

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G  +    A  D    ++L A YPGEAGA A+AE + G +NP
Sbjct: 679 GSAI----ALKDSETDALLEAWYPGEAGAQAIAETLAGKNNP 716


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 181/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E +PF    LPI +R  DL+ RLT +EKI Q++N+ PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           GK          AT FPQ I  AA+FD    Y     +  EARA Y+  Q         G
Sbjct: 84  GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T +  V+ V+G+QGD           +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD  VT +DL  TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
            C+   LL    R  WG+   I SDC A++  +  +   ++P        E A  D +  
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRD--ERTPRHETHPDAESASADAVLN 302

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+ CG+  +   K A+K+ K+ E+++D +L  L   R  LG+F+ +  + P+ +I  +
Sbjct: 303 GTDLECGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 360

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H   AL+ A   +VLLKN +  LPL K+    +A++GPNA  +  L  NY G  
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419

Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
             ++T L+ ++N V +T  +Y  GC+  A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  QQE++ +  +A  KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G  + + +   D NI +IL A Y G+    A+A+++FGD+NP
Sbjct: 662 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNP 704


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E +PF    LPI +R  DL+ RLT +EK+ Q++N+ PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           GK          AT FPQ I  AA+FD    Y     +  EARA Y+  Q         G
Sbjct: 84  GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T +  V+ V+G+QGD           +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD  VT +DL  TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
            C+   LL    R  WG+   I SDC A++  ++ +   ++P        E A  D +  
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWERD--ERTPRHETHPDAESASADAVLN 302

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+ CG+  +   K A+K  K+ E+++D +L  L   R  LG+F+ +  + P+ +I  +
Sbjct: 303 GTDLECGNSYRALVK-ALKDGKISENDLDVSLRRLLKGRFELGMFDPDEQV-PYAQIPYN 360

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H   AL+ A   +VLLKN +  LPL K+    +A++GPNA  +  L  NY G  
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419

Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
             ++T L+ ++N V +T  +Y  GC+  A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ LP  QQE++ +  +A  KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G  + + +   + NI +IL A Y G+    A+A+++FGD+NP
Sbjct: 662 LCTGSALALNWE--EANIDAILNAWYGGQEAGTAVADILFGDYNP 704


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 190/438 (43%), Positives = 249/438 (56%), Gaps = 40/438 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   TLP  +RA DLVS++TL+EK+SQ+ ++APAI RLGIP Y WW+E LHGV   G   
Sbjct: 40  YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFW 157
                   AT FPQ I  AA +D    YRI  A+  EARA ++        G   G+TFW
Sbjct: 98  --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            PNINIFRDPRWGRG ET GEDP +TG+ AV Y++G+QGD          L+  A  KHF
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDD------DRYLKLVATSKHF 203

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+ FDAR + +D   TY P F+  +++     +MCAYNR NG+P C  
Sbjct: 204 LVH---SGPEPDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPCCGS 260

Query: 278 R---NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
           +   NLL    R +WGF GYI SDC AV+  Y    +   P  E+A    +KAG D+NCG
Sbjct: 261 KPVENLL----RNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCG 316

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +        AVKQ  + E EID  +  L   R+RLG+F+  P M P+  I   VV S  H
Sbjct: 317 NSYPALVD-AVKQGLVSEEEIDVLVKRLMEARLRLGMFDP-PEMVPYTNIPYSVVDSKEH 374

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LAL AA+  +VLLKN +  LPL K+   ++A+IGPNAN+   LL NY G     +TPL
Sbjct: 375 RELALIAARKSMVLLKNDNNTLPLDKNVK-NVAVIGPNANNLDVLLANYNGYPSNPVTPL 433

Query: 453 QALQNYV--ENTVYYPGC 468
             ++  +   N  Y  GC
Sbjct: 434 DGIRQKLPNANVQYALGC 451



 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 75/140 (53%), Gaps = 15/140 (10%)

Query: 453 QALQNYVENTVYYPGCDTV-ACSSASIDK-AVDIAKGADHVVLMMGLDQTQEKEEL---- 506
           + L  Y +N   YP    +      ++ K A+ IA  +D V++ MGL    E EE+    
Sbjct: 580 KVLIEYTQNNTEYPVMKFLWDVPGRNLKKEAIQIAAASDVVLMFMGLSPNLEGEEMPVNV 639

Query: 507 ------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
                 DRVD+ LP  Q +L+  +    K PV+LVLL G  + I +     N+ +IL A 
Sbjct: 640 PGFSGGDRVDIKLPQIQTDLVKAIMSLGK-PVVLVLLNGSALAINWEA--ENVPAILEAW 696

Query: 561 YPGEAGAVALAEVIFGDHNP 580
           YPG+AG  A+A+V+FGD+NP
Sbjct: 697 YPGQAGGTAIADVLFGDYNP 716


>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
 gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
          Length = 882

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 183/437 (41%), Positives = 256/437 (58%), Gaps = 35/437 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   + P   RA DLVSR+TL+EK +QL+N APAIPRL +  Y WW+E LHGVA  G   
Sbjct: 35  YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAGY-- 92

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY--------NAGQAIGMTF 156
                   AT FPQ +  AA++D  L +R+ + I +E RA Y         +    G+T 
Sbjct: 93  --------ATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDWFGGLTV 144

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PNINIFRDPRWGRGQET GEDP +T +  V++VRG+QGD           +  A  KH
Sbjct: 145 WSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD------PVYYRTVATPKH 198

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
              Y + +     R++ +   +  DLADTY P F + + +G+A  IMCAYN +NG P+CA
Sbjct: 199 ---YAVHSGPEAGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAINGQPACA 255

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           + +LL K  R+ WGF GY+ SDCDAV  IY   +  Y  +PE+ V    + G D+ CG+ 
Sbjct: 256 NEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDLICGNA 315

Query: 335 LQ-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            +  H   AV+Q  LPE  +D AL  LF+ R +LG F  +P  + F KI A+   +PA++
Sbjct: 316 NEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQF--DPPAKVFPKITAEDYDTPANR 373

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             + + A+  +VLLKN + LLPL K +   +A+IGPNA+S  +L+GNY G     +T L 
Sbjct: 374 DFSQKVAESAMVLLKNENNLLPL-KGEPRQIAVIGPNADSMDSLVGNYNGDPSHPVTVLS 432

Query: 454 ALQ-NYVENTVYY-PGC 468
            ++  + + TV Y PG 
Sbjct: 433 GIRARFPKATVTYAPGS 449



 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/117 (41%), Positives = 67/117 (57%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           S      AV  AK AD VV + GL Q  E EE+          DR  L LP  QQ+++ +
Sbjct: 592 SDTGAQSAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQ 651

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V+ AA KPV+LVL+ G  + I +A  D+N+ +I+ A YPG  G  A+A +I GD++P
Sbjct: 652 VS-AAGKPVVLVLINGSALGINWA--DKNVPAIIEAWYPGGQGGAAVARLIAGDYSP 705


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  322 bits (826), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E +PF    LPI +R  DL+ RLT +EK+ Q++N+ PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           GK          AT FPQ I  AA+FD    Y     +  EARA Y+  Q         G
Sbjct: 84  GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T +  V+ V+G+QGD           +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD------PKYFKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD  VT +DL  TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNRYQGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
            C+   LL    R  WG+   I SDC A++  ++ +   ++P        E A  D +  
Sbjct: 245 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWERD--ERTPRHETHPDAESASADAVLN 302

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+ CG+  +   K A+K  K+ E+++D +L  L   R  LG+F+ +  + P+ +I  +
Sbjct: 303 GTDLECGNSYRALVK-ALKDGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 360

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H   AL+ A   +VLLKN +  LPL K+    +A++GPNA  +  L  NY G  
Sbjct: 361 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 419

Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
             ++T L+ ++N V +T  +Y  GC+  A
Sbjct: 420 THTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ LP  QQE++ +  +A  KPV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMV-KALKATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G  + + +   + NI +IL A Y G+    A+A+++FGD+NP
Sbjct: 662 LCTGSALALNWE--EANIDAILNAWYGGQEAGTAVADILFGDYNP 704


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 200/559 (35%), Positives = 295/559 (52%), Gaps = 38/559 (6%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEA 93
           + FPF   TLP  +R  DL+ RLT  E + Q+          AP IPRLGI  Y W +E 
Sbjct: 25  QPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTEC 84

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-- 151
           L G A              AT+FPQ +  AA+F   L YR+  A   E RA +N+  A  
Sbjct: 85  LRGDAEAPGW---------ATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFVAAG 135

Query: 152 -----IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
                 G++ ++P +NI R P WGR QET GEDP +T + A S+V+G+QG      +   
Sbjct: 136 RYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQG------QHPR 189

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            ++ASA CKHF+ +        +R  FDA+V  +D   T+ P F++CV+ G  S  MC+Y
Sbjct: 190 YIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAGSYS-FMCSY 248

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           NR+NG+P+CA++ LL+   R +WGF GY+ SD  AV +I     Y  +  +  +  + AG
Sbjct: 249 NRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAIASVNAG 308

Query: 327 MDVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           +++     ++     H   A+    +    +   +  LF  R+RLG F+  P M P+  +
Sbjct: 309 LNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFD-PPAMNPYNAL 367

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
              VV S  H+ L+L+AA    VLLKN    LPL +     LA++GP A++ + L G+YA
Sbjct: 368 ELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRVLFGDYA 427

Query: 443 G-PSCRSI-TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
             P  + I TP + LQ    N  +  GC    C   S D+  +  +GAD V++ +G    
Sbjct: 428 PVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEVENAVRGADVVLVCLGTGID 487

Query: 501 QEKEELDRVDLVLPGRQQELIT-RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
            E E  DR DL LPG Q +L+   V  AA  PVIL+L   GP+D+++A+    +G+IL  
Sbjct: 488 VEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLHDGVGAILAC 547

Query: 560 GYPGEAGAVALAEVIFGDH 578
            +P +A  +A+A V+ G  
Sbjct: 548 FFPAQATGLAIASVLLGKQ 566


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  322 bits (825), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 188/531 (35%), Positives = 296/531 (55%), Gaps = 33/531 (6%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GEDP +T +  V++V+G+QGD          L+  A  KHF A + +N     
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R+   +++  + L + Y P +E+CVK+     +M AYN  NG+P    R LL +  R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
           GF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AVKQ  + 
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
           E+ ID+AL  + + R +LG F+    + P+      ++       LA +AA   +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
            + LLPL K K+ S+A++GP A+     LG Y+G    SIT L+ +++ +         +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSITLLKGVKDLMGKRGKVNYLN 450

Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
            +  S  SI  AV   KG D V++ +G D+   +E  D   + LP  Q++L+  + +   
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  322 bits (825), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 173/425 (40%), Positives = 244/425 (57%), Gaps = 28/425 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F F   +L   QR  DLVSRLTL+EK+SQ++NS+P I RLGIPAY+WW+E LHGVA    
Sbjct: 27  FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVAR--- 83

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
                 T    T +PQ I  AA+FD    + +     LE RA+YN          + +G+
Sbjct: 84  ------TPFKTTVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W PNINIFRDPRWGRGQET GEDP +T     ++V+G+QGD          L+A+AC 
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD------PKYLKAAACA 191

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH   Y + +   + R+ FD  VT  +L DTY P F   + +   +G+MCAYN     P 
Sbjct: 192 KH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRTQPC 248

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA   L++   R++W F GY+TSDC A+   +         E A  D +  G D++CG+ 
Sbjct: 249 CASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDAESAAADAVFHGTDIDCGTD 308

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    AVK  K+ E +ID ++  LF +R RLG+F+   +M  + +  + V+ S  HQ+
Sbjct: 309 AYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDP-VSMVKYAQTPSSVLESKEHQL 367

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL+ A+  IVLLKN   +LPL K+    + ++GPNA++A ++LGNY G   +  T LQ 
Sbjct: 368 HALKMARQSIVLLKNEKNILPLNKNLK-KIVVLGPNADNAISILGNYNGTPSKLTTVLQG 426

Query: 455 LQNYV 459
           ++  V
Sbjct: 427 IKEKV 431



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 55/110 (50%), Gaps = 13/110 (11%)

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
           ++  K AD  +   G+    E EE+          DR  ++ P  Q +L+ +  +++ KP
Sbjct: 600 IEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLL-KALQSSGKP 658

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           V+  ++ G  + I +     NI +IL   Y G++   A A+VIFGD+NP 
Sbjct: 659 VVFAMMTGSAIAIPWEA--ENIPAILNIWYGGQSAGTAAADVIFGDYNPA 706


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 211/563 (37%), Positives = 304/563 (53%), Gaps = 44/563 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
            PF   +LP  QR  DL+SRL +D+   QL          APAI RL I  Y W +E L 
Sbjct: 30  LPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTECLR 89

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA------- 148
           G A  G           AT+FPQ +  +A+F   L + +  A G E RA YN        
Sbjct: 90  GDAQAGN----------ATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQKGDF 139

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G+  ++P INI R P WGR QET GEDP +TG+ A S+V G+QG+          L
Sbjct: 140 NNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNH------PRYL 193

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             +A CKHF AY       ++R+ FDA+V+ +DL  T+ P F+ C+K G  S +MC+YN 
Sbjct: 194 LTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAGTYS-VMCSYNS 252

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNGIP+CA+  LL+   R +WGF GY+ SD  A+ +   A  Y  S  D  +  LKAG +
Sbjct: 253 VNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNYTTSYLDTAIKSLKAGCN 312

Query: 329 VNCGSF---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           ++ G+    +  +   AV+   L   ++  ++  LF  R+RLG F+  P   P+ K+  D
Sbjct: 313 LDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFDP-PDHNPYVKLNVD 371

Query: 386 -VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-G 443
            VV SP HQ +AL+AA    VL+KN    LP+  +   +LA++GP AN++K L G+YA  
Sbjct: 372 QVVESPEHQEIALKAALKSFVLVKNDGSTLPIEGTIH-TLAVVGPFANNSKLLFGDYAPN 430

Query: 444 PSCRSITP-LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
           P  R +T  L+ L      T +  GC +  C +      ++   GAD VV+ +G     E
Sbjct: 431 PDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQGVLNAVTGADVVVVCLGTGIELE 490

Query: 503 KEELDRVDLVLPGRQQELITRVAE-AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
            E  DR D++LPG+Q++L+   A  AA KPVIL+L   GP++IT+A    ++ +I+   +
Sbjct: 491 SEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSVQAIVECFF 550

Query: 562 PGEAGAVALAEVIFGD---HNPG 581
           P +A  VAL  ++F +    NPG
Sbjct: 551 PAQATGVAL-RMMFQNAPGANPG 572


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 185/423 (43%), Positives = 251/423 (59%), Gaps = 32/423 (7%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           +TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G          GAT FPQ I  
Sbjct: 1   MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50

Query: 123 AASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQET 175
           AA+FD  L + +  AI  EARA ++         +  G+TFW+PNINIFRDPRWGRGQET
Sbjct: 51  AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASACCKHFTAYDLDNWKGTTR 230
            GEDP +T +  V++V+G+QG+  +      G+   KL A+A  KHF  +   +     R
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRKLDATA--KHFAVH---SGPEADR 165

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + FDAR + +DL +TY P FE+ VK G+   +M AYNRV G  + A + LL    R+QWG
Sbjct: 166 HHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWG 225

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SDC A+  I+       + E A    +K G ++ CG        AAV+Q  + E
Sbjct: 226 FKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGEEYST-LPAAVRQGLIDE 284

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           ++ID AL  L + RMRLG+F+  P   P+  I A V  SPAH  LA + A++ +VLLKN 
Sbjct: 285 AQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND 343

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGC 468
            GLLPL ++K   +A+IGP A+    LLGNY G     +T LQ ++    N   +Y  G 
Sbjct: 344 -GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGA 402

Query: 469 DTV 471
           D V
Sbjct: 403 DLV 405



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 68/114 (59%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + +A+D+A+ AD VV + GL    E EE+          DR DL LP  Q++L+  + +A
Sbjct: 574 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEAL-QA 632

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+A+ +FGD NPG
Sbjct: 633 TGKPVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPG 684


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 181/441 (41%), Positives = 259/441 (58%), Gaps = 32/441 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     QRA DLV+R+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TFW
Sbjct: 94  -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRKLDATA 206

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FDA  + +DL +TY P FE+ VK+G+   +M AYNRV G 
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R +WGF GY+ SDC A+  I+       + E+A    +K G ++ CG
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECG 321

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +       +AV++  + E+++D+AL  L   RMRLG+F+  P    + +I      SP H
Sbjct: 322 AEYST-LPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEH 379

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LA + A++ +VLLKN  G+LPL ++K   +A++GP A+    LLGNY G     +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVL 438

Query: 453 QALQNYVEN--TVYYPGCDTV 471
           Q ++    +   +Y  G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459



 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 64/114 (56%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           +  A+D A+ AD VV + GL    E EE+          DR DL LP  Q+ L+  +   
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEAL-HG 686

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+A+V+FGD NPG
Sbjct: 687 TGKPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 38/441 (8%)

Query: 32  SCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWS 91
           + DP         +  T L    RA DLVSR+TL+EK +QL ++APAIPRLG+P Y WW+
Sbjct: 10  ALDPVPADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWN 69

Query: 92  EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY----- 146
           E LHGVA  G           AT FPQ I  AA++D  +   +G  +  E RA Y     
Sbjct: 70  EGLHGVARAGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVH 119

Query: 147 -NAGQAI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
            + G     G+T W+PNINIFRDPRWGRGQET GEDP +T +  + Y+ G+QG   N  K
Sbjct: 120 PDGGTDWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQG---NDPK 176

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
                +  A  KHF  +       + R+K D   +  DL DTY P F + V +G+A  +M
Sbjct: 177 F---FKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVM 230

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVD 321
           C YN V G+P CA   L+ +  R+ WGF G++ SDC A + I+  DA  Y K+ E+ V  
Sbjct: 231 CVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAV 290

Query: 322 VLKAGMDVNCGSFLQKHTKA------AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
            LKAGMD+ CG +  K +        AVK  +LP + +D+AL  LF  R+RLG+F+  P 
Sbjct: 291 GLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFD-PPA 349

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
             PF  I AD   +PAH  +AL  A+  +VLLKN  GLLPL K++  ++A+IGPNA+S  
Sbjct: 350 SLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPL-KAEPKTIAVIGPNADSLD 407

Query: 436 TLLGNYAGPSCRSITPLQALQ 456
            L+GNY G   + +T L  ++
Sbjct: 408 ALVGNYYGKPSKPVTVLDGIR 428



 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 42/111 (37%), Positives = 65/111 (58%), Gaps = 13/111 (11%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
           +AVD+AK AD VV + GL    E EE+          DR  + LP  QQ+L+ +V     
Sbjct: 590 QAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVI-GTG 648

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KP +LVL+ G  + + +A  D+++ +I+ A YPG  G  A+A++I GD++P
Sbjct: 649 KPTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSP 697


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  322 bits (824), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 180/449 (40%), Positives = 250/449 (55%), Gaps = 41/449 (9%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E +PF    LPI +R  DL+ RLT +EKI Q++N+ PAI RLGIP Y+WW+EALHGVA  
Sbjct: 22  EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           GK          AT FPQ I  AA+FD    Y     +  EARA Y+  Q         G
Sbjct: 82  GK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T +  ++ V+G+QGD           +  AC
Sbjct: 132 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDD------PKYFKTHAC 185

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD  VT +DL  TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 186 AKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNRYQGKP 242

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--------EDAVVDVLKA 325
            C+   LL    R  WG+   I SDC A++  +  +   ++P        E A  D +  
Sbjct: 243 CCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRD--ERTPRHETHPDAESASADAVLN 300

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G D+ CG+  +   K A+K+ K+ E+++D +L  L   R  LG+F+ +  + P+ +I  +
Sbjct: 301 GTDLECGNSYKALIK-ALKEGKISENDLDVSLRRLLKGRFELGMFDPDERV-PYAQIPYN 358

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H   AL+ A   +VLLKN +  LPL K+    +A++GPNA  +  L  NY G  
Sbjct: 359 VVESPEHVAQALEMAHKSMVLLKNKNNTLPLSKTIR-KIAVVGPNAADSTMLWANYNGFP 417

Query: 446 CRSITPLQALQNYVENT--VYYPGCDTVA 472
             ++T L+ ++N V +T  +Y  GC+  A
Sbjct: 418 THTVTILEGIRNKVPDTEVIYELGCNHAA 446



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  QQE++ +  +A  KPV+ V
Sbjct: 601 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMV-KALKATGKPVVYV 659

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G  + + +   D NI +IL A Y G+    A+A+++FGD+NP
Sbjct: 660 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNP 702


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  321 bits (823), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 215/575 (37%), Positives = 299/575 (52%), Gaps = 54/575 (9%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C  T    +RA  LV +LT+DEK+  LV+ A    R+G+P Y WWSE LHGVAG   G+ 
Sbjct: 37  CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95

Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           FN T      ATSF   I   ASFD  L Y +G AI  EARA  N G   G+ +W PN+N
Sbjct: 96  FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFG-GLDYWTPNVN 154

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            ++DPRWGRG ETPGEDPL    Y  + + G++G+       +   +  A CKH+ AYDL
Sbjct: 155 PYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGN-------ETVRKVIATCKHYAAYDL 207

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV------------- 269
           + W G TRY+F+A VT+QDL++ Y PPF+ C +  +   IMC+YN +             
Sbjct: 208 ERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKPDE 267

Query: 270 -----NGIPSCADRNLLSKTARRQWGF---HGYITSDCDAV-SIIYDAEGYAKSPEDAVV 320
                   P+CA+  L++   R  W +   + YITSDC+A+   + D   ++++P +A  
Sbjct: 268 IINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAA 326

Query: 321 DVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFN------ 371
              KAG D  C   GS L      A  Q  LPE+ ID AL  L+   +R G  +      
Sbjct: 327 AAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSAV 385

Query: 372 -----GNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
                G+ +   +  +  + V +P+ Q LAL++A +GIVLLKNS  LLPL  S    +AL
Sbjct: 386 AGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSGK-KVAL 444

Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAK 486
           IG  AN+  T+ G Y+G       PL A Q    +  Y  G    A    +       A 
Sbjct: 445 IGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPALAAA 504

Query: 487 GADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
               VVL   G D T   E+LDR  +  P  Q +L++ +A    KPV+++ L G  VD +
Sbjct: 505 EGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELA-GLGKPVVVIQL-GDQVDDS 562

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
               + N+ SILW GYPG++G  A+ +V+ G   P
Sbjct: 563 SLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 597


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 187/531 (35%), Positives = 297/531 (55%), Gaps = 33/531 (6%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GEDP +T +  V++V+G+QGD          L+  A  KHF A + +N     
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R+   +++  + L + Y P +E+CVK+     +M AYN  NG+P    R LL +  R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
           GF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AVKQ  + 
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
           E+ ID+AL  + + R +LG F+    + P+      ++       LA +AA   +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
            + LLPL K K+ S+A++GP A+     LG Y+G    S+T L+ +++ +         +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFADH--NYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLN 450

Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
            +  S  SI  AV   KG D V++ +G D+   +E  D   + LP  Q++L+  + +   
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + ++LV   G P+   +A  D +I +I+ A YPG+    ALA+++FG+ NP
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENP 555


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 187/531 (35%), Positives = 297/531 (55%), Gaps = 33/531 (6%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI+ R + L+ ++TL EK SQLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GEDP +T +  V++V+G+QGD          L+  A  KHF A + +N     
Sbjct: 165 GRNEETYGEDPYLTSRLGVAFVKGLQGDH------PAYLKTVATIKHFVANNEEN----N 214

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R+   +++  + L + Y P +E+CVK+     +M AYN  NG+P    R LL +  R++W
Sbjct: 215 RFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKEW 274

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
           GF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AVKQ  + 
Sbjct: 275 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGLIS 334

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
           E+ ID+AL  + + R +LG F+    + P+      ++       LA +AA   +VLLKN
Sbjct: 335 EATIDQALTRVLTARFKLGEFD-PMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKN 393

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCD 469
            + LLPL K K+ S+A++GP A+     LG Y+G    S+T L+ +++ +         +
Sbjct: 394 EN-LLPLSKEKTKSVAVVGPFAD--HNYLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYLN 450

Query: 470 TVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
            +  S  SI  AV   KG D V++ +G D+   +E  D   + LP  Q++L+  + +   
Sbjct: 451 GIGASRDSIVAAV---KGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQVNP 507

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + ++LV   G P+   +A  D +I +I+ A YPG+    ALA+++FG+ NP
Sbjct: 508 R-IVLVFHSGNPLTSEWA--DVHIPAIMQAWYPGQEAGRALADLLFGNENP 555


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  321 bits (822), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 211/567 (37%), Positives = 297/567 (52%), Gaps = 46/567 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIP--RLGIPAYEWWSEALHGVAGV 100
           +PF  T+L   +R  DLVSRLTLDE + Q+  ++PA P  RLGI  Y W SE LHGV   
Sbjct: 26  YPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGVVP- 84

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIG 153
             G+        AT+FPQ I  AASF   L   + +AIGLE RA +N         +  G
Sbjct: 85  PDGL--------ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQEHTG 136

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           ++ ++P INI R P WGR QET GEDP + G+   +YVRG+QGD          + A+A 
Sbjct: 137 LSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGDH------PRYVLANAG 190

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           CKHF  +        +R+ FDA+V  +D   T+ P F  CVK G  S +MC+YNR+N +P
Sbjct: 191 CKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAGVYS-VMCSYNRINEVP 249

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN--- 330
           +CA+  LL+   R++WGF GY+ SD  AV  I  +  Y  S  D V   + AG +++   
Sbjct: 250 ACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHYTDSIVDTVASAVNAGCNLDLAF 309

Query: 331 -CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
             G  +      AV   K+ E  +   +  LF  RMRLG F+  P + P+  +   VV S
Sbjct: 310 PVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFDP-PELNPYANLNLSVVQS 368

Query: 390 PAHQVLALQAAQD-----GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP 444
             H+ LA++AA         VLLK    +LPL  +    LA+IGP A++   L G+Y+  
Sbjct: 369 EEHRELAVKAALQSFVLLNFVLLKREGRVLPL-DTLVNKLAVIGPFADNPSYLFGDYSPN 427

Query: 445 SCRS--ITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQE 502
             +   +TP + L N   +T   PGC T  C++   +       GAD +V+ +G     E
Sbjct: 428 PDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKAAVTGADLIVVCLGTGVKIE 487

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAK-KPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
            E +DR DL LPG+Q +L+  V + A  KP+IL+L   GP+DI +A  +  I  I+   +
Sbjct: 488 AEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVENPAIQVIVACFF 547

Query: 562 PGEAGAVALAEVIFGDH-------NPG 581
           P +A   AL  +    H       NPG
Sbjct: 548 PSQATGDALYRMFMNTHGVDTGNGNPG 574


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 181/441 (41%), Positives = 257/441 (58%), Gaps = 32/441 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     QRA DLV+R+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TFW
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN-----GGKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G+   KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRKLDATA 206

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FDA  + +DL +TY P FE+ VK+G+   +M AYNRV G 
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R +WGF GY+ SDC A+  I+       + E+A    +K G ++ CG
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATREEAAALAVKHGTELECG 321

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           +        AV++  + E+++D AL  L   RMRLG+F+  P    + +I      SP H
Sbjct: 322 AEYST-LPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPEH 379

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LA + A++ +VLLKN  G+LPL ++K   +A++GP A+    LLGNY G     +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTVL 438

Query: 453 QALQNYVEN--TVYYPGCDTV 471
           Q ++    +   +Y  G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459



 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 64/114 (56%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           +  A+D A+ AD VV + GL    E EE+          DR DL LP  Q+ L+  +   
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEAL-HG 686

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KPV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+A+V+FGD NPG
Sbjct: 687 TGKPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 883

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 185/437 (42%), Positives = 259/437 (59%), Gaps = 35/437 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E+  +  TT    QRA DLVSR++L+EK +QL+N APAIPRLG+  Y WW+E LHGVA  
Sbjct: 33  ESPAYQDTTKTAEQRAADLVSRMSLEEKAAQLINDAPAIPRLGVREYNWWNEGLHGVAAH 92

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------- 152
           G           AT FPQ +  AA+FD  L +R+   I +E RA Y A +          
Sbjct: 93  GY----------ATVFPQAVGMAATFDEPLIHRVADTISVEFRAKYVASRHRFGGSDWFR 142

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+T W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+           +  A
Sbjct: 143 GLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAFVKGLQGED------PVYYRTIA 196

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KH   Y + +    +R++ +   +  DL DTY P F + + +G+A  IMCAYN ++G 
Sbjct: 197 TPKH---YAVHSGPEASRHRDNINPSRYDLEDTYLPAFRATIVEGKAVSIMCAYNAIDGQ 253

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVN 330
           P+CA+ +LL K  R+ WGF G++ SDCDAV  IY   +  Y  +PE+ V    +AG D+ 
Sbjct: 254 PACANDDLLVKHLRQDWGFKGFVVSDCDAVGDIYYKTSHHYRPTPEEGVTVAYQAGTDLI 313

Query: 331 CGSFLQ-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           CG+  +  H  +AV++  LPES +D AL  LFS R +LG F  +P  Q F  I AD   +
Sbjct: 314 CGNANEADHVASAVRKGILPESLVDTALVRLFSARFKLGQF--DPPAQVFPAITADDYDT 371

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            A++  +   A+  +VLLKN  GLLPL KS+  ++A+IGPNA++  +L+GNY G     +
Sbjct: 372 QANRDFSQHVAESAMVLLKND-GLLPL-KSEPRTIAVIGPNADTMDSLVGNYNGDPSHPV 429

Query: 450 TPLQALQNYVEN-TVYY 465
           T L  ++    N TV Y
Sbjct: 430 TVLAGIKARFPNATVRY 446



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           S     +AV  AK +D V+ + GL Q  E EE+          DR  L LP  QQ+++ +
Sbjct: 593 SDTGAQEAVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQ 652

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V+ A  KPV+LVL+ G  + + +A  D+N+ +I+ A YPG  G  A+A +I GD +P
Sbjct: 653 VS-ATGKPVVLVLINGSALSVNWA--DKNVPAIVEAWYPGGQGGAAVARLIAGDFSP 706


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 296/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 296/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 183/441 (41%), Positives = 255/441 (57%), Gaps = 32/441 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     QRA DLVSR+TL+EK +Q+ N+APAIPRLG+PAY+WW+EALHGVA  G   
Sbjct: 37  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFW 157
                  GAT FPQ I  AA+FD  L + +  AI  EARA ++         +  G+TFW
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG-----GKLKGKLQASA 212
           +PNINIFRDPRWGRGQET GEDP +T +  V++V+G+QG+  +      G    KL A+A
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRKLDATA 206

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FDA  + +DL +TY P FE+ VK+G+   +M AYNRV G 
Sbjct: 207 --KHFAVH---SGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            + A + LL    R  WGF GY+ SDC A+  I+       + E A    +  G ++ CG
Sbjct: 262 SASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATREQAAALAVNNGTELECG 321

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                   AAV++  + E+++D+AL  L   RMRLG+F+   T++ + +I      SP H
Sbjct: 322 EEYST-LPAAVRKGLISEADVDKALQKLMYSRMRLGMFDPPDTLR-WAQIPLSANQSPEH 379

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LA + A++ +VLLKN  G+LPL + K   +A+IGP A+    LLGNY G     +T L
Sbjct: 380 DALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 438

Query: 453 QALQNYVEN--TVYYPGCDTV 471
           Q ++    +   +Y  G D V
Sbjct: 439 QGIREAAPDAEVLYARGADLV 459



 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 76/148 (51%), Gaps = 21/148 (14%)

Query: 452 LQALQNYVENTVYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEK 503
           L+A ++Y     YY G        A         +  A+D A+ AD VV + GL    E 
Sbjct: 594 LEAGRSYALRLEYYEGERDAGIRLAWRQPGAKPPLQDALDAARRADVVVFVGGLTGDVEG 653

Query: 504 EEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
           EE+          DR DL LP  Q+EL+  + +   KPV+ VL  G  + I +A+  +++
Sbjct: 654 EEMKVNYPGFAGGDRTDLRLPKPQRELLEAL-QGTGKPVVAVLTTGSALAIDWAQ--QHV 710

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +IL A YPG+ G  A+A+V+FGD NPG
Sbjct: 711 PAILLAWYPGQRGGSAVADVLFGDANPG 738


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 198/532 (37%), Positives = 292/532 (54%), Gaps = 31/532 (5%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI-------RGATS 115
           LTLDEK + LVN+AP + RLG+P YEW +EALHG+AGV  G   N T          +T 
Sbjct: 12  LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQET 175
           FP  I+  A+FD +L + I  A+  EARA  N  +A G+ +WAPNIN +RDPRWGRGQET
Sbjct: 72  FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKA-GLDYWAPNINPYRDPRWGRGQET 130

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
           PGEDP    +YA +YV G++G     G    K +  + CKHF  YD+++  G  R  ++A
Sbjct: 131 PGEDPYHVAQYAYNYVVGLKG-----GVGPAKSKVVSTCKHFAGYDIEDSDGVVRGSYNA 185

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG-- 293
            ++ QDLA+ Y P F SC +  +   +MC+YN VNG PSCA+  +L    R  WG+    
Sbjct: 186 IISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGSSA 245

Query: 294 -YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
            ++T DC AV  +++     +S    V   +  G D++CG+    +  +AV+     E++
Sbjct: 246 HWVTGDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTTEAQ 305

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +D+AL  L+S  + LG F+  P  Q +  +G   V +P+ Q LA  A  +GI        
Sbjct: 306 LDQALSRLYSSLIVLGYFD-PPEGQEYRTLGVSDVNTPSTQKLAYTALVEGI-------N 357

Query: 413 LLPLPKSKSVSLALIGPNANSAK-TLLGNYAGPSCRSITPLQALQN--YVENTVYYPGCD 469
           +LP+ +    ++  +GP AN+A  ++ GNY G +     P+    +  Y  N  Y  G  
Sbjct: 358 ILPI-RPMGQTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQGLQ 416

Query: 470 TVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
            V  +  S    AV  A+ AD VV + G+D+  E E  DR  +  PG Q  LI ++  AA
Sbjct: 417 YVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQL--AA 474

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KPV++V + GG VD +    ++N+  +LW GYPG+     L +++ G   P
Sbjct: 475 VKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAP 526


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 174/426 (40%), Positives = 242/426 (56%), Gaps = 30/426 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           F F    L   +R  DLV+RLTL+EK+SQ++NS+PAIPRL IPAY+WW+E LHGVA    
Sbjct: 29  FLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR--- 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
                 T    T +PQ I  AA+FD    Y++     LE RA+YN          + +G+
Sbjct: 86  ------TPFKVTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNERYLGL 139

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W PNINIFRDPRWGRGQET GEDP +TG    S+V+G+QGD          L+A+AC 
Sbjct: 140 TYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDD------PKYLKAAACA 193

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH   Y + +     R+ FD  VT  +L DTY P F+  V + + +G+MCAYN     P 
Sbjct: 194 KH---YAVHSGPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCAYNAFRTQPC 250

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA   L++   R QW F GY+TSDC A+   +         E A  D +  G D++CG+ 
Sbjct: 251 CASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDAESASADAVFHGTDIDCGTD 310

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADVVCSPAHQ 393
             K    AVK  K+ E +ID ++  LF +R RLG+F  +P  M  + +    V+ +  H+
Sbjct: 311 AYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMF--DPVEMVKYAQTPTSVLENDEHK 368

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ A+  IVLL+N +  LPL K K   + ++GPN ++A  +LGNY G   +  T L+
Sbjct: 369 AHALKMARQSIVLLRNENKTLPLSK-KLKKIVVLGPNVDNAIAILGNYNGTPSKLTTVLE 427

Query: 454 ALQNYV 459
            ++  V
Sbjct: 428 GIKEKV 433



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 73/149 (48%), Gaps = 22/149 (14%)

Query: 452 LQALQNYVENTV--YY--PGCDTVACSSASIDKA-----VDIAKGADHVVLMMGLDQTQE 502
           L+  +N V N V  Y+   G   VA  + + +K      V+  K AD  V + G+    E
Sbjct: 563 LKTKRNTVYNLVVEYWQGEGKANVALHTGNYEKTDFNTLVNRVKDADAFVFVGGISPQLE 622

Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
            EE+          DR  ++LP  Q +L+ +  +   KP++ V++ G  + I +     N
Sbjct: 623 GEEMKVNFPGFKGGDRTSILLPKIQTDLM-KALKTTGKPIVFVMMTGSAIAIPWEA--EN 679

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I +I  A Y G+A   A+A+V+FG++NP 
Sbjct: 680 IPAIANAWYGGQAAGTAVADVLFGNYNPA 708


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+  +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+  +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  318 bits (816), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E  IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEVAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+ ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKKI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
 gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
          Length = 897

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 186/439 (42%), Positives = 257/439 (58%), Gaps = 36/439 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+   +    QRA  LV+++TLDEK +Q+ N+APAI RLG+PAY+WW+E LHGVA  G+ 
Sbjct: 37  PWLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                    AT FPQ I  AA+FD  L  ++   I  EARA ++        G+  G+TF
Sbjct: 96  ---------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTF 146

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PN+NIFRDPRWGRGQET GEDP +T +  V++VRG+QGD      +  KL A+A  KH
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD----PVYRKLDATA--KH 200

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
              +   +     R+ FDAR + +DL DTY P FE+ VK+G    +M AYNRV G  + A
Sbjct: 201 LAVH---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASA 257

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
            R LL    RR WGF GY+ SDC A+  I+       + E A    ++ G ++ CG   Q
Sbjct: 258 SRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHHIVTTREAAAALAVRNGTELECG---Q 314

Query: 337 KHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           ++    +AV+Q  + E+EID A+  LF+ RMRLG+F+  P    + +I A V  +P+H  
Sbjct: 315 EYATLPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHDA 373

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL+AAQ  +VLLKN  G+LPL +     +A++GP A+    LLGNY G     +T LQ 
Sbjct: 374 LALKAAQASLVLLKND-GILPLSRDIK-RIAVVGPTADDTMALLGNYFGTPAAPVTILQG 431

Query: 455 LQNYVE--NTVYYPGCDTV 471
           ++   +     Y  G D V
Sbjct: 432 IREAAKGVEVRYARGVDLV 450



 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 65/117 (55%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           + A  D+A+D A+ AD VV + GL    E EE+          DR DL LP  Q+ L+  
Sbjct: 615 AKAPFDEALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEA 674

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  A  KPV++VL  G  + + +A+   ++ +IL + YPG+ G  A+ + +FGD NP
Sbjct: 675 L-HATGKPVVMVLTGGSAIAVDWAQ--SHLPAILMSWYPGQRGGTAVGQALFGDVNP 728


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 186/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++ + ++
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKFLKKI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHILAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  318 bits (814), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 187/437 (42%), Positives = 251/437 (57%), Gaps = 33/437 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T LP   RA DLVSR+TL+EK +Q+ N+APAIPRL +PAY+WW+EALHGVA  G  
Sbjct: 39  PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTF 156
                   GAT FPQ I  AA+FD+ L   +  AI  EARA ++A  A        G+TF
Sbjct: 97  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTF 148

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+Q       KL       A  KH
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL------DATAKH 202

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           +  +   +     R+ FD   + +DL +TY P F++ V++G  + +M AYNRVNG  + A
Sbjct: 203 YAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESASA 259

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
               L    RR WGF GYI SDC A+  I+       +PE A    +K G D++CG    
Sbjct: 260 S-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 318

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
              K AV+   + E+ ID +L  L + RMRLG+F+  P    + +I A V  SP H  LA
Sbjct: 319 ALPK-AVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHDALA 376

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A++ +VLLKN  GLLPL K     +A++GP A+   +LLGNY G     +T LQ ++
Sbjct: 377 RRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQGIR 434

Query: 457 NYV--ENTVYYPGCDTV 471
           +       VY  G D V
Sbjct: 435 DAAPQAEVVYARGSDLV 451



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 64/114 (56%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEA 527
           + +AVD A+ A+ VV + GL    E EE+D          R D  LP  Q+EL+ +  +A
Sbjct: 620 LQEAVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 678

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              PV+ VL  G  + + +A+  +++ +IL A YPG+ G  A+ +V+FG  +PG
Sbjct: 679 TGTPVVAVLTTGSALAVDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPG 730


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+  +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
          Length = 897

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 186/439 (42%), Positives = 259/439 (58%), Gaps = 36/439 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+   +    QRA  LV+++TL+EK +Q+ N+APAI RLG+PAY+WW+E LHGVA  G+ 
Sbjct: 37  PWLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ- 95

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTF 156
                    AT FPQ I  AA+FD  L  ++   I  EARA ++        G+  G+TF
Sbjct: 96  ---------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTF 146

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PN+NIFRDPRWGRGQET GEDP +T +  V++VRG+QGD      +  KL A+A  KH
Sbjct: 147 WSPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQGDD----PVYRKLDATA--KH 200

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
              +   +     R+ FDAR + +DL DTY P FE+ VK+G    +M AYNRV G  + A
Sbjct: 201 LAVH---SGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESASA 257

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
            R LL    RR WGF GY+ SDC A+  I+       + E A    ++ G ++ CG   Q
Sbjct: 258 SRFLLRDVLRRDWGFKGYVVSDCWAIVDIWKHHRIVTTREAAAALAVRNGTELECG---Q 314

Query: 337 KHTK--AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           ++    +AV+Q  + E+EID A+  LF+ RMRLG+F+  P    + +I A V  +PAH  
Sbjct: 315 EYATLPSAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHDA 373

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL+AAQ  +VLLKN  G+LPL ++ +  +A++GP A+    LLGNY G     +T LQ 
Sbjct: 374 LALKAAQASLVLLKND-GILPLSRN-TRRIAVVGPTADDTMALLGNYFGTPAAPVTILQG 431

Query: 455 LQNYVE--NTVYYPGCDTV 471
           ++   +     Y  G D V
Sbjct: 432 IREAAKGVEVRYARGVDLV 450



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           VV + GL    E EE+          DR DL LP  Q+ L+  +     KPV++VL  G 
Sbjct: 632 VVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEAL-HGTGKPVVMVLTGGS 690

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + + +A+   ++ +IL + YPG+ G  A+ + +FGD NP
Sbjct: 691 AIAVDWAQA--HLPAILMSWYPGQRGGTAVGQALFGDVNP 728


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 213/594 (35%), Positives = 299/594 (50%), Gaps = 53/594 (8%)

Query: 11  PLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKIS 70
           P  C+ FTS+L  + ST               FPF  T+LP ++R  DLV RL L+E + 
Sbjct: 4   PGKCVVFTSVLFSLIST-----------ILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVL 52

Query: 71  QLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           Q+          AP I RL I  Y W +E L G    G           ATSFPQ    A
Sbjct: 53  QMSRGGRYSNGPAPPIDRLNIGPYSWNTECLRGDLSAGP----------ATSFPQAFGLA 102

Query: 124 ASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETP 176
           A+FD+ L  +I  A   E RA YN        G   G++ ++P INI R P WGR QET 
Sbjct: 103 ATFDAVLIKQIANATAYEVRAKYNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETY 162

Query: 177 GEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR 236
           GEDP ++G  A S+V G+QG+          + A+A CKHF AY       ++R  FDA+
Sbjct: 163 GEDPYLSGTLAASFVTGLQGNH------PRYVTANAGCKHFDAYAGPENIPSSRSTFDAK 216

Query: 237 VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYIT 296
           V+ +DL  T+ P F  C++ G  S +MC+YN +NG+P+CA++ LL+   R +W F GY+ 
Sbjct: 217 VSDRDLRMTFLPAFHECIQAGTYS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVI 275

Query: 297 SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESE 352
           SD  AV  +YDA  Y K   D  +  + +G+++   S L  +    T  AVKQ  +    
Sbjct: 276 SDQSAVEKVYDAHHYTKDMLDTAIACVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKT 335

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +   +  LF  RMRLG F+  P M P+ K+   ++ S  HQ L+L+AA    VLLKN + 
Sbjct: 336 VKARVSPLFYTRMRLGEFDP-PEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENR 394

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYA-GPSCRSITPLQALQNYVE-NTVYYPGCDT 470
            LPL K K   LA++GP  ++   + G+ +   S  ++TP   L       T +  GC +
Sbjct: 395 FLPL-KEKIDKLAVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLS 453

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR-VAEAAK 529
            AC+              D VV+ +G     E E  DR +L LPG+Q  L+   V  AA 
Sbjct: 454 PACTEYDPKSTKQAIDRVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAAD 513

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD--HNPG 581
           KPVIL+L   GP+DIT+A  +  I  I+   +P +    AL  +       NPG
Sbjct: 514 KPVILLLFNAGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPG 567


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 212/569 (37%), Positives = 297/569 (52%), Gaps = 50/569 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
           FPF  T+L I  R +D+V  LTL+E + Q+ +        AP IPRL I  Y+W +E L 
Sbjct: 24  FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
           G    G           ATSFP  I  AASF+  L  R+  A   E RA + A       
Sbjct: 84  GNVSAGD----------ATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133

Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G++ W+P +NI RDPRWGR QET GEDP ++G    +YV G+QG+          +
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQGNN------SRYI 187

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
            A+A CKHF  +       T+R+ FDA+V+M+D   T+ P F++CV+ G  S +MC+YNR
Sbjct: 188 IANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAGALS-LMCSYNR 246

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           +NG+P+CA++ LL+   R +W F GY+ SD  A+  I     YA     A  D   AG  
Sbjct: 247 INGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAADAANAGTC 306

Query: 329 VNCGSFLQK------HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           +  G+  +K      H   AVK   +    +  A+  LF VRM+LG F+  P   P+  I
Sbjct: 307 LEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDP-PDNNPYANI 365

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP--LPKSKSVSLA-LIGPNANSAKTLLG 439
              V+ S AH  L+LQAA + IVL+KN  G     LP +  V  A ++GP ++  + L G
Sbjct: 366 PLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITNEVKKACMVGPFSDDPEVLFG 425

Query: 440 NYAGPSCRS--ITPLQALQNY---VENTVYYPGC-DTVACSSASIDKAVDIAKGADHVVL 493
           +Y+    R   IT L  L+N     +   Y  GC D  AC +    K      G + +++
Sbjct: 426 DYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRSACDGVELIIV 485

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK--PVILVLLCGGPVDITFAKYDR 551
             GL +  E E  D  D+ LPG Q +L+ + AEAA K   VIL+L    P+DI +AK D 
Sbjct: 486 TAGLSKHLESEGKDLSDINLPGHQLDLM-QDAEAASKNASVILILFNASPLDIRYAKTDP 544

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            I  IL A YPG+    A+A V+ G++NP
Sbjct: 545 RIVGILEAYYPGQTAGKAIANVLTGEYNP 573


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  317 bits (813), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 175/433 (40%), Positives = 248/433 (57%), Gaps = 30/433 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FPF  T L   +R  +LVS+LTL+EK++Q++N+APAIPRLGIPAY+WW+E LHGVA    
Sbjct: 27  FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGM 154
                 T    T FPQ I  AA+FD    +++     LE RA+YN          + +G+
Sbjct: 84  ------TPFKTTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W PNINIFRDPRWGRGQET GEDP +T     ++V+G+QGD          L+A+AC 
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD------PKYLKAAACA 191

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH   Y + +   + R+ FD  VT  +L DTY P F+  V   + +G+MCAYN     P 
Sbjct: 192 KH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAFRTQPC 248

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA   L++   R QW F GY+TSDC A+   +           A  D +  G D++CG+ 
Sbjct: 249 CASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLHGTDIDCGTD 308

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    AVK  ++ E +ID ++  LF +R RLG+F+   +M  + +  + V+ S  H+ 
Sbjct: 309 AYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDP-VSMVKYAQTPSSVLESEEHKE 367

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL+ A+  IVLLKN    LPL K K   + ++GPNA+++ ++LGNY G   +  T LQ 
Sbjct: 368 HALKMARQSIVLLKNEKNTLPLSK-KLKKIVVLGPNADNSISILGNYNGTPSKLTTVLQG 426

Query: 455 LQNYV--ENTVYY 465
           ++  +  E  V Y
Sbjct: 427 IKEKISPETEVVY 439



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/110 (29%), Positives = 56/110 (50%), Gaps = 13/110 (11%)

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
           ++  K AD  +   G+    E EE+          DR  ++LP  Q  L+ +  +++ KP
Sbjct: 600 IERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLL-KALQSSGKP 658

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           V+ +++ G  + + +     NI +IL   Y G++   A A+VIFGD+NP 
Sbjct: 659 VVFLIMTGSAIAVPWEA--ENIPAILNIWYGGQSAGTASADVIFGDYNPA 706


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 186/445 (41%), Positives = 255/445 (57%), Gaps = 33/445 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++ +T   P+   T P+  RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31  THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
           GVA  G          GAT FPQ I  AA+FD+ L   +  AI  EARA ++A  A    
Sbjct: 91  GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEH 140

Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+Q       KL    
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
              A  KH+  +   +     R+ FD   + +DL +TY P F++ V++G  + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG  + A    L    RR WGF GYI SDC A+  I+       +PE A    +K G D
Sbjct: 252 VNGESASAS-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG        AAV+   + E+ IDR+L  L + R+RLG+F+  P   P+ +I A    
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQ 368

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           SP H  LA + A++ +VLLKN  GLLPL K     +A++GP A+   +LLGNY G     
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426

Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
           +T LQ +++       VY  G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 29/197 (14%)

Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
           HG L  P S +  L +   +        K L+  ++  P  RS T    LQA + Y    
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSSTATVALQAGKAYDLRV 596

Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
            YY          A         + +AVD A+ AD VV + GL    E EE+D       
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656

Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
              R D  LP  Q+EL+ +  +A   PV+ VL  G  + I +A+  +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+ +V+FG  +PG
Sbjct: 714 RGGTAVGDVLFGQASPG 730


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 187/536 (34%), Positives = 294/536 (54%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GEDP +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEDPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+  +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEEQEKLLKEI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + + LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IALVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  317 bits (812), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 185/437 (42%), Positives = 250/437 (57%), Gaps = 33/437 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+  T  P   RA DLV+R+TL+EK +Q+ N+APAIPRL +P Y+WW+EALHGVA  G  
Sbjct: 19  PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTF 156
                   GAT FPQ I  AA+FD+ L   +  AI  EARA ++A  A        G+TF
Sbjct: 77  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+Q       KL       A  KH
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL------DATAKH 182

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F  +   +     R+ FD   + +DL +TY P F++ V++G+ + +M AYNRVNG  S +
Sbjct: 183 FAVH---SGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNG-ESAS 238

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ 336
               L    RR WGF GYI SDC A+  I+       +PE A    +K G D++CG    
Sbjct: 239 ASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTDLDCGDTYA 298

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
               AAV+   + E+ ID AL  L + RMRLG+F+  P   P+ +I A    SP H  LA
Sbjct: 299 A-LPAAVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHDALA 356

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            + A++ +VLLKN  G+LPL K     +A+IGP A+   +LLGNY G     +T LQ ++
Sbjct: 357 RRTARESLVLLKND-GVLPL-KPTLKRIAVIGPTADDPMSLLGNYYGTPAAPVTILQGIR 414

Query: 457 NYV--ENTVYYPGCDTV 471
           +       +Y  G D V
Sbjct: 415 DAAPQAQVIYARGSDLV 431



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/114 (35%), Positives = 64/114 (56%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEA 527
           + +AVD A+ A+ VV + GL    E EE+D          R D  LP  Q+EL+ +  +A
Sbjct: 600 LQEAVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQA 658

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              PV+ VL  G  + I +A+  +++ +IL A YPG+ G  A+ +V+FG  +PG
Sbjct: 659 TGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPG 710


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 220/636 (34%), Positives = 309/636 (48%), Gaps = 101/636 (15%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           L LCF + L          SC P   S     +   + P   RA+ L+ +L+L+EK   +
Sbjct: 4   LRLCFRAALAAACMMAAFASCAPQEIS-----YTDKSQPAELRAKALLPKLSLEEKAGLV 58

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
             ++PA+ RLGI AY WWSEALHGVA  G           AT FPQ I  AASFD     
Sbjct: 59  QYNSPAVERLGIKAYNWWSEALHGVARNGS----------ATVFPQPIGMAASFDVEKIE 108

Query: 133 RIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  A+  EAR             Q  G++FW PNINIFRDPRWGRG ET GEDP + G+
Sbjct: 109 TVFTAVSDEARVKNRIAAEDGRVYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQ 168

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADT 245
             ++ VRG+QGD          L+  AC KH+  +   +   + R++FDA+V+ +DL +T
Sbjct: 169 LGMAVVRGLQGDPD-----ADVLKTHACAKHYAVH---SGLESNRHRFDAQVSERDLRET 220

Query: 246 YQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSII 305
           Y P F+  V +     +M AYNR  G P  A   L+ K  R +WG+ G + SDC A+   
Sbjct: 221 YLPAFKDLVTKAGVKEVMTAYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDF 280

Query: 306 YD--AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           ++    G+  + E+A    +  G+DV CGS   K   AA+ Q  L E ++DR L  + + 
Sbjct: 281 FEPGRHGFVATGEEAAALAVANGLDVECGSTFSK-IPAAIDQGLLKEEDLDRNLLRVLTE 339

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           R RLG  +G     P+  +   +V  P H+ L+L  A++ +VLL+N +G+LPL   + + 
Sbjct: 340 RFRLGEMDGE---SPWDDLDPAIVEGPEHRALSLDIARETMVLLRN-NGVLPLKAGEKI- 394

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC--------------- 468
            ALIGPNA+ A+   GNY      +IT LQA+Q  V   VY   C               
Sbjct: 395 -ALIGPNADDAQMQWGNYNPVPKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYA 453

Query: 469 DTVACSSASIDKAV--------DIA--------------------------KGADHVVLM 494
           + +  S A ++ A         DI                           +G D VV  
Sbjct: 454 NLIGASEAQLEAAARRYAVSVNDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFA 513

Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
            G+    E EE+          DR D+ LPG Q+ L+  + +A KK V+LV   G    I
Sbjct: 514 GGISPRLEGEEMRVQVPGFSGGDRTDIELPGVQRRLLKALHDAGKK-VVLVNFSG--CAI 570

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
                  +  +IL A YPG+ G  A+A+V+FGD NP
Sbjct: 571 GLVPETESCDAILQAWYPGQEGGTAIADVLFGDVNP 606


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 244/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD   T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 186/536 (34%), Positives = 295/536 (55%), Gaps = 43/536 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P++ R R L+ ++TL EK++QLV+ + +IPRL +PAY +W+E LHGVA  G+        
Sbjct: 57  PVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGE-------- 108

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  R+  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 109 --VTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 164

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           GR +ET GE+P +T +  V++V+G+QGD  T+        L+  A  KHF A + +N   
Sbjct: 165 GRNEETYGEEPHLTSRLGVAFVKGLQGDHPTY--------LKTVATIKHFVANNEEN--- 213

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+P      LL    R+
Sbjct: 214 -NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHWLLDDVLRK 272

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           +WGF G++ SDC A+ ++        S E+A    + +G D+ CG+  ++    AV+Q  
Sbjct: 273 EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVEQGL 332

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           + E+ IDRAL  + + R +LG F+    + P+      ++       LA +AA   +VLL
Sbjct: 333 ISEAAIDRALTRVLTARFKLGEFDPM-ELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLL 391

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVY 464
           KN   LLPL K K  S+A++GP A+     LG Y+G    S++ L+ ++  +       Y
Sbjct: 392 KND-ALLPLNKEKIKSVAVVGPFADY--NYLGGYSGQPPYSVSLLKGVKELIGKKGKVTY 448

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
             G  T      S D    + KGAD V++ +G D+   +E  D   + LP  Q++L+  +
Sbjct: 449 LNGMGT------SADSIAQVVKGADIVLVALGSDEKMARENHDMPSIYLPEGQEKLLKEI 502

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +   + ++LV   G P+   +A  D +I +I+ A YPG+    ALA ++FG+ NP
Sbjct: 503 YQVNPR-IVLVFHTGNPLTSEWA--DTHIPAIMQAWYPGQEAGRALANLLFGNENP 555


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 244/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++FD   T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++F+A  T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPTK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 875

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 175/422 (41%), Positives = 238/422 (56%), Gaps = 28/422 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FPF    L    R  DLVSRLTL+EK++Q++N+AP IPRL IPAY+WW+E LHGVA    
Sbjct: 28  FPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLHGVAR--- 84

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA--------IGM 154
                 T    T FPQ I  AA++D+   YR+     LE R ++N   A        +G+
Sbjct: 85  ------TPYNVTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKAIAAGKEKDRYLGL 138

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W PNINIFRDPRWGRGQET GEDP +T   A ++VRG+QG+          L+A+AC 
Sbjct: 139 TYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQGND------PKYLKAAACA 192

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH   Y + +    +R+ FD  VT  DL DTY P F+  V     +G+MCAYN     P 
Sbjct: 193 KH---YAVHSGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMCAYNAFRKQPC 249

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           CA   L++   R QW F GY+TSDC A+   Y           A  D +  G D++CG+ 
Sbjct: 250 CASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVFHGTDIDCGNE 309

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    AVK+ K+ E +ID ++  LF +R RLG+F+  P+M  + +  A  + S AH  
Sbjct: 310 AYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFDP-PSMVKYAQTPATELESAAHAK 368

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL  A + IVLLKN++  LPL K     + ++GPNA +    LGNY+G   + IT  Q 
Sbjct: 369 HALLMAHESIVLLKNANNTLPLKKGLK-KIVVLGPNATNVIAPLGNYSGTPSKLITLFQG 427

Query: 455 LQ 456
           ++
Sbjct: 428 IK 429



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           AD  +   G+    E EE+          DR  ++LP  Q EL+ +  +A+ KPV+ V++
Sbjct: 607 ADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELM-KALQASGKPVVFVMM 665

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            G  +   +     NI +I+ A Y G+A   ALA+V+FGD+NP
Sbjct: 666 TGSALATPWES--ENIPAIVNAWYGGQAAGTALADVLFGDYNP 706


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++F+A  T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++F+A  T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 169/425 (39%), Positives = 243/425 (57%), Gaps = 28/425 (6%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           T  +P+    L  + R  DLV R+TL+EK+SQ++NS+PAIPRL IPAY+WW+E LHGVA 
Sbjct: 23  TSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVAR 82

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QA 151
                    T    T +PQ I  AA+FD     ++     LE RA++N          + 
Sbjct: 83  ---------TPFKVTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEKY 133

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
           +G+T+W PNINIFRDPRWGRGQET GEDP +TG    ++V G+QG   N  K    L+A+
Sbjct: 134 LGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQG---NDPKY---LKAA 187

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           AC KH   Y + +     R+ F+A ++  DL DTY P F+  V   + +G+MCAYN    
Sbjct: 188 ACAKH---YAVHSGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYNAFKT 244

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C    L+    R QW F GY+TSDC  +   +       + EDA  D +  G D+ C
Sbjct: 245 QPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHATAEDASTDAVLHGTDIEC 304

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G+   K   AAVK+ K+ E++ID ++  LF +R RLG+F+ +  ++ + +    V+ SP 
Sbjct: 305 GTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSDVVK-YAQTPVSVLESPE 363

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           HQ  AL+ A+  +VLLKN++  LPL K+    + ++GPNA++   +LGNY G      T 
Sbjct: 364 HQAHALKMARQSVVLLKNANHTLPLSKTIR-KIVVLGPNADNPIAILGNYNGTPSNLTTV 422

Query: 452 LQALQ 456
            Q ++
Sbjct: 423 YQGIR 427



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/103 (33%), Positives = 54/103 (52%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           AD +V + G+    E EE+          DR  + LP  Q  L+ +  +A  KPV+ V++
Sbjct: 603 ADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLM-KTLQATGKPVVFVMM 661

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            G  +   +     NI +I+ A Y G+A   A+A+V+FGD+NP
Sbjct: 662 TGSALATPWEA--ENIPAIVNAWYGGQAAGTAVADVLFGDYNP 702


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 245/447 (54%), Gaps = 37/447 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    LP+ +R  DL+SRLT +EKI Q++N  PAI RLGIP Y+WW+EALHGVA  
Sbjct: 24  QDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARA 83

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IG 153
           G+          AT FPQ I  AA+FD    +     +  EARA Y+  Q         G
Sbjct: 84  GR----------ATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKG 133

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PNINIFRDPRWGRG ET GEDP +T K  V+  RG+QGD  N        +  AC
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQGDDPN------YYKTHAC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +    W    R++F+A  T +DL +TY P FE+ VK+G    +MCAYNR  G P
Sbjct: 188 AKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNRFEGKP 244

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG----YAKSP--EDAVVDVLKAGM 327
            C+   LL    R  WG+   I SDC A+   +  +     +   P  E A  D +  G 
Sbjct: 245 CCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDAESASADAVLNGT 304

Query: 328 DVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           D+ CG   +   K A+   K+ E ++D +L  L   R  LG+F+ +  + P+ KI   VV
Sbjct: 305 DLECGGSYRALNK-ALADGKISEKDLDVSLRRLLKGRFELGMFDPDERV-PYSKIPYSVV 362

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            SP H   AL  A+  IVLLKN + +LPL K+    +A++GPNA  +  L  NY G   +
Sbjct: 363 ESPEHIAKALDMARKSIVLLKNKNNMLPLDKNIK-KIAVVGPNAADSTMLWANYNGFPSK 421

Query: 448 SITPLQALQNYVEN--TVYYPGCDTVA 472
           ++T ++ ++N V N   +Y  GC+  A
Sbjct: 422 TVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 58/105 (55%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V + G+    E EE+          DR ++ +P  Q+E++  +  A  KPV+ V
Sbjct: 603 KDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEMVKALV-ATGKPVVYV 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  G  + + +   + ++ +IL A Y G+ G  A+A+V+FGD+NP
Sbjct: 662 VCTGSALALNWE--NDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704


>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 896

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 178/450 (39%), Positives = 252/450 (56%), Gaps = 39/450 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+     PI +R  +LVS++TL E+ +Q++N+APAIPRLG+PAY WWSE LHG+A  G  
Sbjct: 38  PWDNPNQPIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSGY- 96

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
                    AT FPQ I  +A+FD    +++G  +  EARA YN            G+T 
Sbjct: 97  ---------ATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTL 147

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           WAPNINI RDPRWGRGQET GEDP +TG  A  YV G+QG+          L+  A  KH
Sbjct: 148 WAPNINIVRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGNN------PKYLKTVATPKH 201

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F+ Y   N   + R+K +A  +  D+ DTY   F   + +G A  +MC+YN V G+PSCA
Sbjct: 202 FSVY---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCA 258

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           ++ LL+   R +WGF GYITSDC A+S  Y   A GY+     A    + AG D +CG+ 
Sbjct: 259 NK-LLADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVLAGTDTDCGTG 317

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    +V+Q  + ++ IDRA+  LF+ R RLG+F+    + P+  I   VV S AH+ 
Sbjct: 318 Y-KVLPQSVQQGLISKAAIDRAVERLFTARFRLGMFDPKADV-PYNSIPYSVVDSAAHRA 375

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
            AL+ A   +VLLKN  G+LPL  ++++  A++GPNA +  ++ GNY         P+  
Sbjct: 376 QALEDASKSMVLLKNEGGILPLRNARTI--AVVGPNAANLNSIEGNYNAIPSHPSLPVDG 433

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDI 484
           ++        +P    V    +S D  + +
Sbjct: 434 IE------AAFPQAHVVYAQGSSFDAVLPV 457



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/97 (41%), Positives = 54/97 (55%), Gaps = 13/97 (13%)

Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           MGL  T   EE+          DR  L LP  QQ+L+  +  A  KPV+LVLL G  + I
Sbjct: 633 MGLSPTFVSEEMPIKIPGFDGGDRTRLSLPQTQQDLLHALV-ATGKPVVLVLLNGSALSI 691

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +AK  +++  IL A YPGEAG  A+ E + G ++PG
Sbjct: 692 DWAK--QHVQGILEAWYPGEAGGEAIGETLSGQNDPG 726


>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 874

 Score =  315 bits (807), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 177/441 (40%), Positives = 253/441 (57%), Gaps = 31/441 (7%)

Query: 47  KTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
           +T   I  R  +L++++T+ E+I+QL + APAI RLG+P+Y WW+E LHG+A  G     
Sbjct: 30  QTPQQIDARIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDGY---- 85

Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY------NAGQAIGMTFWAPN 160
                 AT FPQ I  AA++D+ L + +G  +  EARA +      N  +  G+T W+PN
Sbjct: 86  ------ATVFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPN 139

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTA 219
           INIFRDPRWGRGQET GEDP +T      +V GVQG D F        L+A A  KHF A
Sbjct: 140 INIFRDPRWGRGQETYGEDPFLTATLGTQFVEGVQGNDPF-------YLKADATPKHFAA 192

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           +   +     R  F+A V+  DLADTY P F +      A+ +MC+YN ++G PSCA  N
Sbjct: 193 H---SGPEEGRDSFNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGN 249

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
            L    R +WGF GY+ SDCDAV  I     +A        D L AG+D++CG+     +
Sbjct: 250 NLQDLVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGAADALNAGVDLDCGNTYAALS 309

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           K ++ Q    E+++++ALH L   R+RLG+ +   +  P+  IGA+ + SPAH  LAL+A
Sbjct: 310 K-SLDQNLTTEAKLNQALHRLLLARVRLGMLD-PLSCSPYRDIGAEELDSPAHHTLALRA 367

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A++ IVLLKN  G+LPL ++ +  +++IGP A+  K L  NY G +   ITPL   ++  
Sbjct: 368 AEESIVLLKND-GVLPL-QASTQKVSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRF 425

Query: 460 ENTVYYPGCDTVACSSASIDK 480
            +  Y  G       SA + +
Sbjct: 426 HDVSYAQGSLLAEGVSAPVPR 446



 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           ++A +D+AV  A  +D +V  +GL    E E L          DR  L LP  Q+ L++R
Sbjct: 591 AAALLDQAVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSR 650

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +   KPVI+VL  G    +      ++   +L A YPGEAG  ALA ++ G+ NP
Sbjct: 651 LTQL-HKPVIIVLTSGS--GVALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNP 704


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  315 bits (807), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 185/445 (41%), Positives = 254/445 (57%), Gaps = 33/445 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++ +T   P+   T P+  RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31  THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
           GVA  G          GAT FPQ I  AA+FD+ L   +  AI  EARA ++A  A    
Sbjct: 91  GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEH 140

Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+Q       KL    
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
              A  KH+  +   +     R+ FD   + +DL +TY P F++ V++G  + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG  + A    L    RR WGF GYI SDC A+  I+       +PE A    +K G D
Sbjct: 252 VNGESASAS-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG        AAV+   + E+ IDR+L  L + R+RLG+F+  P   P+ +  A    
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQ 368

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           SP H  LA + A++ +VLLKN  GLLPL K     +A++GP A+   +LLGNY G     
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426

Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
           +T LQ +++       VY  G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 29/197 (14%)

Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
           HG L  P S +  L +   +        K L+  ++  P  RS T    LQA + Y    
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSSTATVALQAGKAYDLRV 596

Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
            YY          A         + +AVD A+ AD VV + GL    E EE+D       
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656

Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
              R D  LP  Q+EL+ +  +A   PV+ VL  G  + I +A+  +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+ +V+FG  +PG
Sbjct: 714 RGGTAVGDVLFGQASPG 730


>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
 gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
          Length = 902

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 185/448 (41%), Positives = 250/448 (55%), Gaps = 33/448 (7%)

Query: 38  PSTETFPFCKT-TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           PS  + P  +  +     RA DLV+ +TL+EK +Q+ N+APAIPRLG+ AY+WW+E LHG
Sbjct: 39  PSAASEPVYRDLSRSFHDRAADLVAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHG 98

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------G 149
           VA  G+          AT FPQ I  AA+FD  L + +  AI  EARA YN        G
Sbjct: 99  VARAGQ----------ATVFPQAIGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHG 148

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
           +  G+T+W+PNINIFRDPRWGRGQET GEDP +T +  V++V G+QGD     KL     
Sbjct: 149 RYEGLTYWSPNINIFRDPRWGRGQETYGEDPYLTERMGVAFVTGLQGDNPTYRKL----- 203

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             A  KHF  +   +     R+ FD   + +DL +TY P F++ V++     +M AYNRV
Sbjct: 204 -DATAKHFAVH---SGPEADRHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRV 259

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV 329
           NG P+     LL +  R+ WGF GY+ SDC AV  IY       + E A    +K G+D+
Sbjct: 260 NGEPATGSPRLLGQILRKDWGFKGYVVSDCGAVEDIYKHHKVVDTVEAASALAVKNGVDL 319

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           +CG+      K AV    + ESEID AL  L   RMRLG+F+    + P+  +   V  S
Sbjct: 320 DCGTEYAALVK-AVHDGLIKESEIDAALTRLMQARMRLGMFDPASKV-PWSDVPYSVNQS 377

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           P H  LA +AA++ +VLLKN  G+LPL K     +A+IGP A+    L+GNY G     +
Sbjct: 378 PQHDALARRAARESMVLLKND-GVLPLSKDIK-HIAVIGPTADDVMALVGNYHGTPADPV 435

Query: 450 TPLQALQNYV--ENTVYYPGCDTVACSS 475
           T L+ ++        VY  G D V   S
Sbjct: 436 TILRGIREAAPQAKVVYARGVDLVEGRS 463



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 13/100 (13%)

Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           VV   GL    E EE+          DR DL LP  Q++L+  + +A  KPV+LVL  G 
Sbjct: 641 VVFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEAL-QATGKPVVLVLTSGS 699

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + + +A  ++++ ++L A YPG+ G  A+A+V+FG  +P
Sbjct: 700 ALAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADP 737


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 185/445 (41%), Positives = 254/445 (57%), Gaps = 33/445 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++ +T   P+   T P+  RA DLVSR+TL+EK +Q+ N+APAIPRL +P Y+WW+EALH
Sbjct: 31  THAATPPPPYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALH 90

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--- 152
           GVA  G          GAT FPQ I  AA+FD+ L   +  AI  EARA ++A  A    
Sbjct: 91  GVARAG----------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEH 140

Query: 153 ----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G+TFW+PNINIFRDPRWGRGQET GEDP +T +  V++V+G+Q       KL    
Sbjct: 141 KRYQGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYRKL---- 196

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
              A  KH+  +   +     R+ FD   + +DL +TY P F++ V++G  + +M AYNR
Sbjct: 197 --DATAKHYAVH---SGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNR 251

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           VNG  + A    L    RR WGF GYI SDC A+  I+       +PE A    +K G D
Sbjct: 252 VNGESASAS-TRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPEAAAALGVKHGTD 310

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           ++CG        AAV+   + E+ IDR+L  L + R+RLG+F+  P   P+ +  A    
Sbjct: 311 LDCGDTYAA-LPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQ 368

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           SP H  LA + A++ +VLLKN  GLLPL K     +A++GP A+   +LLGNY G     
Sbjct: 369 SPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMSLLGNYYGTPAAP 426

Query: 449 ITPLQALQNYV--ENTVYYPGCDTV 471
           +T LQ +++       VY  G D V
Sbjct: 427 VTILQGIRDAAPQAEVVYARGSDLV 451



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 29/197 (14%)

Query: 411 HGLLPLPKSKSVSLALIGPNAN----SAKTLLGNYA-GPSCRSITP---LQALQNYVENT 462
           HG L  P S +  L +   +        K L+  ++  P  RS T    LQA + Y    
Sbjct: 537 HGQLLPPVSGNYELQIAADDGVRLYLDGKPLIDQWSDAPRMRSGTATVALQAGKAYDLRV 596

Query: 463 VYYPGCDTVACSSA--------SIDKAVDIAKGADHVVLMMGLDQTQEKEELD------- 507
            YY          A         + +AVD A+ AD VV + GL    E EE+D       
Sbjct: 597 EYYEATRDAGVRLAWRMPGAKPPLQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFA 656

Query: 508 ---RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
              R D  LP  Q+EL+ +  +A   PV+ VL  G  + I +A+  +++ +IL A YPG+
Sbjct: 657 GGDRTDTRLPKPQRELL-QALQATGTPVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQ 713

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+ +V+FG  +PG
Sbjct: 714 RGGTAVGDVLFGQASPG 730


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 213/589 (36%), Positives = 297/589 (50%), Gaps = 69/589 (11%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVN-----SAP---------------AIPRLGIP 85
           C TT     RA  LV  L + EK+  LV       AP                  R+G+P
Sbjct: 36  CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95

Query: 86  AYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEA 142
           AY WW+EALHGVA    G+ FN        ATSF   I  AA+FD+ L Y +   I  EA
Sbjct: 96  AYAWWNEALHGVAA-SPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154

Query: 143 RALYNAGQAIGMTFWAPNINIFRDPRWGRG------------------QETPGEDPLVTG 184
           RA  NA  A G+ +W PNIN ++DPRWGRG                  Q+TPGEDP+   
Sbjct: 155 RAFSNAELA-GLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIK 213

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
            Y  + + G++G      K++   +  A CKHF AYDL+ W+G  RY+F+A VT QDL++
Sbjct: 214 GYVQALLEGLEGR----DKIR---KVIATCKHFAAYDLERWQGALRYRFNAVVTSQDLSE 266

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDA 301
            Y  PF+ C +  +    MC+YN +NG P+CA   L+    R+ W +   + YITSDC+A
Sbjct: 267 YYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNNYITSDCNA 326

Query: 302 VS-IIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRAL 357
           +   + +   ++++P  A  D   AG D  C   G         A  Q  L E  IDRAL
Sbjct: 327 IQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIGAYNQSLLSEEIIDRAL 386

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             L+   +R G  + + +  P+ KI    V +P  Q LALQ+A DGIVLLKN +GLLPL 
Sbjct: 387 RRLYEGLIRAGYLD-SASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN-NGLLPLD 444

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS 477
            +   ++ALIG  AN+ + +LG Y+G       P+ A      N  ++     V  SS S
Sbjct: 445 LTNK-TIALIGHWANATRQMLGGYSGIPPYYANPIYAATQL--NVTFHHAPGPVNQSSPS 501

Query: 478 IDK-----AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPV 532
            +      A+  A  +D ++ + G D +   E+ DR  +  P  Q  L+T +A+  K  +
Sbjct: 502 TNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLAQMGKPTI 561

Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +  L  G  VD T    + NI SILW GYPG++G  AL  +I G  +P 
Sbjct: 562 VARL--GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPA 608


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  314 bits (805), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 204/555 (36%), Positives = 302/555 (54%), Gaps = 48/555 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDE----KISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           FPF   TL   +R  DLV RLT++E     ++Q   S   + RLG+  Y++ +E + GV 
Sbjct: 20  FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGV- 78

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI------ 152
                         +T+FPQ I   ASF   L + + QAI  E R  YN           
Sbjct: 79  ----------RWENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+  + P INI R P WGR QET GEDP ++G+ +V +V+G+QGD          +QAS 
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGDH------PRYIQASG 182

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            CKHF  ++       +R+ FDA+V+ +D   T+ P F++CV+ G  + IMC+YNR+NG+
Sbjct: 183 GCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAGSIN-IMCSYNRINGV 241

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN-- 330
           P+CA++ LL+   R++WGF+GY+ SD  A+  I     Y K+  +A  D +KAG +V   
Sbjct: 242 PACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAEAAADSVKAGCNVELT 301

Query: 331 --CGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT-MQPFGKIGADV 386
              GS +   +   AVKQ  + E E+   L      RMR G F  +P  M PF KI   V
Sbjct: 302 GATGSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMRQGEF--DPVDMNPFTKIDMSV 359

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA---G 443
           V S  HQ LA++A+    VL+KN + +LPL K +   LA+IGP A++A+TL G+Y     
Sbjct: 360 VLSQEHQDLAVKASAMSFVLMKNLNRVLPL-KKRFDRLAIIGPFADNAETLFGDYIPNWD 418

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSS---ASIDKAVDIAKGADHVVLMMGLDQT 500
           P   S TP + L++  ++  Y  GCD  +C++    +I+KAV   KGA  V + +G+   
Sbjct: 419 PKFVS-TPYEGLKSLGDDVRYASGCDDPSCTNYDPKAIEKAV---KGAQFVFVCLGVGSN 474

Query: 501 QEKEELDRVDLVLPGRQQELITRVAEAAKK-PVILVLLCGGPVDITFAKYDRNIGSILWA 559
            E+E  DR DL LPG Q +++      +++ P++LVL   GPVD+T+ K    +  I+  
Sbjct: 475 LEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEVDGIIEC 534

Query: 560 GYPGEAGAVALAEVI 574
            YP      AL +V+
Sbjct: 535 FYPAMGTGKALYQVV 549


>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
          Length = 864

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 173/430 (40%), Positives = 245/430 (56%), Gaps = 28/430 (6%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
            + F    LP+ +R  DLV  LTL+EKISQ++N+APAI RLGIPAY WW+E LHGVA   
Sbjct: 23  VYKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR-- 80

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGM 154
                  +    TSFPQ I  AA++D+   Y++ +    E RA+Y+        G   G+
Sbjct: 81  -------SPYPVTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGL 133

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T+W+PNINIFRDPRWGRGQET GEDP +T    V++V+G+QGD          L++SAC 
Sbjct: 134 TYWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQGDD------PVYLKSSACA 187

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KH+  +    W    R+ ++A V+  DL DTY P F   V   + +G+MCAYN     P 
Sbjct: 188 KHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELVVDAKVTGVMCAYNSFFEQPC 244

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           C +  L+    R QW F GY+TSDC A+   Y+     +   +A  D +  G D  CG+ 
Sbjct: 245 CGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHEDAAEASADAVLHGTDCECGNG 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    A+ +  + E ++D +L  LF +R RLG+F+ +  + P+  I   V+   AH+ 
Sbjct: 305 AYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDDRV-PYSDIPISVLECDAHKA 363

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
            AL+ A+  IVLLKN   LLPL  +K   +A++GPNA+    LL NY G PSC + T L+
Sbjct: 364 HALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDKSVLLANYYGYPSCVT-TVLE 422

Query: 454 ALQNYVENTV 463
            ++  V + V
Sbjct: 423 GIKGKVGDQV 432



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 57/105 (54%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD VV + GL    E EE+          DR  + +P  QQ L+  +  A  KPVI +
Sbjct: 597 KDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQNLLKELY-ATGKPVIFI 655

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L+ G  V + +     ++ +IL A Y G+AG  A+A+V+FGD+NP
Sbjct: 656 LMTGSAVGLEWES--EHLPAILNAWYGGQAGGQAIADVLFGDYNP 698


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  313 bits (803), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 182/441 (41%), Positives = 249/441 (56%), Gaps = 33/441 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           + E F F    L   QR  DLVSR+++DEKISQL++S+PAI RLG+P Y WW+E+LHGVA
Sbjct: 20  AQENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVA 79

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQA 151
             G           AT FPQ I  A+S+D  L + +   I  EARA ++        G  
Sbjct: 80  RAGY----------ATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMY 129

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+TFW+PN+NIFRDPRWGRG ET GEDP +TG+  + YV G+QG        +  L+  
Sbjct: 130 QGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTN------EKYLKVI 183

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A  KH+  +        +R+ F+A  +  DL +TY P F + VK+G    +M AYNR  G
Sbjct: 184 ATAKHYAVHSGPE---PSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFRG 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
             SC+    L    R  WGF GYI SDC AV+ I+           A    LK G+D+ C
Sbjct: 241 -ESCSASPFLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAATASALALKDGLDLEC 299

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GS   K  K A+ +K + E++ID A+  LF+ R +LG+F+    +  + +I   V  + A
Sbjct: 300 GSSF-KSLKEAIDRKLISEADIDIAVKRLFTARFKLGMFDPEEIVS-YAQIPYSVNNNSA 357

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  LA  A+Q  IVLLKN +  LPL +    ++A+IGPNAN  ++L GNY+G     IT 
Sbjct: 358 HDWLARVASQKSIVLLKNQNNTLPLSRDIK-TVAVIGPNANDVQSLWGNYSGVPSNPITV 416

Query: 452 LQALQNYVE-NT--VYYPGCD 469
           L+ +QN +E NT  +Y  G D
Sbjct: 417 LKGIQNKLEPNTKVLYAKGTD 437



 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 71/113 (62%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + +AV +A  AD +VL++GL++  E EE+          DR  L LP  Q+EL+ +   A
Sbjct: 589 LQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEELM-KAMTA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             KPVILVL+ G  + I +A  + ++ +IL AGYPG+ G  A+A+V+FGD+NP
Sbjct: 648 TGKPVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNP 698


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 215/608 (35%), Positives = 298/608 (49%), Gaps = 98/608 (16%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+   +LP  +RA+DLVSRLTL+EK S  ++ +  +  LGI AY WWSEALHGVA  G  
Sbjct: 45  PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIGMTF 156
                    AT FPQ I  AASFD  L Y +  A+  EAR  Y       + GQ  G+TF
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINIFRDPRWGRG ET GEDP +TG+  ++ VRG+QG + +       L+A AC KH
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPSDS-----PVLKAHACAKH 209

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           +  +    W    R+ +DA V+ +DL +TY P F+  V +     +M AYNR  G P  A
Sbjct: 210 YAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYNRFRGEPCGA 266

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
              L++   R +WG+ G ITSDC AV   Y     GY+     A    + AG+D  CG  
Sbjct: 267 SDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHAGVDTECGQ- 325

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +H   AV++  L E ++DR L  LF+ R +LG  +    +  +  + A ++  P H  
Sbjct: 326 AYRHIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMD---DISLWDDLPASILEGPEHLA 382

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+ + AQ+ +VLL+N  G+LPL  +  V +AL+GPN +  +   GNY     R++T   A
Sbjct: 383 LSRKMAQESMVLLQNKGGILPL--APDVRVALVGPNGDDREMQWGNYNPVPGRTVTLYDA 440

Query: 455 LQNYVENTVYYPGCDTVACS---------------------------------------- 474
           L+       Y  GC  V                                           
Sbjct: 441 LKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGVQDILNYV 500

Query: 475 ------SASIDKAVDIA------KGADHVVLMMGLDQTQEKEEL----------DRVDLV 512
                  AS    +D+       +G D V+   G+    E EE+          DR D+ 
Sbjct: 501 RRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKGGDRTDIQ 560

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LP  Q++L+  + +A KK VILV   G    I       +  +IL A YPGE G +A+ +
Sbjct: 561 LPQVQRDLMKALHDAGKK-VILVNFSG--CAIGLVPETESCDAILQAWYPGEEGGLAITD 617

Query: 573 VIFGDHNP 580
           V+FGD NP
Sbjct: 618 VLFGDVNP 625


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 172/427 (40%), Positives = 241/427 (56%), Gaps = 28/427 (6%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           S + FP   T L    R +DLVSRLTL+EK+ Q++N+APAIPRLGIPAY+WW+E LHGVA
Sbjct: 36  SFKGFPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVA 95

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
                     T    T +PQ I  AA +DS     +     LE RA++N   A+G     
Sbjct: 96  R---------TPFHVTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNER 146

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
              +T+W PNINIFRDPRWGRGQET GEDP +T     ++VRG+QGD          L+A
Sbjct: 147 YLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDD------PKYLKA 200

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           +AC KHF  +        +R+  +   +  DL DTY P F+  V + +  G+MCAYN  +
Sbjct: 201 AACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAYNAFH 257

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           G P C    L++   R+QW F GY+TSDC A+   +           A VD +  G DV 
Sbjct: 258 GQPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLHGTDVE 317

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CG+ + K     VK+  + E+++D +L  LF+ R RLG+F+   +M  + +    ++ + 
Sbjct: 318 CGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFD-PVSMVKYAQTPESILETA 376

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+  +L+ AQ  IVLLKN    LPL K+    +A++GPNA++   +LGNY G     IT
Sbjct: 377 EHKAHSLKMAQQSIVLLKNEGNTLPLSKNIK-KIAVLGPNADNRIVVLGNYNGQPSEIIT 435

Query: 451 PLQALQN 457
            LQ ++N
Sbjct: 436 ALQGIKN 442



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 65/115 (56%), Gaps = 13/115 (11%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
           +++   V+  K AD +V + G+    E EE+          DR  ++LP  Q EL+ ++ 
Sbjct: 607 SNLSAIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELL-KML 665

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +   KP++ V++ G  + + +   D+NI +I+ A Y G++   A+A+V+FGD+NP
Sbjct: 666 KGTGKPLVFVVMTGSAIALPYE--DQNIPAIVNAWYGGQSAGTAIADVLFGDYNP 718


>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 881

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 174/426 (40%), Positives = 242/426 (56%), Gaps = 37/426 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA DLV R+T++EK+SQL N + A+PRL +P Y+WWSEALHGVA  G   
Sbjct: 30  YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQPG--- 86

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                    T +PQ +  AA+FD+    R+ + IG+E R  +  G          G+ FW
Sbjct: 87  --------VTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           APNINIFRDPRWGRGQET GEDP +T +  V+YV+G+QGD            A +  KH+
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQGDD------PKYYLAISTPKHY 192

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +   TTR+  D +V+  D  DTY P F + V + +A  +MCAYN +NG P+C +
Sbjct: 193 AVH---SGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPACVN 249

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQK 337
             LL    R +W F GY+ SDC+A+  IY    + K+  +A    ++ GMD  C  F ++
Sbjct: 250 EFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKFTKTQAEASALAVQRGMDNECVDFGKQ 309

Query: 338 HTKA-------AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
                      A KQ  L ESEID AL  LF+ RM+LG+F+  P M P+ KI    + S 
Sbjct: 310 KDDHDYRPYFDAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDPKELESA 368

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LA   A + +VLLKN  G LPL KS  + +A+IGP A   + LLGNY G    +++
Sbjct: 369 EHRELARTLANESMVLLKND-GTLPLKKS-GLKIAVIGPLAEQTRYLLGNYNGTPSHTVS 426

Query: 451 PLQALQ 456
            L+ L+
Sbjct: 427 VLEGLR 432



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 61/110 (55%), Gaps = 13/110 (11%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
           AV  AK AD V+ ++G+    E EE+          DR  L LP  +Q+L+  ++ AA K
Sbjct: 603 AVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESIS-AAGK 661

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           PV+LVL  G  + + +A+   N  +IL   YPGE G  A+A+ + G +NP
Sbjct: 662 PVVLVLSNGSALSVNWAQQHAN--AILEGWYPGEEGGTAIAQTLSGKNNP 709


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  312 bits (800), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 189/499 (37%), Positives = 272/499 (54%), Gaps = 44/499 (8%)

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY 146
           Y WWSEAL+                 ATSFP  I   A+FD  L + I   I  EARA  
Sbjct: 1   YNWWSEALN--------------FSSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46

Query: 147 NAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
           N  +  G+ F+ PNIN F+DPRWGRGQETPGEDP    +Y    V G+QG     G    
Sbjct: 47  NVNRG-GLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQG-----GVGPT 100

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            L+ +A CKH+ AYDL+N  G +R++FDA+VTMQDLA+ Y P F+SC++  + + IMC+Y
Sbjct: 101 NLKIAADCKHWAAYDLEN-LGVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMCSY 159

Query: 267 NRVNGIPSCADRNLLSKTARRQWGF--HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           N VNGIPSCA+R LL   AR  WG     +IT DC AV  I+    Y   P +     L 
Sbjct: 160 NAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHYTDDPANGTAVALN 219

Query: 325 AGMDVNCGSFLQKHTK---AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
           AG D++C S    +++    A+ +  + E ++  A+   ++  +RL              
Sbjct: 220 AGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW------------ 267

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
              DV   PA Q LA QAA +GIVLLKN  G+LPL  S    +A++GP AN+   +  NY
Sbjct: 268 --DDVNTEPAQQ-LAYQAAVEGIVLLKN-DGILPLASSVK-KVAVVGPMANATTQMQSNY 322

Query: 442 AGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ 501
            G +   ++P QA +N   N  +  G    +  ++    A+  A  AD V  + G+D T 
Sbjct: 323 NGIAPFLVSPQQAFRNAGFNVTFANGTGLNSSDTSGFSAAIAAADDADVVFYVGGIDTTI 382

Query: 502 EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
           E+E+ DR ++   G Q  L+ ++A    KP+I++ + GG VD +  + + ++ +++W GY
Sbjct: 383 EREDRDRPEISWTGNQLALVQQLASLG-KPLIVLQMGGGQVDSSSLRDNTSVNALIWGGY 441

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PG++G  AL ++I G   P
Sbjct: 442 PGQSGGTALVDLITGKQAP 460


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  312 bits (799), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 183/445 (41%), Positives = 247/445 (55%), Gaps = 39/445 (8%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N   E FP+  T+L   +RA DL+ RLTL+EK S ++N +PAIPRL I AY WW+EALHG
Sbjct: 21  NKLPEKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHG 80

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAG 149
           +A  G           AT FPQ I   ASFD  L Y +  A+  EARA         N  
Sbjct: 81  LARTGL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKL 208
           +   +T W PN+NIFRDPRWGRGQET GEDP +T +  V+ V G+QG DT    KL    
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--- 187

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
              AC KH+  +    W    R+ F+A  ++ +DL +TY P F++ V++ +   +MCAYN
Sbjct: 188 ---ACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYN 241

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA-- 325
           R  G P C    LL++  R +WGF G + SDC AVS  +    +   P+ A         
Sbjct: 242 RFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLN 301

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G DV CG+   K    AVK   + E++ID ++  L   R  LG  + N     +  I +D
Sbjct: 302 GTDVECGNSY-KSLPDAVKAGLITENQIDISVKRLLKARFELGEMDEN----VWTGISSD 356

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H+ LALQ A++ + LL+N++ +LPL  SK   +ALIGPNAN +    GNY G  
Sbjct: 357 VVDSPKHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLP 414

Query: 446 CRSITPLQALQNYV--ENTVYYPGC 468
             +IT L+ +Q Y+   N +Y P C
Sbjct: 415 SHTITLLEGMQRYLPTSNLIYEPVC 439



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K  D V+   G+    E EE+          DR ++ LP  Q+ ++  +  A K+ ++ V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G    I       N  +IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 658 NFSGAA--IALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNP 700


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 183/445 (41%), Positives = 247/445 (55%), Gaps = 39/445 (8%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N   E FP+  T+L   +RA DL+ RLTL+EK S ++N +PAIPRL I AY WW+EALHG
Sbjct: 21  NKLPEKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHG 80

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAG 149
           +A  G           AT FPQ I   ASFD  L Y +  A+  EARA         N  
Sbjct: 81  LARTGL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLT 130

Query: 150 QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKL 208
           +   +T W PN+NIFRDPRWGRGQET GEDP +T +  V+ V G+QG DT    KL    
Sbjct: 131 RYQALTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPDTARYNKLH--- 187

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
              AC KH+  +    W    R+ F+A  ++ +DL +TY P F++ V++ +   +MCAYN
Sbjct: 188 ---ACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYN 241

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA-- 325
           R  G P C    LL++  R +WGF G + SDC AVS  +    +   P+ A         
Sbjct: 242 RFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLN 301

Query: 326 GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           G DV CG+   K    AVK   + E++ID ++  L   R  LG  + N     +  I +D
Sbjct: 302 GTDVECGNSY-KSLPDAVKAGLITENQIDISVKRLLKARFELGEMDEN----VWTGISSD 356

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           VV SP H+ LALQ A++ + LL+N++ +LPL  SK   +ALIGPNAN +    GNY G  
Sbjct: 357 VVDSPKHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLP 414

Query: 446 CRSITPLQALQNYV--ENTVYYPGC 468
             +IT L+ +Q Y+   N +Y P C
Sbjct: 415 SHTITLLEGMQRYLPTSNLIYEPVC 439



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K  D V+   G+    E EE+          DR ++ LP  Q+ ++  +  A K+ ++ V
Sbjct: 599 KDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAGKR-IVFV 657

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G    I      +N  +IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 658 NFSGAA--IALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNP 700


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 175/422 (41%), Positives = 242/422 (57%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 35  EQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH---------- 84

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 85  ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 144

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   +
Sbjct: 145 RDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHLAVH---S 195

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  D+  TY P F + + QG+A  +MCAYN ++G P+CA   LL+  
Sbjct: 196 GPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNGR 255

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A++
Sbjct: 256 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIE 314

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 315 RGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESI 373

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN    LPL  +    LA+IGPNA++   L  NY G S   +TPL  L Q +    V
Sbjct: 374 VLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQV 431

Query: 464 YY 465
            Y
Sbjct: 432 RY 433



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 61/104 (58%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           +D VV  +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G  V + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  311 bits (797), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A  D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +  P    + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  311 bits (797), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A  D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +  P    + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 38.9 bits (89), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q+ L+  + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  311 bits (797), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 185/434 (42%), Positives = 250/434 (57%), Gaps = 32/434 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + +PF    L  S R  DL+ RLT++EKI QL+ ++PAI RLGIP Y WW+E+LHGVA  
Sbjct: 25  QQYPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARA 84

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
           G           AT FPQ I  AA++DS L   +  AI  EARA ++        G   G
Sbjct: 85  GY----------ATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQG 134

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW+PNINIFRDPRWGRG ET GEDP +TG+  ++YV+G+QG+  N  KL       A 
Sbjct: 135 LTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQGNDPNYLKL------VAT 188

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KHF  +   +     R++FD   + +DL +TY P F   VKQG    +M AYNRV G  
Sbjct: 189 AKHFAVH---SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNRVYGEA 245

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS 333
           + A   L +   R  W F GY+ SDC A+S I+     AK   +A    +  G D+NCG 
Sbjct: 246 ASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAEASAMAVIEGCDLNCGD 304

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
             +K  + A +Q  + E +ID AL  L   R++LG+F+    + P+ +I  +V  S  H 
Sbjct: 305 SYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMFDPE-QLVPYAQIPFNVNTSEKHN 362

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LAL+AA++ IVLLKN   LLPL K    S+A+IGPNA++ ++L GNY G     IT LQ
Sbjct: 363 QLALKAAKESIVLLKNQGDLLPLSKDLK-SVAVIGPNADNIQSLWGNYNGNPKDPITVLQ 421

Query: 454 ALQNYV--ENTVYY 465
            +QN +  + TV Y
Sbjct: 422 GIQNALGPQTTVVY 435



 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 59/98 (60%), Gaps = 13/98 (13%)

Query: 493 LMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
           +++GL++  E EE+D          R  L LP  Q+ L+  VA+  K P++LVLL G  +
Sbjct: 607 MVLGLNERLEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSAL 665

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            I +A    NI +I+ AGY G+ G  A+AEV+FGD+NP
Sbjct: 666 SINWAA--ENIPAIMTAGYAGQQGGNAVAEVLFGDYNP 701


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  311 bits (797), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A  D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASADAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +  P    + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 38.9 bits (89), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q+ L+  + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  311 bits (796), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 175/422 (41%), Positives = 241/422 (57%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 35  EQRAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGH---------- 84

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 85  ATVFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 144

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   +
Sbjct: 145 RDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHLAVH---S 195

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  D+  TY P F + + QG+A  +MCAYN ++G P+CA   LL+  
Sbjct: 196 GPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNGR 255

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A++
Sbjct: 256 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIE 314

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 315 RGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESI 373

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN    LPL  +    LA+IGPNA++   L  NY G S   +TPL  L Q +    V
Sbjct: 374 VLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQV 431

Query: 464 YY 465
            Y
Sbjct: 432 RY 433



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 61/104 (58%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           +D VV  +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G  V + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  311 bits (796), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A  D ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASADAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +  P    + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.3 bits (90), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 188/469 (40%), Positives = 257/469 (54%), Gaps = 54/469 (11%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           LCLCF S       +QPP+     NP+          L   +RA DLV RLTL+EK S +
Sbjct: 14  LCLCFLS------CSQPPYK----NPA----------LTPEERAADLVGRLTLEEKASLM 53

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
            N++PAIPRLGI AY+WW+EALHGV   G           AT FPQ I   ASF++ L Y
Sbjct: 54  QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNDLLY 103

Query: 133 RIGQAIGLEARA----LYNAG---QAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  A+  EARA        G   +  G+T W PN+NIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKTAEFSKEGGLKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQ 163

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
             ++ VRG+QG    GGK     +  AC KHF  +    W    R+ FDA  V  +DL +
Sbjct: 164 MGMAVVRGLQGP--EGGKYD---KLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWE 215

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY P F+  V++     +MCAYNR  G P C    LL +  R +W + G I SDC A++ 
Sbjct: 216 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAIND 275

Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            ++   +   P  E A    +  G DV CG       + AVK   + E +ID +L  L  
Sbjct: 276 FFNKGAHETEPDKEHASAKAVLTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMK 334

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
            R  LG  + NP +  + +I   VV S  H+ LAL+ A++ +VLL+N+  +LPL  +KS+
Sbjct: 335 ARFELGEMD-NPELVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQNVLPL--NKSL 391

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCD 469
            +A++GPNAN +    GNY G    ++T L+ ++ Y+     +Y PGCD
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPGHTVTLLEGIRQYLPEAQLIYEPGCD 440



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 57/113 (50%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++K +   K AD +V   G+    E EE+          DR  + LP  Q  L+  + +A
Sbjct: 590 LNKTIQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK V+ V   G  + +T     +   +IL A YPG+AG  A+A V+FGD+NP
Sbjct: 650 GKK-VVFVNFSGSAIALT--PETKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 244/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
           T FPQ I  AAS++++L  ++G  +  EARA +N AG+         G+T W+PNINIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++ G+QGD  +        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLD------HPRTIATPKHLAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   L++   
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V +  H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN+   LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 378 LLKNNANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435

Query: 465 Y 465
           Y
Sbjct: 436 Y 436



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 245/421 (58%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
           T FPQ I  AAS++++L  ++G  +  EARA +N AG+         G+T W+PNINIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++ G+QGD  +        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDLD------HPRTIATPKHLAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   L++   
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V +  H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN+   LPL  +    LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 378 LLKNNANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435

Query: 465 Y 465
           Y
Sbjct: 436 Y 436



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 176/443 (39%), Positives = 246/443 (55%), Gaps = 36/443 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG      +  G  +  AC
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGP-----EDAGYDKLHAC 186

Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G 
Sbjct: 187 AKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGE 243

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVN 330
           P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++ G D+ 
Sbjct: 244 PCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAAAVRTGTDLE 303

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CGS        AVK   + E EID +L  L + R  LG  +  P    + +I   V+ S 
Sbjct: 304 CGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WAEIPTSVLNSK 359

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ LAL+ A++ +VLL+N + +LPL  + ++ +A++GPNAN +    GNY G    ++T
Sbjct: 360 EHQALALRMARESLVLLQNKNNILPL--NTNLKIAVMGPNANDSVMQWGNYNGIPAHTVT 417

Query: 451 PLQALQNYVEN--TVYYPGCDTV 471
            L+A++  +     +Y PGCD V
Sbjct: 418 LLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 174/422 (41%), Positives = 242/422 (57%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 23  EQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY---------- 72

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 73  ATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 132

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   +
Sbjct: 133 RDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---S 183

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 184 GPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGR 243

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ 
Sbjct: 244 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIA 302

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 303 RGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESI 361

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V
Sbjct: 362 VLLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQV 419

Query: 464 YY 465
            Y
Sbjct: 420 SY 421



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 670 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 703


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 247/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++    A+    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++ G D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +  P    + +I A V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMDEQPA---WSEIPASVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 38.9 bits (89), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q+ L+  + +A
Sbjct: 588 LNFAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G  + +          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSAIGLV--PETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 173/414 (41%), Positives = 240/414 (57%), Gaps = 31/414 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ DEK++Q +N APAIPRL IPAYEWWSE LHG+A  G           
Sbjct: 23  EQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY---------- 72

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 73  ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 132

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD  N        +  A  KH   +   +
Sbjct: 133 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLN------HPRTIATPKHIAVH---S 183

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + +  G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 184 GPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGR 243

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A++
Sbjct: 244 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIE 302

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 303 RGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESI 361

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           VLLKN+   LPL       LA+IGPNA++   L  NY G S   ITPL  L+ +
Sbjct: 362 VLLKNTATTLPL--KAGTRLAVIGPNADALAALEANYQGTSATPITPLLGLRQH 413



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/104 (40%), Positives = 60/104 (57%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           +D VV  +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+
Sbjct: 603 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 661

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G  V + +AK   N  +I+ A YPG++G  A+A  + GD NPG
Sbjct: 662 SGSAVALNWAKA--NADAIVAAWYPGQSGGTAIARALAGDDNPG 703


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 89  TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+   
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFATRYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 378 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435

Query: 465 Y 465
           Y
Sbjct: 436 Y 436



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 196/565 (34%), Positives = 288/565 (50%), Gaps = 48/565 (8%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
           T+PF    L   QRA DL  RLTL+EK   + N++  +PRLGI  ++WW EALHG A  G
Sbjct: 23  TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------M 154
                      AT FPQ I  AASFD  L  ++      EARA YN     G       +
Sbjct: 83  L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           + W PN+NIFRDPRWGRGQET GEDP +T +   + V G+QG    G       +A AC 
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQG----GKGPHKYYKAFACA 188

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHF  +    W   +    D  V+ +D  +TY P F+  V+ G    +MCAYN ++G P 
Sbjct: 189 KHFAVHSGPEWNRHSISIDD--VSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSIDGEPC 246

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
           C+D+ LL +  R +WGF G + SDC A+  I+    +   P+   A    +K G D++CG
Sbjct: 247 CSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAHASARAVKGGTDMSCG 306

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                    AV+  K+ E  ID++L  L   RM+LG F+ + ++  +  I    V +PA 
Sbjct: 307 QTYGS-LPEAVRLGKVTEERIDKSLKRLIVGRMQLGEFDPD-SITRWNAISMKDVSTPAS 364

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL+ A++ + LL N    LPL K     + ++GPNAN +  + GNY G    ++T L
Sbjct: 365 REVALKMARETMTLLHNPMHALPLSKQLK-QVVVMGPNANDSVMMWGNYNGTPHHTVTIL 423

Query: 453 QALQNYV--ENTVYYPGCDTV-----ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
             ++  +  +   +  GC  V        + +  + V+       V+ + G+    E E+
Sbjct: 424 DGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFVGGISPQLEGEQ 483

Query: 506 L----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           L          DRV + LP  Q+E+I  +  A K+  ++++ C G   I       +  +
Sbjct: 484 LEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQ--VIMVNCSGSA-IGLVPEVTHTDA 540

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNP 580
           IL A YPGE G  A+A+V+FGD+NP
Sbjct: 541 ILQAWYPGERGGEAVADVLFGDYNP 565


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 200/573 (34%), Positives = 291/573 (50%), Gaps = 52/573 (9%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPAYEWW 90
           ++  FPF   +L  S+R  DLV RLTLD+ + QL           APAI  LGI  Y+W 
Sbjct: 20  ASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGIGPYQWN 79

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +E L G    G           ATSFPQ I  AA+F   L + + +A   E RA +N   
Sbjct: 80  TECLRGDVEAGN----------ATSFPQAIGLAAAFSKDLIFNVSKAAATEVRAKHNDFV 129

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
                    G++ ++P +NI R P WGR QET GEDP ++G YA  +V+G+QGD      
Sbjct: 130 KRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQGDH----- 184

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
               +QA+A CKHF A+        +R  FDA+V+M+DL  T+ P F+ CV+ G A  +M
Sbjct: 185 -DRYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCVQAG-AYSLM 242

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           C+YN +NG+P+C+++ L+    R +W F GY+ SD  A+        Y  + EDA    +
Sbjct: 243 CSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNNSEDAAAGSV 302

Query: 324 KAGMDVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
            AG ++     L +        AVK  KL ES +   +  LF  RMRLG F+  P M P+
Sbjct: 303 NAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGEFDP-PEMNPY 361

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLK------NSHGLLPLPKSKSVSLALIGPNANS 433
             +   V+ S  H+ L+L AA   +VLLK        H +   P  +   +A+IGP AN+
Sbjct: 362 SSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFPSER---MAVIGPMANN 418

Query: 434 AKTLLGNYAGPSCRSI--TPLQALQNYVENTVYYPGC-DTVACSSASIDKAVDIAKGADH 490
              + G+Y+  +      TPL+ L     +  Y  GC D   C + S D       GAD 
Sbjct: 419 TDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKTALVGADL 478

Query: 491 VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
           VV+ +G  +  E E +DR D++LPG+Q +L+  V     K V L++   GPV+IT+A+  
Sbjct: 479 VVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVNITWAQES 538

Query: 551 RNIGSILWAGYPGEAGAVALAEVIF---GDHNP 580
             +  IL   YP ++   A+ + +    G  NP
Sbjct: 539 ERVLIILQCFYPAQSAGDAITQALIMRDGRFNP 571


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  309 bits (792), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 242/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 89  TVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+   
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 378 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435

Query: 465 Y 465
           Y
Sbjct: 436 Y 436



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  309 bits (791), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 184/469 (39%), Positives = 259/469 (55%), Gaps = 54/469 (11%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           LCLCF S       +QPP+     NP+          L   +RA DLV RLTL+EK + +
Sbjct: 14  LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 53

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
            N++PAIPRLGI AY+WW+EALHGV   G           AT FPQ I   ASF++ L Y
Sbjct: 54  QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 103

Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  A+  EARA            +  G+T W PNINIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQ 163

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
             ++ VRG+QG     G+   KL   AC KH+  +    W    R+ F+A  +  +DL +
Sbjct: 164 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 215

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY P F+  V++     +MCAYNR  G P C    LL +  R +WG+   + SDC A+S 
Sbjct: 216 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISD 275

Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            Y+ + +   P  + A    + +G DV CG       + AVK+  + E +ID +L  L  
Sbjct: 276 FYNKDAHETDPDKQHASAKAVLSGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMK 334

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
            R  LG  +  P+   + +I   VV S  H+ LAL+ A++ +VLL+N+  LLPL  +K++
Sbjct: 335 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 391

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCD 469
            +A++GPNAN +    GNY G    +IT L+ ++ Y+  +  +Y PGCD
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCD 440



 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 57/113 (50%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + + VD  K AD ++   G+    E EE+          DR  + LP  Q  L+  + +A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK ++ V   G  + +T     +   +IL A YPG+AG  A+A V+FGD+NP
Sbjct: 650 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 255/444 (57%), Gaps = 33/444 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L  S+RA DL+ R+TL+EKISQ+ N +PAI RLGIPAY+WW+EALHGVA  GK 
Sbjct: 24  PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTF 156
                    AT FPQ I  AA+FD+   Y     +  EARA Y+        G   G+TF
Sbjct: 83  ---------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTF 133

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINI+RDPRWGRG ET GEDP +T    ++ V+G+QG+    GK     +A AC KH
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGN--GAGKYD---KAHACAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           +  +    W    R+ FD++ ++ +DL +TY P F++ V +G+   +MCAYNR  G P C
Sbjct: 189 YAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEPCC 245

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
           +++ LL +  R  WG+   + SDC A+   Y    +   P  E A  D + +G D+ CG 
Sbjct: 246 SNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLECGG 305

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
                 + AVK+  + E +I+ ++  L   R +LG+F+ + T+  + +I   VV S  H 
Sbjct: 306 SYSSLNE-AVKKGLITEDKINESVFRLLRARFQLGMFD-DDTLVSWSEIPYSVVESKEHV 363

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ A+  +VLL N +  LPL KS    +A++GPNAN +  L  NY G   +S+T L+
Sbjct: 364 DKALEMARKSMVLLTNKNNSLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPTKSVTILE 422

Query: 454 ALQNYV-ENTVYYP-GCDTVACSS 475
            +++ + E  VYY  GCD V+  +
Sbjct: 423 GIRSKLPEGAVYYEKGCDFVSTQT 446



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 55/100 (55%), Gaps = 13/100 (13%)

Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           ++ + GL    E EE+          DR ++ LP  Q+E++  + +  K PVI V+  G 
Sbjct: 604 IIFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCSGS 662

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + + +     N+ ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 663 TLALPWEA--ENLDAMLEAWYPGQQGGTAVADVLFGDYNP 700


>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 906

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 178/428 (41%), Positives = 243/428 (56%), Gaps = 30/428 (7%)

Query: 40  TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAG 99
           T  F F        +R   LV +++L+EK+SQ++N++PAIPRL +P Y WW+E LHGVA 
Sbjct: 47  TLDFSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVAR 106

Query: 100 VGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAI 152
            G           AT FPQ I  AASFD  L   IG  I  EARA ++        G   
Sbjct: 107 AGY----------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKRGIYT 156

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+ FW+PNINIFRDPRWGRG ET GEDP +TG+ A  ++ G+Q    + GK    L+  A
Sbjct: 157 GLDFWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQD---SDGKY---LKTIA 210

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF  +   +     R+ FD  V+ +DL +TY P F   VK+ +   IM AYNR  G 
Sbjct: 211 TSKHFAVH---SGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRG- 266

Query: 273 PSCADRN-LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            SC+  + LL++  R QWGF GY+ SDC A+  I+     A +  +A    +  G D+NC
Sbjct: 267 ESCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHKIASTAAEAAAIGVSGGCDLNC 326

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G++   H   AV +  + E EID A+  LF  R RLG+F+    +  + +I   +VCS A
Sbjct: 327 GNYY-THLTEAVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVS-YAQIPFGIVCSEA 384

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  LA QAAQ  +VLLKN   LLPL   K   +A+IGPNA++ ++LLGNY G   + +T 
Sbjct: 385 HNTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKKPVTF 444

Query: 452 LQALQNYV 459
           L  +++ V
Sbjct: 445 LDGIKHKV 452



 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 51/115 (44%), Positives = 70/115 (60%), Gaps = 13/115 (11%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELD----------RVDLVLPGRQQELITRVA 525
           + ID+AV +AK AD  V+++GL Q  E E +D          R  + LP +Q+ L+  V 
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           E  K PVILVL  G  + I +AK   N+ +I+ AGYPGE G  ALA+V+FGD+NP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAK--ENVDAIISAGYPGEEGGNALADVVFGDYNP 726


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 210/598 (35%), Positives = 311/598 (52%), Gaps = 59/598 (9%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           ++ LC   LL+        F    S  ++E FPF   +LPI +R  DLV+RLT++E I Q
Sbjct: 1   MIRLCLFLLLSY-------FHFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQ 53

Query: 72  LVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L          APAI RLGI  Y+W +E L G A  G           AT FPQ I  AA
Sbjct: 54  LSRGGVRDNGPAPAITRLGIGPYQWNTECLRGYAMNGD----------ATCFPQPIGLAA 103

Query: 125 SFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPG 177
           +FD  L Y++ + + LEARA +N        G   G++ ++P INI R P WGR QET G
Sbjct: 104 TFDQGLIYKMAKTVALEARAKHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYG 163

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           EDP++T   A +YV G+QGD          L A+A CKHF AY       TTR+ F A V
Sbjct: 164 EDPVLTSLMARAYVTGLQGDEI-------YLPATAVCKHFVAYGGPENIPTTRFSFSANV 216

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           +  D+  T+ P F  CV  G A G+MC+YN +NG+PSCA+  +L  T R+++ F GY+ S
Sbjct: 217 SDHDIGTTFYPAFRECVHAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVS 274

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA----AVKQKKLPESEI 353
           D +A+  I     + KS  +     L AG+D+    F + +  +    AV+Q  + E+ +
Sbjct: 275 DENALENIDLYFNFTKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAAL 334

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
            R+   LF  RM LG F+  P    +  +  DVV S AH+  A++ A    VLLKN  G+
Sbjct: 335 RRSAKRLFRTRMALGEFDP-PEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GI 392

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI--TPLQALQNYVENTV--YYPGC- 468
           LPL K     ++++GP  N+++ L G+Y          +PL A  +   + V  +  GC 
Sbjct: 393 LPL-KQLYDKVSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFTTGCV 451

Query: 469 -----DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
                +   C++ +     ++  G+D V++ +G  +  E E  DR D+ LPG+Q +LI  
Sbjct: 452 GTNNQNLPICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQD 511

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           V + A  PVI+VL   GP+D+++     N  +++   +  +    A+ EV+ G  NP 
Sbjct: 512 VVKYANGPVIVVLFNAGPLDVSWVM--GNTAAVIACHFSAQMTGEAMLEVLTGVVNPA 567


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 173/421 (41%), Positives = 245/421 (58%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA DLV+ ++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AGQA-------IGMTFWAPNINIFR 165
           T FPQ I  AAS++++L  ++G  +  EARA +N AG+         G+T W+PNINIFR
Sbjct: 89  TVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++ G+QG+  +        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDLD------HPRTIATPKHLAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   L++   
Sbjct: 200 PEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLINGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V +  H+ LALQAA + IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAQHRALALQAAAESIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN+   LPL  +    LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 378 LLKNNANTLPL--NAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 435

Query: 465 Y 465
           Y
Sbjct: 436 Y 436



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
 gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
          Length = 882

 Score =  308 bits (788), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 174/412 (42%), Positives = 240/412 (58%), Gaps = 31/412 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            Q A  LV+++TL EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A  G           
Sbjct: 31  EQHAAALVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY---------- 80

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L   +G     EARA +N AG       +  G+T W+PNINIF
Sbjct: 81  ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIF 140

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AVS++RG+QG+      +    +  A  KHF  +   +
Sbjct: 141 RDPRWGRGMETYGEDPYLTGQLAVSFIRGLQGN------IPDHPRTIATPKHFAVH---S 191

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  DL  TY P F + +  G A  +MCAYN ++G P+CA   LL+  
Sbjct: 192 GPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTR 251

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF+G++ SDCDA+  +     + +    A    LK+G D+NCG+   +    A+ 
Sbjct: 252 LRNDWGFNGFVVSDCDAIDDMTRFHFFRQDNASASAAALKSGNDLNCGN-TYRDLNQAIA 310

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + E+ +D+AL  LF+ R RLG         P+  IG   + +PAH+ LALQAA   +
Sbjct: 311 RGDIDEALLDQALIRLFAARQRLGTLQPR-EHDPYATIGIKHIDTPAHRALALQAAVQSL 369

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           VLLKNS   LPL  +   +LA++GP+A+S   L  NY G S   +TPL  L+
Sbjct: 370 VLLKNSGNTLPL--TPGTTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419



 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 65/121 (53%), Gaps = 13/121 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           +A ++  + +A      AD +V  +GL    E EEL          DR  + LP  Q+ L
Sbjct: 594 LAPAAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V +   KP+I+VL+ G  V + +A++  N  +IL A YPG++G  A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAN--AILAAWYPGQSGGTAIAQALAGDVNP 710

Query: 581 G 581
           G
Sbjct: 711 G 711


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  308 bits (788), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 184/469 (39%), Positives = 259/469 (55%), Gaps = 54/469 (11%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           LCLCF S       +QPP+     NP+          L   +RA DLV RLTL+EK + +
Sbjct: 14  LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 53

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
            N++PAIPRLGI AY+WW+EALHGV   G           AT FPQ I   ASF++ L Y
Sbjct: 54  QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 103

Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  A+  EARA            +  G+T W PNINIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 DVFTAVSDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQ 163

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
             ++ VRG+QG     G+   KL   AC KH+  +    W    R+ F+A  +  +DL +
Sbjct: 164 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 215

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY P F++ V++     +MCAYNR  G P C    LL +  R +WG+   + SDC A+S 
Sbjct: 216 TYLPAFKNLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISD 275

Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            Y+   +   P  + A    + +G DV CG       + AVK+  + E +ID +L  L  
Sbjct: 276 FYNKGAHETDPDKQHASAKAVLSGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMK 334

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
            R  LG  +  P+   + +I   VV S  H+ LAL+ A++ +VLL+N+  LLPL  +K++
Sbjct: 335 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 391

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCD 469
            +A++GPNAN +    GNY G    +IT L+ ++ Y+  +  +Y PGCD
Sbjct: 392 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCD 440



 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 57/113 (50%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + + VD  K AD ++   G+    E EE+          DR  + LP  Q  L+  + +A
Sbjct: 590 LKQTVDKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK ++ V   G  + +T     +   +IL A YPG+AG  A+A V+FGD+NP
Sbjct: 650 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 699


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 174/422 (41%), Positives = 241/422 (57%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 35  EQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY---------- 84

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 85  ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIF 144

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   +
Sbjct: 145 RDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---S 195

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P CA   LL+  
Sbjct: 196 GPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGR 255

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDA+  +     +      + V  LKAG D+NCG    +    A+ 
Sbjct: 256 VRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVAALKAGHDLNCGHAY-RELGTAIA 314

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 315 RGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNVAHRALALQAAAESI 373

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V
Sbjct: 374 VLLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQV 431

Query: 464 YY 465
            Y
Sbjct: 432 SY 433



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 682 WAKTHAD--AIMAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   + 
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P CA   LL+   
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 315

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432

Query: 465 Y 465
           Y
Sbjct: 433 Y 433



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V 
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 679

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 680 LNWAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  306 bits (785), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   + 
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P CA   LL+   
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIAR 315

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432

Query: 465 Y 465
           Y
Sbjct: 433 Y 433



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V 
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVA 679

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 176/431 (40%), Positives = 242/431 (56%), Gaps = 32/431 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P  K      QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G  
Sbjct: 14  PPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY- 72

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
                    AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T
Sbjct: 73  ---------ATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLT 123

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            W+PNINIFRDPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  K
Sbjct: 124 IWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPK 177

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           H   +   +     R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P C
Sbjct: 178 HIAVH---SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVC 234

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           A   LL+   R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG   
Sbjct: 235 AADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY 294

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
            +    A+ + ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ L
Sbjct: 295 -RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRAL 352

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           ALQAA + IVLLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L
Sbjct: 353 ALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGL 410

Query: 456 -QNYVENTVYY 465
            Q +    V Y
Sbjct: 411 RQRFGAQQVSY 421



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V 
Sbjct: 609 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 667

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 668 LNWAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 703


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 174/421 (41%), Positives = 240/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  N        +  A  KH   +   + 
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLN------HPRTIATPKHIAVH---SG 196

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P CA   LL+   
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRV 256

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+ +
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGHAY-RELGTAIAR 315

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 316 GEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 375 LLKNDANTLPL--RAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432

Query: 465 Y 465
           Y
Sbjct: 433 Y 433



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V 
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVA 679

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 211/594 (35%), Positives = 300/594 (50%), Gaps = 82/594 (13%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
           +V PLL L  + +   VD+ + P +C   +   +   FC T+LP+S R  DL++RL LDE
Sbjct: 1   MVLPLLLLVASVVAASVDA-EIPRACVGEH---QKLQFCNTSLPVSARVEDLLARLPLDE 56

Query: 68  KISQLVNSAPAIPR-----LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           K   L  +A A PR     +G+P Y W +  +HGV          GT    TSFP  +  
Sbjct: 57  KAILL--TARASPRGNMSSIGLPEYNWGANCVHGVRSTC------GT-NCPTSFPNPV-- 105

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFR--DPRWGRGQETPGEDP 180
                                                N++I R  DPRWGR  ETP EDP
Sbjct: 106 -------------------------------------NLSIHRRRDPRWGRNTETPSEDP 128

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           LV  KY V+Y +G+Q       +    LQA    KH+ AY  +N+ G  R  F+A V+  
Sbjct: 129 LVNSKYGVAYTKGLQEGKHEDPRY---LQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPY 185

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           D ADTY P F S +  G A G+MC+YN VNG+P+CA+  L +K  R   GF GYITSD  
Sbjct: 186 DFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSG 245

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT--KAAVKQKKLPESEIDRALH 358
           A+  I D   Y  +  +A    + AG DVN G         K  V+  +L    +D  L 
Sbjct: 246 AIEAISDWLHYVPTRCEAARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLR 305

Query: 359 NLFSVRMRLGLFNGNPTM-QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
           +   +R  LGLF  +P   QP+ K+  + V + A + L+L  A+  IVLL+N+  +LPL 
Sbjct: 306 HTLKLRFELGLF--DPIEDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPL- 362

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRS--------ITPLQAL--QNYVENTVYYPG 467
             + V LA++GP+A + + LLGNY G  C           TP +A+   N   +T Y  G
Sbjct: 363 -RRGVKLAVVGPHAQAKRALLGNYLGQMCHGDYNEVGCIKTPFEAVSASNGDSSTTYALG 421

Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
           C+    S+A   +AV   +GA+ VVL +G+D++ E E  DR ++ LP  Q +L+ RV  A
Sbjct: 422 CNVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRV-RA 480

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KP ++VL+ GG   +T         +++ A YPG  GA A+ +++FGD NPG
Sbjct: 481 VGKPTVVVLMNGGV--LTAEDIIGQTDALVEAFYPGFFGAQAMTDILFGDANPG 532


>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 902

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 182/464 (39%), Positives = 256/464 (55%), Gaps = 40/464 (8%)

Query: 30  PFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEW 89
           P      +P+T  +     T P ++RA DLV R+TLDEK +QL + A AIPRLG+P Y+ 
Sbjct: 25  PIQLPAQSPATPVY--RDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQT 82

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG 149
           WSEALHGVA  G           AT FPQ I  AA++D+ +  ++G  I  EAR  YN  
Sbjct: 83  WSEALHGVARAGH----------ATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEA 132

Query: 150 QA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           Q         G+TFW+PNINIFRDPRWGRGQET GEDP +TGK  ++++ GVQG      
Sbjct: 133 QREGNHRIFWGLTFWSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPD---- 188

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
                 +A A  KHF  +       + R+ FD +V+ +DL +TY   F + V  G    +
Sbjct: 189 --AAHPKAVATSKHFAVHSGPE---SLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSV 243

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVV 320
           MCAYN V+G+ +CA++ LL +  ++ WGF G++ SDC A+  +   +G+  +P+   A  
Sbjct: 244 MCAYNAVDGMGACANKMLLEEHLKQAWGFKGFVVSDCGAIMDV--TQGHKNAPDIVHAAA 301

Query: 321 DVLKAGMDVNCGSFLQKHTKA--AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
             L AG D++C  +         AV++  + E  + RA   L++ R  LG+F+  P   P
Sbjct: 302 ISLAAGTDLSCSIWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFD-EPGSNP 360

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
             KI    V S  H+  AL+AA++ IVLLKN  GLLPL  +K++  A+IGP A    +L 
Sbjct: 361 NDKIDMSQVASEEHRAEALKAAEESIVLLKND-GLLPLKNAKTI--AVIGPTAELLASLE 417

Query: 439 GNYAGPSCRSITPLQAL--QNYVENTVYYPGCDTVACSSASIDK 480
           GNY G   R +TPL  +  Q   EN  Y  G    A +   + +
Sbjct: 418 GNYNGQPVRPVTPLDGIVKQFGAENVRYAQGSSLAAGAPVPVPR 461



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 52/95 (54%), Gaps = 13/95 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EE+          DR  + LP  Q++L+  +  AA KPV++V L G  V + 
Sbjct: 640 GLSPNLEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALG-AAGKPVVVVNLSGSAVALN 698

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +A  +++ G+IL A YPG  G  A+A+ + G+ NP
Sbjct: 699 WA--NQHAGAILQAWYPGVEGGTAIAKTLAGESNP 731


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  306 bits (783), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVE--NTVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 52/104 (50%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +  KK V+ +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKK-VVFINY 656

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 657 SGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 173/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  +        +  A  KH   +   + 
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLD------HPRTIATPKHIAVH---SG 196

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+   
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 256

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRDLGTAIER 315

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 375 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432

Query: 465 Y 465
           Y
Sbjct: 433 Y 433



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 57/97 (58%), Gaps = 13/97 (13%)

Query: 495 MGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V +
Sbjct: 622 VGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVAL 680

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 681 NWAKTHAD--AIVAAWYPGQSGGTAMARMLAGDDNPG 715


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 173/421 (41%), Positives = 241/421 (57%), Gaps = 32/421 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 36  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 85

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 86  TVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFR 145

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QG+  +        +  A  KH   +   + 
Sbjct: 146 DPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDLD------HPRTIATPKHIAVH---SG 196

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+  D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+   
Sbjct: 197 PEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGRV 256

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A+++
Sbjct: 257 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRDLGTAIER 315

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + IV
Sbjct: 316 GDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESIV 374

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVY 464
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L Q +    V 
Sbjct: 375 LLKNDANTLPL--KAGTRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVS 432

Query: 465 Y 465
           Y
Sbjct: 433 Y 433



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/98 (38%), Positives = 57/98 (58%), Gaps = 13/98 (13%)

Query: 494 MMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
            +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V 
Sbjct: 621 FVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVA 679

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 680 LNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 715


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 175/443 (39%), Positives = 244/443 (55%), Gaps = 36/443 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG      +  G  +  AC
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGP-----EDAGYDKLHAC 186

Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G 
Sbjct: 187 AKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGE 243

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVN 330
           P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++ G D+ 
Sbjct: 244 PCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAAAVRTGTDLE 303

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S 
Sbjct: 304 CGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNSK 359

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++T
Sbjct: 360 EHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTVT 417

Query: 451 PLQALQNYVEN--TVYYPGCDTV 471
            L+A++  +     +Y PGCD V
Sbjct: 418 LLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNLAVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 195/559 (34%), Positives = 296/559 (52%), Gaps = 56/559 (10%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS--APAIPRLGIPAYEWWSEALHG 96
           S ET  +    +PI +R  DL+ RLTL+EK+ QL +S  +  I RL IPA    +E LHG
Sbjct: 66  SKETPKYLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHG 124

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF 156
                          G+T FP  I   ++FD+ L   +G+A  +EA+A      A     
Sbjct: 125 ----------QSYATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKA------ANLRVS 168

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W+P +++ RD RWGR +ET GEDP + G+  V++++G QG+              AC KH
Sbjct: 169 WSPVLDVARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGE-----------HMFACPKH 217

Query: 217 FTAYDLD-NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           F  +      + +  Y    RV    + + +  PF   +K+  A G+M AY   NG+P  
Sbjct: 218 FAGHGQPVGGRDSHDYGLSDRV----MRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDN 273

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
             + LL K  R +WGF G++ SDC     I   +    + E+A    ++AG+D+ CGS  
Sbjct: 274 GSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMAVRAGVDIECGSAY 333

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           +K   +AVK+  + ESE+D  L  +F  +MRLGLF+  P+++           +P H+ L
Sbjct: 334 KKALASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHRAL 392

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC--RSITPLQ 453
           A + A    VLLKN + LLPL K+   ++A+IGPNA+  +T  G+Y+      + I+ L+
Sbjct: 393 ARKVAVKSTVLLKNENNLLPLDKNIK-TIAVIGPNADQGQT--GDYSAKYAPGQIISVLE 449

Query: 454 ALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK------- 503
            ++N+V      +Y  GC  +   +    +AV+IAK AD V+L++G +  + +       
Sbjct: 450 GVKNHVSPSTKVLYAQGCTQLDMDTTGFAEAVNIAKQADAVILVVGDNSNRHENGNKKST 509

Query: 504 --EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGY 561
             E +D   L +PG Q++LI +  EA  KPV+LVL+ G P  +T+   D NI SIL   Y
Sbjct: 510 TGENVDGATLEIPGVQRQLI-KAVEATGKPVVLVLVNGKPFTLTWE--DENIESILETWY 566

Query: 562 PGEAGAVALAEVIFGDHNP 580
           PGE G  A A++IFGD NP
Sbjct: 567 PGEEGGNATADIIFGDENP 585


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  305 bits (782), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  305 bits (782), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 180/445 (40%), Positives = 248/445 (55%), Gaps = 38/445 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + FP+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI AY+WW+EALHGV   
Sbjct: 23  QQFPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRA 82

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIG 153
           G           AT FPQ I  AASFD  L Y++  A+  EARA Y       N  +  G
Sbjct: 83  GI----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQG 132

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PNINIFRDPRWGRGQET GEDP +T +  V+ VRG+QG D     KL       A
Sbjct: 133 LTFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDKLH------A 186

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W    R+ F+A  +  +DL +TY P F++ V++     +MCAYNR  G
Sbjct: 187 CAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEG 243

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +W + G I SDC A+S  +    +   P  E A    + +G D+
Sbjct: 244 EPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPDKETASAGAVLSGTDL 303

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CG+   K    AV++  + E +ID ++  L + R  LG  + +     +  I   VV S
Sbjct: 304 ECGNNY-KSLPEAVQKGLIDEKQIDISVKRLLTARFELGEMDEHVC---WDSIPYSVVDS 359

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
            AH+ LAL+ A+  IVLL+N + +LPL   + + +ALIGPNAN +    GNY G    + 
Sbjct: 360 KAHKDLALEIARKSIVLLQNRNNILPL--KEDMKIALIGPNANDSVMQWGNYNGFPSHTS 417

Query: 450 TPLQALQNYV--ENTVYYPGCDTVA 472
           T  +AL+  +     +Y  GCD  +
Sbjct: 418 TLYEALKERIPANQLIYDFGCDRTS 442



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           ++  ++D  K AD +V   G+  + E EE+          DR  + LP  Q+ LI+ + +
Sbjct: 590 NLQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKK 649

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             K P+I V   G  V +      +   +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 650 LGK-PIIFVNYSGSAVGL--EPESKICDAILQAWYPGQAGGTAVADVLFGDYNP 700


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  305 bits (782), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPDKEHASAGAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  305 bits (780), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QRA DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----G 153
           G           AT FPQ I   ASF+  L Y +  A   EAR    ++     +    G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPDKEHASAGAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           ++ AV     AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A
Sbjct: 588 LNLAVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK V+ +   G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 648 GKK-VVFINYSGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  305 bits (780), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 183/469 (39%), Positives = 255/469 (54%), Gaps = 54/469 (11%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           LCLCF S       +QPP+     NP+          L   +RA DLV RLTL+EK + +
Sbjct: 30  LCLCFLSC------SQPPYK----NPA----------LSPEERANDLVGRLTLEEKAALM 69

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
            N++PAIPRLGI AY+WW+EALHGV   G           AT FPQ I   ASF++ L Y
Sbjct: 70  QNTSPAIPRLGIKAYDWWNEALHGVGRAGL----------ATVFPQAIGMGASFNNELLY 119

Query: 133 RIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  AI  EARA            +  G+T W PNINIFRDPRWGRGQET GEDP +T +
Sbjct: 120 DVFTAISDEARAKNTEFSKEGGLKRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQ 179

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
             ++ VRG+QG     G+   KL   AC KH+  +    W    R+ F+A  +  +DL +
Sbjct: 180 MGMAVVRGLQGPE---GEKYDKLH--ACAKHYAVHSGPEW---NRHSFNAENIDPRDLWE 231

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY P F+  V++     +MCAYNR  G P C    LL    R +WG+   + SDC A+S 
Sbjct: 232 TYLPAFKDLVQKAHVKEVMCAYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISD 291

Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            Y+   +   P  + A    + +G D+ CG         AVK+  + E +ID +L  L  
Sbjct: 292 FYNKGAHETDPDKQHASAKAVLSGTDIECGDSYGS-LPEAVKEGLIDEKQIDISLKRLMK 350

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
            R  LG  +  P+   + +I   VV S  H+ LAL+ A++ +VLL+N+  LLPL  +K++
Sbjct: 351 ARFELGEMD-EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPL--NKNL 407

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCD 469
            +A++GPNAN +    GNY G    +IT L+ ++ Y+  +  +Y PGCD
Sbjct: 408 KVAVVGPNANDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGCD 456



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + + V+  K AD ++   G+    E EE+          DR  + LP  Q  L+  + +A
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK ++ V   G  + +T     +   +IL A YPG+AG  A+A V+FGD+NP
Sbjct: 666 GKK-IVFVNFSGSAIALT--PESKTCDAILQAWYPGQAGGTAIANVLFGDYNP 715


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  304 bits (779), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 246/422 (58%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N+APAIPRLGIPAYEWW+E LHG+A  G           
Sbjct: 37  EQRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNGY---------- 86

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 87  ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD           +  A  KH   +   +
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---S 197

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 198 GPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGR 257

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG+   +    A++
Sbjct: 258 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCGTAY-RALGTAIE 316

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P    + ++GA  + +  ++ LALQAA + I
Sbjct: 317 RGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESI 375

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN++  LPL    S  LA+IGPNA++   L  NY G S + +TPL  L Q +    V
Sbjct: 376 VLLKNANATLPL--KASTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQV 433

Query: 464 YY 465
            Y
Sbjct: 434 RY 435



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           +D VV  +GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+
Sbjct: 617 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLM 675

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G  V + +AK   +  +I+ A YPG++G  A+A  + GD NPG
Sbjct: 676 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPG 717


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 181/472 (38%), Positives = 257/472 (54%), Gaps = 51/472 (10%)

Query: 13  LCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           L +C  SLL         FSC       +  P+  T+L   QRA DL+ RLTL+EK++ +
Sbjct: 9   LGVCSLSLL---------FSC------AQKLPYQDTSLTAEQRAEDLLPRLTLEEKVALM 53

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWY 132
            N++PAIPRLGI  Y+WW+EALHGV   G           AT FPQ I   ASF+  L Y
Sbjct: 54  QNASPAIPRLGIKEYDWWNEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLY 103

Query: 133 RIGQAIGLEARA---LYNAGQAI----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGK 185
            +  A+  EAR    +++    +    G+TFW PN+NIFRDPRWGRGQET GEDP +TG+
Sbjct: 104 EVFDAVSDEARVKSRIFSENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQ 163

Query: 186 YAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLAD 244
             ++ VRG+QG     GK     +  AC KHF  +    W    R+ FDA  +T +DL +
Sbjct: 164 LGMAVVRGLQGP--ENGKYD---KLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWE 215

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
           TY P F+  V++     +MCAYNR  G P C    LL +  R +WG+ G + SDC A+S 
Sbjct: 216 TYLPAFKDLVQKADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISD 275

Query: 305 IYDAEGYAKSP--EDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
            Y    +   P  E A    + +G D+ CG         AVK   + E +ID +L  L +
Sbjct: 276 FYRPGTHGTHPDKEHASAGAVLSGTDLECGGEYGSLAD-AVKAGLIDEKQIDVSLKRLLT 334

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
            R  LG  +  P    + +I A  + S  HQ LAL+ A++ +VLL+N + +LPL  +  +
Sbjct: 335 ARFELGEMDEQPA---WAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPL--NTDL 389

Query: 423 SLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTVA 472
            +A++GPNAN +    GNY G    ++T L+A+++ +     +Y PGCD  +
Sbjct: 390 KVAVMGPNANDSVMQWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGCDRTS 441



 Score = 47.0 bits (110), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 60/114 (52%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +++ AV+  K AD V+   G+  + E EE+          DR D+ LP  Q++L+  + +
Sbjct: 587 NLNLAVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKK 646

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK V+ +   G  + +          +IL   YPG+AG  A+ +V+FGD+NP
Sbjct: 647 AGKK-VVFINYSGSAIGLV--PESNTCEAILQGWYPGQAGGTAIVDVLFGDYNP 697


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 177/444 (39%), Positives = 245/444 (55%), Gaps = 38/444 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+  T+L   QR  DL+ RLTL+EK+S + N++PAIPRLGI  YEWW+EALHGV   
Sbjct: 22  QLLPYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRA 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAG---QAIG 153
           G           AT FPQ I   ASF+  L Y +  A   EAR       ++G   +  G
Sbjct: 82  GL----------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +TFW PN+NIFRDPRWGRGQET GEDP +TG+  ++ VRG+QG +     KL       A
Sbjct: 132 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDARYDKLH------A 185

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ FDA  +  +DL +TY P F+  V++     +MCAYNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDV 329
            P C    LL +  R +WG+ G + SDC A+S  Y    +   P  E A    ++AG D+
Sbjct: 243 EPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPDKEHASAGAVRAGTDL 302

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS        AVK   + E EID +L  L + R  LG  +       + +I   V+ S
Sbjct: 303 ECGSEYASLAD-AVKAGLIDEKEIDISLKRLLTARFELGEMD---EQSAWSEIPTSVLNS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ +VLL+N + +LPL  +  + +A++GPNAN +    GNY G    ++
Sbjct: 359 KEHQALALRMARESLVLLQNKNNILPL--NTHLKVAVMGPNANDSVMQWGNYNGIPAHTV 416

Query: 450 TPLQALQNYVEN--TVYYPGCDTV 471
           T L+A++  +     +Y PGCD V
Sbjct: 417 TLLEAVRAKLPEGQIIYEPGCDRV 440



 Score = 38.5 bits (88), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 53/104 (50%), Gaps = 13/104 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           AD ++   G+  + E EE+          DR D+ LP  Q++L+  + +A KK V+ +  
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK-VVFINY 656

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G    I          +IL A YPG+AG  A+ + ++G++NPG
Sbjct: 657 SGSA--IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPG 698


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 201/540 (37%), Positives = 281/540 (52%), Gaps = 53/540 (9%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQA 137
           R+G+P Y WWSE LHGVAG   G+ FN T      ATSF   I   ASFD  L Y +G A
Sbjct: 13  RIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71

Query: 138 IGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD 197
           I  EARA  N G   G+ +W PN+N ++DPRWGRG ETPGEDPL    Y  + + G++G+
Sbjct: 72  ISTEARAFANFGFG-GLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAGLEGN 130

Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
                  +   +  A CKH+ AYDL+ W G TRY+F+A VT+QDL++ Y PPF+ C +  
Sbjct: 131 -------ETVRKVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDS 183

Query: 258 RASGIMCAYNRV-----------------NGIPSCADRNLLSKTARRQWGF---HGYITS 297
           +   IMC+YN +                    P+CA+  L++   R  W +   + YITS
Sbjct: 184 KVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITS 242

Query: 298 DCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEI 353
           DC+A+   + D   ++++P +A     KAG D  C   GS L      A  Q  LPE+ I
Sbjct: 243 DCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVI 301

Query: 354 DRALHNLFSVRMRLGLFN------------GNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           D AL  L+   +R G  +            G+ +   +  +  + V +P+ Q LAL++A 
Sbjct: 302 DTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSAT 361

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKNS  LLPL  S    +ALIG  AN+  T+ G Y+G       PL A Q    +
Sbjct: 362 EGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLS 421

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G    A    +       A     VVL   G D T   E+LDR  +  P  Q +L
Sbjct: 422 FSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPKAQMKL 481

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++ +A    KP++++ L G  VD +F   + N+ SILW GYPG++G  A+ +V+ G   P
Sbjct: 482 LSELA-GLGKPLVVIQL-GDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 539


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 191/526 (36%), Positives = 285/526 (54%), Gaps = 53/526 (10%)

Query: 82  LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE 141
           + IP Y +W+EALHGVA  G           AT FPQ I  AA+FD +L   I   IG E
Sbjct: 1   MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50

Query: 142 ARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
            RA YN            G+TFW+PN+NIFRDPRWGRG ET GEDP +T K+ V++++G+
Sbjct: 51  GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           QG           L+ +A  KHF  +   +     R+ FDA V+ +DL +TY P F++ V
Sbjct: 111 QGQA-------KYLKLAATAKHFAVH---SGPEGLRHGFDAVVSDKDLYETYLPAFKAAV 160

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
           ++     IM AYN V+G+P+     LL      +W F G++ SD  A   +++   Y K 
Sbjct: 161 EEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKYTKD 220

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
             + +   +KAG+++  G  +++    A+ +  + E EI  A+ +L++ R+RLG+F    
Sbjct: 221 AAETMGLAIKAGLNLVAGH-IEQSLHEALDRGLVTEEEITNAVISLYATRVRLGMF---A 276

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
           T   +  I  +   + AH  L+  AA+   VLLKN  G+LPL K    ++A++GPNA+S 
Sbjct: 277 TDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVVGPNAHSE 335

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC----DTVACSSASID----KAVD 483
             LLGNY G   RS T L+ +Q  + + V   Y  G     D  A   A  D    +AV 
Sbjct: 336 IALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAVI 395

Query: 484 IAKGADHVVLMMGLDQTQEKEE---------LDRVDLVLPGRQQELITRVAEAAKKPVIL 534
            A+ +D VV ++GLD T E EE          D+ +L LPGRQ++L+ R+  A  KPV++
Sbjct: 396 AAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLL-AVGKPVVV 454

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +L  G  + +   +   N+ +I+   YPG  G +A+A+V+FG  +P
Sbjct: 455 LLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSP 500


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 245/422 (58%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 37  EQRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY---------- 86

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 87  ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD           +  A  KH   +   +
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---S 197

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 198 GPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGR 257

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG+   +    A++
Sbjct: 258 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIE 316

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P    + ++GA  + +  ++ LALQAA + I
Sbjct: 317 RGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESI 375

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN++  LPL       LA+IGPNA++   L  NY G S + +TPL  L Q +    V
Sbjct: 376 VLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQV 433

Query: 464 YY 465
            Y
Sbjct: 434 RY 435



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 55/96 (57%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A  + GD NPG
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPG 717


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 177/440 (40%), Positives = 251/440 (57%), Gaps = 33/440 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L  S+RA DL+ R+TL+EKISQ+ N +PAI RLGIPAY WW+EALHGVA  GK 
Sbjct: 24  PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAGK- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTF 156
                    AT FPQ I  AA+FD+   +     +  EARA Y+  Q         G+TF
Sbjct: 83  ---------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDGYKGLTF 133

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W PNINI+RDPRWGRG ET GEDP +T    ++ V+G+QGD    GK     +  AC KH
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGD--GTGKYD---KTHACAKH 188

Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           +  +    W    R+ FDA+ ++ +DL +TY P F++ V +G+   +MCAYNR  G P C
Sbjct: 189 YAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRYEGEPCC 245

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
           +++ LL +  R  WG+   + SDC A+   Y    +   P    A  D + +G D+ CG 
Sbjct: 246 SNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLECGG 305

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
                 + AV++  + E +I+ ++  L   R +LG+F+ N T+  + +I   VV S  H 
Sbjct: 306 SYSSLNE-AVRKGLISEDKINESVFRLLRARFQLGMFDDN-TLVSWSEIPYSVVESKEHV 363

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
             AL+ A+  +VLL N + +LPL KS    +A++GPNAN +  L  NY G   +S+T L+
Sbjct: 364 AKALEMARKSMVLLTNKNNILPLSKSVR-KVAVLGPNANDSVMLWANYNGFPTKSVTILE 422

Query: 454 ALQNYV-ENTVYYP-GCDTV 471
            ++N + E  VYY  GCD V
Sbjct: 423 GIRNKLPEGAVYYEKGCDFV 442



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 60/108 (55%), Gaps = 13/108 (12%)

Query: 483 DIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPV 532
           D A  AD ++ + GL  + E EE+          DR ++ LP  Q+E++  + +  K PV
Sbjct: 596 DKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQEEMLKALKKTGK-PV 654

Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + VL  G  + + +     N+ +I+ A YPG+ G  A+A+V+FGD+NP
Sbjct: 655 VFVLCSGSTLALPWEA--ENLDAIIEAWYPGQQGGTAVADVLFGDYNP 700


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 245/422 (58%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 37  EQRAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNGY---------- 86

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 87  ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD           +  A  KH   +   +
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---S 197

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 198 GPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGR 257

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG+   +    A++
Sbjct: 258 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIE 316

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P    + ++GA  + +  ++ LALQAA + I
Sbjct: 317 RGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESI 375

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN++  LPL       LA+IGPNA++   L  NY G S + +TPL  L Q +    V
Sbjct: 376 VLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQV 433

Query: 464 YY 465
            Y
Sbjct: 434 RY 435



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 55/96 (57%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A  + GD NPG
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPG 717


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 176/420 (41%), Positives = 242/420 (57%), Gaps = 32/420 (7%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           RA  LV+++T  EKI+Q +N APAIPRLGIPAYEWWSE LHG+A  G+          AT
Sbjct: 55  RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGE----------AT 104

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRD 166
            FPQ I  AAS+++ L + +G     EARA +N AG       +  G+T W+PNINIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRG ET GEDP +TG+ AV ++ G+QGD           +  A  KH   +   +  
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPT------HPRTIATPKHLAVH---SGP 215

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
            + R+ FD  V+  D   TY P F + + +G A  +MCAYN ++GIP+CA   L+    R
Sbjct: 216 ESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCAYNALHGIPACAADWLIDGRVR 275

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQK 346
             WGF G++ SDCDA+  +     Y      +    LKAG D+NCG +  +    A+ + 
Sbjct: 276 GNWGFKGFVVSDCDAIDDMTQFHYYRADNAGSAAAALKAGHDLNCG-YAYRDLGTALDRG 334

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
           +  E+ +DR+L  LF+ R RLG      +  P+ ++GA  + SP H+ LALQAAQ  +VL
Sbjct: 335 EAEEAMLDRSLVRLFAARYRLGELQPR-SKDPYARLGAKDIDSPTHRALALQAAQQSLVL 393

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT-VYY 465
           L+N +  LPL     + LA+IGPNA++   L  NY G S   +TPLQ L+     T V+Y
Sbjct: 394 LQNRNDTLPL--RPGLRLAVIGPNADALAALEANYQGTSVAPVTPLQGLRARFGTTQVHY 451



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/96 (42%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR DL LP  QQ L+ R A+A+ KP+I+VL+ G  V + 
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALN 699

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK  ++  +IL A YPG++G  A+A+ + GD NPG
Sbjct: 700 WAK--QHADAILAAWYPGQSGGTAIAQALAGDINPG 733


>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 908

 Score =  303 bits (775), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 185/438 (42%), Positives = 248/438 (56%), Gaps = 33/438 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +    L   QRA DLV R+TL+EK  Q+VN A AIPRL +PAY++W+E LHGVA  G   
Sbjct: 24  YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSGY-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFW 157
                   AT FPQ I  AA++D+ L  +IG  I  EARA  N            G+TFW
Sbjct: 82  --------ATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PNINIFRDPRWGRGQET GEDP +T +  V+++ G+QG      K+       A  KHF
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTDPKFYKV------IATPKHF 187

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
             +   +     R+KFD   T  DL DTY P F + +   +A  IMCAYNR++G P+C  
Sbjct: 188 AVH---SGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPACGS 244

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD--VLKAGMDVNCGSFL 335
           + LL    R  W F G++TSDC A+   +    +   P+    D   L AG D NCGS  
Sbjct: 245 KLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDAEHADKAALLAGTDTNCGSTY 304

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           +K    AVK   + ES+ID +L  LF  R+RLGLF+   ++ P+ +I    V SPA+  +
Sbjct: 305 RK-LGDAVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSV-PYAQIPFSQVNSPANAAV 362

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A +AA++ +VLLKN  G+LPL   K  ++A+IGPN  S  +L GNY G +     P+ AL
Sbjct: 363 AKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPVDAL 421

Query: 456 QNYVE--NTVYYPGCDTV 471
           ++ +   N VY PG   V
Sbjct: 422 RSALSGTNVVYAPGAPYV 439



 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 62/113 (54%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           + +A++ A  +D VV M+GL    E EE+          DR D+ LP  QQ L+  +  A
Sbjct: 621 LPEALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLI-A 679

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             KP I+VLL G  + I  A  D    +IL + YPGEAG+ ALA+ + G +NP
Sbjct: 680 TGKPTIVVLLNGSALAINLA--DEKANAILESWYPGEAGSTALADTLVGRNNP 730


>gi|256394357|ref|YP_003115921.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360583|gb|ACU74080.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1357

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 209/612 (34%), Positives = 310/612 (50%), Gaps = 76/612 (12%)

Query: 34  DPSNPSTETFP-FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWS 91
           D + P+  + P +  T+ P   RA DLVSR+TL EK +QL   SAPAIPRLG+  Y + +
Sbjct: 37  DTAAPAASSTPIYLNTSYPFEARAADLVSRMTLAEKAAQLNTTSAPAIPRLGVQQYTYQA 96

Query: 92  EALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEAR---- 143
           EA HG+  +G G   +G++ G    ATSFP    ++ S+D  L Y+   A+  EAR    
Sbjct: 97  EAQHGINYLG-GDQNSGSVAGNPPVATSFPTNFASSMSWDPALVYQETTAVSDEARGLVD 155

Query: 144 -ALYNAGQA-IG--------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRG 193
            +L+  GQ  +G        +TFWAP +N+ RDPRWGR  E  GEDP + G+ A ++V G
Sbjct: 156 KSLFGTGQNNLGPSASDYGSLTFWAPTVNLDRDPRWGRTDEAFGEDPYLVGQMAGAFVNG 215

Query: 194 VQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESC 253
            QG++  G  L G L+A+A  KH+   D++      R    + V+  DL D Y   F   
Sbjct: 216 FQGNSMTGQSLDGYLKAAATAKHYALNDVEQ----NRTGISSNVSDTDLRDYYTKQFADL 271

Query: 254 VKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA--EGY 311
           ++    +G+M +YN +NG PS AD    ++ A+R +GF+GY+TSDC AV   Y     G+
Sbjct: 272 IENSHVAGLMTSYNAINGTPSVADTYTANQLAQRTYGFNGYVTSDCGAVGTAYRNFPAGH 331

Query: 312 AKSPEDAVVD--------------------------VLKAGMDVNCG--SFLQKHTKAAV 343
           A +P     D                           L+AG  VNCG   F  ++ +AA+
Sbjct: 332 AWAPPGWTTDGGDTNSIWTNTSTGAKISGAAGGEAYSLRAGTQVNCGGDEFSLQNIQAAI 391

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQD 402
               L E  ID  L  LF++RM  G F+  P  + P+  I    + SPAHQ LA   A +
Sbjct: 392 SAGILSEGVIDSDLTKLFTIRMETGEFD--PASKVPYTSITKAQIQSPAHQALATSVADN 449

Query: 403 GIVLLKN------SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            +VLLKN      S  LLP   SK  ++ ++G  AN  +  LG+Y+G     +  +Q L 
Sbjct: 450 SLVLLKNANVSGTSAPLLPASASKLANVVILGDMAN--QVTLGDYSGAPSLQVNAVQGLT 507

Query: 457 NYVE------NTVYYPGCDTVACSSASIDKAVDIA--KGADHVVLMMGLDQTQEKEELDR 508
             ++      N ++     +   +SA+   +   A  K AD VV+ +G +Q   +E  DR
Sbjct: 508 TAIKAANPSANILFDAAGTSSTTTSAATLSSATQAAIKKADLVVMFVGTNQNNAQEGNDR 567

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L +PG    LIT+          LV+   GPV I+      ++ +++++GY GE+   
Sbjct: 568 TTLNMPGNYDSLITQTTALGNPKTALVVQSDGPVKIS--DVQGSVPAVVFSGYNGESQGT 625

Query: 569 ALAEVIFGDHNP 580
           ALA+V+ G  NP
Sbjct: 626 ALADVLLGKQNP 637


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 245/422 (58%), Gaps = 32/422 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ +EK++Q +N+APAIPRLGIPAYEWWSE LHG+A  G           
Sbjct: 37  EQRAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNGY---------- 86

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 87  ATVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD           +  A  KH   +   +
Sbjct: 147 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLE------HPRTIATPKHIAVH---S 197

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + + +G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 198 GPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNGR 257

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG+   +    A++
Sbjct: 258 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGTAY-RALGTAIE 316

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P    + ++GA  + +  ++ LALQAA + I
Sbjct: 317 RGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAKDIDNAGNRALALQAAAESI 375

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTV 463
           VLLKN++  LPL       LA+IGPNA++   L  NY G S + +TPL  L Q +    V
Sbjct: 376 VLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQV 433

Query: 464 YY 465
            Y
Sbjct: 434 RY 435



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 55/96 (57%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A  + GD NPG
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPG 717


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 178/447 (39%), Positives = 256/447 (57%), Gaps = 33/447 (7%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N + +  P+    L  S+RA DL+ R+TL+EK+SQ+ N +PAI RLGIPAY+WW+EALHG
Sbjct: 16  NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA----- 151
           VA  GK          AT FPQ I  AA+FD+   Y     +  EARA Y+  Q      
Sbjct: 76  VARAGK----------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERD 125

Query: 152 --IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
              G+TFW PNINI+RDPRWGRG ET GEDP +T    ++ V+G+QG    GG  K   +
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG----GGTGKYD-K 180

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           A AC KH+  +    W    R+ FDA+ ++ +DL +TY P F++ VK+G+   +MCAYNR
Sbjct: 181 AHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNR 237

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAG 326
             G P C+++ LL +  R  WG+   + SDC A+   Y    +   P    A  D + +G
Sbjct: 238 FEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSG 297

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
            D+ CG       + AV++  + E +I+ ++  L   R +LG+F+ +  +  + +I   V
Sbjct: 298 TDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFDDD-ALVSWSEIPYSV 355

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           V S  H   AL+ A+  +VLL N +  LPL KS    +A++GPNAN +  L  NY G   
Sbjct: 356 VESKEHVAKALEMARKSMVLLTNKNHTLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPT 414

Query: 447 RSITPLQALQNYV-ENTVYYP-GCDTV 471
           +S+T L+ +++ + E TVYY  GCD V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 13/111 (11%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
           +  D A  AD ++ + GL  T E EE+          DR ++ LP  Q E++  + +  K
Sbjct: 592 EVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK 651

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            PVI VL  G  + + +     N+ +IL A YPG+ G  A+A+V+FGD+NP
Sbjct: 652 -PVIFVLCSGSTLALPWEA--ENLDAILEAWYPGQQGGTAVADVLFGDYNP 699


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  302 bits (773), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 193/519 (37%), Positives = 277/519 (53%), Gaps = 43/519 (8%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CDP           K T P  +RA  LV  L ++EK+  LV+ +    RLG+PAY WWSE
Sbjct: 39  CDP-----------KATPP--ERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSE 85

Query: 93  ALHGVAGVGKGIFFNGTIRG----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA 148
           ALHGVA    G+ FN T  G    ATSF   I  +A+FD  L Y++   I  EARA  NA
Sbjct: 86  ALHGVA-ASPGVRFNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANA 144

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           G A G+ +W PNIN ++DPRWGRG ETPGEDP+    Y  + + G++GD  +  K+    
Sbjct: 145 GLA-GLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGYVKALLAGLEGDDPSIRKV---- 199

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
              A CKH+ AYDL+ W+GTTR++FDA V++QDL++ Y PPF+ C +  +    MC+YN 
Sbjct: 200 --VATCKHYAAYDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNA 257

Query: 269 VNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAVSIIYDAEGY-----AKSPEDAVV 320
           +NG P+CA   L+    R+ WG+   + YITSDC+A+        +      ++  +A  
Sbjct: 258 LNGTPACASTYLMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAA 317

Query: 321 DVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
              +AG D  C   G         A  Q  L E  ID AL  L+   +R+G F+   +  
Sbjct: 318 VAYQAGTDTVCEVPGWPPYTDVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYFD-PASGS 376

Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP-KSKSVSLALIGPNANSAKT 436
           P+  IG + V +P  Q LALQ+  DG+VLLKN  G LPL  + K+V+L     N+ +   
Sbjct: 377 PYRSIGWEDVNTPEAQELALQSGTDGLVLLKND-GTLPLNLEDKTVALIGFWANSTNGGR 435

Query: 437 LLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASID----KAVDIAKGADHVV 492
           +LG Y+G      +P+ A +       Y  G      + A+ID    KA++ AK ++ ++
Sbjct: 436 ILGGYSGFPPYIHSPVDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVIL 495

Query: 493 LMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
              G D +   E+LDR  +  P  Q  +I  ++   + P
Sbjct: 496 YFGGTDTSIAAEDLDRDSIAWPEIQLAVIEALSALRQAP 534


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  302 bits (773), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 174/436 (39%), Positives = 242/436 (55%), Gaps = 30/436 (6%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           F C     + + FPF    L +  R  DLVSR+TL+EKISQL++ APAI RL IP Y WW
Sbjct: 12  FCCCIQIEAQKQFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWW 71

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-- 148
           +E+LHGVA  G           AT FPQ I  AAS+D+ L   +  AI  EARA ++   
Sbjct: 72  NESLHGVARAGY----------ATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYL 121

Query: 149 -----GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
                    G+T W+PNINIFRDPRWGRG ET GEDP +TG     YV+G+QGD      
Sbjct: 122 RRDQHDIYQGLTMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGDD----- 176

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
               L+  A  KHF  +   +    +R+ FDA  + +DL +TY P F   VK  +   +M
Sbjct: 177 -PEYLKVVATAKHFAVH---SGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVM 232

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
            AYNR  G  + +++ LL    R +WGF GY+ SDC A++ I++          A    L
Sbjct: 233 TAYNRFRGEAASSNK-LLFDILRNKWGFDGYVVSDCGAINDIWEDHKITADAASASALAL 291

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           + G D+NCG+   K  K A+    + E +I+ A+  LF  R++LG+F+    +  +  I 
Sbjct: 292 ETGTDLNCGA-TYKSLKEAIANGLITEEKINIAIERLFRARLKLGMFDTEENLS-YATIP 349

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
             V  + +H  LA +AAQ+ IVLLKN   +LPL K     +A+IGPNA++ ++L GNY G
Sbjct: 350 FSVNTNASHTALARKAAQESIVLLKNEAHMLPLSKDLK-QIAVIGPNAHNVQSLWGNYNG 408

Query: 444 PSCRSITPLQALQNYV 459
                +T +Q ++N V
Sbjct: 409 TPKNPVTVVQGIRNKV 424



 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/114 (39%), Positives = 73/114 (64%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           +++AV++A+ +D  +L++GL++  E EE+          DR  L LP  Q+EL+ R   A
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELM-RALVA 647

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             KP++LVLL G  + I +A+   ++ +IL AGYPG+ G  A+A+V+FGD+NP 
Sbjct: 648 TGKPIVLVLLNGSALAINYAQ--EHVPAILSAGYPGQEGGNAIADVLFGDYNPA 699


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  301 bits (772), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 168/412 (40%), Positives = 238/412 (57%), Gaps = 31/412 (7%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           QRA  LV++++ +EK++Q +N APAIPRLGIPAYEWWSE LHG+A  G           A
Sbjct: 39  QRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------A 88

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIFR 165
           T FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIFR
Sbjct: 89  TVFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFR 148

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +TG+ AV ++RG+QGD  +        +  A  KH   +   + 
Sbjct: 149 DPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLD------HPRTIATPKHIAVH---SG 199

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
               R+ FD  V+ +D+  TY P F + +  G+A  +MCAYN ++G P+CA   LL+   
Sbjct: 200 PEPGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRV 259

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQ 345
           R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG +  +    A+++
Sbjct: 260 RGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCG-YAYRALGTAIER 318

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            ++ E+ +D++L  LF+ R RLG     P   P+  +GA  + + A++ LAL+AA   IV
Sbjct: 319 GEVDEALLDQSLVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIV 377

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           LLKN    LPL       LA+IGPNA++   L  NY G S   +TPL  L+ 
Sbjct: 378 LLKNDANTLPL--KAGARLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQ 427



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR D+ LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   +  +I+ A YPG++G  A+A ++ GD NPG
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPG 718


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  301 bits (771), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 186/563 (33%), Positives = 295/563 (52%), Gaps = 44/563 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQL--------VNSAPAIPRLGIPAYEWWSE 92
           + +PF  T+LP   R +DLV RLT++E + Q+         + APA+PRLG+  + W +E
Sbjct: 24  QDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWNTE 83

Query: 93  ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----- 147
            L G    G           ATSFPQ +  AA+F + +   +  A  +E RA +N     
Sbjct: 84  CLRGDVYAGN----------ATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQRR 133

Query: 148 --AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLK 205
              G   G++ ++P INI R P WGR QET GEDP ++G+ A  +V+ +QGD        
Sbjct: 134 KIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQGDD------P 187

Query: 206 GKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCA 265
             ++A+A CKHF  +        +R+ FDA+V+ +D   T+ P F+ CV+ G  S +MC+
Sbjct: 188 TYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAGSYS-LMCS 246

Query: 266 YNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA 325
           +NR+NG+P+C ++ LL+   R +WGF GY+ SD +A+  I     Y  +  D     +KA
Sbjct: 247 FNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSVDTAALCVKA 306

Query: 326 GMDVNCGSFLQKHTK----AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
           G ++   +   K T      A+K  KL + ++ +++  LF  RMRLG F+  P   P+  
Sbjct: 307 GCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFDP-PDHNPYNF 365

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           I   V+ S  H+ ++L AA    VLLKN  G LP+ K    +++++GP A++    +G+Y
Sbjct: 366 IDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPITKLFD-TISVLGPMADNKYQQIGSY 424

Query: 442 AG---PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           A    PS  + TPLQ L    +   Y  GC+  ACS  +  +       +D   + +G  
Sbjct: 425 APDVMPS-YTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRTEIQRAVNSSDIFFVCLGTG 483

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKK--PVILVLLCGGPVDITFAKYDRNIGSI 556
              E E+ DR  + LPG+Q +L+      + K  P++L+L  GGPV+IT+A     + +I
Sbjct: 484 PMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADRSDRVVAI 543

Query: 557 LWAGYPGEAGAVALAEVIFGDHN 579
           +   +P +    A+  V+    N
Sbjct: 544 MECFFPAQETGEAVLRVVTNTGN 566


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  301 bits (771), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 170/439 (38%), Positives = 247/439 (56%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L + QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG  N   +  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N++ +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           + AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      +N  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 169/439 (38%), Positives = 247/439 (56%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L + QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  +   R  LG  N   +  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKDLISEEKINTSVKRVLKARFELGEMN---STHPWSNIPFSVIDCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N++ +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           + AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      +N  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 179/440 (40%), Positives = 246/440 (55%), Gaps = 39/440 (8%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            Q A  LV+++T  EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A  G           
Sbjct: 31  EQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY---------- 80

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L   +G     EARA +N          +  G+T W+PNINIF
Sbjct: 81  ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIF 140

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AVS++RG+QGDT +        +  A  KHF  +   +
Sbjct: 141 RDPRWGRGMETYGEDPYLTGQLAVSFIRGLQGDTPD------HPRTIATPKHFAVH---S 191

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  DL  TY P F + +  G A  +MCAYN ++G P+CA   LL+  
Sbjct: 192 GPEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTR 251

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF+G++ SDCDA+  +     + +    A    LK+G D+NCG+   +    A+ 
Sbjct: 252 LRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGN-TYRDLNQAIA 310

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + ES +D+AL  LF+ R RLG         P+  IG   + +PAH+ LALQAA   +
Sbjct: 311 RGDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSL 369

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ-------- 456
           VLLKNS   LPLP     +LA++GP+A+S   L  NY G S   +TPL  L+        
Sbjct: 370 VLLKNSGNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKV 427

Query: 457 NYVENTVYYPGCDTVACSSA 476
           +Y +     PG  +    +A
Sbjct: 428 HYAQGASLAPGVPSTIPETA 447



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 65/121 (53%), Gaps = 13/121 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           +A ++  + +A      AD +V  +GL    E EEL          DR  + LP  Q+ L
Sbjct: 594 LAPAAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V +   KP+I+VL+ G  V + +A++  +  +IL A YPG++G  A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710

Query: 581 G 581
           G
Sbjct: 711 G 711


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 168/407 (41%), Positives = 234/407 (57%), Gaps = 31/407 (7%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LV+++T  EKI+Q +N+APAIPRLG+PAYEWWSE LHG+A  G+          AT FPQ
Sbjct: 40  LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWG 170
            I  AA+++  L + +G     EARA +N AG       +  G+T W+PNINIFRDPRWG
Sbjct: 90  AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           RG ET GEDP +TG+ AV ++ G+QGD           +  A  KH   +   +     R
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDD------PAHPRTIATPKHLAVH---SGPEPGR 200

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + FD  V+  D   TY P F + +  G+A  +MCAYN ++G P+CA   L+    R  WG
Sbjct: 201 HGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRGDWG 260

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F G++ SDCDA+  +     Y      +    LKAG D+NCG+   +    A  + +  E
Sbjct: 261 FKGFVVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDLNCGTAY-RELGIAFDRGEADE 319

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           + +DR+L  LF+ R RLG         P+ ++GA  + S AH+ LALQAAQ  +VLLKN+
Sbjct: 320 ALLDRSLVRLFAARYRLGELQPR-RNDPYARLGARDIDSAAHRALALQAAQQSLVLLKNA 378

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +  LPL     + LA++GPNA++   L  NY G S + +TPLQ L+ 
Sbjct: 379 NATLPL--RPGLRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRT 423



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 57/96 (59%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR DL LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 680

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +A  +++  +I+ A YPG++G  A+A+ + GD NPG
Sbjct: 681 WA--EQHADAIIAAWYPGQSGGTAIAQALAGDINPG 714


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 188/533 (35%), Positives = 276/533 (51%), Gaps = 33/533 (6%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LT  E           + R+G+P Y+W   A+HGV         +GT+   TSFP  +  
Sbjct: 56  LTAREGGGGSPGPPGNVSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVNY 114

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQA----------IGMTFWAPNINIFRDPRWGRG 172
             +++   +  +G+ IG+E RAL+ AG            IG+  W+PNINI R P WGR 
Sbjct: 115 GFTWNYSAYLELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRN 174

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           QE PGEDP + G++  +Y  G+QGD          LQA    KH+ AY L++  G TR+ 
Sbjct: 175 QEVPGEDPFMNGQFGKAYTLGLQGDD------DTYLQAIVTLKHWDAYSLEDSDGATRHN 228

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
           F+A V+   L DTY P F   V +G+A G+MC+YN VNGIP+CA   LL    R  W F 
Sbjct: 229 FNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNAVNGIPTCA-HPLLRTVLRDLWKFD 287

Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM-DVNCGSFLQKHTKAAVKQKKLPES 351
           GY++SD  AV  I D   Y  S   A    ++ G  D++ G+   K     V +      
Sbjct: 288 GYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRME 347

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
           ++D AL N   +R  LGLF+     Q +  +    V + A +   +    + +VLL+N +
Sbjct: 348 DVDNALRNTLRLRFELGLFDPVEN-QSYWHVPLAAVNTNASRATNMLHTLESMVLLQNKN 406

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS------ITPLQALQNYV--ENTV 463
            +LPL  +  V  ALIGP+A + + ++GNY G  C        ++P  AL + +  +   
Sbjct: 407 NVLPLASNTKV--ALIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAVT 464

Query: 464 YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
           Y PG +   CS + ID+AV +A  AD  VLM+G+D++ E E  DR  + LP  Q +L + 
Sbjct: 465 YAPGTNVTTCSQSHIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLASA 524

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           +  A  KP ++VLL GG + I   K  +   +I+ AGYPG  G  A+A+ + G
Sbjct: 525 IF-AVGKPTVIVLLNGGMLAIENEK--QQADAIIEAGYPGFYGGTAIAQTLTG 574


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 170/439 (38%), Positives = 246/439 (56%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG  N   +  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N++ +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNNNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           + AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      +N  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 200/541 (36%), Positives = 295/541 (54%), Gaps = 43/541 (7%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           I  R  DL++ +TL+EKIS   +  P I RLGIP +EW+ EALHG+      I +N    
Sbjct: 32  IEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGI------ISWN---- 81

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGR 171
             T FPQ I   A+++  L + +  AI  EARAL NAG+   M F +P +N+ RDPRWGR
Sbjct: 82  -CTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMF-SPTVNMARDPRWGR 139

Query: 172 GQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRY 231
             E   EDP +  + A  YVRG+QG   N  K    ++     KH+ A +++    T R 
Sbjct: 140 NGECYAEDPHLMSEMARMYVRGMQG---NDPKY---VKTVTTVKHYVANNVE----TKRE 189

Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
              + +  +DL + Y P +++C+    A+GIM A N +NGIP  A   L++   R +WGF
Sbjct: 190 WIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNGIPCSAHDWLVNGVLRNEWGF 249

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA--------AV 343
            GY+ +D  AV  +     YA S   A    +KAG+D  C  F  K  +A        A+
Sbjct: 250 KGYVIADWAAVQGLEKRMKYASSQAQAAAMAIKAGVDQEC--FRNKVRQAPMVQALPDAL 307

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
           +Q  + E E+D  +  L  +R   G F+ +P++ P+  I   V+   AH+ LAL+AA+  
Sbjct: 308 QQGLITEKELDVTVKRLLRLRFMTGDFD-DPSLNPYSAIPTSVLECDAHKQLALKAAEQS 366

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
           IVLLKN   +LPL K    S+A+IGP A+  +  +G Y+G     ++PL  ++ Y    V
Sbjct: 367 IVLLKND-AVLPLKKDLK-SIAMIGPFAD--RCWMGIYSGHPKSKVSPLDGIKAYTNAKV 422

Query: 464 YYP-GCDTVACS--SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
            +  GC+  A       I +AV +AK ++ V+L++G D+T   E  DR  + LPG Q +L
Sbjct: 423 SFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTSTENTDRKSIKLPGNQHQL 482

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I +  +A  K VILVL+  GP  +T+ +  +NI  I+ A   G+    ALA+V+FGD NP
Sbjct: 483 I-KAVQAVNKNVILVLVPSGPTAVTWEQ--KNIPGIVCAWPNGQEQGTALAKVLFGDVNP 539

Query: 581 G 581
           G
Sbjct: 540 G 540


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 183/463 (39%), Positives = 250/463 (53%), Gaps = 42/463 (9%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+    L    RA DL++RLTL EK + + N++PAIPRLGI AYEWW+EALHGV   
Sbjct: 20  QQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRS 79

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIG 153
           G           AT FPQ I  AASF++ L +    A+  EARA  N         +  G
Sbjct: 80  GV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQG 129

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           +T+W PN+NIFRDPRWGRGQET GEDP +T    V+ V+G+QG D     KL       A
Sbjct: 130 LTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPDNAEYDKLH------A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W    R+ F+A  +  +DL +TY P F++ V++     +MCAYNR   
Sbjct: 184 CAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQKADVKEVMCAYNRFED 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDV 329
            P C    LL++  R  W F G + SDC A+S  Y    +A  P+   A  + +  G D+
Sbjct: 241 EPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQPDATHAAANAVLNGTDL 300

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
            CGS   ++   AVK   + E  ID +L  L   R  LG  N +        I   VV S
Sbjct: 301 ECGSDF-RNLPEAVKAGLIEEKRIDVSLKRLLKARFELGEMNSDQVW----PISYSVVNS 355

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             HQ LAL+ A++ IVLL+N++ +LPL  SK + +A++GPNAN +    GNY G    ++
Sbjct: 356 EKHQNLALRMAEESIVLLQNNNNILPL--SKKLKIAVMGPNANDSVMQWGNYNGFPAHTV 413

Query: 450 TPLQALQNYV--ENTVYYPGCDT---VACSSASIDKAVDIAKG 487
           T L+A++        +Y PGCD    VA SS   + ++D  KG
Sbjct: 414 TLLEAMRKSFPGAQLIYEPGCDRTMDVAVSSLFQECSIDGKKG 456



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 60/116 (51%), Gaps = 17/116 (14%)

Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRV 524
           SASI K  D    AD VV   G+  + E EE+          DR D+ LP  Q+ L+  +
Sbjct: 586 SASIAKVKD----ADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQAL 641

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +A KK V+ V   G  + +      ++  +IL A YPG+AG  A+A V+ G++NP
Sbjct: 642 KDAGKK-VVFVNFSGSAMGLV--PETQSCEAILQAWYPGQAGGTAVANVLLGNYNP 694


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/447 (39%), Positives = 255/447 (57%), Gaps = 33/447 (7%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N + +  P+    L  S+RA DL+ R+TL+EK+SQ+ N +PAI RLGIPAY+WW+EALHG
Sbjct: 16  NVTAQNEPYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHG 75

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA----- 151
           VA  GK          AT FPQ I  AA+FD+   Y     +  EARA Y+  Q      
Sbjct: 76  VARAGK----------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERD 125

Query: 152 --IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
              G+TFW PNINI+RDPRWGRG ET GEDP +T    ++ V+G+QG    GG  K   +
Sbjct: 126 GYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG----GGTGKYD-K 180

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           A AC KH+  +    W    R+ FDA+ ++ +DL +TY   F++ VK+G+   +MCAYNR
Sbjct: 181 AHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNR 237

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAG 326
             G P C+++ LL +  R  WG+   + SDC A+   Y    +   P    A  D + +G
Sbjct: 238 FEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSG 297

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
            D+ CG       + AV++  + E +I+ ++  L   R +LG+F+ +  +  + +I   V
Sbjct: 298 TDLECGGSYSSLNE-AVRKGLISEEKINESVFRLLRARFQLGMFD-DDALVSWSEIPYSV 355

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           V S  H   AL+ A+  +VLL N +  LPL KS    +A++GPNAN +  L  NY G   
Sbjct: 356 VESKEHVTKALEMARKSMVLLTNKNHTLPLSKSIR-KVAVLGPNANDSVMLWANYNGFPT 414

Query: 447 RSITPLQALQNYV-ENTVYYP-GCDTV 471
           +S+T L+ +++ + E TVYY  GCD V
Sbjct: 415 KSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 13/111 (11%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
           +  D A  AD ++ + GL  T E EE+          DR ++ LP  Q E++  + +  K
Sbjct: 592 EVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQAEMLKALKKTGK 651

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            PVI VL  G  + + +     N+ +IL A YPG+ G  A+A+V+FGD+NP
Sbjct: 652 -PVIFVLCSGSTLALPWEA--ENLDAILEAWYPGQQGGTAVADVLFGDYNP 699


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  299 bits (766), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 173/426 (40%), Positives = 247/426 (57%), Gaps = 30/426 (7%)

Query: 48  TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
           TT    +R  DL+S +T +EKI QL N A +IPRLG+ AY +W+E+LHGV          
Sbjct: 109 TTGSYKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLA-------- 160

Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
               GATSFPQ I   A++D  L  R+  A+  EARAL N     G+T+W+P INI RDP
Sbjct: 161 ---EGATSFPQAIALGATWDPRLVNRVATAVSDEARAL-NRLYGKGLTYWSPTINIARDP 216

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGR +E+  EDP +  +  V++++G+QGD          L+  A  KHF A    N + 
Sbjct: 217 RWGRNEESYSEDPYLLSRMGVAFIKGMQGDH------PYYLKTVATPKHFIA----NNEE 266

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R+   + V M++L + Y P F+S + + RA  IM AYN +N +PS A+  L++   RR
Sbjct: 267 ERRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRR 326

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKK 347
           QWGF GY+ SDC A+  +     + K+  +AV   + AG D+NCG   ++  K A+ +  
Sbjct: 327 QWGFEGYVVSDCGAIHDMLYGHKFFKTGAEAVARSILAGCDLNCGQAYREFIKDALDEGL 386

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           L E +ID AL  + S R RLG F+  P + P+  IG D + S  ++ LAL AA+  IVLL
Sbjct: 387 LREKDIDSALFRVLSARFRLGEFDP-PELVPYSSIGKDKLDSKENRRLALDAARKSIVLL 445

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN----TV 463
           KN+  +LP+ KSK  S+A+IGPNA  A+  LG Y+G     I+PL+ ++N  ++      
Sbjct: 446 KNND-ILPIDKSKIKSIAVIGPNAREAQ--LGIYSGFPNVLISPLEGIKNKADSLDIRVG 502

Query: 464 YYPGCD 469
           Y  GCD
Sbjct: 503 YVKGCD 508



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 3/124 (2%)

Query: 458 YVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
           Y +N++      T        +KA  IA   D V+L++G+     +EELDR ++ LP  Q
Sbjct: 638 YFDNSLGATARLTWDLGQKDFEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQ 697

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           +EL+ + AE     +++VL+ GGPV    A  ++   +I+   Y GE G  ALA+V+FGD
Sbjct: 698 RELVKQTAEVNPN-IVIVLVNGGPV--ALAGAEKYAKAIVENWYNGEFGGQALADVLFGD 754

Query: 578 HNPG 581
           +NPG
Sbjct: 755 YNPG 758


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 204/575 (35%), Positives = 284/575 (49%), Gaps = 80/575 (13%)

Query: 46  CKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIF 105
           C +T     RA  LV +LT+DEK+  LV+ +   PRLG+P Y WWSE LHGVAG   G+ 
Sbjct: 37  CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95

Query: 106 FNGT---IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
           FN +      ATSF  VI   A+ D  L Y +G AI  EARA    G   G+ +W PNIN
Sbjct: 96  FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFG-GLDYWTPNIN 154

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
            ++DPRWGRG ETPGEDPL    Y  + V G++G+    G ++   +  A CKHF AYDL
Sbjct: 155 PYKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGN----GTVR---KVIATCKHFAAYDL 207

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC------------------ 264
           + W+G TRY FDA V++QDL++ Y PPF+ C +  R   IMC                  
Sbjct: 208 ERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRLVT 267

Query: 265 --------------AYNRVNGIPSCADRNLLSKTARRQWGF---HGYITSDCDAV-SIIY 306
                         +YN +NG P+CA   L++   R  W +   + YITSDC+A+   + 
Sbjct: 268 RQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDFLP 327

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           D   ++++P +A      AG D  C   G         A  Q  L ES ID AL  L+  
Sbjct: 328 DNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVGAYNQSLLSESVIDTALRRLYEG 387

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
            +R G  +           G     SP     +             S   LPL  +   +
Sbjct: 388 LIRAGYLDH----------GRPASSSPDKAPFS-------------SPDFLPLDLTGK-T 423

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSAS--IDKA 481
           +ALIG  AN+ +T+ G Y+G       P+ A++  ++ + YY     V  + A      A
Sbjct: 424 VALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQ-LKLSFYYANGPVVNSTDADTWTAAA 482

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           +  A+ AD V+   G D T   E+LDR  +  P  Q  LI ++A+  K  V++ L  G  
Sbjct: 483 MLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGKPMVVIQL--GDQ 540

Query: 542 VDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           VD T    ++NI SILW GYPG++G  A+ +V+ G
Sbjct: 541 VDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTG 575


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  298 bits (764), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D +  G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAVHASADAVLNGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG    NPT  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVIDCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N + +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      ++  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  298 bits (764), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 174/427 (40%), Positives = 238/427 (55%), Gaps = 33/427 (7%)

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           ++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A  G           AT FPQ I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116

Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
            AA++++ L  ++G     EARA +N AG       +  G+T W+PNINIFRDPRWGRG 
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ET GEDP +TG+ AV ++RG+QGD           +  A  KH   +   +     R+ F
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 227

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  V+  DL  TY P F + +  GRA  +MCAYN ++G P+CA   LL+   R  WGF G
Sbjct: 228 DVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWGFTG 287

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           ++ SDCDAV  +     +      +    LKAG D+NCG +  +    A+ +    E+ +
Sbjct: 288 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEAVL 346

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D++L  LF+ R RLG         P+ ++GA  V S AH+ LALQAAQ  IVLL+N +  
Sbjct: 347 DQSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 405

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
           LPL     + LA+IGPNA++   L  NY G S   +TPL  L+      N  Y  G    
Sbjct: 406 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANLRYAQGAPLA 463

Query: 472 ACSSASI 478
           A  S  I
Sbjct: 464 AGVSGMI 470



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR DL LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK  ++  +I+ A YPG++G  A+A+V+ GD NPG
Sbjct: 705 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPG 738


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  298 bits (763), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D +  G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG    NPT  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVINCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N + +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      ++  +IL A YPG+AG  A+A+V+FG++NP
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGNYNP 699


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  298 bits (763), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 288/572 (50%), Gaps = 55/572 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALH 95
           FPF   +LPI +R +D+V +L+LD+ + Q+ +        AP IP+  I  Y+W +E L 
Sbjct: 27  FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-----LYNAGQ 150
           G    G           ATSFP  I  AASF+  L  ++  A   E RA     + N   
Sbjct: 87  GDVNAGD----------ATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136

Query: 151 AI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
           A   G++ W+P +NI RDPRWGR QET GEDP ++G    ++V G+QGD          +
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQGDD------PTYV 190

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
            A+A CKHF  +         R  FDA VTM D   T+ P F++CV+ G  S +MC+YNR
Sbjct: 191 IANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAGALS-LMCSYNR 249

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV------ 322
           +NG+P+CA++ LL+   R +W F GY+ SD  A+  I     YA    D V         
Sbjct: 250 INGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAP---DFVTAAADAANA 306

Query: 323 ---LKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
              L+ G     G  +  +   AV++  +    +  A+  LF VR +LG F+      P+
Sbjct: 307 GTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNNPY 366

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG---LLPLPKSKSVSLALIGPNANSAKT 436
             I   ++ S  H  L++QAA + IVL+KN +     LPL         ++GP   +A T
Sbjct: 367 ANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENADT 426

Query: 437 LLGNYAGPSCRS---ITPLQALQNY---VENTVYYPGC-DTVACSSASIDKAVDIAKGAD 489
           + G+Y+ P+  +   +TPL  ++      +   Y  GC D  AC      K     +G D
Sbjct: 427 MFGDYS-PTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDGYKVRTACEGVD 485

Query: 490 HVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA-KKPVILVLLCGGPVDITFAK 548
            V++  GL +  E E  D  D+ LPG Q  L+T    A+   P+IL+L    P+DI++AK
Sbjct: 486 LVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGSAPIILLLFNANPLDISYAK 545

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +    +IL A YPG+   VA+A V+ G +NP
Sbjct: 546 SNPRFAAILEAYYPGQEAGVAIANVLTGSYNP 577


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  298 bits (763), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 172/439 (39%), Positives = 245/439 (55%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D +  G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLNGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG    NPT  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEM--NPT-HPWSNIPYSVINCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N + +LPL  ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNKNNILPL--NRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      ++  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQSCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
 gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
          Length = 1548

 Score =  298 bits (763), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 211/637 (33%), Positives = 303/637 (47%), Gaps = 102/637 (16%)

Query: 47  KTTLPI--------SQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGV 97
           KT LPI          RA DLVSR+TL EK+ QL  NS PAIPRLG+  Y +WSE  HGV
Sbjct: 58  KTPLPIYLDTHYSAQARAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGV 117

Query: 98  A--GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------- 147
              G  +    NG    +TSFP    +  S+D  L Y+   AI  EAR   +        
Sbjct: 118 NTLGANQDNGGNGGAVHSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQ 177

Query: 148 ------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNG 201
                 A     +TFWAP +N+ RDPRWGR  E  GEDP +T   A ++V G +G+T  G
Sbjct: 178 NNLGPSAADYGSLTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTG 237

Query: 202 GKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASG 261
               G L+ +A  KH+   D++      R    + V+  DL D Y   F S ++    SG
Sbjct: 238 QSKTGTLKVAATAKHYALNDVEQ----DRTGISSNVSDTDLHDYYTKQFASLIENAHVSG 293

Query: 262 IMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA--EGYAKSPEDAV 319
           +M +YN +NG PS AD    ++ A+RQ+GF+GY+TSDC A+   Y +   G+  +P    
Sbjct: 294 LMTSYNAINGTPSVADTYTANQLAQRQFGFNGYVTSDCGAIGTAYQSFPSGHDWAPPGWT 353

Query: 320 VD--------------------------VLKAGMDVNC--GSFLQKHTKAAVKQKKLPES 351
            D                           L+AG D+NC  G        AA+    L E 
Sbjct: 354 TDGKSSTGTWTNTATGATVPAQAGGQAYALRAGTDLNCAGGENTYAQITAAISAGVLSEG 413

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN-- 409
            ID AL  +F+VR+  G F+      P+  I    + SPAHQ LA + A + +VLLKN  
Sbjct: 414 VIDNALVKIFTVRVETGEFD-PAGSNPYTGITKAQIQSPAHQALATKVADNSLVLLKNQP 472

Query: 410 -----------------------SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
                                  +  LLPL  + +  + ++G  AN+    LGNY+    
Sbjct: 473 PAASGTSTTPPAASSAASSAAAAAKPLLPLSAAATAKIVIVGDMANAVT--LGNYSSDPA 530

Query: 447 RSITPLQALQNYVEN-----TVYYPGCDTVACSSASIDKA----VDIAKGADHVVLMMGL 497
             ++P+Q +   V       +V +  C T   +SA+   +     D+A GAD V++ +G 
Sbjct: 531 LKVSPVQGITAAVRKANPGASVTFDACGTSTTASAAASCSAQTLADVA-GADAVIVFVGT 589

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
           +Q    E  DR  + +PG    LI++VA      ++L +  GGPV I      ++  SI+
Sbjct: 590 NQQIADEGKDRTSIAMPGNYDSLISQVAAVGNPRMVLAVQSGGPVRID--DVQKDFASIV 647

Query: 558 WAGYPGEAGAVALAEVIFGDHNPG---EFTVIAVEMQ 591
           ++G+ GE+   ALA+V+FG  NP    +FT  A + Q
Sbjct: 648 FSGFNGESQGTALADVLFGAQNPDGHLDFTWYADDSQ 684


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  297 bits (761), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 195/567 (34%), Positives = 302/567 (53%), Gaps = 61/567 (10%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVN---SAPAIPRLGIPAYEWWSEALHGVAGVG 101
           F    + +  R  DL+SRLTL++K +QL+N       +    I A + W++ LHGV    
Sbjct: 33  FLNQEMSMEARVADLMSRLTLEQK-AQLLNHRGKTVVVDGFSIRA-DQWNQCLHGVKWT- 89

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--------GQAIG 153
                       T+FP  I   A++D+ L +R+   I  EARA+YN         G+  G
Sbjct: 90  ---------EPTTNFPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRGEHKG 140

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           + + +P INI R+P WGR  E  GEDP  TG+  V+YV+G+QGD  +  KL      ++ 
Sbjct: 141 LIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQGDDSHYLKL------AST 194

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+   +++      R K  A+V  + L + + P F+ C+ +G+A  +M +YN +NG+P
Sbjct: 195 LKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCIVEGKAQSVMASYNAINGVP 250

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK---SPEDAVVDVLKAGMDVN 330
           +  ++ LL+   + QWG  G++ SD   V  +   EG+ +   S E+AV   + AG D +
Sbjct: 251 NNINKLLLTDILKNQWGHEGFVVSDLGGVKTM--VEGHHQRQISCEEAVGRSIMAGCDFS 308

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
              + +K+   A+++  L E  ++ AL  +  VR RLG F+   ++ P+ +I  DV+   
Sbjct: 309 DAEY-EKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFDDFKSV-PYSRISPDVIGCK 366

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ L+L+AA+  IVLLKN   LLP+ +S    +A+IGP A+      GNY G     +T
Sbjct: 367 EHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADLFNQ--GNYGGVPKDPVT 424

Query: 451 PLQALQNYVENTVYYPGCDTVACS----------------SASIDKAVDIAKGADHVVLM 494
           PLQ ++N V N V    C     +                 A + KAV++A+ +D V L 
Sbjct: 425 PLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAEMKKAVEMARNSDVVFLF 484

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           +G     E E  DR  LVLPG Q EL+  V E  KK V++VL+  GPV +   K  +NI 
Sbjct: 485 VGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNKK-VVVVLMSAGPVAVPEVK--KNIP 541

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
           ++L A +PG+ G  A+A+V+FGD+NPG
Sbjct: 542 AVLQAWWPGDEGGNAIADVLFGDYNPG 568


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  297 bits (761), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 174/412 (42%), Positives = 237/412 (57%), Gaps = 31/412 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            Q A  LV+++T  EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A  G           
Sbjct: 31  EQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY---------- 80

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L   +G     EARA +N          +  G+T W+PNINIF
Sbjct: 81  ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIF 140

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +T + AVS++RG+QGDT +        +  A  KHF  +   +
Sbjct: 141 RDPRWGRGMETYGEDPYLTSQLAVSFIRGLQGDTPD------HPRTIATPKHFAVH---S 191

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  DL  TY P F + +  G A  +MCAYN ++G P+CA   LL+  
Sbjct: 192 GPEQGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTR 251

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF+G++ SDCDA+  +     + +    A    LK+G D+NCG+   +    A+ 
Sbjct: 252 LRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGNDLNCGN-TYRDLNQAIA 310

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + ES +D+AL  LF+ R RLG         P+  IG   + +PAH+ LALQAA   +
Sbjct: 311 RGDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSL 369

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           VLLKNS   LPLP     +LA++GP+A+S   L  NY G S   +TPL  L+
Sbjct: 370 VLLKNSGNTLPLP--PETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLR 419



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 64/121 (52%), Gaps = 13/121 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           +A  +  + +A      AD +V  +GL    E EEL          DR  + LP  Q+ L
Sbjct: 594 LAPETPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V +   KP+I+VL+ G  V + +A++  +  +IL A YPG++G  A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710

Query: 581 G 581
           G
Sbjct: 711 G 711


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  296 bits (759), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 178/440 (40%), Positives = 244/440 (55%), Gaps = 41/440 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            P+    L   QRA DL+ RLT++EKIS + N++P IPRLGI  YEWW+EALHGVA  G 
Sbjct: 14  LPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAGL 73

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEA----RALYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L  ++  A+  EA    RA  + GQ     G+T
Sbjct: 74  ----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLT 123

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP +T +  V+ V+G+QG D+    KL       AC 
Sbjct: 124 MWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYDKLH------ACA 177

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F++ V++     +MCAYNR  G P
Sbjct: 178 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDP 234

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R +WGF+G + SDC A+S  + A+ +   P+   A  D + +G D+ C
Sbjct: 235 CCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAHASADAVLSGTDLEC 294

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG-LFNGNPTMQPFGKIGADVVCSP 390
           GS  +K T  AVK   + E +ID ++  L   R  LG +   +P   P+      +V  P
Sbjct: 295 GSNYRKLTD-AVKAGIISEEQIDISVKRLLKARFELGEMEESHPWALPYS-----IVDCP 348

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LALQ A + + LL+N   +LPL K   V  A+IGPNAN +    GNY G    + T
Sbjct: 349 EHRHLALQIAHETMTLLQNKENILPLDKHAKV--AVIGPNANDSVMQWGNYNGTPSHTST 406

Query: 451 PLQALQNYV--ENTVYYPGC 468
            L AL++ +     +Y P C
Sbjct: 407 LLSALRSKLPAAQLIYEPVC 426



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 62/126 (49%), Gaps = 13/126 (10%)

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLP 514
           Y   D V     +++  ++  K  + V+   G+    E EE+          DR D+ LP
Sbjct: 564 YFNFDLVEDIPLNMNATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELP 623

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
             Q+ ++  + +A KK VI V   G  + +T      N  +IL A YPG+ G  A+A+V+
Sbjct: 624 AVQRNVLAALKKAGKK-VIFVNFSGSAMALT--PETENCDAILQAWYPGQEGGTAVADVL 680

Query: 575 FGDHNP 580
           FGD+NP
Sbjct: 681 FGDYNP 686


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 169/430 (39%), Positives = 243/430 (56%), Gaps = 32/430 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            P+    L  +QRA DL+SRLTLDEK+S +++++PAIPRLGIP ++WW+EALH   G+G+
Sbjct: 23  LPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALH---GIGR 79

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MT 155
             F       AT FP  +  AAS+D  L +++  A+  EAR      +  G       ++
Sbjct: 80  NGF-------ATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLS 132

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL--KGKLQASAC 213
           FW PNINIFRDPRWGRGQET GEDP +T K  ++ VRG+QG  +NG  L      +  AC
Sbjct: 133 FWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVSKYRKLLAC 192

Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    W    R++F+   +  +DL +TY P F++ V++G+ + +MCAY R++G 
Sbjct: 193 AKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALVQEGKVAEVMCAYQRIDGQ 249

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
             CA      +  R +WGF G ITSDC A+   +      +K   +A    + AG DV C
Sbjct: 250 ACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSKDGAEASAKAVLAGTDVEC 309

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GS   KH   AV++  + E++IDR+L  L   R  LG  + +  +  + KI   VV S A
Sbjct: 310 GSEY-KHLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD-DLNAWTKIPETVVASQA 367

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPK------SKSVSLALIGPNANSAKTLLGNYAGPS 445
           H+ LAL+ A   IVLL+N   +LPL             + ++GPNAN +  + GNYAG  
Sbjct: 368 HKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVMGPNANDSVMMWGNYAGYP 427

Query: 446 CRSITPLQAL 455
             ++T L  +
Sbjct: 428 THTVTALDGI 437


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 180/458 (39%), Positives = 250/458 (54%), Gaps = 39/458 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+    L   +RA DLV RLTL+EK S + N++PAIPRLGI AY+WW+EALHGV   
Sbjct: 23  QQLPYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRA 82

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-------NAGQAIG 153
           G           AT FPQ I  AASFD  L Y++  A+  EARA Y       +  +  G
Sbjct: 83  GI----------ATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQG 132

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +TFW PN+NIFRDPRWGRGQET GEDP +T +  ++ VRG+QG          KL   AC
Sbjct: 133 LTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPE---DAPYDKLH--AC 187

Query: 214 CKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    W    R++F+A  +  +DL +TY P F+  V++     +MCAYNR+ G 
Sbjct: 188 AKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGE 244

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD--VLKAGMDVN 330
           P C +  LL+   R +WG+ G + SDC A+S  +    +   P+ A      + +G D+ 
Sbjct: 245 PCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDHETHPDKAHASAGAVLSGTDLE 304

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CGS   K    AVK   + ES++D ++  L   R  LG  + +     +  I   VV   
Sbjct: 305 CGSNY-KSLPEAVKAGLIAESQLDISVKRLLKARFELGEMDKDVC---WDTIPYSVVDCQ 360

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
           AH+ LAL+ A++ IVLL+N + +LPL   K + +AL+GPNAN +    GNY G    + T
Sbjct: 361 AHKDLALRMARESIVLLQNRNNILPL--RKDMKIALVGPNANDSIMHWGNYNGFPSHTET 418

Query: 451 PLQALQNYV--ENTVYYPGCDT---VACSSASIDKAVD 483
             +AL+  +     +Y  GCD    VA  S     A+D
Sbjct: 419 LYEALKKRLPASQLIYEFGCDRTSPVALESVFAQCAID 456



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 57/114 (50%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           ++    D  K AD ++   G+  T E EE+          DR  + LP  Q++L+  + +
Sbjct: 590 NLQATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKK 649

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             K P++ +   G  + +  A        ++ A YPG+AG  A+A+V+FGD+NP
Sbjct: 650 LGK-PIVFINYSGSAMGL--APESEICDGMIQAWYPGQAGGTAIADVLFGDYNP 700


>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 909

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 173/427 (40%), Positives = 237/427 (55%), Gaps = 33/427 (7%)

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           ++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A  G           AT FPQ I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 116

Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
            AA++++ L  ++G     EARA +N AG       +  G+T W+PNINIFRDPRWGRG 
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ET GEDP +TG+ AV ++ G+QGD           +  A  KH   +   +     R+ F
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 227

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  V+  DL  TY P F + +  GRA  +MCAYN ++G P+CA   LL+   R  WGF G
Sbjct: 228 DVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTG 287

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           ++ SDCDAV  +     +      +    LKAG D+NCG +  +    A+ +    E+ +
Sbjct: 288 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALL 346

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D++L  LF+ R RLG         P+ ++GA  V S AH+ LALQAAQ  IVLL+N +  
Sbjct: 347 DKSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 405

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
           LPL     + LA+IGPNA++   L  NY G S   +TPL  L+      N  Y  G    
Sbjct: 406 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQGAPLA 463

Query: 472 ACSSASI 478
           A  S  I
Sbjct: 464 AGVSGMI 470



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR DL LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK  ++  +I+ A YPG++G  A+A+V+ GD NPG
Sbjct: 705 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPG 738


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 173/454 (38%), Positives = 250/454 (55%), Gaps = 36/454 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           FSC       + +P+    L   +RA DL SRLTL+EK   + NS+PAIPRLGIP +EWW
Sbjct: 14  FSC---VAGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHG+A  G           AT FPQ    AAS+D  L YR+  A   EA A  N  +
Sbjct: 71  SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120

Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
                    G++ W PNINIFRDPRWGRGQET GEDP +T +  ++ V G+QG  F    
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
               +     +  AC KH+  +    W    R+ FD  R+  +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
               +MCAY R++G P C +   L +  R +WG++G + SDC A+S  Y  EG+    ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFY-REGHHHVVET 296

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           P +A    ++AG DV CG+      + AV+Q  +    ID ++  L   R  +G F+ + 
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
            + P+   G +V+ S  H+ LAL  A++ + LL+N + LLPL K+  + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
             L GNY G    + T L+ +++ V    +  GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           A  AD VV + G+    E EE+          DR  + LP  Q+E+I R+   A K V+ 
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V   GG V            ++L A Y GEAG  A+A+V+FGD+NP
Sbjct: 667 VNCSGGAV--ALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 173/454 (38%), Positives = 250/454 (55%), Gaps = 36/454 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           FSC       + +P+    L   +RA DL SRLTL+EK   + NS+PAIPRLGIP +EWW
Sbjct: 14  FSC---VAGAQQYPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHG+A  G           AT FPQ    AAS+D  L YR+  A   EA A  N  +
Sbjct: 71  SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLAR 120

Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
                    G++ W PNINIFRDPRWGRGQET GEDP +T +  ++ V G+QG  F    
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
               +     +  AC KH+  +    W    R+ FD  R+  +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
               +MCAY R++G P C +   L +  R +WG++G + SDC A+S  Y  EG+    ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFY-REGHHHVVET 296

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           P +A    ++AG DV CG+      + AV+Q  +    ID ++  L   R  +G F+ + 
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
            + P+   G +V+ S  H+ LAL  A++ + LL+N + LLPL K+  + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
             L GNY G    + T L+ +++ V    +  GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           A  AD VV + G+    E EE+          DR  + LP  Q+E+I R+   A K V+ 
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V   GG V            ++L A Y GEAG  A+A+V+FGD+NP
Sbjct: 667 VNCSGGAV--ALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  295 bits (756), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 190/554 (34%), Positives = 294/554 (53%), Gaps = 42/554 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   + PI  R  DL+ R TL EK  QL ++   +PRLG+P +  W++ LHGV       
Sbjct: 38  YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGV------- 90

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-----IGMTFWAP 159
               + +  T FP      A++D  L + +  A+  EARALYNA         G+ + +P
Sbjct: 91  ---WSKQPTTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSP 147

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
            INI RDPRWGR QE   EDPL+TG+  V+YVRG+QGD          L+ +A  KHF  
Sbjct: 148 VINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQ------HLKLAATVKHFAV 201

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
            +++    + R   +A V  ++L + + P + + + +  A  +M +YN +NG+P   +  
Sbjct: 202 NNVE----SGRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHW 257

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYD---------AEGYAKSPEDAVVDVLKAGMDVN 330
           LL+   R++WGF G++T D  AV+++            + +++ P  A    ++AG D +
Sbjct: 258 LLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSD 317

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
              F + +   AV++  L E ++D AL N+  V  RLG ++  P    + +IG DVV S 
Sbjct: 318 DVEF-ETNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQ 375

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR-SI 449
           AH+ L+ + A++ + LL N    LPL + +  S+A+IGP    A      Y  P+ + S+
Sbjct: 376 AHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGPAGGEAYETGNYYGTPAVKTSV 435

Query: 450 TP-LQALQNYVENTVYYPGCDTVACS-SASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           T  L+AL        Y  G   V  +    I++A ++A+ +D VVL +G +   E E  D
Sbjct: 436 TEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRD 495

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R DL LPG QQ L+  V  AA   V LVL+  GP+ +T+A    ++ +IL A YPGE G 
Sbjct: 496 RRDLNLPGAQQRLLEAVY-AANPKVALVLMNAGPLGVTWAH--DHVPAILSAWYPGELGG 552

Query: 568 VALAEVIFGDHNPG 581
            A+A  +FG +NPG
Sbjct: 553 AAIARTLFGLNNPG 566


>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1212

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 207/596 (34%), Positives = 300/596 (50%), Gaps = 70/596 (11%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           + +T    ++RA DLVSR+TL EK++QL  NSAPAIPRLG+ +Y +WSE  HG+  +G  
Sbjct: 45  YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104

Query: 104 IFFNGTIRG--ATSFPQVILTAASFDSYLWYRIGQAIGLEAR-----ALYNAGQA-IG-- 153
               G   G  ATSFP  + +  S+D  L Y+   AI  E R     +L+   Q  IG  
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164

Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
                 +T+WAP +N+ RDPRWGR  E  GEDP + GK A ++V G QG+T +G      
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGETIDGTPTSPY 224

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L+ +A  KHF   + +N     R+   A  +  D+ D Y   F S V+    +G+M +YN
Sbjct: 225 LKVAATAKHFALNNNEN----DRHADSADASESDIRDYYTAQFRSLVEDSHVAGLMTSYN 280

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV----- 322
            +NG PS AD       A+R WGF GYITSDC AV  +  +  +  +P    V V     
Sbjct: 281 AINGTPSPADTYTTDALAQRTWGFDGYITSDCGAVGDVTASSSHDWAPPGWTVSVVNGTS 340

Query: 323 ---------------------LKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHN 359
                                L+AG D NC  G     + +AA+K + L E  ID AL  
Sbjct: 341 TWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATLGNIEAAIKAEILSEGVIDHALVQ 400

Query: 360 LFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQDGIVLLKN------SHG 412
           LF+VRM+ G F  +P  +  + +I    + SP HQ LA + A + +VLLKN      +  
Sbjct: 401 LFTVRMQTGEF--DPANKVAYTRITKAQIQSPEHQALAEKVAANSLVLLKNDPMPGSAAK 458

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE-----NTVYYPG 467
           +LP   +   ++ ++G  AN+    LG Y+G     +  +Q + + V+      TV +  
Sbjct: 459 VLPANPASLNNVVVVGDLANTVT--LGGYSGDPTLQVNAVQGITSAVKAANPNATVTFDA 516

Query: 468 CDTVACSSASIDKAVDIA---KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
           C T   ++A+   +       K AD VV+ +G D +   E  DR  L +PG    LI++V
Sbjct: 517 CGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGSTAGESNDRASLAMPGNYDSLISQV 576

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A       +L +   GPVDI   K D    +I+++ Y GE+   ALA+V+FG  NP
Sbjct: 577 AALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAYNGESQGTALADVLFGKQNP 630


>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 895

 Score =  295 bits (755), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 173/427 (40%), Positives = 237/427 (55%), Gaps = 33/427 (7%)

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           ++T +EK++Q +N+APAIPRLG+PAYEWW+E LHG+A  G           AT FPQ I 
Sbjct: 53  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNGY----------ATVFPQAIG 102

Query: 122 TAASFDSYLWYRIGQAIGLEARALYN-AG-------QAIGMTFWAPNINIFRDPRWGRGQ 173
            AA++++ L  ++G     EARA +N AG       +  G+T W+PNINIFRDPRWGRG 
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ET GEDP +TG+ AV ++ G+QGD           +  A  KH   +   +     R+ F
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQGDDLT------HPRTIATPKHLAVH---SGPEPGRHGF 213

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  V+  DL  TY P F + +  GRA  +MCAYN ++G P+CA   LL+   R  WGF G
Sbjct: 214 DVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWGFTG 273

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
           ++ SDCDAV  +     +      +    LKAG D+NCG +  +    A+ +    E+ +
Sbjct: 274 FVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCG-YAYRDLGKAIARGDADEALL 332

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D++L  LF+ R RLG         P+ ++GA  V S AH+ LALQAAQ  IVLL+N +  
Sbjct: 333 DQSLVRLFAARYRLGELQPQ-RKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQNRNAT 391

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGCDTV 471
           LPL     + LA+IGPNA++   L  NY G S   +TPL  L+      N  Y  G    
Sbjct: 392 LPL--RPGLRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQGAPLA 449

Query: 472 ACSSASI 478
           A  S  I
Sbjct: 450 AGVSGMI 456



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 58/96 (60%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EEL          DR DL LP  QQ L+ R A+A+ KP+++VL+ G  V + 
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 690

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK  ++  +I+ A YPG++G  A+A+V+ GD NPG
Sbjct: 691 WAK--QHADAIVAAWYPGQSGGTAIAQVLAGDVNPG 724


>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1343

 Score =  295 bits (755), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 205/597 (34%), Positives = 295/597 (49%), Gaps = 76/597 (12%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           +  T    ++RA DLVSR+TL EK +QL  NSAPAIPRLG+  Y +WSE  HGV  +G  
Sbjct: 49  YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108

Query: 104 IFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEAR-----ALYNAGQAI--- 152
               G + G   ATSFP       S+D  L Y+   A+  E R     +L+  GQ     
Sbjct: 109 -SNRGDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGP 167

Query: 153 ------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
                  +TFWAPN+N+ RDP WGR  E+ GEDP +T   A ++V G QG +  G +   
Sbjct: 168 SASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMTGQQQTP 227

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            L+ +A  KH++  ++++    +R+   +  T  ++ D Y   F S V+    SGIM +Y
Sbjct: 228 YLKVAATAKHYSLNNIED----SRHTGSSDTTDANIRDYYTKQFASLVRDAHVSGIMTSY 283

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVD----- 321
           N VNG PS AD   + +  +  +GF GY TSDC A+  +Y A  +  +P     +     
Sbjct: 284 NAVNGTPSPADTYTVDELLQATYGFAGYTTSDCGAIGDVYGAASHGWAPPGWTSNGTSWT 343

Query: 322 -----------------VLKAGMDVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
                             ++AG  +NC  G    ++  AA+    L    +D  L  LF+
Sbjct: 344 NNATGRQISAAAGGQAFAIRAGTQLNCAGGEMTAQNISAAIDLGLLSNGVVDATLTRLFT 403

Query: 363 VRMRLGLFNGNPTMQPFGKIG-----ADVVCSPAHQVLALQAAQDGIVLLKN------SH 411
           VRM  G F+      P GK+G      D + SPAHQ LA Q A + IVLL+N      S 
Sbjct: 404 VRMETGEFD------PAGKVGYTKITKDQIESPAHQALAEQVAANDIVLLQNGAVSGTSA 457

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE-----NTVYYP 466
            LLP+  +K+ S+ ++G  AN  K  LG Y+G     +  +Q +   V+      TV + 
Sbjct: 458 KLLPVDPAKTDSVVIVGDLAN--KVTLGGYSGEPTHEVNAVQGITAAVQAANPSATVTFD 515

Query: 467 GCDTVA--CSSASIDKAVDIA-KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITR 523
            C T     + AS   A   A K A  V+++ G D +   E  DR  L LPG    LI++
Sbjct: 516 ACGTGTQITTPASCSAATQAAIKSASLVLVVAGSDLSVADEANDRSTLALPGNYDSLISQ 575

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V+        LV+   GP DI  A+ D    +I+++GY G++   ALA+V+FG  NP
Sbjct: 576 VSALGNPRTALVMQADGPYDIQDAQKD--FPAIVFSGYNGQSQGTALAQVLFGQQNP 630


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  295 bits (754), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 200/540 (37%), Positives = 279/540 (51%), Gaps = 54/540 (10%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGT---IRGATSFPQVILTAASFDSYLWYRIGQA 137
           RLG+P Y WWSE LHGVAG   G+ FN T      ATSF   I   ASFD  L Y +G A
Sbjct: 13  RLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71

Query: 138 IGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD 197
           I  EARA  N G   G+ +W PN+N ++DPRWGRG ETPGEDPL    Y  + + G++G+
Sbjct: 72  ISTEARAFANFGFG-GLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAGLEGN 130

Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
                  +   +  A CKH+ AYDL+ W G TRY+F+A VT+QDL++ Y PPF+ C +  
Sbjct: 131 -------ETVRKVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDS 183

Query: 258 RASGIMCAYNRV-----------------NGIPSCADRNLLSKTARRQWGF---HGYITS 297
           +   IMC+YN +                    P+CA   L++   R  W +   + YITS
Sbjct: 184 KVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTEHNNYITS 242

Query: 298 DCDAV-SIIYDAEGYAKSPEDAVVDVLKAGMDVNC---GSFLQKHTKAAVKQKKLPESEI 353
           DC+A+   + D   ++++P +A     KAG D  C   GS L      A  Q  LPE+ I
Sbjct: 243 DCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTD-VVGAYNQSLLPEAVI 301

Query: 354 DRALHNLFSVRMRLGLFN------------GNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           D AL  L+   +R G  +            G+ +   +  +  + V +P+ Q LAL++A 
Sbjct: 302 DTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQELALRSAT 361

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           +GIVLLKN+  LLPL  S    +ALIG  AN+  T+ G Y+G       PL A Q    +
Sbjct: 362 EGIVLLKNAGSLLPLDFSGK-KVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLS 420

Query: 462 TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMM-GLDQTQEKEELDRVDLVLPGRQQEL 520
             Y  G    A    +       A     VVL   G D T   E+LDR  +  P  Q +L
Sbjct: 421 FSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPETQMQL 480

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++ +A    KP++++ L G  VD +    + N+ SILW GYPG++G  A+ +V+ G   P
Sbjct: 481 LSELA-GLGKPLVVIQL-GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAP 538


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  295 bits (754), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 209/329 (63%), Gaps = 25/329 (7%)

Query: 136  QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
            QA+  EARA+YN G+  G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV++VRG+Q
Sbjct: 916  QAVSTEARAMYNMGKG-GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ 974

Query: 196  G-----DTFNGGKLKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
                      GG    + L+ SACCKH+ AYDLD+W   TR++FDARV  +D+ +T+Q P
Sbjct: 975  DIPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRP 1034

Query: 250  FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD-A 308
            FE CV+ G  S +MC+YNRVNGIP+CAD  LLS+T RR WG HGYI SDCDAV ++ D A
Sbjct: 1035 FEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNA 1094

Query: 309  EGYAKSPEDAVVDVLKAGMDVNCGS-------------FLQKHTKAAVKQKKLPESEIDR 355
                 +  +A    LKAG+D++CG              FL  +   AV + K+ ES+ID 
Sbjct: 1095 TWLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDN 1154

Query: 356  ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
            AL N +   MRLG F+    +  +  +G   +C+  H+ LAL  A+ GIVLLKN + LLP
Sbjct: 1155 ALTNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLP 1211

Query: 416  LPKSKSVSLALIGPNANSAKTLL-GNYAG 443
            L  +K   + + GP+  + + ++ G+Y G
Sbjct: 1212 LDANKVGFVNVRGPHVQAPEKIMDGDYTG 1240


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  294 bits (753), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 174/440 (39%), Positives = 241/440 (54%), Gaps = 41/440 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            P+    L   QRA DL+ RLT++EK+S + N++P I RLGI  YEWW+EALHGVA  G 
Sbjct: 28  LPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARAGL 87

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-------LYNAGQAIGMT 155
                     AT FPQ I  AASFD  L Y +  AI  EARA       L    +  G+T
Sbjct: 88  ----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQGLT 137

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PNINIFRDPRWGRGQET GEDP +T +  V+ V+G+QG D+    KL       AC 
Sbjct: 138 MWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPDSARYNKLH------ACA 191

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F++ V++     +MCAYNR  G P
Sbjct: 192 KHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDP 248

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R +WGF G + SDC A+S  +  + +   P+   A  + +  G D+ C
Sbjct: 249 CCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAHASAEAVLNGTDLEC 308

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG-LFNGNPTMQPFGKIGADVVCSP 390
           GS  +K T+ A+K   + E +I+ ++  L   R  LG + N +P   P+      +V SP
Sbjct: 309 GSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFELGEMENIHPWTLPY-----SIVDSP 362

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LAL+ A + + LL+N   +LPL   K   +A+IGPNAN +    GNY G    + T
Sbjct: 363 KHRCLALKMAHETMTLLQNKGKVLPL--DKQARIAIIGPNANDSVMQWGNYNGTPSHTST 420

Query: 451 PLQALQNY--VENTVYYPGC 468
            L A +    + + +Y P C
Sbjct: 421 LLSAFRKRLPISHLIYEPVC 440



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 67/132 (50%), Gaps = 14/132 (10%)

Query: 459 VENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DR 508
           V++  Y+   D V     +I   ++  K  D ++   G+  + E EE+          DR
Sbjct: 573 VKDETYF-NFDLVEGLPLNISNTLEKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDR 631

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
            D+  P  Q++++  + EA KK VILV   G  + +T     ++  +IL A YPGE G +
Sbjct: 632 TDIEFPAVQRKVLAALKEAGKK-VILVNFSGSAMALT--PETKSCDAILQAWYPGEEGGM 688

Query: 569 ALAEVIFGDHNP 580
           A+  V+FGD+NP
Sbjct: 689 AIVNVLFGDYNP 700


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 190/520 (36%), Positives = 280/520 (53%), Gaps = 41/520 (7%)

Query: 76  APAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG-ATSFPQVILTAASFDSYLWYRI 134
           AP IPRLGI  Y W +E L G          +G   G AT+FPQ +  AA+F   L YR+
Sbjct: 6   APPIPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRV 55

Query: 135 GQAIGLEARALYNAGQA-------IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             A   E RA +N+  A        G++ ++P +NI R P WGR QET GEDP ++G+ A
Sbjct: 56  ANATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELA 115

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
            S+V+G+QG           ++ASA CKHF+ +     +    Y     V  +D   T+ 
Sbjct: 116 RSFVQGLQGPH------PRYVKASAGCKHFSVHG--GHENILLYLLT--VLERDWRMTFL 165

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
           P F++CV+ G  S  MC+YNR+NG+P+CA++ LL+   R +WGF GY+ SD  AV +I  
Sbjct: 166 PQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIML 224

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT----KAAVKQKKLPESEIDRALHNLFSV 363
              Y +S  +  V  + AG ++     ++ +       A+    +    +   +  LF  
Sbjct: 225 GHHYTRSFLETAVASVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFYT 284

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           RMRLG F+  P M P+  +   VV SP H+ L+L+AA    VLLKN  G LPL K++ +S
Sbjct: 285 RMRLGEFD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPL-KAQDLS 342

Query: 424 ---LALIGPNANSAKTLLGNYAG-PSCRSI-TPLQALQNYVENTVYYPGCDTVACSSASI 478
              LA++GP A++ + L G+YA  P  R I TP + L+    N  +  GC    C   S 
Sbjct: 343 SQHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRCQRYSR 402

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELIT-RVAEAAKKPVILVLL 537
            + V +   AD V++ +G     E E  DR DL LPG Q EL+   V  AA +PVIL+L 
Sbjct: 403 AELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVILLLF 462

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
             GP+D+++A+    +G+IL   +P +A  +A+A V+ G+
Sbjct: 463 NAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGE 502


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 168/440 (38%), Positives = 242/440 (55%), Gaps = 35/440 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PF    LP+ +R  DLV RLTL EK+  + + + ++PRLGI  Y WW+EALHGV   G 
Sbjct: 22  LPFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL 81

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMT 155
                     AT FPQ I  AA+FD     ++ + +  EARA Y+  +         G+T
Sbjct: 82  ----------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLT 131

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  ++ VRG+QG + +        +  AC K
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYD-----KLHACAK 186

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +    W    R++FD   ++ +DL +TY P F++ V+QG    +MCAYNR  G P 
Sbjct: 187 HYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPC 243

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK---SPEDAVVDVLKAGMDVNC 331
           C    LL    R +WGF G + SDC A+S  Y  +G+ +   + E AV   +KAG D++C
Sbjct: 244 CGSNRLLYNILREEWGFDGLVVSDCGAISDFY-LKGHHETHSTKESAVAAAVKAGTDLDC 302

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   Q   K AV++  + E +ID +L  L   R  LGL +    +  +  I   VV S  
Sbjct: 303 GVDYQSLEK-AVEKGIITEKQIDVSLSRLLKARFELGLMD-EEHLVSWSDIPYTVVDSEK 360

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+  AL+ A+  + LLKN +G LPL K     + +IGPNAN +  + GNY G    ++T 
Sbjct: 361 HRAKALEVARKSMTLLKNKNGTLPLSKHCG-KIVVIGPNANDSIMMWGNYNGFPSHTVTI 419

Query: 452 LQALQNYVE--NTVYYPGCD 469
           L+ + + ++    +Y  GC+
Sbjct: 420 LEGITHKLDAGQVIYDKGCE 439



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 59/103 (57%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           A+ +V + G+    E EEL          DR  + LP  Q++L+  + +  K P+IL+L 
Sbjct: 600 AEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTGK-PIILILC 658

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            G  + ++ A+ D    +I+ A YPG+AG  A+A+V+FGD+NP
Sbjct: 659 SGSAIGLS-AEVDL-ADAIIQAWYPGQAGGTAVADVLFGDYNP 699


>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
 gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
          Length = 882

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 175/440 (39%), Positives = 244/440 (55%), Gaps = 39/440 (8%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            Q A  LV+++T  EKI+Q +N+APAIPRLGIPAY+WWSE LHG+A  G           
Sbjct: 31  EQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNGY---------- 80

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L   +G     EARA +N          +  G+T W+PNINIF
Sbjct: 81  ATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIF 140

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +T + AVS++RG+QG+      +    +  A  KHF  +   +
Sbjct: 141 RDPRWGRGMETYGEDPYLTSQLAVSFIRGLQGN------IPDHPRTIATPKHFAVH---S 191

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+  DL  TY P F + +  G A  +MCAYN ++G P+CA   LL+  
Sbjct: 192 GPEPGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTR 251

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF+G++ SDCDA+  +     + +    A    LK+G D+NCG+   +    A+ 
Sbjct: 252 LRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKSGDDLNCGN-TYRDLNQAIA 310

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           +  + ES +D+AL  LF+ R RLG         P+  IG   + +PAH+ LALQAA   +
Sbjct: 311 RGDIDESTLDQALIRLFTARQRLGTLQPR-EHDPYAAIGIKHIDTPAHRALALQAAAQSL 369

Query: 405 VLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ-------- 456
           VLLKNS   LPL  +   +LA++GP+A+S   L  NY G S   +TPL  L+        
Sbjct: 370 VLLKNSGNTLPL--TPGTTLAVLGPDADSLTALEANYQGTSSTPVTPLIGLRTRFGTAKV 427

Query: 457 NYVENTVYYPGCDTVACSSA 476
           +Y +     PG  +    +A
Sbjct: 428 HYAQGASLAPGVPSTITETA 447



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 64/121 (52%), Gaps = 13/121 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           +A  +  + +A      AD +V  +GL    E EEL          DR  + LP  Q+ L
Sbjct: 594 LAPEAPQLAEAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETL 653

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V +   KP+I+VL+ G  V + +A++  +  +IL A YPG++G  A+A+ + GD NP
Sbjct: 654 LQHV-KTTGKPLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNP 710

Query: 581 G 581
           G
Sbjct: 711 G 711


>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
 gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
          Length = 889

 Score =  292 bits (747), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 168/411 (40%), Positives = 235/411 (57%), Gaps = 41/411 (9%)

Query: 51  PISQRAR----DLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
           P+  +AR    DLV+++TLDEK+ QL+N+APAIPRL IPAY WW+E+LHG  G       
Sbjct: 25  PVRAKARAMAADLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGALGS------ 78

Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWA 158
                  T+FP+ I  AA+FD+ L   +  AI  E R L+   +          G+  W+
Sbjct: 79  ----LPTTNFPEPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWS 134

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           PNINIFRDPRWGRGQET GEDP +T +  VS+V G+QG   +           A  KHF 
Sbjct: 135 PNINIFRDPRWGRGQETYGEDPYLTARMGVSFVEGMQGPDPD------LPDVIATPKHFA 188

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
            +   N   +TR+  +  V+  DL DTY P F + + +GRA  +MCAYNRV+G P+CA +
Sbjct: 189 VH---NGPESTRHHANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQ 245

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF---- 334
            LL +     WGF GY+ SDCDAV  I D   YA     AV   ++ G+D  C ++    
Sbjct: 246 ELLQEHLVDAWGFQGYVVSDCDAVKDISDNHKYAPDGAAAVAAAMRMGVDSECHTWTLSD 305

Query: 335 ---LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
              L    + A+++  +  S++DR L  LFS R+R G   G   +  F    AD V +PA
Sbjct: 306 TDGLTDRYREALERGLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSAAD-VGTPA 364

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           H  LAL+AA++ +VLLKN  G+LP  ++  + +A+IGP  ++ + L GNY+
Sbjct: 365 HGALALKAAEESLVLLKND-GILPF-QTAGMKVAVIGPFGDATRVLRGNYS 413



 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 68/122 (55%), Gaps = 16/122 (13%)

Query: 472 ACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQ 518
           A    S D A D+   A+ AD +V ++GL    E EE           D+  L +P  QQ
Sbjct: 599 AWKRISTDPAGDMRRAAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQ 658

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           EL+ + A+A  KP+I+V + G P+++ +AK   N  +IL A YPG++G +A+A V+ G  
Sbjct: 659 ELLEQ-AKATGKPLIVVAMNGSPINLHWAK--ENADAILEAWYPGQSGGLAIANVLTGKA 715

Query: 579 NP 580
           NP
Sbjct: 716 NP 717


>gi|294146655|ref|YP_003559321.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
 gi|292677072|dbj|BAI98589.1| putative beta-glucosidase [Sphingobium japonicum UT26S]
          Length = 874

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 179/439 (40%), Positives = 241/439 (54%), Gaps = 42/439 (9%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           +AR +V RLTLDEK++QL+N APAIPRLGIPAY WW+E+LHG  G              T
Sbjct: 36  KARAIVDRLTLDEKVAQLLNVAPAIPRLGIPAYNWWTESLHGAIGAVP----------TT 85

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRD 166
           +FP+ I  AA+FD+ L  ++  AIG E RAL+  G+          G+  W+PNINIFRD
Sbjct: 86  NFPEPIGLAATFDAPLIQQVAAAIGTEVRALHTLGRRTGHLGRIGTGLDTWSPNINIFRD 145

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T +  V++V+G+QG   N   L       A  KHF  +   +  
Sbjct: 146 PRWGRGQETYGEDPYLTARIGVAFVQGMQG---NDPDLP---LVVATPKHFAVH---SGP 196

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
             +R+  +   T +DL DTY P F + + +G+A  IMCAYNRV+G P+C    LL    R
Sbjct: 197 EPSRHTDNIFATPRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGQPACGSHMLLKDYLR 256

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHT 339
             WGF GY+ SDCDAV  IY+   YA      V   L+ G+D  C +        L    
Sbjct: 257 GAWGFKGYVVSDCDAVVDIYEHHKYAPDAATGVAVALRHGVDSECNNATLGGRPDLGDRY 316

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           K ++ +  +   +ID AL  LFS R R G   G  + +P   I    + +P HQ LAL  
Sbjct: 317 KESLARGHIAMGDIDAALVRLFSARYRNGDLPGLSSRKP-NAIPPSAIGTPDHQALALST 375

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A+  +VLLKN  G+LPL     ++LA  GP A++ + L GNY+  S +S  P+   +   
Sbjct: 376 AEKSLVLLKND-GILPLRPGARIALA--GPLADATRVLRGNYS--SAKSAPPISVAEGLK 430

Query: 460 ENTVYYPGCDTVACSSASI 478
           +  V   G  TV   S SI
Sbjct: 431 Q--VMGAGAVTVIPFSPSI 447



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 52/96 (54%), Gaps = 13/96 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EE+          DR  L LP  Q+  + + A A  KP+++V + G  +D++
Sbjct: 611 GLTSDLEGEEMPVKVEGFDGGDRTTLDLPADQRAFLEK-ARALGKPLVIVAMNGSAIDLS 669

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +AK   N  +I+ A YPG++G +A+  V+ G  +PG
Sbjct: 670 WAK--ENAAAIVEAWYPGQSGGLAVGNVLSGRADPG 703


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 171/454 (37%), Positives = 248/454 (54%), Gaps = 36/454 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           FSC       + +P+    L   +RA DL SRLTL+EK   + NS+PAIPRLGIP +EWW
Sbjct: 14  FSC---VAGAQQYPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWW 70

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           SEALHG+A  G           AT FPQ    AAS+D  L Y +  A   EA A  N  +
Sbjct: 71  SEALHGIARNGF----------ATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLAR 120

Query: 151 AI-------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG- 202
                    G++ W PNINIFRDPRWGRGQET GEDP +T +  ++ V G+QG  F    
Sbjct: 121 KSGDIKRYQGVSIWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDM 180

Query: 203 ----KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQG 257
               +     +  AC KH+  +    W    R+ FD  R+  +DL +TY P F+S V++G
Sbjct: 181 RPFTERPRYYKTLACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSLVQEG 237

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGY---AKS 314
               +MCAY R++G P C +   L +  R +W ++G + SDC A+S  Y  EG+    ++
Sbjct: 238 NVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFY-REGHHHVVET 296

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           P +A    ++AG DV CG+      + AV+Q  +    ID ++  L   R  +G F+ + 
Sbjct: 297 PAEASAMGVRAGTDVECGAVYATLPR-AVEQGLISREAIDTSVVRLLKARFEVGDFD-SE 354

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
            + P+   G +V+ S  H+ LAL  A++ + LL+N + LLPL K+  + +A++GPNAN +
Sbjct: 355 KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKN-GLRIAVMGPNANDS 413

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGC 468
             L GNY G    + T L+ +++ V    +  GC
Sbjct: 414 VMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/106 (36%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           A  AD VV + G+    E EE+          DR  + LP  Q+E+I R+   A K V+ 
Sbjct: 608 AGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQREVI-RLLRQAGKLVVF 666

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V   GG V            ++L A Y GEAG  A+A+V+FGD+NP
Sbjct: 667 VNCSGGAV--ALVPETEACDAVLQAWYAGEAGGQAVADVLFGDYNP 710


>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
 gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
          Length = 869

 Score =  290 bits (743), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 164/404 (40%), Positives = 240/404 (59%), Gaps = 33/404 (8%)

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           ++R+T+++K +Q+ N AP +P  G+ AYEWW+E LHGVA  G+          AT FPQ 
Sbjct: 40  IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAG-------QAIGMTFWAPNINIFRDPRWGRG 172
           I  AA+++  L  ++G  +  EARA +N+        +  G+T W+PNINIFRDPRWGRG
Sbjct: 90  IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           QET GEDP +T + A  +V G+QG      K+       A  KH   +         R+ 
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHPKV------VASVKHLAVHSGPE---AGRHG 200

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
           F A V+  DL  TY P F   V   +A  +MCAYN V G+P+CA   LL    R  WGF 
Sbjct: 201 FAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAWGFK 260

Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
           GY+ +DCDA+  +     Y  +  ++  + LKAG+D+NCG+      + AV++  +PES 
Sbjct: 261 GYVVTDCDAIYDMTRFHFYRLNDAESSAESLKAGVDLNCGNAYAALPE-AVQKGLIPESL 319

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +D++L+ L  VR RLG+ +G P+  P+ +I  + + +P  Q LALQAA+  +VLLKN +G
Sbjct: 320 MDQSLNRLLDVRKRLGI-DGAPS--PWARISPEAINTPQAQGLALQAAEQSLVLLKN-NG 375

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +LPL   ++V  A+IGPNA++ +TL GNY G + + +TPL  L+
Sbjct: 376 VLPLKPGQTV--AVIGPNADTEETLRGNYNGIARQPVTPLTGLR 417



 Score = 55.1 bits (131), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 63/120 (52%), Gaps = 13/120 (10%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQEL 520
           +A +   ID+AV  A+ AD +V  +GL    E EEL          DR DL LP R QE 
Sbjct: 583 IAPAQPQIDEAVKAAQAADAIVAFVGLSPDIEGEELQILVPGFDRGDRTDLGLP-RTQED 641

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +  +A  KP+++VLL G  V + +A    +     W  YPGEAG  A+A  + G+ NP
Sbjct: 642 LLKAVKATGKPLVVVLLSGSAVALNWADAHADAVVAAW--YPGEAGGTAIARTLTGEANP 699


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  290 bits (742), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 165/424 (38%), Positives = 236/424 (55%), Gaps = 31/424 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP  +R  DL+ RLT++EK+  + + +  I RLGI  Y WW+EALHGVA  G 
Sbjct: 24  YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAGL 83

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMT 155
                     AT FPQ I  AASFD    + +  A+  EARA +N        G+  G+T
Sbjct: 84  ----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGRYEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            W P INIFRDPRWGRG ET GEDP +T    V  V+G+QG +   GK     +  AC K
Sbjct: 134 MWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPS--NGKYD---KLHACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           HF  +    W    R+ FDA  +  +DL +TY P FE+ VK+ +   +MCAYNR  G P 
Sbjct: 189 HFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEVMCAYNRFEGDPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
           C    LL +  R++WGF G + +DC A++  +    +   P+   A    + +G D++CG
Sbjct: 246 CGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAASASAAAVYSGTDLDCG 305

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           S  +  T+ AVK+  + E +ID ++  L   R RLG  + + ++ P+ KI  +VV S AH
Sbjct: 306 SSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGEMD-DQSLVPWSKISYNVVASKAH 363

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             +AL  A+  I LL+N + +LPL KS  + +A++GPNA  +    GNY G    +IT L
Sbjct: 364 NQIALDMARKSITLLQNKNNILPL-KSGGLKIAVMGPNAQDSVMQWGNYNGTPANTITIL 422

Query: 453 QALQ 456
           + ++
Sbjct: 423 EGIK 426



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 65/117 (55%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
             A+I+K++    GAD VV + G+  + E EE+          DR D+ LP  Q++ +  
Sbjct: 589 EEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQRQFVKA 648

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + EA K+ VI +   G P+ +  A    N  +I+ A YPG+AG  A+A+V+FG +NP
Sbjct: 649 LKEAGKR-VIFINCSGSPIGL--ADEMANSEAIVQAWYPGQAGGQAVADVLFGKYNP 702


>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 858

 Score =  289 bits (739), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 176/463 (38%), Positives = 252/463 (54%), Gaps = 35/463 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           + +  P+C   L   +RARDL+SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A
Sbjct: 18  TAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
            +G          G T FP+ +  AASF+  L  R+  A   E RA YN     G     
Sbjct: 78  NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
              ++ W PN+NIFRDPRWGRGQET GEDP +T     + VRG+QG +T    KL     
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             AC KH+  +    +   TR+  + A V+ +DL +TY P F++ V + +   +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           ++  P C++  LL +  R +WGF+  + SDC AV+ IY     +     A      AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  K    AV++  + E+E+D+ +  L   R  LG  + +P +  + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S AH+ LAL  A+  +VLL+N  G+LPL K+    +A+IGPNA+    + GNY G   R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGEPIAVIGPNADDGPMMWGNYNGTPNR 416

Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
           ++T L  ++   +   Y  GC   DT   +S     A+D  KG
Sbjct: 417 TVTILDGIKARHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459



 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +G   VV + G+    E EE+          DR ++ LP  Q++ +  + EA K   ++ 
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I          +IL A Y G+ G  A+++V+FG  NP
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNP 698


>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 865

 Score =  288 bits (738), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 179/415 (43%), Positives = 240/415 (57%), Gaps = 34/415 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI  R  DL+ R+TL+EK +Q+ N APAIPRLGIP Y++W+EALHGVA  G+        
Sbjct: 20  PIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE-------- 71

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINI 163
             AT FPQ I  AA++D  +    GQ +  E RA YN  QA        G+TFW+PNINI
Sbjct: 72  --ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTFWSPNINI 129

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
           FRDPRWGRGQET GEDP +TG  AV +V GVQG   N       L+A A  KHF  +   
Sbjct: 130 FRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQGTDAN------YLKAIATPKHFAVH--- 180

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           +     R++F+   + +DL++TY P F   +  GRA  +MCAYN V+   +CA+  LL  
Sbjct: 181 SGPEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKAACANTMLLKD 240

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK--A 341
           T R  WGF G++TSDC A+  I      + +  +     +KAG D  C  F  +      
Sbjct: 241 TLRGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGAALAVKAGTDTGC-DFKDEMLDLPR 299

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAA 400
           AVK   L E ++D AL  LF+ RM+LG+F  +P  + PF  I      SPAH+ LAL+AA
Sbjct: 300 AVKAGYLTEGDMDVALRRLFTARMKLGMF--DPAARVPFSTISIAENHSPAHRALALRAA 357

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           ++ IVLLKN  G+LPL  + +  +A++GP A S   L GNY G    ++ P+  +
Sbjct: 358 RESIVLLKND-GVLPL-AAGARRIAVVGPTAASLIALEGNYNGTPVGAVLPVDGM 410



 Score = 45.8 bits (107), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 47/95 (49%), Gaps = 13/95 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL+   E EE+          DR  + LP  Q +L+  +  A  KP+++VL  G    I 
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDALF-ATGKPLVIVLQSGSA--IA 658

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
               +    ++L A YPGEAG  A+AEV+ G  NP
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNP 693


>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
 gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
          Length = 858

 Score =  288 bits (738), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 176/463 (38%), Positives = 252/463 (54%), Gaps = 35/463 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           + +  P+C   L   +RARDL+SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A
Sbjct: 18  TAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
            +G          G T FP+ +  AASF+  L  R+  A   E RA YN     G     
Sbjct: 78  NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
              ++ W PN+NIFRDPRWGRGQET GEDP +T     + VRG+QG +T    KL     
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             AC KH+  +    +   TR+  + A V+ +DL +TY P F++ V + +   +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           ++  P C++  LL +  R +WGF+  + SDC AV+ IY     +     A      AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  K    AV++  + E+E+D+ +  L   R  LG  + +P +  + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S AH+ LAL  A+  +VLL+N  G+LPL K+    +A+IGPNA+    + GNY G   R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGEPIAVIGPNADDGPMMWGNYNGTPNR 416

Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
           ++T L  ++   +   Y  GC   DT   +S     A+D  KG
Sbjct: 417 TVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459



 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +G   VV + G+    E EE+          DR ++ LP  Q++ +  + EA K   ++ 
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I          +IL A Y G+ G  A+++V+FG  NP
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNP 698


>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 865

 Score =  288 bits (738), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 165/444 (37%), Positives = 245/444 (55%), Gaps = 32/444 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  PF    L   +RA+DL+SRLT+ EK   L + + AIPRLGI  + WWSEALHG A  
Sbjct: 20  QVLPFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEALHGYAN- 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
                 N ++   T FPQ I  AASF+  L + I  AI  EARA Y+  Q  G       
Sbjct: 79  ------NDSV---TVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQAQRRGEENRRFL 129

Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PN+NIFRDPRWGRGQET GEDP +T +  V  V+G+QG      K +  L   A
Sbjct: 130 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYRKLL---A 184

Query: 213 CCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+T +    W   +R++ +   V+ ++  +TY P F++ V++     +MCAY+R++ 
Sbjct: 185 CAKHYTVHSGPEW---SRHELNINDVSPREFYETYMPAFKALVQKADVRQVMCAYHRLDD 241

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C++  +L +  R +WG+   + +DC A+S  Y   G + +P  A    L AG D+ C
Sbjct: 242 EPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHGISSTPVHAAATGLLAGTDLEC 301

Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
              ++  K    A+++  + E +IDR+L  +   R  LG  + N ++ P+ +I   V+  
Sbjct: 302 IWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEMDDN-SLVPWAQIPPSVLNC 360

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             H+ LA + AQ  IVLL+N + +LPL KS    +A++GPNA+    L GNY G   R+I
Sbjct: 361 EKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPNADDEVVLWGNYNGTPIRTI 420

Query: 450 TPLQALQNYV--ENTVYYPGCDTV 471
           T L  +++ V  +N  Y    D V
Sbjct: 421 TVLDGIKSKVPHDNIFYDKAVDLV 444



 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +G + V+ + GL    E EE+          DR ++ LP  Q+  +  + EA K   ++ 
Sbjct: 601 EGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKALKEAGK--TVIF 658

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I       +  +I+ A Y GE+G  A+A+V+FGD+NP
Sbjct: 659 VNCSGSA-IALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNP 702


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  288 bits (736), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 169/440 (38%), Positives = 245/440 (55%), Gaps = 36/440 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           + FP+  T L   +RA DL SRLTL+EK   + N +PAIPRLGIP +EWWSEALH   G+
Sbjct: 21  QQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH---GI 77

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------- 152
           G+  F       AT FP  +  A+S+D  L  ++  A+  E R    A QA         
Sbjct: 78  GRNGF-------ATVFPITMGMASSWDDALLQKVFDAVSDEGRV--KAQQAKRSGTIKRY 128

Query: 153 -GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G++FW PNINIFRDPRWGRGQET GEDP +T +  ++ VRG+QG +    K +  L   
Sbjct: 129 QGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--DSKYRKLL--- 183

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           AC KHF  +    W    R+ F+   +  +DL +TY P F++ V+QG  + +MCAY R++
Sbjct: 184 ACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMCAYQRID 240

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMD 328
           G P C +   L    R +W + G + SDC AV   +    +  SP+   A    + +G D
Sbjct: 241 GQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKAVLSGTD 300

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           V CGS    +   AV+   + E+++D ++  L   R  LG F+ +  + P+ KI   VV 
Sbjct: 301 VECGSDY-SNLPEAVRAGIIKEADVDVSVRRLLEARFALGDFDPD-ELVPWTKISESVVA 358

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           S AH+ LAL  A+  +VLL+N+  +LPL +S    + ++G NA  +  + GNY+G   ++
Sbjct: 359 SKAHKQLALDMARKSMVLLQNND-ILPLKRSGQ-KIVVVGANAIDSTMMWGNYSGYPTQT 416

Query: 449 ITPLQALQNYVENTVYYPGC 468
           +T LQ LQ   +   + PGC
Sbjct: 417 VTILQGLQTKSDQVTFIPGC 436



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 57/104 (54%), Gaps = 13/104 (12%)

Query: 487 GADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           GAD V+ + G+    E EE+          DR  + LP  Q+E+I  ++EA ++  I+ +
Sbjct: 598 GADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR--IVFV 655

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            C G   I      + + +IL A YPGE G  A+A+V+FGD+NP
Sbjct: 656 NCSGSA-IALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNP 698


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 173/442 (39%), Positives = 241/442 (54%), Gaps = 39/442 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PF  + LP+ +RA+DL+ RLTL EK+  + + +  IPRLGI  Y WW+EALHGV   G 
Sbjct: 22  LPFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL 81

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMT 155
                     AT FPQ I  AA+FD     +  + +  EARA Y+  +         G+T
Sbjct: 82  ----------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLT 131

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  ++ VRG+QG + +        +  AC K
Sbjct: 132 FWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYD-----KLHACAK 186

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +    W    R+ FD   ++ +DL +TY P F++ V+QG    +MCAYNR  G P 
Sbjct: 187 HYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEPC 243

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
           C    LL    R +WGF G + SDC A+S  Y    +   P  E AV   +KAG D++CG
Sbjct: 244 CGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDCG 303

Query: 333 ---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
                LQK    AV++  + E +ID +L  L   R  LGL +    +  +  I   VV S
Sbjct: 304 VDYYALQK----AVEEGIITEKQIDVSLFRLLKARFELGLMD-EEHLVSWSDIPYTVVDS 358

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             H+  AL+ A+  + LLKN HG LPL K     +A+IGPNAN +  + GNY G    ++
Sbjct: 359 EKHREKALEMARKSMTLLKNDHGTLPLSKHCG-KIAVIGPNANDSVMMWGNYNGFPSHTV 417

Query: 450 TPLQALQNYV--ENTVYYPGCD 469
           T L+ + + +  E  +Y  GC+
Sbjct: 418 TILEGITHKLGAEQIIYDKGCE 439



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/103 (34%), Positives = 58/103 (56%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           A+ +V + G+    E EEL          DR  + LP  Q++L+  + +  K PVIL+L 
Sbjct: 600 AEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-PVILILC 658

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            G  + ++ A+ D    +I+ A Y G+AG  A+A+V+FGD+NP
Sbjct: 659 SGSAIGLS-AEVDL-ADAIIQAWYLGQAGGTAVADVLFGDYNP 699


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 167/457 (36%), Positives = 243/457 (53%), Gaps = 34/457 (7%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           F C  S      +PF    LP+ QR  +L+  LT+DEKI  +++++ A+PRL IPAY WW
Sbjct: 11  FICAFSFGQNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWW 70

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--- 147
           +EALHGVA  G           AT FPQ I  AA++D     +  + I  EARA YN   
Sbjct: 71  NEALHGVARAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSF 120

Query: 148 -----AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
                 G+  G+TFW PNINIFRDPRWGRGQET GEDP +T    V+ V+G+QG+     
Sbjct: 121 DEASKTGRYEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---- 176

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
                 +  AC KHF  +    W    R+ ++A V+ +DL +TY P F+S V +G    +
Sbjct: 177 --PKYFKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREV 231

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED--AVV 320
           MCAYN  +G P CA   LL++  R +W + G + SDC A++  Y  + +   P++     
Sbjct: 232 MCAYNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTAA 291

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
           D LK   D+ CG       K ++    + E +ID ++  +      LG+ +   ++  + 
Sbjct: 292 DALKHSTDLECGDTYNNLNK-SLAGGLITEKDIDISMRRILKGWFELGMLDPKSSVL-WN 349

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +I   VV S  H+  AL+ AQ  IVL+KN + +LP  K+    +A++GPNA+     LGN
Sbjct: 350 QIPYSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNKNIK-KIAVVGPNADDEMMQLGN 408

Query: 441 YAGPSCRSITPLQALQNYVENT--VYYPGCDTVACSS 475
           Y G     +T L+ ++    NT  +Y  G +    SS
Sbjct: 409 YNGTPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSS 445



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 61/115 (53%), Gaps = 16/115 (13%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
           AS+ + V   K AD +V   GL  + E EE+          D+  + LP  Q+EL+  + 
Sbjct: 593 ASVKEKV---KDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELR 649

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  K PV+ VL  G  + +   + ++N   +L A Y G++G  A+A+V+ GD+NP
Sbjct: 650 KTGK-PVVFVLCTGSSLGL--EQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNP 701


>gi|270340289|ref|ZP_06007688.2| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270331970|gb|EFA42756.1| periplasmic beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 867

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 175/467 (37%), Positives = 250/467 (53%), Gaps = 40/467 (8%)

Query: 17  FTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA 76
           F SLL  V     PF   P+    +  P+    L   +RA DL  RLTL+EK S +++ +
Sbjct: 7   FVSLLMAV----LPFCNMPA----QQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDS 58

Query: 77  PAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQ 136
           PAIPRLGI  ++WWSEALHGVA +G            T FPQ I  AASF+  L Y++  
Sbjct: 59  PAIPRLGIKRFQWWSEALHGVANMGD----------VTVFPQPIGMAASFNDNLVYQVFN 108

Query: 137 AIGLEARALYNAGQAI--------GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAV 188
           A   E RA +N  QA         G++ W PN+NIFRDPRWGRGQET GEDP +T +   
Sbjct: 109 ATSDEMRAKWNELQASGKEVTRFHGLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGC 168

Query: 189 SYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQ 247
           + VRG+QG +T    KL       AC KH+  +    W   T    D  V+ +DL +TY 
Sbjct: 169 AVVRGLQGPETAKYRKL------WACAKHYAIHSGPEWARHTDNVTD--VSPRDLWETYM 220

Query: 248 PPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYD 307
           P F+S V+      +MCAY R +  P C +  LL +  R +WGF   + SDC AVS  + 
Sbjct: 221 PAFKSLVQDANVREVMCAYQRWDDDPCCGNNRLLQRILRDEWGFKYLVVSDCGAVSDFWT 280

Query: 308 AEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
           +   + +  +A    + AG DV CG +++ K    AVK   + E+E+D  +  L   R  
Sbjct: 281 SHKSSSNARNAATKAVLAGTDVECGYNYVYKSIPEAVKYGAMTEAEMDEHVVRLLEGRFD 340

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLAL 426
           LG  + N ++ P+  I +  +CS  H+ L+L  +   + LL+NS+ +LPL K K   +A+
Sbjct: 341 LGEMDDN-SIVPWSSIPSSALCSKKHRQLSLDMSLQTMTLLQNSNDVLPLNK-KEKKIAV 398

Query: 427 IGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV 471
           IGPN ++   + GNY G    ++T L  +++ +     V + GCD V
Sbjct: 399 IGPNVDNEPMMWGNYNGTPRSTVTILDGIKSRLRKNQLVTFRGCDLV 445



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 13/110 (11%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
           A+   K  + VV + G+    E EE+          DR D+ LP  Q+E +  + +A K 
Sbjct: 597 AIKQLKDCETVVFVGGISPQLEGEEMPINVDGFKGGDRTDIELPKVQREFLKALKQAGKS 656

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              +V +C     I      +   +IL A YPG+ G  A+A V+FG++NP
Sbjct: 657 ---VVFVCCSGSAIALTPETKTCDAILQAWYPGQEGGEAVARVLFGEYNP 703


>gi|281422547|ref|ZP_06253546.1| beta-glucosidase [Prevotella copri DSM 18205]
 gi|281403371|gb|EFB34051.1| beta-glucosidase [Prevotella copri DSM 18205]
          Length = 869

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 168/447 (37%), Positives = 241/447 (53%), Gaps = 31/447 (6%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N S +  P+    L   +RA DL  RLTL+EK S +++ +PAIPRLGI  ++WWSEALHG
Sbjct: 20  NASAQQLPYQNPALSAHERAVDLCGRLTLEEKASLMLDDSPAIPRLGIKRFQWWSEALHG 79

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--- 153
           VA +G            T FP+ I  AASF+  + YR+  A   E RA +N  Q  G   
Sbjct: 80  VANMGD----------VTVFPEPIGMAASFNDRMVYRVFDATSDEMRAKWNELQQKGGDV 129

Query: 154 -----MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
                ++ W PN+NIFRDPRWGRGQET GEDP +T +   + VRG+QG      +     
Sbjct: 130 TRFHALSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVRGLQGPEDTKYR----- 184

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
           +  AC KH+  +    W   T    D  VT +DL +TY P F+S V+  +   +MCAY R
Sbjct: 185 KLWACAKHYAIHSGPEWARHTDNITD--VTPRDLWETYMPAFKSLVQDAKVREVMCAYQR 242

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
            +  P C +  LL +  R +WGF   + SDC AV+  ++    + +  +A    + AG D
Sbjct: 243 WDDEPCCGNTRLLQQILRDEWGFKYLVVSDCGAVTDFWENHKVSSNARNAAAKGVLAGTD 302

Query: 329 VNCG-SFLQKHTKAAVKQKKL-PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           V CG +++ K    AVK   L  E E+D+ +  L   R  LG  + N  +  + KI   V
Sbjct: 303 VECGFNYIYKSVPEAVKYGALTEEEEVDKHVIRLLEGRFDLGEMDDNKIVS-WSKIPVSV 361

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           +CS AH+ L+L  A   + LL+N + +LPL K K   +A IGPN ++   + GNY G   
Sbjct: 362 LCSKAHRQLSLDMALQTMTLLQNKNEVLPLDK-KVKKIAFIGPNVDNEPMMWGNYNGTPR 420

Query: 447 RSITPLQALQNYVENT--VYYPGCDTV 471
           ++IT L  +++ ++    V + GCD V
Sbjct: 421 QTITILDGIKSRLKKNQVVTFKGCDLV 447



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K  + VV + G+    E EE+          DR ++ LP  Q+  +  + EA KK  ++ 
Sbjct: 604 KDCETVVFVGGISPQLEGEEMPIEISGFKGGDRTNIELPKVQRNFLKALKEAGKK--VVF 661

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I       +  +IL A YPG+ G  A+A V+FG++NP
Sbjct: 662 VNCSGSA-IALTPETESCDAILQAWYPGQEGGEAVARVLFGEYNP 705


>gi|315606832|ref|ZP_07881841.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251497|gb|EFU31477.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 858

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 175/463 (37%), Positives = 250/463 (53%), Gaps = 35/463 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           + +  P+C   L   +RARDL+SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A
Sbjct: 18  TAQLLPYCNPALSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWSEALHGAA 77

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG----- 153
            +G          G T FP+ +  AASF+  L  R+  A   E RA YN     G     
Sbjct: 78  NMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRMLNGGEDEK 127

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQ 209
              ++ W PN+NIFRDPRWGRGQET GEDP +T     + VRG+QG +T    KL     
Sbjct: 128 FHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPETAKYRKL----- 182

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNR 268
             AC KH+  +    +   TR+  + A V+ +DL +TY P F++ V + +   +MCAY R
Sbjct: 183 -WACAKHYAVHSGPEY---TRHTANVADVSPRDLWETYLPAFKTLVTEAKVREVMCAYQR 238

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           ++  P C++  LL +  R +WGF+  + SDC AV+ IY     +     A      AG D
Sbjct: 239 LDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVHAAAKAAVAGTD 298

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  K    AV++  + E+E+D+ +  L   R  LG  + +P +  + KI A V+
Sbjct: 299 VECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLVEWSKIPASVM 357

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S AH+ LAL  A+  +VLL+N  G+LPL K+    + +IGPNA+    + GNY G   R
Sbjct: 358 DSKAHRQLALDMARQSLVLLQNKGGVLPL-KAGGDPITVIGPNADDGPMMWGNYNGTPNR 416

Query: 448 SITPLQALQNYVENTVYYPGC---DTVACSSASIDKAVDIAKG 487
           ++T L  ++       Y  GC   DT   +S     A+D  KG
Sbjct: 417 TVTILDGIKARHTRVTYLKGCDLTDTKTVNSLLPQCAIDGRKG 459



 Score = 42.0 bits (97), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +G   VV + G+    E EE+          DR ++ LP  Q++ +  + EA K   ++ 
Sbjct: 597 QGIRKVVFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK--TVVF 654

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I          +IL A Y G+ G  A+++V+FG  NP
Sbjct: 655 VNCSGSA-IALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNP 698


>gi|94495010|ref|ZP_01301591.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
 gi|94425276|gb|EAT10296.1| family 3 glycoside hydrolase [Sphingomonas sp. SKA58]
          Length = 872

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 172/417 (41%), Positives = 227/417 (54%), Gaps = 48/417 (11%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           +AR +V +LTL EK  QL+N+APAIPRLGIPAY WW+E+LHG  G              T
Sbjct: 34  KARAIVRQLTLQEKTEQLLNTAPAIPRLGIPAYNWWTESLHGAIGPVP----------TT 83

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRD 166
           +FP+ I  AA+FD+ L + +   I  E +AL+  G+          G+  W+PNINIFRD
Sbjct: 84  NFPEPIGLAATFDAPLIHTVASVISTEVQALHTLGRQTGHLGRIGTGLDTWSPNINIFRD 143

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGRGQET GEDP +T +  V++++G+QGD  +           A  KHF  +   +  
Sbjct: 144 PRWGRGQETYGEDPYLTARMGVAFIQGMQGDNPD------LPDVVATPKHFAVH---SGP 194

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
             +R+  +   T +DL DTY P F + + +G+A  IMCAYNRV+G P+C    LL+   R
Sbjct: 195 EPSRHTDNIFATKRDLEDTYLPAFRAAIVEGKAGSIMCAYNRVDGEPACGSAMLLTDYLR 254

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHT 339
             WGF GY+ SDCDAV  IY    YA SP   V   L+ G+D  C +        L    
Sbjct: 255 NAWGFRGYVVSDCDAVVDIYAHHKYAPSPATGVSVALRRGVDSECNNATLSGQAGLGDRY 314

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG----NPTMQPFGKIGADVVCSPAHQVL 395
           K A+    +   +ID AL  LFS R+R G   G     P   P   IG     +PAH  L
Sbjct: 315 KDALAANHISIGDIDSALVRLFSARLRNGDLPGLSARKPNSTPVSAIG-----TPAHGAL 369

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           AL AA   +VLLKN  G LPL     V + L GP A++ + L GNY+  S RS  P+
Sbjct: 370 ALDAAVKSLVLLKN-EGALPL--KPGVRIVLAGPLADATRVLRGNYS--STRSAPPI 421



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 78/159 (49%), Gaps = 26/159 (16%)

Query: 446 CRSITPLQALQNYVENTVYYP----GCDTVACSSASIDKAV------DIAKG---ADHVV 492
            R   PL+ +   +     YP    G   VA + A   K V      D+A G   AD +V
Sbjct: 546 SRWAEPLKLVDVRLTKGERYPIRLEGQSGVAAAPAIFWKRVTDDLDRDLAAGVRDADVIV 605

Query: 493 LMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPV 542
            +MGL    E EE+          DR  L LP  Q   + + A+A  KP+ILV++ G  +
Sbjct: 606 AVMGLTSDLEGEEMPVKVEGFEGGDRTTLALPADQIAFLEK-AKATGKPLILVMMNGSAI 664

Query: 543 DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           D+ +AK   N  +IL A YPG++G +A+A V+ G  +PG
Sbjct: 665 DLGWAK--DNAAAILEAWYPGQSGGLAIANVLSGKADPG 701


>gi|399070782|ref|ZP_10749848.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
 gi|398043993|gb|EJL36849.1| beta-glucosidase-like glycosyl hydrolase [Caulobacter sp. AP07]
          Length = 900

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 169/416 (40%), Positives = 234/416 (56%), Gaps = 41/416 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A  LV ++TLDEK+ QL+N+APAIPRLGIPAY WW+E+LHG  G              T+
Sbjct: 49  AAQLVGQMTLDEKVEQLLNTAPAIPRLGIPAYNWWTESLHGALGALP----------TTN 98

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDP 167
           FP+ I  AA+FD  L + +  A+G E R L+   ++ G        +  W+PNINIFRDP
Sbjct: 99  FPEPIGLAATFDPALVHDVAGAVGSEVRGLHTLARSTGRMGRIGGGLDTWSPNINIFRDP 158

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +T +  V++V+GVQG   +        Q  A  KHF  +   +   
Sbjct: 159 RWGRGQETYGEDPFLTARVGVAFVQGVQGPNPD------LPQVIATPKHFAVH---SGPE 209

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
           +TR+  +  V+  DL DTY P F + + +GRA  +MCAYNR++G P+CA   LL    R+
Sbjct: 210 STRHSANVFVSRHDLEDTYLPAFRAAITEGRAGSVMCAYNRIDGQPACASDLLLKDHLRQ 269

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHTK 340
            W F GY+ SDCDAV  I D   YA     AV   ++AG+D  C          L    +
Sbjct: 270 AWRFDGYVVSDCDAVKDISDHHRYAPDAAAAVAAAMRAGVDNECNGATLTDTDGLADRYR 329

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A+++  +  +++DRAL  LFS R+R G   G    +P   +    V +P H VLAL AA
Sbjct: 330 EALRRDLISVADVDRALVRLFSARLRTGDLPG--VRKPETPVALASVMTPEHAVLALAAA 387

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +  +VLLKN  G+LPL       +A+IGP  ++ + L GNY+  S  S  P+  L+
Sbjct: 388 EKSLVLLKND-GVLPL--KADTRIAVIGPLGDATRVLRGNYS--SALSAPPISVLE 438



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 66/117 (56%), Gaps = 16/117 (13%)

Query: 477 SIDKAVDI---AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           S D A D+   A  AD +V ++GL    E EE           D+  L +P  QQ L+ +
Sbjct: 615 SDDPAGDLKAAAAQADVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPSDQQALLEQ 674

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            A+A  KP+++V + G P+++++AK   N  +IL A YPG++G +A+A V+ G  NP
Sbjct: 675 -AKALGKPLVVVTMNGSPLNLSWAK--DNASAILEAWYPGQSGGLAVANVLSGKTNP 728


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 169/444 (38%), Positives = 248/444 (55%), Gaps = 33/444 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +T PF   +L   +RA++LVS LTL+EKI+Q+ +   AIPRL I  Y +W+EA+HGVA  
Sbjct: 38  QTHPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARS 97

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
           G           ATSFP     ++++D  L +    A   EAR +Y+  +  G+ +W P 
Sbjct: 98  GL----------ATSFPVSKAMSSTWDLPLIFDCAVATSDEAR-VYSNTKDKGLIYWCPT 146

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
           IN+ RDPRWGR +E  GEDP +TGK AV Y++G+QGD           +  A  KHF A 
Sbjct: 147 INMSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDD------PKYYKTIATAKHFAAN 200

Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
           + +  + +T    DAR    +L + Y P FE  VK+G    +M AYN +NGIP  A+  L
Sbjct: 201 NYEKGRHSTSSDMDAR----NLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHEL 256

Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDA--EGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
           L    R +WGF+G++TSDC AV  +Y +    +  +  +A    +  G D+NCG+  Q +
Sbjct: 257 LIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDY 316

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            K A+++  + E+++D AL  +F  R  +G F+ N +  P+  I  DV+    H+ LA +
Sbjct: 317 CKEAIEKGYMQEADLDTALVRVFEARFSVGEFD-NASNVPWRSISDDVLDCEEHRQLAYK 375

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN- 457
           AAQ+ IVLLKN + +LPL K+KSV  A+IGP  N+    LG Y+G      TP   +   
Sbjct: 376 AAQEAIVLLKNDNNILPLDKTKSV--AVIGPFGNT--ITLGGYSGSPTALTTPFGGIAEK 431

Query: 458 --YVEN--TVYYPGCDTVACSSAS 477
             YV N  T+ +  C+  +  S S
Sbjct: 432 IGYVVNDGTIQFEDCEEQSVPSDS 455



 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 3/116 (2%)

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
           Y GC     +  ++++A +IA  AD V+   G D T   E  DR +L LPG QQ+L+  V
Sbjct: 590 YKGCAVTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAV 649

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +A   VIL+L     V I +AK   ++ +I+ A Y G+A   A+A+V++GD+NP
Sbjct: 650 Y-SANPNVILLLQTCSSVTINWAK--EHVPAIIEAWYGGQAQGKAIADVLYGDYNP 702


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 200/569 (35%), Positives = 281/569 (49%), Gaps = 50/569 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQL-------VNSAPAIPRLGIPAYEWWSEALH 95
           +PF    LP + R  DLV RLTL+EK+ QL       +  APA+ RLGI  + W SE   
Sbjct: 34  YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSEC-- 91

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA---- 151
            V G+G     +G     T+FPQ +  AA+FD  L  R    I LE RA  N  +     
Sbjct: 92  -VTGLGT----DGNDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVV 146

Query: 152 ---IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
               G++ W+P +NI R P WGR  ET GE P+++   A S+V G+QG+           
Sbjct: 147 KFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTR------YY 200

Query: 209 QASACCKHFTAYD-LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
            A+A CKH   Y   DN     RY FDA V+  DL  T+   FE C   G   G MC+YN
Sbjct: 201 AAAAACKHLDVYGGPDNL----RYVFDADVSQADLTGTFLMAFEECAAAG-VMGYMCSYN 255

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            + G+P+CA+   ++  AR QWGF GY+ SD  AV  I ++  Y  +     V  L AG 
Sbjct: 256 SIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVAALNAGC 315

Query: 328 DVNCGSFLQK----HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           D+      Q     +   A+  K    + ID ++  LF VRMRLG F+  P   P+  + 
Sbjct: 316 DMEDSDDAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDPWRSLN 374

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP-KSKSVSLALIGPNANSAKTLLGNYA 442
             +V SPAH  +A   A   IVLLKN +  LPL   +K+ S  L+GP A++A  ++G Y+
Sbjct: 375 MSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLMMGKYS 434

Query: 443 GPSCRSITPL------QALQNYVENTV--YYPGCDTVACSSASIDKAVD-IAKGADHVVL 493
                ++T         ALQN  +     Y  GC    C           I +G D V+L
Sbjct: 435 PHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAVTTFIQQGCDTVLL 494

Query: 494 MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEA--AKKPVILVLLCGGPVDITFAKYDR 551
            +G     E E LDR ++  PG Q  L+  V EA   K+ ++L++   GPVD+   + D 
Sbjct: 495 AVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAALEQDT 554

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + +IL   Y G+    ALA+++ G+ +P
Sbjct: 555 RVAAILDLIYLGQTAGTALADILLGETSP 583


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 169/454 (37%), Positives = 251/454 (55%), Gaps = 40/454 (8%)

Query: 31  FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWW 90
           FS    NP     P+    L   +RA DL++R+TL EK +Q+ N+   I RLG+  Y+WW
Sbjct: 17  FSFRAENP-----PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWW 71

Query: 91  SEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ 150
           +EALHG+A  GK          AT FPQ I  AA+FD    Y +   +  E RA Y+  Q
Sbjct: 72  NEALHGIARAGK----------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQ 121

Query: 151 AIGM-------TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
             GM       TFW PNINIFRDPRWGRG ET GEDP +T K  ++ V+G+QGD    G 
Sbjct: 122 RKGMYNGYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGD----GT 177

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGI 262
            K   +A AC KH+  +    W    R+ ++A  ++++DL +TY P F++ V +G+   +
Sbjct: 178 QKYD-KAHACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKALVTEGKVKEV 233

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDA---EGYAKSPEDAV 319
           MCAYNR  G P C+++ LL    + +WGF   I SDC A++  Y     E +A + + + 
Sbjct: 234 MCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAADASA 293

Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             V+ +G D+ CG         A+++  + E++I+ ++  L   R  LG+F+ + ++  +
Sbjct: 294 DAVI-SGTDLECGGSYWA-LDEALEKGLITETKINESVFRLLRARFELGMFD-DDSLVSW 350

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
             I   VVC   H+  AL+ A+  +VLL N +  LPL KS    +A++GPNAN +  L  
Sbjct: 351 SSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSKSIK-KVAVMGPNANDSVMLWA 409

Query: 440 NYAGPSCRSITPLQALQNYVE--NTVYYPGCDTV 471
           NY G   RS+T L+ ++  +   + +Y  GCD V
Sbjct: 410 NYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/118 (36%), Positives = 68/118 (57%), Gaps = 15/118 (12%)

Query: 475 SASID-KAV-DIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELIT 522
           S  ID KAV +  K AD ++ + G+  + E EE+          DR ++ LP  Q+ ++ 
Sbjct: 587 SRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQKNMMK 646

Query: 523 RVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + E  K PVI VL  G  + +++   D+N+ +IL A YPG+ G  A+A+V+FGD+NP
Sbjct: 647 ALKETGK-PVIFVLCSGSTMALSWE--DKNMDAILQAWYPGQEGGTAVADVLFGDYNP 701


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  285 bits (729), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 167/424 (39%), Positives = 242/424 (57%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 35  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 86

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  AA+++  L  RI   I  EARA +N            +  +TFW+P +
Sbjct: 87  --FTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 144

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD  +       L+  +  KHF A  
Sbjct: 145 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPH------YLKIVSTPKHFAA-- 196

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLL 254

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  RR WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     ++  
Sbjct: 255 QKVLRRDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLL 314

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ    E++ID A +++ + RM+LGLF+G     P+ KI   V+ S  HQ +AL AA
Sbjct: 315 NAYKQYMASEADIDSAAYHVLTARMKLGLFDG-VERNPYAKISPSVIGSKEHQTVALNAA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN   +LPL   K  S+A++G NA   K   G+Y+G P    ++ LQ ++N V
Sbjct: 374 RECIVLLKNQKNMLPLNVKKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIKNRV 431

Query: 460 ENTV 463
            + V
Sbjct: 432 GDKV 435



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +ILVL+ G  + + +  
Sbjct: 604 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLVAGSSLAVNWE- 661

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + ++ +I+ A YPGE G  A+AEV+FGD+NP
Sbjct: 662 -NEHLPAIVNAWYPGEQGGTAVAEVLFGDYNP 692


>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
 gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
          Length = 871

 Score =  285 bits (728), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 168/444 (37%), Positives = 240/444 (54%), Gaps = 34/444 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+    L   +RA+DL+SRLTL EK + + + + A+PRLGI  + WWSEALHG A  
Sbjct: 28  QQLPYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFAN- 86

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM------ 154
                 NG +   T FP+ +  AASFD  L Y+I  A+  EARA YN     G+      
Sbjct: 87  ------NGNV---TVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFL 137

Query: 155 --TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
             + W PN+NIFRDPRWGRGQET GEDP +T +  VS V+G+QG      K +  L   A
Sbjct: 138 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL---A 192

Query: 213 CCKHFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KHF  +    W   +R+  +   +  +DL +TY P F+S V++     +MCAY R++ 
Sbjct: 193 CAKHFAVHSGPEW---SRHSLNINNLDPRDLWETYLPAFKSLVQKSDVRQVMCAYQRLDD 249

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C +  LL +  R +WG+   + +DC AVS  Y +   +     A    + AG DV C
Sbjct: 250 EPCCGNTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSSDAVHAASKGVWAGTDVEC 309

Query: 332 --GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
              + + K    AV +  + E+EI++ L N+   R  LG  + +  + P+ KI   VV +
Sbjct: 310 QWDNHIYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMDDD-ALVPWSKIPMSVVNN 368

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
             H+ LAL  A   + LL+N + +LPL KSK +  A++GPNAN    L GNY G   R+I
Sbjct: 369 EEHRKLALDMALKSMTLLQNKNNILPLSKSKKI--AVVGPNANDKPMLWGNYNGTPVRTI 426

Query: 450 TPLQALQNYV--ENTVYYPGCDTV 471
           T L  + + +     +Y  GCD V
Sbjct: 427 TILDGITSKLSANKVLYEKGCDLV 450



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 3/74 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR D+ LP  Q+  +  + EA K+  ++ + C G   I       +  +IL A Y GE+G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGKQ--VIFVNCSGSA-IALIPETESCDAILQAWYGGESG 694

Query: 567 AVALAEVIFGDHNP 580
             A+A+V+FGD+NP
Sbjct: 695 GQAVADVLFGDYNP 708


>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
 gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
           49814]
          Length = 897

 Score =  284 bits (727), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 185/500 (37%), Positives = 265/500 (53%), Gaps = 62/500 (12%)

Query: 20  LLTRVDSTQPPFSCDPSNPSTET-----------FPFCKTTLPISQRARDLVSRLTLDEK 68
           LL  + S     +C  S   TET           F F   +L   +RA DLVS +TL+EK
Sbjct: 8   LLGTIASLAFATACSSSQTDTETAQTTEEAKSSEFRFMDPSLSPKERALDLVSHMTLEEK 67

Query: 69  ISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDS 128
            +Q+ + A AIPRLG+  Y WW+EALHGVA  G           AT FPQ I  AA++D 
Sbjct: 68  AAQMYDKAAAIPRLGLHEYNWWNEALHGVARAGH----------ATVFPQAIGMAATWDE 117

Query: 129 YLWYRIGQAIGLEARALYN--AGQAI-----GMTFWAPNINIFRDPRWGRGQETPGEDPL 181
            L   +   I  E RA ++  A + +     G+TFW+PNINIFRDPRWGRGQET GEDP 
Sbjct: 118 DLMLEVANVISDEGRAKHHFYANEDVYAMYGGLTFWSPNINIFRDPRWGRGQETYGEDPY 177

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQD 241
           +TG+ AV+++ G+QGD           ++ A  KH+  +   +    +R++ +   T  D
Sbjct: 178 LTGRMAVNFINGLQGDD------DKYFKSVATVKHYAVH---SGPEPSRHRDNYIATDAD 228

Query: 242 LADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDA 301
           L +TY P F++   +   + +MCAYN V G P+C    L+    R + GF GY+ SDC A
Sbjct: 229 LYETYLPAFKTAFDETEVASVMCAYNAVWGDPACGSERLMKDLLREELGFDGYVVSDCGA 288

Query: 302 VSIIY-----DAEGYAKSPEDAVVDV--------LKAGMDVNCGSFLQKHTKA---AVKQ 345
           +   Y      AEG A       VD         +  G D+NCG        A   AVK+
Sbjct: 289 IGDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSVNMGTDLNCGDGEGNKMDALPQAVKE 348

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  ID+++  L+S   +LG+++ +P++ P+  I  D V SP+H   + +AA+  +V
Sbjct: 349 GLITEETIDQSVVRLYSALFKLGMYD-DPSLVPWSNISIDTVASPSHLEKSEEAARASLV 407

Query: 406 LLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENT 462
           LLKN  G+LPL P +K   +A+IGPNA++  TL+ NY G     +T L+ ++  +  EN 
Sbjct: 408 LLKND-GILPLKPDTK---VAVIGPNADNWWTLVANYYGQPTAPVTALKGIKAKIGAENV 463

Query: 463 VYYPGCDTVACSSASIDKAV 482
            Y  G  T+A    S  KAV
Sbjct: 464 SYSVGS-TIAGDIYSNYKAV 482



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 53/100 (53%), Gaps = 13/100 (13%)

Query: 491 VVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           ++   G+D   E EE+          DR  + LP  Q++L+  +  A  KPV+LV   G 
Sbjct: 633 ILFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKEL-HATGKPVVLVNFSGS 691

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + + +   D N+ +I+ A YPGE    A+A++++G+ +P
Sbjct: 692 AMALNWE--DENLPAIVQAFYPGEKSGTAIADLLWGEFSP 729


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  284 bits (727), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 160/447 (35%), Positives = 243/447 (54%), Gaps = 34/447 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP+++R  +L++ LT +EKI  +++++ A+PRL IPAY WW+EALHGVA  G 
Sbjct: 44  YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAIGM 154
                     AT FPQ I  AA++D    ++  + I  EARA YN         G+  G+
Sbjct: 104 ----------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TFW PNINIFRDPRWGRGQET GEDP +T    V+ V+G+QG   N  K     +  AC 
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQG---NDPKF---FKTHACA 207

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHF  +    W    R+ ++A ++ +DL +TY P F++ V++G    +MCAYN  +G P 
Sbjct: 208 KHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDGQPC 264

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV--DVLKAGMDVNCG 332
           CA+  LL++  R +W + G + SDC A++  +  + +   P++     D LK   D+ CG
Sbjct: 265 CANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTAADALKHSTDLECG 324

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                  K ++    + E +ID ++  +      LG+ +   ++  +  I   VV S  H
Sbjct: 325 DTYNNLNK-SLASGLITEKDIDESMRRILKGWFELGMLDPKSSVH-WNTIPYSVVDSEEH 382

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           +  AL+ AQ  IVL+KN   +LPL ++    +A++GPNA+     LGNY G     +T L
Sbjct: 383 KKQALKMAQKSIVLMKNEKNILPLNRNIK-KIAVVGPNADDGLMQLGNYNGTPSSIVTIL 441

Query: 453 QALQNYVENT--VYYPGCDTVACSSAS 477
             ++    N   +Y  G +    SS +
Sbjct: 442 DGIKTKFPNAEIIYEKGSEVTDPSSRT 468



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 57/105 (54%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K AD +V   GL  + E EE+          D+  + LP  Q++L+  + +  K PV+ V
Sbjct: 621 KNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK-PVVFV 679

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G  + +   + ++N  ++L A Y G++G  A+A+V+ GD+NP
Sbjct: 680 LCTGSALGL--EQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNP 722


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 156/372 (41%), Positives = 217/372 (58%), Gaps = 29/372 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            QRA  LV++++ DEK++Q +N APAIPRL IPAYEWWSE LHG+A  G           
Sbjct: 33  EQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNGY---------- 82

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMTFWAPNINIF 164
           AT FPQ I  AAS+++ L  ++G  +  EARA +N          +  G+T W+PNINIF
Sbjct: 83  ATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIF 142

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRG ET GEDP +TG+ AV ++RG+QGD  N        +  A  KH   +    
Sbjct: 143 RDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDDLN------HPRTIATPKHIAVHSGPE 196

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+ FD  V+ +D+  TY P F + +  G+A  +MCAYN ++G P+CA   LL+  
Sbjct: 197 ---PGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCAYNSLHGTPACAADWLLNGR 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
            R  WGF G++ SDCDAV  +     +      +    LKAG D+NCG    +    A++
Sbjct: 254 VRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKAGHDLNCG-HAYRELGTAIE 312

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
           + ++ E+ +D++L  LF+ R RLG     P   P+ ++GA  V + AH+ LALQAA + I
Sbjct: 313 RGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAKDVDNAAHRALALQAAAESI 371

Query: 405 VLLKNSHGLLPL 416
           VLLKN+   LPL
Sbjct: 372 VLLKNTATTLPL 383


>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 861

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 161/442 (36%), Positives = 243/442 (54%), Gaps = 32/442 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+    L   +RA+DL+SRLTL EK+  + +++  + RLG+  + WWSEALHGVA  
Sbjct: 19  QIMPYKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQ 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAI 152
           G          G T FP+ I  AASF+  L Y +  AI  EARA ++          Q  
Sbjct: 79  G----------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDN 128

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G++ W PN+NIFRDPRWGRGQET GEDP +T +  +S V G+QG      K K  L   A
Sbjct: 129 GLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGP--KDAKYKKLL---A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W    R+  +   +  + L +TY P F+  V++   S +MCAY+R + 
Sbjct: 184 CAKHYAVHSGPEW---NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDD 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC 331
            P C + +LL +  R +WGF   + SDC A++  Y +   +     + V  + AG DV C
Sbjct: 241 DPCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVLAGTDVEC 300

Query: 332 GSFLQKHTKA-AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           G     H    AV +  + E++ID+++  L + R RLG F+ N ++ P+  I   ++   
Sbjct: 301 GFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDN-SIVPWANIPDTIINCK 359

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ LAL+ A+  + LL+N + +LPL   K +  A+IGPNA+ AK + GNY G   +++T
Sbjct: 360 KHQALALEMARQSMTLLQNKNNILPLSSKKKI--AVIGPNADDAKLMWGNYNGIPVKTVT 417

Query: 451 PLQALQNYVENTVYYP-GCDTV 471
            L+ +++     ++Y  GCD V
Sbjct: 418 ILEGIKSIAGKDIFYEKGCDIV 439



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/109 (33%), Positives = 54/109 (49%), Gaps = 13/109 (11%)

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKP 531
           VD  K  D VV   G+    E EE+          DR D+ LP  Q+  I  + +A K+ 
Sbjct: 593 VDRLKDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGKR- 651

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            ++++ C G   I       +  +IL A Y G++G  A+AEV+FG +NP
Sbjct: 652 -VIMVNCSGSA-IGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNP 698


>gi|393724082|ref|ZP_10344009.1| beta-glucosidase [Sphingomonas sp. PAMC 26605]
          Length = 900

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 173/419 (41%), Positives = 234/419 (55%), Gaps = 47/419 (11%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           AR +V+R+T  EK+ QL+N APAIPRL IPAY WW+E+LHG  G              T+
Sbjct: 50  ARAIVARMTPAEKLPQLLNVAPAIPRLNIPAYNWWTESLHGAIGTVP----------TTN 99

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
           FP+ I  AASFD+ L +R+  AI  E RAL+   +          G+  W+PNINIFRDP
Sbjct: 100 FPEPIGLAASFDAPLVHRVAGAISTEVRALHTLARETGKLGRIGTGLDTWSPNINIFRDP 159

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +T    V++V G+QG   +        +  A  KHF  +   +   
Sbjct: 160 RWGRGQETYGEDPYLTAHQGVAFVTGMQGPNPD------LPEVIATPKHFAVH---SGPE 210

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
           +TR+  +  V+  DL DTY P F + + +GRA  IMCAYNR++G P+CA+  LL    R 
Sbjct: 211 STRHAANVYVSPHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRG 270

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-------FLQKHTK 340
            WGF+GY+ SDCDAV  I D   YA     AV   LKAG+D  C +        L     
Sbjct: 271 AWGFNGYVVSDCDAVKDIADNHKYAPDQATAVAAALKAGVDNECNTQTIGDIGGLPDRFG 330

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNG---NPTMQPFGKIGADVVCSPAHQVLAL 397
            A+K+  + E ++DR L  LFS R+R G   G    P + P G I      +P H  LAL
Sbjct: 331 EALKRNLISEGDVDRTLVRLFSARIRNGDLPGIAAAPAVVPVGAI-----LTPDHAALAL 385

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +AAQ  +V+LKN  G+LPL       +A+IGP  ++ + L GNY+  S +S  P+  ++
Sbjct: 386 EAAQKSLVMLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVSVVE 439



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 48/75 (64%), Gaps = 3/75 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L LP  QQ L+ R A A  KP+I++ + G P+++ +AK   N  +I+ A YPG+AG
Sbjct: 658 DKTTLDLPAEQQALLER-AHATGKPLIVIAMNGSPINLAWAK--DNAAAIVEAWYPGQAG 714

Query: 567 AVALAEVIFGDHNPG 581
            +A+A V+ G  +PG
Sbjct: 715 GLAVANVLTGKADPG 729


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 162/419 (38%), Positives = 232/419 (55%), Gaps = 31/419 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
            +RA DL+  LTL+EK+S +V+   AI RLGI  Y WW+EALHGVA  G+          
Sbjct: 34  EERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVARAGQ---------- 83

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQAIGMTFWAPNINIFR 165
           AT FPQ +  AA+FD  +   +  A   EARA ++        G+  G+T W PNIN+FR
Sbjct: 84  ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQGLTMWTPNINVFR 143

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG E  GEDP + G    + V+G+QGD    GK     +  AC KH+  +    W
Sbjct: 144 DPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGD--RSGKYD---KLHACAKHYAVHSGPEW 198

Query: 226 KGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
               R+ F+A  +  +DL +TY P F+  V  G    +MCAYNR  G P C +  LL   
Sbjct: 199 ---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDI 255

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGSFLQKHTKAA 342
            R +WGF G + SDC A++  ++ + +A  P+   A  D + AG D+NCG       + A
Sbjct: 256 LRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAKTASTDAVLAGTDLNCGDSYPSLVE-A 314

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           V+Q  + E ++D +L  L   R  LG  + +  ++ + KI   VV SP H  +AL+AA+ 
Sbjct: 315 VEQGLITEEQLDISLRRLLIARFELGEMDPDEEVE-WSKIPHSVVSSPTHSEMALEAARK 373

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
            + LL N +G LPL K + +++A++GPNAN +    GNY G    + T LQ ++N + N
Sbjct: 374 SMTLLMNKNGALPL-KKEGLTVAVMGPNANDSLMQWGNYNGTPATTTTILQGIRNALGN 431



 Score = 62.0 bits (149), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/117 (36%), Positives = 63/117 (53%), Gaps = 15/117 (12%)

Query: 476 ASIDKAVDIAK--GADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           A +D    +AK   AD VV   G+    E EE+          DR D+ LP  Q+E++  
Sbjct: 589 AKVDIPSSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKA 648

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +A K+  I+++ C G   I F +      +IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 649 LHKAGKE--IILVNCSGSA-IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNP 702


>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
          Length = 303

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 138/232 (59%), Positives = 168/232 (72%), Gaps = 12/232 (5%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDE 67
           LVF LLCL  +S     +S +P F+CDP+N  T T  FC+  +PI  R +DL+ RLTL E
Sbjct: 18  LVF-LLCLVHSS-----ESLRPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQE 71

Query: 68  KISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFD 127
           KI  LVN+A A+PRLGI  YEWWSEALHG++ VG G  F G   GATSFPQVI TAASF+
Sbjct: 72  KIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFN 131

Query: 128 SYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYA 187
             LW  IG+ +  EARA+YN G A G+T+W+PN+NI RDPRWGRGQETPGEDP+V  KYA
Sbjct: 132 QSLWEEIGRVVSDEARAMYNGGVA-GLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYA 190

Query: 188 VSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
            SYVRG+QG T  G +LK     +ACCKH+TAYDLDNW G  R+ F+A+V +
Sbjct: 191 ASYVRGLQG-TAAGNRLK----VAACCKHYTAYDLDNWNGVDRFHFNAKVNL 237


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 164/424 (38%), Positives = 241/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI +R +DL+S+LT++EK+S L  ++P I R+GI  Y   +EALHG+   GK        
Sbjct: 20  PIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-------- 71

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  A+ ++  L + I   I  EARA +N  +         +  +TFW+P +
Sbjct: 72  --FTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLTFWSPTV 129

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+A A  KHF A  
Sbjct: 130 NMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH------PRYLKAVATPKHFAA-- 181

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  DA +T  DL + Y P FE C+++G+A  IM AYN +NG+P  A+  LL
Sbjct: 182 --NNEEHNRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 239

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH-TK 340
           +K  ++ WGF+GYI SDC A  ++     Y K+PE A +  +KAG+DV CG ++  +   
Sbjct: 240 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDVECGDYVFANPLL 299

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  +  +EID A + +   RMRLG+F+ +P   P+  +  ++V    H  LAL+AA
Sbjct: 300 NAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-DPEKNPYNHLSPEIVGCKKHHDLALEAA 358

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN    LPL   K  S+A++G NA  A    G+Y+G P    ++ L  ++N V
Sbjct: 359 RQSIVLLKNQQNTLPLNAQKIKSIAVVGINA--ANCEFGDYSGTPVNAPVSVLDGIRNRV 416

Query: 460 ENTV 463
            N +
Sbjct: 417 GNEI 420



 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 63/101 (62%), Gaps = 3/101 (2%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           A  I + +D V+ +MG++Q+ E+E  DR  + LP + Q++  R A  A    I+VL+ G 
Sbjct: 581 ASKIIRESDVVIAVMGINQSIEREGQDRNSIELP-KDQQIFIREAYKANPNTIVVLVAGS 639

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            + I +   D++I +I+ A YPGE G  A+AEV+FGD+NP 
Sbjct: 640 SMAIGWM--DQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPA 678


>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
 gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 888

 Score =  281 bits (719), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 233/434 (53%), Gaps = 38/434 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG-VAGVG 101
           +PF    L + +R  +L+S LTL EK+S ++N + AI RLGIPAY WW EA HG +AG  
Sbjct: 28  YPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGLIAG-- 85

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
                     G T FPQ I  AA+FD          +  EARA YN     G        
Sbjct: 86  ----------GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSA 135

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
              +TFWAPNINIFRDPRWGRGQET GEDP +  +  ++ V G+QGD       +   + 
Sbjct: 136 IPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDD------EHYYKT 189

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            AC KH   Y + +     R++F+A V+M+DL +TY P FE+ V +G    +MCAY+   
Sbjct: 190 HACAKH---YGVHSGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 246

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMD 328
           G P CA   LL    R +WGF G + SDCDA++  Y    +   P    A  D +  G D
Sbjct: 247 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 306

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           + CG       + AV++  + E ++D +L  + + R RLGL +    + P+  I   V+ 
Sbjct: 307 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLDPAKYV-PYSTIPGSVID 364

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
              H+  AL+ A +  VLLKN   +LPL K+   S+A++GPN N +  + GNY+G     
Sbjct: 365 CQEHRDHALKMAHESQVLLKNEGNILPLDKNIK-SIAIVGPNINDSIMMRGNYSGSPTHC 423

Query: 449 ITPLQALQNYVENT 462
           IT LQ L+N + NT
Sbjct: 424 ITILQGLKNKLPNT 437



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 3/121 (2%)

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
           E +V +   D      A  ++  +  K  + ++ + G+  TQE E  +R  + LP  Q+ 
Sbjct: 603 EGSVSHLSVDMYERKLAVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKR 662

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
            +  + E  K PVI V  C G   I  A  D    ++L A YPG+ G  A+A+V+FGD+N
Sbjct: 663 FLKAMHETGK-PVIYVN-CSGSA-IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYN 719

Query: 580 P 580
           P
Sbjct: 720 P 720


>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
 gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
          Length = 858

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 239/437 (54%), Gaps = 30/437 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+    L    RA+DL+SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A +G 
Sbjct: 31  YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMGN 90

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-----NAGQA---IGM 154
                      T FP+ I  AASF+  L Y++  A   E RA Y     N G+      +
Sbjct: 91  ----------VTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSL 140

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           + W PN+NIFRDPRWGRGQET GEDP +T     + VRG+QG   +  +     +  AC 
Sbjct: 141 SVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSKYR-----KLWACA 195

Query: 215 KHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KH+  +    +   TR+  +   V+ +DL +TY P F++ V++ +   +MCAY  ++  P
Sbjct: 196 KHYAVHSGPEY---TRHTANLNNVSPRDLWETYLPAFKTLVEEAKVREVMCAYQALDDEP 252

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG- 332
            C +  LL +  R +WGF   + SDC AVS I+     +     A      AG DV CG 
Sbjct: 253 CCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVHATAKAALAGTDVECGF 312

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           ++  K    AV++  + E E+D+ +  L   R  LG  + +P + P+ KI   V+ S AH
Sbjct: 313 NYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMD-DPALVPWSKIPYSVMDSKAH 371

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + L+L  A+  IVLL+N   +LPL K+    +A+IGPNA++   + GNY G   R++T L
Sbjct: 372 RQLSLDMARQSIVLLQNKQNMLPLKKNNE-RIAVIGPNADNVPMMWGNYNGTPNRTVTIL 430

Query: 453 QALQNYVENTVYYPGCD 469
             ++   +N  Y  GCD
Sbjct: 431 DGIRAKHKNVKYIKGCD 447



 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           KG + VV + G+    E EE+          DR D+ LP  Q++ I  +  A K+  ++ 
Sbjct: 606 KGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ--LVY 663

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I          +I+ A Y G+ G  A+A+V+FGD+NP
Sbjct: 664 VNCSGSA-IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNP 707


>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
 gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
          Length = 858

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 239/437 (54%), Gaps = 30/437 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+    L    RA+DL+SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A +G 
Sbjct: 31  YPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMGN 90

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-----NAGQA---IGM 154
                      T FP+ I  AASF+  L Y++  A   E RA Y     N G+      +
Sbjct: 91  ----------VTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHSL 140

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           + W PN+NIFRDPRWGRGQET GEDP +T     + VRG+QG   +  +     +  AC 
Sbjct: 141 SVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSKYR-----KLWACA 195

Query: 215 KHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KH+  +    +   TR+  +   V+ +DL +TY P F++ V++ +   +MCAY  ++  P
Sbjct: 196 KHYAVHSGPEY---TRHTANLNNVSPRDLWETYLPAFKTLVEEAKVREVMCAYQALDDEP 252

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG- 332
            C +  LL +  R +WGF   + SDC AVS I+     +     A      AG DV CG 
Sbjct: 253 CCGNSRLLQQILRDEWGFQYLVVSDCGAVSDIWQNHKTSSDAVHATAKAALAGTDVECGF 312

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           ++  K    AV++  + E E+D+ +  L   R  LG  + +P + P+ KI   V+ S AH
Sbjct: 313 NYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFDLGEMD-DPALVPWSKIPYSVMDSKAH 371

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + L+L  A+  IVLL+N   +LPL K+    +A+IGPNA++   + GNY G   R++T L
Sbjct: 372 RQLSLDMARQSIVLLQNKQNMLPLKKNNE-RIAVIGPNADNVPMMWGNYNGTPNRTVTIL 430

Query: 453 QALQNYVENTVYYPGCD 469
             ++   +N  Y  GCD
Sbjct: 431 DGIRAKHKNVKYIKGCD 447



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 52/105 (49%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           KG + VV + G+    E EE+          DR D+ LP  Q++ I  +  A K+  ++ 
Sbjct: 606 KGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAAGKQ--LVY 663

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I          +I+ A Y G+ G  A+A+V+FGD+NP
Sbjct: 664 VNCSGSA-IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNP 707


>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 877

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 233/434 (53%), Gaps = 38/434 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG-VAGVG 101
           +PF    L + +R  +L+S LTL EK+S ++N + AI RLGIPAY WW EA HG +AG  
Sbjct: 17  YPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEACHGLIAG-- 74

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
                     G T FPQ I  AA+FD          +  EARA YN     G        
Sbjct: 75  ----------GVTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGDIGPYVSA 124

Query: 154 ---MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
              +TFWAPNINIFRDPRWGRGQET GEDP +  +  ++ V G+QGD       +   + 
Sbjct: 125 IPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQGDD------EHYYKT 178

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
            AC KH   Y + +     R++F+A V+M+DL +TY P FE+ V +G    +MCAY+   
Sbjct: 179 HACAKH---YGVHSGPEPLRHEFNAVVSMRDLWETYLPAFETLVVKGNVREVMCAYSAYE 235

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMD 328
           G P CA   LL    R +WGF G + SDCDA++  Y    +   P    A  D +  G D
Sbjct: 236 GEPCCASNRLLVDILRNRWGFDGMVVSDCDAINDFYVKGRHETHPDAAAASADAVLTGTD 295

Query: 329 VNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVC 388
           + CG       + AV++  + E ++D +L  + + R RLGL +    + P+  I   V+ 
Sbjct: 296 LECGRSYNALIE-AVEKGIIKEQDLDVSLRRILTERFRLGLLDPAKYV-PYSTIPGSVID 353

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
              H+  AL+ A +  VLLKN   +LPL K+   S+A++GPN N +  + GNY+G     
Sbjct: 354 CQEHRDHALKMAHESQVLLKNEGNILPLDKNIK-SIAVVGPNINDSIMMRGNYSGSPTHC 412

Query: 449 ITPLQALQNYVENT 462
           IT LQ L+N + NT
Sbjct: 413 ITILQGLKNKLPNT 426



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 3/121 (2%)

Query: 460 ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
           E +V +   D      A  ++  +  K  + ++ + G+  TQE E  +R  + LP  Q+ 
Sbjct: 592 EGSVSHLSVDMYERKLAVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKR 651

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
            +  + E  K PVI V  C G   I  A  D    ++L A YPG+ G  A+A+V+FGD+N
Sbjct: 652 FLKAMHETGK-PVIYVN-CSGSA-IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYN 708

Query: 580 P 580
           P
Sbjct: 709 P 709


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 167/424 (39%), Positives = 243/424 (57%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 34  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 85

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+A        +TFW+P +
Sbjct: 86  --FTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLTFWSPTV 143

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD  +       L+  +  KHF A  
Sbjct: 144 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPH------YLKIVSTPKHFAA-- 195

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNAWLL 253

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   +++ +A  Y K+ E A    LKAG+D+ CG         
Sbjct: 254 QKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKTKEAAATLSLKAGLDLECGDDVYDGPLL 313

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A +++ + RM+LGLF+G     P+ KI   V+ S  HQ +AL AA
Sbjct: 314 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDG-VERNPYTKISPSVIGSKEHQQIALDAA 372

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN   +LPL  SK  S+A++G NA   K   G+Y+G P    ++ LQ ++N V
Sbjct: 373 RQCIVLLKNQKNMLPLNASKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 430

Query: 460 ENTV 463
            + V
Sbjct: 431 GDQV 434



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + V+ +MG++++ E+E  DR D+ LP  Q+E +  + +     +I++L+ G  + I +  
Sbjct: 603 ETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILVAGSSLAINWM- 660

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D ++ +I+ A YPGE G  A+AEV+FGD+NP 
Sbjct: 661 -DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPA 692


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 161/424 (37%), Positives = 242/424 (57%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI +R +DL+S+LT++EKIS L  ++P I R+GI  Y   +EALHG+   GK        
Sbjct: 29  PIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGIIRPGK-------- 80

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  A+ ++  L + I   I  EARA +N  +         +  +TFW+P +
Sbjct: 81  --FTVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLTFWSPTV 138

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L++ +  KHF A  
Sbjct: 139 NMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH------PRYLKSVSTPKHFAA-- 190

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  DA +T  D+ + Y P FE C+++G+A  IM AYN +NG+P  A+  LL
Sbjct: 191 --NNEEHNRFYCDAAITETDMREYYLPAFEKCIREGKAESIMTAYNAINGVPCTANNWLL 248

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL-QKHTK 340
           +K  ++ WGF+GYI SDC A  ++     Y K+PE A +  +KAG+D+ CG ++      
Sbjct: 249 NKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKTPEAAAMIAIKAGLDLECGDYVFGAPLL 308

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  +  +EID A +++   RMRLG+F+ +P   P+  +  ++V    H+ LAL+AA
Sbjct: 309 NAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-DPEKNPYNHLSPEIVGCEKHKELALEAA 367

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN    LPL   K  S+A++G NA  A    G+Y+G P    ++ L  ++N V
Sbjct: 368 RQSIVLLKNQKNTLPLNAKKIKSIAVVGINA--ANCEFGDYSGTPVNAPVSVLDGIRNRV 425

Query: 460 ENTV 463
            N +
Sbjct: 426 GNEI 429



 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 63/101 (62%), Gaps = 3/101 (2%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           A  + + +D V+ +MG++Q+ E+E  DR  + LP + Q++  R A  A    I+VL+ G 
Sbjct: 590 ASKVIRESDVVIAVMGINQSIEREGQDRSSIELP-KDQQIFIREAYKANPNTIVVLVAGS 648

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            + + +   D+NI +I+ A YPGE G  A+AEV+FGD+NP 
Sbjct: 649 SMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPA 687


>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
 gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 864

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 163/448 (36%), Positives = 247/448 (55%), Gaps = 33/448 (7%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
           N + + +P+   +L   +RA DL+SRLTL+EK + + + + AIPRLGI  + WWSEALHG
Sbjct: 16  NLNAQQYPYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHG 75

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------- 148
            A        N  +   T FP+ I  AASFD  L +R+  A+  E RA YN         
Sbjct: 76  YAN-------NDNV---TVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNEN 125

Query: 149 GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKL 208
            + + ++ W PN+NIFRDPRWGRGQET GEDP +T +  VS V+G+QG      K +  L
Sbjct: 126 KRFLSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL 183

Query: 209 QASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
              AC KHF  +    W   +R++ +   V  ++L +TY P F++ V++     +MCAY 
Sbjct: 184 ---ACAKHFAVHSGPEW---SRHELNLNNVKPRELYETYLPAFKALVQEADVRQVMCAYQ 237

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
           R++  P C++  LL +  R +WGF   + SDC AV+  Y     +     A    + AG 
Sbjct: 238 RLDDEPCCSNTRLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVSSDEVHAASKAVLAGT 297

Query: 328 DVNC--GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
           DV C    +  K    AV++  + E EI+++L  +   R  LG  + + ++ P+ +I A 
Sbjct: 298 DVECVWDKYPFKKLPEAVEKDLIKEEEINKSLLRVLIGRFDLGEMDDD-SIVPWAQIPAS 356

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           V+ S  HQ LAL+ AQ  + LL+N + +LPL K+ +  +A+IGPNA++   L GNY G  
Sbjct: 357 VLNSKEHQQLALEMAQKSMTLLQNKNNILPLNKNVN-KVAVIGPNADNEPMLWGNYNGTP 415

Query: 446 CRSITPLQALQNYV--ENTVYYPGCDTV 471
            ++IT  + ++  +     +Y   CD V
Sbjct: 416 NKTITIKKGIEGKLAANKVLYDKSCDLV 443



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 21/149 (14%)

Query: 451 PLQALQNYVENTVYYPGCDTVACSSASIDKAVDI--------AKGADHVVLMMGLDQTQE 502
           P++A   Y    ++    D  A       K  DI         KG D V+   GL    E
Sbjct: 557 PVEAGMKYKIEILFAQSNDWQANLEFDFGKEFDIDFKALIQKLKGIDTVIFAGGLSTLLE 616

Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
            EE+          DR ++ LP  Q++ +  +  A KK  ++ + C G   I       +
Sbjct: 617 GEEMPVSFPGFKGGDRTNIELPAVQRKCLKELKAAGKK--VIFVNCSGSA-IALTPETES 673

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             +IL A YPGE+G  A+A+V+FGD+NP 
Sbjct: 674 CDAILQAWYPGESGGQAVADVLFGDYNPA 702


>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 886

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 163/401 (40%), Positives = 226/401 (56%), Gaps = 38/401 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A  LV ++T DEK+ QL+N+APAIPRLG+PAY WW+E+LHG  G              T+
Sbjct: 34  AATLVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTN 83

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
           FP+ +  AA+FD  L  R  QAI  E R L+   +          G+  W+PNINIFRDP
Sbjct: 84  FPEPVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGTGLDSWSPNINIFRDP 143

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +T +  V++VRG+QG   N      ++   A  KH+  +   +   
Sbjct: 144 RWGRGQETYGEDPFLTARMGVAFVRGMQGTDPN------RIDVVATPKHYAVH---SGPE 194

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
           +TR++ +  V+  DL DTY P F + + +G A  IMCAYNR++G P+CA   LL    R 
Sbjct: 195 STRHEANVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAYNRIDGQPACASDLLLKDYLRG 254

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-------K 340
           +WGF GY+ SDCDAV  I     YA  P  AV   ++AG+D  C +     T       +
Sbjct: 255 RWGFRGYVVSDCDAVKDIDANHHYAPDPATAVAAAMRAGVDNECNNATLTDTDGLTNPYR 314

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A+ +  +  ++IDR+L  LF+ R R G   G   +       AD + +PAHQ LAL+ A
Sbjct: 315 DALARGLISVADIDRSLTRLFAARYRTGDLPGVRPLSMQSSSPAD-IGTPAHQALALETA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +  +VLLKN  G+LPL       +A+IGP  ++ + L GNY
Sbjct: 374 EKSLVLLKNK-GILPL--KAQARIAVIGPLGDATRVLRGNY 411



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 61/106 (57%), Gaps = 13/106 (12%)

Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           AK +D +V ++GL    E EE           D+  L LP  QQ L+ + A+A  KP+I+
Sbjct: 612 AKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQ-AKATGKPLIV 670

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VL+ G PV++ +AK   N  +I+ A YPG+AG +A+  V+ G  NP
Sbjct: 671 VLMNGSPVNLAWAKA--NADAIVEAWYPGQAGGLAVGNVLSGKTNP 714


>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
 gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
 gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
          Length = 861

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 166/444 (37%), Positives = 241/444 (54%), Gaps = 35/444 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           S +T P+    L   +RA DL SRLTL+EK   +++ +PAIPRLGI  + WWSEALHG A
Sbjct: 18  SAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAA 77

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA------GQAI 152
            +G            T+FP+ +  AASF+ +L +++      E RA YN       G+ +
Sbjct: 78  NMG----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDM 127

Query: 153 GM---TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQ 209
            M   + W PN+NIFRDPRWGRGQET GEDP +T    V  V+G+QG      + +   +
Sbjct: 128 KMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGP--EDARYR---K 182

Query: 210 ASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRV 269
             AC KH+  +    +   T    D  V+ +D  +TY P F++ VK  +   +MCAY R+
Sbjct: 183 LWACAKHYAVHSGPEYTRHTANLTD--VSARDFWETYMPAFKTLVKDAKVREVMCAYQRL 240

Query: 270 NGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AG 326
           +  P C    LL +  R +WGF   + SDC AVS  Y+     KS  DAV    K   AG
Sbjct: 241 DDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENH---KSSSDAVHGTSKAVLAG 297

Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
            DV CG ++  K    AV++  L E E+D+ +  L   R  LG  + +P++  + KI   
Sbjct: 298 TDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFDLGEMD-DPSLVEWSKIPYS 356

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
            + + A   +AL  A+  IVLL+N + +LPL K+ +  +A+IGPNA++   + GNY G  
Sbjct: 357 AMSTKASANVALDMARQTIVLLQNKNNILPLKKN-AEKIAIIGPNAHNEPMMWGNYNGTP 415

Query: 446 CRSITPLQALQNYVENTVYYPGCD 469
             ++T L  ++   +  VY PGCD
Sbjct: 416 NHTVTILDGVKAKQKKLVYIPGCD 439



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 60/117 (51%), Gaps = 15/117 (12%)

Query: 477 SIDKAVDIA--KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRV 524
           +ID    IA  KG + V+   G+  + E EE+          DR  + LP  Q+E + + 
Sbjct: 589 NIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKVQREFL-KA 647

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +AA K VI V  C G   I       +  +I+ A YPG+ G  A+A+V+FGD+NPG
Sbjct: 648 LKAAGKQVIYVN-CSGSA-IALQPETESCDAIVQAWYPGQEGGTAVADVLFGDYNPG 702


>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
 gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
          Length = 869

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 169/415 (40%), Positives = 233/415 (56%), Gaps = 40/415 (9%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LV+R+TL EK+ QL+N APAIPRLGIP Y WW+E+LHG  G              T+FP+
Sbjct: 25  LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
            I   ASFD+ L  R+  AI  E R L+   +          G+  W+PN+NIFRDPRWG
Sbjct: 75  PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           RGQET GEDP +T +  V++V G+QG   +      + +  A  KHF  +   N   +TR
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPNAD------RPEVIATPKHFAVH---NGPESTR 185

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           ++ +  V+  DL DTY P F + + + +A  IMCAYNR++G P+CA+  LL    R  WG
Sbjct: 186 HEANVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRIDGQPACANDLLLKHYLRGAWG 245

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAV 343
           F GY+ SDCDAV  I D   YA     AV   ++AG+D  C        + L+     A+
Sbjct: 246 FTGYVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVDNECNGATLSDTAGLEDRYSEAL 305

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
           +++ +  ++IDRAL  LF+ R R+G   G P   P   +    + S A+  LAL AAQ  
Sbjct: 306 QRELISVADIDRALVRLFAARYRVGDLAGLPGTAP--AVPPSAIGSAANSQLALDAAQKT 363

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP-SCRSITPLQALQN 457
           +VLLKN  G+LPL     V +A+IGP  ++ + L GNY+ P S   I+ L  LQ 
Sbjct: 364 LVLLKND-GVLPL--KAGVKVAVIGPLGDATRVLRGNYSSPLSGTPISVLDGLQQ 415



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 16/116 (13%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
           A +D+A   A  AD +V ++GL    E EE           D+  L LP  Q  ++ + A
Sbjct: 589 ADLDRA---AAQADVLVAVVGLTSDLEAEETSVTVPGFAGGDKTTLDLPADQIAMLKQ-A 644

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            A  KP+I+V + G P+D+ ++K  +N  +IL A YPG++G +A+A V+ G  +PG
Sbjct: 645 RATGKPLIVVAMNGSPIDLAWSK--QNAAAILEAWYPGQSGGLAIANVLSGKTDPG 698


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  278 bits (711), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 161/441 (36%), Positives = 239/441 (54%), Gaps = 30/441 (6%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           S +  P+    L    RA+DLV+RLTL EK+  + + + A+PRLGI  + WWSEALHG A
Sbjct: 18  SAQQLPYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYA 77

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN-------AGQA 151
             G            T FP+ +  AASFD    + +  A+  EARA  N       + + 
Sbjct: 78  NQGP----------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRF 127

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
             ++ W PN+NIFRDPRWGRGQET GEDP +T +  VS V+G+QG      K +  L   
Sbjct: 128 HDLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGPA--DAKYRKLL--- 182

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           AC KH+  +    W   +R++ +   VT +DL +TY P F+S V+      +MCAY R++
Sbjct: 183 ACAKHYAVHSGPEW---SRHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLD 239

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
             P C +  LL +  R  WGF   + SDC A++  Y++   +     A    + +G DV 
Sbjct: 240 DEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATHASAKAVLSGTDVE 299

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           C  +       AV +  + E +I+ ++  L + R  LG  + +  + P+ KI   VV S 
Sbjct: 300 CVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD-ELVPWTKIPLSVVNSE 358

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            HQ LAL  A++ + LL+N++ +LPL KS    LA+IGPNAN ++ L GNY G   R+I 
Sbjct: 359 DHQKLALDMARETMTLLQNNNNILPLSKSIG-KLAVIGPNANDSQMLSGNYNGTPLRTIN 417

Query: 451 PLQALQNYV--ENTVYYPGCD 469
            L+ ++  +  ++ +Y  GCD
Sbjct: 418 ILEGIKTKLGADHVIYDAGCD 438



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           + D  ++  K AD VV + G+    E EE+          DR D+ LP  Q+  I  + +
Sbjct: 588 TYDGIIEKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRK 647

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK  I+ + C G   I      +N  +IL A Y GE+G  A+A+V+FGD+NP
Sbjct: 648 AGKK--IVFVNCSGSA-IAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNP 698


>gi|237721201|ref|ZP_04551682.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448997|gb|EEO54788.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 863

 Score =  278 bits (711), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 169/439 (38%), Positives = 243/439 (55%), Gaps = 38/439 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   QRA DL+ RLTL+EK++ + N++PAIPRLGI  YEWW+EALHGVA  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAGL 85

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----LYNAGQAI---GMT 155
                     AT FPQ I  AASF+  L Y +  A+  EARA        GQ     G+T
Sbjct: 86  ----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACC 214
            W PN+NIFRDPRWGRGQET GEDP ++G+  ++ VRG+QG +     KL       AC 
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYDKL------HACA 189

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KHF  +    W    R+ F+A  +  +DL +TY P F+  V++     +MCAYNR  G P
Sbjct: 190 KHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYNRFEGDP 246

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNC 331
            C    LL++  R  WGF G + +DC A+   +  + +   P+   A  D + +G D+ C
Sbjct: 247 CCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLSGTDLEC 306

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           G   +  T  AVK+  + E +I+ ++  L   R  LG  N   +  P+  I   V+  P 
Sbjct: 307 GGNFKSITD-AVKKGLISEEKINTSVKRLLKARFELGEMN---STHPWSNIPFSVIDCPK 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LAL+ A + +VLL+N+     LP ++ + +A+IGPNAN +    GNY G    ++T 
Sbjct: 363 HKELALKMAHESLVLLQNN--NNLLPLNRQMKVAVIGPNANDSVMQWGNYNGFPSHTVTL 420

Query: 452 LQALQNYVENT--VYYPGC 468
           L+ ++  + +   +Y P C
Sbjct: 421 LEGIRAKLPDAQIIYEPVC 439



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           + AD V+   G+    E E +          DR ++ LP  Q+E++  + +  KK  + V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGKK-TVFV 656

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G  + I      +N  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 657 NFSGSAMAIV--PETQNCDAILQAWYPGQAGGTAVADVLFGDYNP 699


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  278 bits (710), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 158/437 (36%), Positives = 243/437 (55%), Gaps = 29/437 (6%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PIS R  +++ ++TL+EK++QL N + +IPRL +P+Y +W+E LHGVA  G+        
Sbjct: 55  PISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE-------- 106

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  RI  AI  EAR  Y + G+  G+T+WAP IN+ RDPRW
Sbjct: 107 --VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGK--GLTYWAPTINMARDPRW 162

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GEDP +T +  V++V+G+QGD  N       L+  A  KHF A + +N     
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQGDHPN------YLKTVATVKHFVANNQEN----D 212

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R+   +++  + L + Y P +E+CVK+     IM AYN  NGIP      LL    R++W
Sbjct: 213 RFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWLLEDVLRKEW 272

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
           GF G++ SDC A+ ++        S E+A    + +G D+ CG   +++  AAV++  + 
Sbjct: 273 GFDGFVVSDCGAIGVMNWQHRIVNSLEEAAALGINSGCDLECGGTYRENLVAAVQRGLVS 332

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
           E  IDRAL  + ++R +LG F+    + P+      ++     + LA +AA   I+LLKN
Sbjct: 333 EYAIDRALTRVLTMRFKLGEFDP-IELVPYNHYDKKLLAGEQFRRLAYEAAVKSIILLKN 391

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYP 466
               LP+ K    S+A++GP A++    LG Y+G    +I+ LQ ++  V   V   Y  
Sbjct: 392 EDNFLPIDKKDVRSIAIVGPFADN--NYLGGYSGKPVHNISLLQGVKKMVGEEVEISYIE 449

Query: 467 GCDTVACSSASIDKAVD 483
           G   V+   +S   A D
Sbjct: 450 GTSVVSPVDSSYLLASD 466



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 3/103 (2%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           IDK   I   AD V++ +G D    +E  D   + LP  Q+ L+  + +   + + L+L 
Sbjct: 599 IDKVKKIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVNPR-IALILQ 657

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            G P+   +A    ++ SIL A YPG+ G  ALA ++FG  NP
Sbjct: 658 TGNPLTSQWAA--EHVPSILQAWYPGQEGGAALAGILFGLENP 698


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 174/465 (37%), Positives = 252/465 (54%), Gaps = 43/465 (9%)

Query: 5   KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
           KLS +  L C  F + +    ST+ P              +  T+    +RA DL+SRLT
Sbjct: 7   KLSCLI-LSCFAFMAFMPAKVSTKKPI-------------YLNTSYSFEERAADLISRLT 52

Query: 65  LDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAA 124
           L+EK S L NS  AIPRLGI +   WSEALHG+ G   G   +  I G TSFP  +   +
Sbjct: 53  LEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---GANQSVGISGPTSFPNSVALGS 109

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           ++D  L  R   AI  EARA+   G   G+T+W+P +   RDPRWGR  E+ GEDP +  
Sbjct: 110 AWDPALMQREAMAIADEARAINQTGTK-GLTYWSPVVEPIRDPRWGRTGESYGEDPFLAA 168

Query: 185 KYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
           + A  +VRG+ G+  T+        L++  C KH+ A + +  +  +    D+R    D+
Sbjct: 169 EIAGGFVRGMVGNDPTY--------LKSVPCAKHYFANNSEFDRHVSSSNMDSR----DM 216

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            + Y  P++  ++Q     IM +YN VNG+P+ A +  L   ARR +G  GYIT DC A+
Sbjct: 217 REFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAI 276

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFS 362
             IY    Y K+ E+A    LKAG+D +CGS  Q++  AA+K+  +  ++IDRAL N+F 
Sbjct: 277 EDIYTGHYYVKTAEEATAKGLKAGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFI 336

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL------LPL 416
           VRMR G F+  P    + +   ++V SPA++ LA + A    VLLKN+  L      LPL
Sbjct: 337 VRMRTGEFD-PPAKVLYAQFQPNIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPL 395

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYV 459
             +    +ALIGP+A+  K  LG Y+G   +   ITP   ++ Y+
Sbjct: 396 NPADLKKIALIGPHAD--KVELGPYSGRPAQENMITPFAGIKKYI 438



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/86 (44%), Positives = 51/86 (59%), Gaps = 1/86 (1%)

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           G D+    EE DR+ L+LPG Q ELI  VA A     I+V+   G V++   K  +NI  
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVA-AVNPNTIVVMQTLGCVEVEEFKNLQNIPG 677

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPG 581
           I+W GY G+A   A+A V+FG+ NPG
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPG 703


>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
 gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
          Length = 884

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 166/450 (36%), Positives = 244/450 (54%), Gaps = 37/450 (8%)

Query: 36  SNPSTETFP--FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           SN  ++  P  F    LP+++R  +L+  LTL+EK+  ++NS+  + RLGIPAY+WW+EA
Sbjct: 21  SNLKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIPAYDWWNEA 80

Query: 94  LHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA----- 148
           LHGVA  GK          AT FPQ I  AA+++     +    I  EARA YN      
Sbjct: 81  LHGVARSGK----------ATVFPQAIGMAATWNESGHKQTFDLISDEARAKYNEAIRNG 130

Query: 149 --GQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
             G+  G++FW PNINIFRDPRWGRGQET GEDP +T +  V+ VRG+QGD         
Sbjct: 131 ERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDD------PK 184

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
             +  AC KHF  +    W    R+ +DA  + +DL +TY P F++ VK+     +MCAY
Sbjct: 185 YFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVKEANVQEVMCAY 241

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY---DAEGYAKSPEDAVVDVL 323
           N   G P C    LL+   R +W + G + SDC A+   +     E +  +   A   V+
Sbjct: 242 NAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHKDAAAAAADAVI 301

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
            +  D+ CGS      + AV+Q  + + +ID +L  +      LG+ +    + P+ ++ 
Sbjct: 302 HS-TDLECGSAYTNLLE-AVRQGLISQQQIDISLRRVLRGWFELGMLDPAERL-PWSQLP 358

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
             +V S  H   AL+ A++ + LLKN+  +LPL KS    +A+IGPNA  +  L GNY G
Sbjct: 359 YQIVASKEHVQQALKVARESMTLLKNNGSILPLSKSIK-KIAVIGPNAADSVMLWGNYNG 417

Query: 444 PSCRSITPLQALQNYVENT--VYYPGCDTV 471
               ++T LQ ++N + +   +Y  GCD V
Sbjct: 418 TPNSTVTILQGIKNKLPHAEIIYDKGCDWV 447



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 56/105 (53%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           K  D +V   GL    E EE+          D++ + LP  Q+EL++ + ++  KPV+ V
Sbjct: 606 KEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSL-KSTGKPVVFV 664

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L  G    +   + ++N  ++L A Y G+    A+A+V+FGD+NP
Sbjct: 665 LCTGS--SLALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNP 707


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 136/259 (52%), Positives = 186/259 (71%), Gaps = 2/259 (0%)

Query: 324 KAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
           +A +D++CG FL +HT+ AV++  L E+EI+ AL N  +V+MRLG+F+G P+ +P+G +G
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
              VC+PAHQ LAL+AA+ GIVLLKN    LPL      S+A+IGPN+N   T++GNYAG
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQE 502
            +C   TPLQ +  Y + T+Y  GC  VAC S      A+D A+ AD  VL+MGLDQ+ E
Sbjct: 125 VACGYTTPLQGIGRYAK-TIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  DR +L+LPGRQQELI++VA A+K P ILVL+ GGP+D++FA+ D  IG I+WAGYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G+AG  A+++V+FG  NPG
Sbjct: 244 GQAGGAAISDVLFGTTNPG 262


>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 876

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 157/454 (34%), Positives = 247/454 (54%), Gaps = 44/454 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF   +L   +R +DL+SRLT++EK + L + + AIPRLGI  + WWSEALHG A    
Sbjct: 33  YPFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYAN--- 89

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------M 154
               N ++   T FPQ +  AASF+  L + I  A   E RA Y+  Q  G        +
Sbjct: 90  ----NDSV---TVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSL 142

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           + W PN+NIFRDPRWGRGQET GEDP +T +  V  V+G+QG      K +  L   AC 
Sbjct: 143 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYRKLL---ACA 197

Query: 215 KHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
           KH+T +    W   +R++ +   V+ ++  +TY P F++ V+      +MCAY+R++  P
Sbjct: 198 KHYTVHSGPEW---SRHELNVNDVSPREFYETYMPAFKALVQDADVRQVMCAYHRLDDEP 254

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-- 331
            C++  +L +  R +WG+   + +DC A++  +   G + +P  A    L AG D+ C  
Sbjct: 255 CCSNTRILQRILRDEWGYEHLVAADCGAITDFHTTHGISSTPVHAAATGLLAGTDLECIW 314

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
            ++  K    A+++  + E  ID +L  +   R  LG  + +  + P+ +  A V+ +  
Sbjct: 315 DNYHYKMLPEALEKDLIKEEHIDSSLFRVLMGRFELGEMDDDAIV-PWAQTPASVLNNKE 373

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ LA + AQ  + LL+N + +LPL KS +  +A+IGPNA+  + L GNY G   R+IT 
Sbjct: 374 HRELAYKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGPNADDEEVLWGNYNGTPIRTITI 433

Query: 452 LQALQNYVE-NTVYYPGCDTVACSSASIDKAVDI 484
           L  +++    + ++Y             DKAVD+
Sbjct: 434 LDGMKSKTNSDRIFY-------------DKAVDL 454



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 51/105 (48%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           +G + VV   GL  + E EE+          DR  + LP  Q+  +  + EA K  VI V
Sbjct: 612 EGIETVVFAGGLSGSLEGEEMPVSYPGFKGGDRTHIELPSVQRNALKALKEAGKT-VIFV 670

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G    I       +  +IL A Y GE+G  A+A+V+FGD+NP
Sbjct: 671 NFSGSA--IALEPETESCDAILQAWYAGESGGQAIADVLFGDYNP 713


>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 852

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 169/427 (39%), Positives = 236/427 (55%), Gaps = 34/427 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F     P  +R  DL+SRLT++EKIS LVN APAI RLGI  Y   +EALHGV  V  G 
Sbjct: 29  FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPGD 86

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
           F        T FPQ I  AA ++  L YRI  AI  EAR  +          AG +  +T
Sbjct: 87  F--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++G   V++V+G+QG+          L+  +  K
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGNH------PRYLKTVSTPK 192

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF      N +   R   +A+V+ +DL + Y P FE C+ +G+A  IM AYN VN +P  
Sbjct: 193 HFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCT 248

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
            +  L+    R  WGF+GYI SDC A   +     Y K+ E A    +KAG+D+ CG+ +
Sbjct: 249 VNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKAGLDLECGNQV 308

Query: 336 Q-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    A +Q  + E++ID A + +   RM LGLF+ +P+  P+ +I   VV   AHQ 
Sbjct: 309 YGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFD-DPSQNPYNQIEPSVVGCKAHQD 367

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+AA+  +VLLKN    LPL   K  S+A++G +A   +   G+Y+G P    +T L 
Sbjct: 368 LALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVGISAGHCE--FGDYSGTPKNEPVTILD 425

Query: 454 ALQNYVE 460
            ++ Y E
Sbjct: 426 GIKQYAE 432



 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 59/98 (60%), Gaps = 3/98 (3%)

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           +A   D  V ++G++++ E+E  DR  L LP  QQE I  + +     V+ VL+ G  + 
Sbjct: 600 VAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVV-VLVAGSSLA 658

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +   D N+ +IL A YPGE G  A+AEV+FGD+NPG
Sbjct: 659 VNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 694


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  275 bits (704), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 166/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA  LV  LTL+EK   +++ + ++ RLGI  Y WW+EALHGVA  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
                   AT FPQ I  AASF+  + Y +  A+  EARA   Y A Q       G+T W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG     GK     +  AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 187

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R+ F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 188 AVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y+  G+   P  E A    + +G D+ CGS 
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    +VK+  + E  +D ++  L   R  LG  +  P    + KI   VV S AH  
Sbjct: 305 Y-KALIESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A++ + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 363 LALNMARESMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421

Query: 455 LQNYV---ENTVYYPGC 468
           ++N +   +  +Y  GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438



 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 61/113 (53%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I K+V+  K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  +  A
Sbjct: 590 IRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 649

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK +ILV   G P  I      +   +IL A YPG+ G  A+AEV+FGD+NP
Sbjct: 650 GKK-IILVNCSGSP--IGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDYNP 699


>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 862

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 171/471 (36%), Positives = 244/471 (51%), Gaps = 46/471 (9%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           LL  C    +  V++ Q P+  DP              L    RA+DL SRLTL+EK S 
Sbjct: 6   LLTGCLLMCVVGVNAQQSPYQ-DPG-------------LSFEARAKDLCSRLTLEEKASL 51

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           + + +PAIPRLGI  + WWSEALHG A        NG +   T FP+ I  AASF+  + 
Sbjct: 52  MCDVSPAIPRLGIKPFNWWSEALHGYAN-------NGDV---TVFPEPIGMAASFNPTMV 101

Query: 132 YRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           Y++  A   EAR  YN   A G        ++ W PN+NIFRDPRWGRGQET GEDP +T
Sbjct: 102 YQVFTATSDEARGKYNQSMAEGKEDTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLT 161

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDL 242
               V  V+G+QG      +     +  AC KHF  +    +   TR+  + A ++ +DL
Sbjct: 162 SVMGVEVVKGLQGPESTKYR-----KLYACAKHFAVHSGPEY---TRHTANLADISPRDL 213

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            +TY P F++ V+Q     +MCAY R++  P C +  LL +  R +WGF   + SDC A+
Sbjct: 214 WETYLPAFKATVQQAGVREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAI 273

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLF 361
           +  Y     +     A      AG DV CG  +       AV++  + E+E+D+ +  L 
Sbjct: 274 ADFYTNHHVSSDAVHAAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLL 333

Query: 362 SVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKS 421
             R  LG+ + +P +  + KI   VV S AH+ LAL  A+  + LL+N + +LPL K + 
Sbjct: 334 KGRFELGVMD-DPKLVSWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKGEK 392

Query: 422 VSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV-YYPGCDTV 471
           +  A++GPNA     L GNY G   R+ T L+ ++      + Y  GCD V
Sbjct: 393 I--AVVGPNAADGPMLWGNYNGTPSRTTTILEGIRAKAGKDIPYLQGCDLV 441



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 13/111 (11%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKK 530
           A+   +G   VV + G+    E EE+          DR  + LP  Q++ + +  +AA K
Sbjct: 593 AIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQRDFL-KALKAAGK 651

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            V+ V   G  + +T      +  +IL A Y GE G  A+A+V++GD+NPG
Sbjct: 652 TVVFVNCSGSAIALT--PEVESCDAILQAWYAGEEGGRAVADVLYGDYNPG 700


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 166/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +L   +RA  LV  LTL+EK   +++ + ++ RLGI  Y WW+EALHGVA  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
                   AT FPQ I  AASF+  + Y +  A+  EARA   Y A Q       G+T W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG     GK     +  AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 187

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R+ F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 188 AVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDPCCG 244

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y+  G+   P  E A    + +G D+ CGS 
Sbjct: 245 SDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVISGTDLECGSS 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    +VK+  + E  +D ++  L   R  LG  +  P    + KI   VV S AH  
Sbjct: 305 Y-KALIESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASAAHDS 362

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A++ + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 363 LALNMARESMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPAHTVTILDG 421

Query: 455 LQNYV---ENTVYYPGC 468
           ++N +   +  +Y  GC
Sbjct: 422 VRNLLGTDDKLIYEQGC 438



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/113 (39%), Positives = 61/113 (53%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I K+V+  K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  +  A
Sbjct: 590 IRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 649

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK +ILV   G P  I      +   +IL A YPG+ G  A+AEV+FGD+NP
Sbjct: 650 GKK-IILVNCSGSP--IGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDYNP 699


>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
 gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
          Length = 851

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 169/427 (39%), Positives = 234/427 (54%), Gaps = 34/427 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F     P  +R  DL+SRLT++EKIS LVN APAI RLGI  Y   +EALHGV  V  G 
Sbjct: 29  FRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPGD 86

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
           F        T FPQ I  AA ++  L YRI  AI  EAR  +          AG +  +T
Sbjct: 87  F--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLLT 138

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++G   V++V+G+QGD          L+  +  K
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGDH------PRYLKTVSTPK 192

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF      N +   R   +A+V+ +DL + Y P FE C+ +G+A  IM AYN VN +P  
Sbjct: 193 HFAV----NNEEHNRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVNDVPCT 248

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
            +  L+    R  WGF+GYI SDC A   +     Y K+ E A    +K G+D+ CG+ +
Sbjct: 249 VNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVKTREAAATLAVKVGLDLECGNQV 308

Query: 336 Q-KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    A +Q  + E++ID A + +   RM LGLF+  P+  P+ +I   VV   AHQ 
Sbjct: 309 YGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDA-PSQNPYNQIEPSVVGCKAHQD 367

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+AA+  +VLLKN    LPL   K  S+A++G +A   +   G+Y+G P    +T L 
Sbjct: 368 LALEAARQSMVLLKNKDNFLPLNPKKVKSIAVVGISAGHCE--FGDYSGTPKNEPVTILD 425

Query: 454 ALQNYVE 460
            ++ Y E
Sbjct: 426 GIKQYAE 432



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/106 (38%), Positives = 61/106 (57%), Gaps = 3/106 (2%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
           A    A  +A   D  V ++G++++ E+E  DR  L LP  QQE I  + +     V+ V
Sbjct: 591 ARFGDAGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPNTVV-V 649

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           L+ G  + + +   D N+ +IL A YPGE G  A+AEV+FGD+NPG
Sbjct: 650 LVAGSSMAVNWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 693


>gi|340347926|ref|ZP_08671025.1| beta-glucosidase [Prevotella dentalis DSM 3688]
 gi|433652779|ref|YP_007296633.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
           3688]
 gi|339608382|gb|EGQ13286.1| beta-glucosidase [Prevotella dentalis DSM 3688]
 gi|433303312|gb|AGB29127.1| beta-glucosidase-like glycosyl hydrolase [Prevotella dentalis DSM
           3688]
          Length = 849

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 167/427 (39%), Positives = 242/427 (56%), Gaps = 34/427 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
           +++A  L  +LTL EKIS L++ +PAIP+L IPA++WW+EALHGVA        NG    
Sbjct: 26  NRQAEALCRQLTLSEKISLLLDDSPAIPQLNIPAFQWWNEALHGVAR-------NGV--- 75

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIG-------LEARALYNAGQAIGMTFWAPNINIFR 165
           AT FPQ I  AA+FD  L  ++  A+          A+AL +  +  G++FW PNINIFR
Sbjct: 76  ATVFPQTIGMAAAFDDELLQQVFTAVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFR 135

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDN 224
           DPRWGRGQET GEDP +TG+  ++ VRG+QG DT          +  AC KHF  +   +
Sbjct: 136 DPRWGRGQETYGEDPYLTGRMGLAVVRGLQGPDT------ASYYKTLACAKHFAVH---S 186

Query: 225 WKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
              + R++ +   ++ +DL +TY P F+S V +G+ + +MCAY R +G P C +  LL  
Sbjct: 187 GPESLRHEMNIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQH 246

Query: 284 TARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
             R +W F G + SDC A+S  +     G AK   +A     +AG DV CGS      + 
Sbjct: 247 ILRDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQRAGTDVECGSNYHS-LEE 305

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQ 401
           AVK   L ES+IDR++  +   R+ LG  + +  + P+  I   VV  PAH+ LAL+ A+
Sbjct: 306 AVKAGVLKESDIDRSVIRVLEARLALGDISPSADV-PWKTIPYAVVDCPAHRQLALRMAR 364

Query: 402 DGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN 461
           + +VLL+N H  LPL  S    + ++G NA  +  L  NY G    ++T L  ++   E+
Sbjct: 365 ESMVLLQNKHHTLPL--STHDKILVVGENAVDSMMLWANYNGTPSHTMTILDGIRQLAEH 422

Query: 462 TVYYPGC 468
             + PGC
Sbjct: 423 VEFMPGC 429



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 13/122 (10%)

Query: 469 DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQ 518
           D V      + + ++ A+ AD VV + GL    E EE+          DR  + LP  Q+
Sbjct: 574 DVVHKQPFDLHQLIEKARQADAVVFVGGLSPRLEGEEMKVDLPGFSGGDRTSIELPVAQR 633

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           E I  ++  A+KP+++V   G    I      +N  +IL A YPGE G  A+A+V+FG  
Sbjct: 634 EAIQALS-VARKPIVMVNCSGSA--IALEPETKNCDAILQAWYPGEEGGRAVADVLFGKV 690

Query: 579 NP 580
           NP
Sbjct: 691 NP 692


>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
 gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 499

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 161/420 (38%), Positives = 241/420 (57%), Gaps = 34/420 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EK+S L  ++P I RL IP Y   +EALHGV   G+        
Sbjct: 34  PLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 85

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--------GQ-AIGMTFWAPNI 161
              T FPQ I  AA+++  L Y++   I  EARA +N         GQ +  +TFW+P +
Sbjct: 86  --FTVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQFSDLLTFWSPTV 143

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++VRG+QGD          L+  +  KHF A  
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQGDDAR------YLKVVSTPKHFAA-- 195

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R++ + +++ + L + Y P FE+C+K G+A+ IM AYN +N +P   +  LL
Sbjct: 196 --NNEEHNRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINNVPCTLNSWLL 253

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R  WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     +   
Sbjct: 254 TKVLRHDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYYEPLL 313

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID   +++   RMRLGLF+ N    P+ KI   ++ S  HQ +AL+AA
Sbjct: 314 NAYKQYMVSDADIDSTAYHVLKARMRLGLFD-NGKNNPYTKISPSIIGSKLHQRVALEAA 372

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN + +LPL   K  S+A++G NA + +   G+Y+G P    I+ LQ ++  V
Sbjct: 373 RQCIVLLKNHNWVLPLDTKKLKSIAVVGINAGNCE--FGDYSGSPVIAPISILQGIKERV 430


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 166/432 (38%), Positives = 241/432 (55%), Gaps = 23/432 (5%)

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           +PNINI RDPRWGR QE P EDPL+ G++   Y  G+Q      G+     +     KH+
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQ-----GEDSRYTKVVVTLKHW 65

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
            AY L++  G TR+ FDA+V+   L DTY P F   V +G A G+MC+YN +NG P+C  
Sbjct: 66  DAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPTCT- 124

Query: 278 RNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM-DVNCGSFLQ 336
             LL+K  R  W F GY+TSD  A+  IY    Y  +   AV   L+ G  D++ G+   
Sbjct: 125 HPLLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGAVYH 184

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM-QPFGKIGADVVCSPAHQVL 395
                AV   +    ++DRAL+N   +R  LGLF  +P   QP+ +I A  + +   Q L
Sbjct: 185 DALLDAVNSGECSMDDVDRALYNTLKLRFELGLF--DPIEDQPYWRINASSINTTYAQDL 242

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS------I 449
            ++   + ++LL+N +  LP  K + V  A+IGP+ N+ + L+GNY G  C         
Sbjct: 243 NMKITLESMILLQNHNNALPFKKGRKV--AVIGPHINAQEALVGNYLGQLCPDDSFDCIT 300

Query: 450 TPLQALQ--NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
           +PL A++  N + NTV   G   +AC+ ASI +AV++AK AD+VVL++G++ T E E  D
Sbjct: 301 SPLAAIEAINGMSNTVSAMGSGVLACTDASIQEAVNVAKDADYVVLLIGINDTIEAESND 360

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R  + LP  Q +L   +A    K    VL+ GG + I   K  + + +I+ AGYPG  G 
Sbjct: 361 RTSIDLPQCQHKLTAAIAH-LNKTTAAVLINGGMLAIEQEK--KQLPAIIEAGYPGFYGG 417

Query: 568 VALAEVIFGDHN 579
            A+A+ IFGD+N
Sbjct: 418 AAIAKTIFGDNN 429


>gi|373461705|ref|ZP_09553443.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
 gi|371951597|gb|EHO69442.1| hypothetical protein HMPREF9944_01707 [Prevotella maculosa OT 289]
          Length = 662

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 237/425 (55%), Gaps = 34/425 (8%)

Query: 55  RARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
           +A  L  +LTL EK+S L++ +PAIPRL IPA++WW+EALHGVA        NG    AT
Sbjct: 19  QAEALCRQLTLSEKVSLLLDDSPAIPRLNIPAFQWWNEALHGVAR-------NGV---AT 68

Query: 115 SFPQVILTAASFDSYLWYRIGQAIG-------LEARALYNAGQAIGMTFWAPNINIFRDP 167
            FPQ I  AA+FD  L  ++   +          A+AL +  +  G++FW PNINIFRDP
Sbjct: 69  VFPQTIGMAAAFDDELLQQVFTVVSDEARAKAERAKALGHVRRYQGLSFWTPNINIFRDP 128

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           RWGRGQET GEDP +TG+  ++ VRG+QG DT          +  AC KHF  +   +  
Sbjct: 129 RWGRGQETYGEDPYLTGRMGLAVVRGLQGPDT------ACYYKTLACAKHFAVH---SGP 179

Query: 227 GTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
            + R++ D   ++ +DL +TY P F+S V +G+ + +MCAY R +G P C +  LL    
Sbjct: 180 ESLRHEMDIENLSPRDLWETYLPAFKSLVTKGQVAEVMCAYQRFDGQPCCGNSRLLQHIL 239

Query: 286 RRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
           R +W F G + SDC A+S  +     G AK   +A      AG DV CGS      + AV
Sbjct: 240 RDEWNFKGLVVSDCGAISDFWIPGRHGVAKDAVEASAQAQSAGTDVECGSNYHS-LEEAV 298

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
           K   L ES+IDR++  +   R+ LG  + +  + P+  I   VV  PAH+ LAL+ A + 
Sbjct: 299 KAGVLKESDIDRSVIRVLEARLALGDISPSAVV-PWKTIPYAVVDCPAHRQLALRMAHES 357

Query: 404 IVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV 463
           +VLL+N H  LPL  S    + ++G NA  +  L  NY G    ++T L  ++   E+  
Sbjct: 358 MVLLQNRHHTLPL--STHDKILVVGENAVDSMMLWANYNGIPSHTVTVLDGIRQLAEHVE 415

Query: 464 YYPGC 468
           + PGC
Sbjct: 416 FMPGC 420


>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
 gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
 gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
          Length = 885

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 160/444 (36%), Positives = 235/444 (52%), Gaps = 29/444 (6%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S+ ++   P+    L   +RA DL  RLTL+EK   + + +PAIPRLGI  + WWSEALH
Sbjct: 41  SDKTSTLLPYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALH 100

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
           G A +G            T+FP+ I  A+SF+  L   +  A   E RA Y+     G  
Sbjct: 101 GAANMG----------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGE 150

Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
                 ++ W PN+NIFRDPRWGRGQET GEDP +T     + V G+QG   +  +    
Sbjct: 151 DEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGPESSKYR---- 206

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            +  AC KHF  +   +   +TR+  +   ++ +DL +TY P F+S V+ G    +MCAY
Sbjct: 207 -KLWACAKHFAVH---SGPESTRHTANLNNISPRDLYETYLPAFQSTVQDGHVREVMCAY 262

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
            R++  P C++  LL +  R +WGF   + SDC AVS I+ +   +     A      AG
Sbjct: 263 QRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSSDAVHASRQATLAG 322

Query: 327 MDVNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD 385
            DV CG  +       AVK+  L E EID+ +  L   R  LG  + +  ++ + KI   
Sbjct: 323 TDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMDDSKLVE-WSKIPYS 381

Query: 386 VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS 445
           ++   AH  LAL  A+  IVLL+N   +LPL   K+  +A+IGPNA++   + GNY G  
Sbjct: 382 IMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNADNKPMMWGNYNGTP 441

Query: 446 CRSITPLQALQNYVENTVYYPGCD 469
             +++ L+ ++   +N VY P CD
Sbjct: 442 NHTVSILEGIRKQYKNVVYLPACD 465



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 61/115 (53%), Gaps = 15/115 (13%)

Query: 478 IDKAVDIA--KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVA 525
           +D A +IA  KG D V+ + G+  + E EE+          DR D+ +P  Q++ I  +A
Sbjct: 614 VDYAANIAQLKGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALA 673

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           EA K+  I+++ C G   I      +   +I+ A YPG+ G  A+A+++ G  NP
Sbjct: 674 EAGKQ--IILVNCSGSA-IALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNP 725


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 162/424 (38%), Positives = 244/424 (57%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 85  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+  +  KHF A  
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 194

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 195 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 252

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   S++ +A  Y K+ E A    ++AG+D+ CG     ++  
Sbjct: 253 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 312

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A  ++ + RM+LGLF+G     P+ +I   V+ S  HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 371

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN + +LPL  +K  S+A++G NA   K   G+Y+G P    ++ LQ +++ V
Sbjct: 372 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKDRV 429

Query: 460 ENTV 463
            + V
Sbjct: 430 GDRV 433



 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + + +  
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D +I +I+ A YPGE G  A+A+V+FGD+NP
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNP 690


>gi|375145746|ref|YP_005008187.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361059792|gb|AEV98783.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 866

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 234/443 (52%), Gaps = 31/443 (6%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+    L   +RA+DL++RLTL+EK S + + +PAIPRLGI  + WWSEALHG+A  
Sbjct: 22  QKLPYQNPDLSSEERAKDLITRLTLEEKASLMFDQSPAIPRLGIKKFNWWSEALHGLANN 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAI 152
            K           T FP+ +  AASFD  L Y+I  A   E RA Y+          + +
Sbjct: 82  DK----------VTVFPEPVGMAASFDDTLVYKIFDATSDEVRAKYHDAIRNGKENRRFL 131

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PNINIFRDPRWGRGQET GEDP +T +  V+ V+G+QG      + +  L   A
Sbjct: 132 SLSVWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPA--DARYRKLL---A 186

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
           C KHF  +    W   T   +  R   +DL +TY P F+S V +     +MCAY R++  
Sbjct: 187 CAKHFAVHSGPEWSRHTLNLYQVRP--RDLYETYLPAFKSLVMEADVRQVMCAYQRLDDE 244

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC- 331
           P C +  LL    R  WGF   + SDC A++  + +   +     A      AG DV C 
Sbjct: 245 PCCGNNRLLQTILRDNWGFKHVVVSDCGAITDFFTSHKVSSDAVHASAKGALAGTDVECV 304

Query: 332 -GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
              +  K+   AV +  + E EID+ L  + + R  LG  + +  + P+  I   VV + 
Sbjct: 305 WEGYAFKNLPEAVSRGLITEKEIDKHLLRVLAGRFDLGEMDDDAVV-PWAAIPMSVVNNE 363

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
           AH+ LAL  AQ+ + LL+N + +LPL K  +  +A+IGPNA++   L GNY G   R+I 
Sbjct: 364 AHRKLALDMAQESMTLLQNRNNILPL-KRNAKKIAVIGPNADNDPVLWGNYNGRPVRTIN 422

Query: 451 PLQALQNYV--ENTVYYPGCDTV 471
            +  ++  +  +  +Y   CD V
Sbjct: 423 IVSGIRAKMPPDGVLYDKACDLV 445



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 54/105 (51%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           KG D VV + GL    E EE+          DR D+ LP  Q+  +  +  A KK  ++ 
Sbjct: 602 KGIDEVVFVGGLSTLLEGEEMPVSYPGFKGGDRTDIQLPEVQRNCLKALKAAGKK--VIF 659

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + C G   I F     +  +IL A Y GE+G  A+A+V+FGD+NP
Sbjct: 660 VNCSGSA-IGFVPETESCDAILQAWYGGESGGQAVADVLFGDYNP 703


>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1073

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 165/430 (38%), Positives = 230/430 (53%), Gaps = 45/430 (10%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
            +PF  TTL   +R +DL+SRL + EKIS L  ++PAIPRLGI  Y   +EALHGV   G
Sbjct: 26  NYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYYHGNEALHGVVRPG 85

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAI 152
           K           T FPQ I  A+ ++      +  AI  EAR  +N         AG + 
Sbjct: 86  K----------FTVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNELNQGKDQTAGASD 135

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            +TFW+P IN+ RDPRWGR  ET GEDP +TG    ++VRG+QG+          ++  +
Sbjct: 136 LLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQGND------PKYIKVVS 189

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
             KHF A    N +   R   +A ++ +DL + Y P FE C+K+G+A  +M AYN VNGI
Sbjct: 190 TPKHFAA----NNEEHNRASGNAVISERDLREYYFPAFEKCIKEGQAQSVMSAYNAVNGI 245

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
           P   ++ LL+   R  WGF GY+ SDC A   I     Y  + E+A    +KAG+D+ CG
Sbjct: 246 PCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYVDTYEEAASLCIKAGLDLECG 305

Query: 333 S------FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
                   L  + +  V       SEID A + +   RMRLGLF+ +P   P+ KI   +
Sbjct: 306 DNVYITPLLNAYNRGMVTM-----SEIDSAAYRVLRGRMRLGLFD-DPNENPYNKISPSI 359

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           V    H+ LAL+AA+  +VLLKN   +LP+      S+A++G NA  A    G+Y+G   
Sbjct: 360 VGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVGINA--ANCEFGDYSGTPV 417

Query: 447 RSITPLQALQ 456
              TP+  L+
Sbjct: 418 N--TPISVLE 425



 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 4/116 (3%)

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVA 525
           P  D+ +   A  D A +I +G+D  + ++G+D+T E+E  DR  + LP  QQ  I   A
Sbjct: 722 PSTDSESLLDAYGD-AGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEE-A 779

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             A    ++VL+ G  + I +   D+NI ++L A YPGE G  A+AE +FGD+NPG
Sbjct: 780 YKANPNTVVVLVAGSSLAINWI--DQNIPAVLDAWYPGEQGGTAVAEALFGDYNPG 833


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|291292288|gb|ADD92016.1| Xyl3C [Prevotella bryantii B14]
          Length = 857

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/438 (37%), Positives = 240/438 (54%), Gaps = 33/438 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+   +L   QRA DL SRLTL+EK   + N +PAI RL IPA+EWWSEALHG A  
Sbjct: 22  QHLPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARN 81

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE-------ARALYNAGQAIG 153
           G           AT FP     AAS++  L  +I  AIG E       AR   N  +  G
Sbjct: 82  G----------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQG 131

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           ++ W PNINIFRDPRWGRGQET GEDP +TGK  ++ V G+QG      K     +  AC
Sbjct: 132 LSIWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQGP-----KNSKYYKLLAC 186

Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    +    R+ F+   +  +DL +TY P F++ +++G  + +MCAY+ ++G+
Sbjct: 187 AKHFAVHSGPEY---LRHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGL 243

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVS--IIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           P C     L +  R+  GF G + SDC A+    I      A+    A    + AG DV 
Sbjct: 244 PCCGSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVE 303

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CG+   K    AVK+ ++ E +I+ ++  L   R +LG F+ +  M  + ++   ++   
Sbjct: 304 CGANYDK-LPEAVKRGEISEEKINVSVMRLLKARFKLGDFDSD-NMVEWTQLPESLIACS 361

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LA Q AQ+ + LLKN +G+LPL   K+  +A++G NAN +  L GNY G   ++I+
Sbjct: 362 KHKQLAYQMAQESMTLLKN-NGILPL--QKNARIAVMGANANDSIMLWGNYNGYPTKTIS 418

Query: 451 PLQALQNYVENTVYYPGC 468
            L+ LQN  ++  Y PGC
Sbjct: 419 ILEGLQNKSKHISYIPGC 436



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           A  V+ + G+    E EE+          DR  + LP  Q+++I  +  + KK VI V  
Sbjct: 600 AQVVIYVGGISPRLEGEEMKVNELGFKGGDRTTIELPQSQRDMIALLHNSGKK-VIFVNC 658

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            GG   I      RN  +IL A Y GE G  A+A+V+FGD+NP
Sbjct: 659 SGGA--IALEPESRNADAILQAWYGGEMGGQAVADVLFGDYNP 699


>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
 gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
          Length = 898

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 165/403 (40%), Positives = 230/403 (57%), Gaps = 40/403 (9%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A +LV ++TLDEK+ QL+N+APAIPRLG+PAY WW+E+LHG  G              T+
Sbjct: 46  AAELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTN 95

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFRDP 167
           FP+ I  AA+FD+ L + +  AIG E + L+   +A G        +  W+PNINIFRDP
Sbjct: 96  FPEPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDP 155

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +T +  V++V G+QGD  +  ++       A  KHF  +   +   
Sbjct: 156 RWGRGQETYGEDPFLTARMGVAFVEGIQGDDPDHPRI------IATPKHFAVH---SGPE 206

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
           +TR+  +  V+ +DL DTY P F + V +GRA  IMCAYNR++G P+CA   LL +  R 
Sbjct: 207 STRHGANVFVSRRDLEDTYLPAFRAAVVEGRAGSIMCAYNRIDGQPACASDLLLKEHLRG 266

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF-------LQKHTK 340
            W F GY+ SDCDAV  I D   YA     AV   L+AG+D  C          L    +
Sbjct: 267 AWKFDGYVVSDCDAVKDISDHHKYAPEAASAVAAALRAGVDNECNGATLTDTDGLAGRYR 326

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI-GADVVCSPAHQVLALQA 399
            A+ +  +  ++ID AL  LFS R R G           G++ G  VV +  H+ LAL A
Sbjct: 327 EALDRGLISTAQIDTALVRLFSARFRNGDLPAKGGSD--GRLAGPSVVTTHEHEALALAA 384

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA 442
           ++  +VLLKN  G+LPL     + +A+IGP  ++ + L GNY+
Sbjct: 385 SEKSLVLLKND-GVLPL--KPGLRIAVIGPLGDATRVLRGNYS 424



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 57/102 (55%), Gaps = 13/102 (12%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D +V ++GL    E EE           D+  L LP  Q+ ++ + A+A  KP+I+V + 
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQ-AKALGKPLIVVAMN 686

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G P++  +AK   N  ++L A YPG++G +A+A V+ G  NP
Sbjct: 687 GSPLNFAWAK--DNASALLEAWYPGQSGGLAIANVLTGKTNP 726


>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 682

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 191/575 (33%), Positives = 285/575 (49%), Gaps = 55/575 (9%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL---------VNSAPAIPRLGIPAY 87
           N   +  P+C  +L I +R  DL+S LTLDEK+  +         +     + R+G+P Y
Sbjct: 64  NDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDY 123

Query: 88  EWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN 147
            W  E       VG         + AT F   +  AASF+   W+  G   G E RAL N
Sbjct: 124 YWLVET---NTAVGSACIAEN--KCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMN 178

Query: 148 ---------AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT 198
                    +G+ IG+T + PNIN  RDPR+GR  E PGEDP ++G+YA   V+G+Q   
Sbjct: 179 VHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQERD 238

Query: 199 FNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
            NG       +  A  KHFTAY  +  +G   Y     ++M DL DTY P +E  + QG 
Sbjct: 239 ANGYP-----KVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQGG 289

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWG-FHGYITSDCDAVSIIYDAEGYAKSPED 317
           A+G+MC+YN VNGIP+CA+  LL+K  R++W     ++T+DC AV+ +      A     
Sbjct: 290 ATGVMCSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNLRGKPIQAADEAQ 349

Query: 318 AVVDVLKAGMDVNCGSFLQKHT-KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
           A    L  G D+  GS L  H    A+      E  +++A+   +      G F+ +PT+
Sbjct: 350 AAAMALMNGADIEMGSTLFVHNLTTAITLGYATEEAVNQAIRRSYRPHFIAGRFD-DPTL 408

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKT 436
             +  +G D + S  HQ + L+AA  G+VLLK+   +LP+  +    LA++GP   +   
Sbjct: 409 SEWFSLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPI--AAGTKLAVLGPLGMTRSG 466

Query: 437 LLGNY--------AGPSC-RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
           L+ +Y         G  C  ++       N  E TV   G D  + +++ +++ + +A  
Sbjct: 467 LMSDYESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQLAAD 526

Query: 488 ADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFA 547
            D +VL +G  +TQE+E  DR D  LPG+Q  L   V    +KPV+LVL+ GG + +   
Sbjct: 527 RDLIVLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVL-TLRKPVVLVLVNGGQIAL--- 582

Query: 548 KYDRNIG---SILWAGYPGEAGAVALAEVIFGDHN 579
             D   G   +I+ A  P   G  ALA  +FG  N
Sbjct: 583 --DGMTGYPSAIIEAFNPNGIGGTALAASLFGQEN 615


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/442 (37%), Positives = 243/442 (54%), Gaps = 37/442 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +     P  +R  DL+S+LT++EKIS L  ++P IPRL I  Y   +EALHGV   G   
Sbjct: 28  YLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGVVRPG--- 84

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQAIG-----MT 155
                    T FPQ I  AA ++  L   I  AI  EARA +N      + +G     +T
Sbjct: 85  -------NFTVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLT 137

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++GK  VS+V+G+QGD          L+  +  K
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD------PRYLKIVSTPK 191

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A    N +   R++ +  ++ +DL + Y P FE C+ +G+A+ IM AYN +N +P  
Sbjct: 192 HFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCT 247

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            +  LL K  R  WGF GY+ SDC A   +     Y K+ E A    ++AG+D+ CG + 
Sbjct: 248 LNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQAGLDLECGDNV 307

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    A KQ  + E+EID A +++   RMRLGLF+ +P + P+ KI   VV    H  
Sbjct: 308 YMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKISPSVVGCEKHSQ 366

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+AA+  IVLLKN    LPL   K  S+A++G NA + +   G+Y+G P  + ++ L+
Sbjct: 367 LALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVGINAGNCE--FGDYSGTPVNQPVSILE 424

Query: 454 ALQNYVENTV---YYPGCDTVA 472
            ++  V + +   Y P   +V+
Sbjct: 425 GIKKRVGDQIEVMYSPWTSSVS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 59/101 (58%), Gaps = 3/101 (2%)

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           A D  +  D  + ++G++++ E+E  DR  + LP  QQ  I    +     V+ VL+ G 
Sbjct: 595 AGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAGS 653

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            + I +   D +I +I+ A YPGEAG  A+AEV+FGD+NPG
Sbjct: 654 SLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPG 692


>gi|383649536|ref|ZP_09959942.1| beta-glucosidase [Sphingomonas elodea ATCC 31461]
          Length = 853

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 165/427 (38%), Positives = 237/427 (55%), Gaps = 32/427 (7%)

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           ++++T+++K +QL ++APA P +G+PAY+WW+E LHG+A  G           AT FPQ 
Sbjct: 20  IAKMTIEQKAAQLQSTAPADPVIGLPAYDWWNEGLHGLARDGY----------ATVFPQA 69

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRG 172
           I  AA++D  L +++G  +  EARA +NA           G+T W+PNINIFRDPRWGRG
Sbjct: 70  IGLAATWDVPLLHKVGDTVATEARAKFNAKPVTADRKIYEGLTIWSPNINIFRDPRWGRG 129

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           QET GEDP +TG  AV +++G+QG            +  A  KH   +         R  
Sbjct: 130 QETYGEDPFLTGHLAVGFIQGLQGPD------PAHPKVLATPKHLAVHSGPE---AGRDG 180

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
           FD   + QDL  TY P F   + +G+A  +MCAYN ++G P+CA   LL+   R+ WGF 
Sbjct: 181 FDVDPSPQDLESTYTPAFRLALTEGKAQSVMCAYNSIHGTPACASGGLLNDRLRKDWGFT 240

Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
           G I SDCDAV+ I+    Y     +A    +KAGMD+NCG+      + AVK+  + E+E
Sbjct: 241 GLIVSDCDAVANIHLFHHYRLDAAEASAAAIKAGMDLNCGTTYAALPE-AVKRGLVSEAE 299

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +D AL      R  LG+  G     P+ +I      +PA + LAL+AA+  IVLLKN   
Sbjct: 300 VDVALKRALDARRALGIAFGG--ANPWSRIKPGERGTPAQRALALEAARKAIVLLKNDGD 357

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL-QNYVENTVYYPGCDTV 471
            LPL  +    +A+IG NA+    L GNY G +   +TPL  + + +  + V Y     +
Sbjct: 358 RLPL--APGGRIAVIGANADDLGVLEGNYHGTARDPVTPLDGIRRQFGADRVVYAQGAQL 415

Query: 472 ACSSASI 478
           A ++A I
Sbjct: 416 AEAAAVI 422



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D +V ++GL    E E L          DR D+ LP  Q EL+ +      KP++LVL  
Sbjct: 580 DTIVAVLGLSPDLEGEALSVSIPGFVGGDRTDIALPRPQLELL-KALRKTGKPLVLVLTS 638

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G  V +  +  D    +IL A YPGE G  A+AE + G +NP
Sbjct: 639 GSAVAVDPSLAD----AILEAWYPGEEGGTAIAETLAGKNNP 676


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETQYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 163/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  + L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    N    K      AC KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK----IHACAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+G+   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R++WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P+   + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPSKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 160/443 (36%), Positives = 237/443 (53%), Gaps = 37/443 (8%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DLV+RLTL+EK S + + + AIPRLGI  + WWSEALHG A     
Sbjct: 13  PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFAN---- 68

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
              N  +   T FP+ +  AASFD  L Y++  A   E RA Y+          + + ++
Sbjct: 69  ---NDDV---TVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLS 122

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            W PN+NIFRDPRWGRGQET GEDP +T +  V  V+G+QG      K K  L   AC K
Sbjct: 123 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGP--EDAKYKKLL---ACAK 177

Query: 216 HFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +    W   +R++ +   V+ +DL +TY P F+  V+      +MCAY R++  P 
Sbjct: 178 HYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPC 234

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF 334
           C    LL +  R +WGF   + SDC A+   Y +   +     A    + AG DV C   
Sbjct: 235 CGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVHAAAKAVLAGTDVECQ-- 292

Query: 335 LQKHT----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
             KH       AV++  + E +IDR++  +   R  LG  + +  + P+ +I A V+ + 
Sbjct: 293 WDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIV-PYAQIPASVINNE 351

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LAL+ A++ + LL+N + +LPL K +   +A+IGPNA+    L GNY G   R+I+
Sbjct: 352 EHRQLALKMARESMTLLQNKNNILPLSKGQD-RIAVIGPNADDEPMLWGNYNGTPVRTIS 410

Query: 451 PLQALQNYV--ENTVYYPGCDTV 471
            L  + + +  ++ VY   CD V
Sbjct: 411 ILDGITSKIGEKSIVYDKACDLV 433



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 53/105 (50%), Gaps = 13/105 (12%)

Query: 486 KGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           KG + V+ + GL    E EE+          DR D+ LP  Q+  +  + +A KK VI V
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGKK-VIFV 648

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              G    I       +  +IL A Y GE+G  A+A+V+FGD+NP
Sbjct: 649 NNSGSA--IGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNP 691


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 160/418 (38%), Positives = 241/418 (57%), Gaps = 34/418 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 36  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 88  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+  +  KHF A  
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 197

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 255

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   S++ +A  Y K+ E A    ++AG+D+ CG     ++  
Sbjct: 256 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 315

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A  ++ + RM+LGLF+G     P+ +I   V+ S  HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 374

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQN 457
           ++ IVLLKN + +LPL  +K  S+A++G NA   K   G+Y+G P    ++ LQ +++
Sbjct: 375 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKD 430



 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + + +  
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D +I +I+ A YPGE G  A+A+V+FGD+NP
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNP 693


>gi|300727409|ref|ZP_07060818.1| beta-xylosidase B [Prevotella bryantii B14]
 gi|299775289|gb|EFI71888.1| beta-xylosidase B [Prevotella bryantii B14]
          Length = 841

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/438 (37%), Positives = 240/438 (54%), Gaps = 33/438 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +  P+   +L   QRA DL SRLTL+EK   + N +PAI RL IPA+EWWSEALHG A  
Sbjct: 6   QHLPYQNPSLSAEQRAEDLCSRLTLEEKCKLMQNGSPAIKRLNIPAFEWWSEALHGTARN 65

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLE-------ARALYNAGQAIG 153
           G           AT FP     AAS++  L  +I  AIG E       AR   N  +  G
Sbjct: 66  G----------FATVFPNTTGMAASWNDQLLLQIFSAIGNESRIKNTLARKSGNIKRYQG 115

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           ++ W PNINIFRDPRWGRGQET GEDP +TGK  ++ V G+QG      K     +  AC
Sbjct: 116 LSIWTPNINIFRDPRWGRGQETYGEDPYLTGKMGLAVVEGLQGP-----KNSKYYKLLAC 170

Query: 214 CKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
            KHF  +    +    R+ F+   +  +DL +TY P F++ +++G  + +MCAY+ ++G+
Sbjct: 171 AKHFAVHSGPEY---LRHSFNIENLPARDLWETYLPAFKTLIQEGNVAEVMCAYHSMDGL 227

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVS--IIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           P C     L +  R+  GF G + SDC A+    I      A+    A    + AG DV 
Sbjct: 228 PCCGSNKYLQQILRQDLGFKGMVVSDCGAIGDFWIQGRHEVAQDAAQASAQAVLAGTDVE 287

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
           CG+   K    AVK+ ++ E +I+ ++  L   R +LG F+ +  M  + ++   ++   
Sbjct: 288 CGANYDK-LPEAVKRGEISEEKINVSVMRLLKARFKLGDFDSD-NMVEWTQLPESLIACS 345

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
            H+ LA Q AQ+ + LLKN+ G+LPL   K+  +A++G NAN +  L GNY G   ++I+
Sbjct: 346 KHKQLAYQMAQESMTLLKNN-GILPL--QKNARIAVMGANANDSIMLWGNYNGYPTKTIS 402

Query: 451 PLQALQNYVENTVYYPGC 468
            L+ LQN  ++  Y PGC
Sbjct: 403 ILEGLQNKSKHISYIPGC 420



 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 52/103 (50%), Gaps = 13/103 (12%)

Query: 488 ADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           A  V+ + G+    E EE+          DR  + LP  Q+++I  +  + KK VI V  
Sbjct: 584 AQVVIYVGGISPRLEGEEMKVNELGFKGGDRTTIELPQSQRDMIALLHNSGKK-VIFVNC 642

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            GG   I      RN  +IL A Y GE G  A+A+V+FGD+NP
Sbjct: 643 SGGA--IALEPESRNADAILQAWYGGEMGGQAVADVLFGDYNP 683


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 160/418 (38%), Positives = 241/418 (57%), Gaps = 34/418 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 85  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 142

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+  +  KHF A  
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDD------PRYLKIVSTPKHFAA-- 194

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 195 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNDVPCTLNAWLL 252

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   S++ +A  Y K+ E A    ++AG+D+ CG     ++  
Sbjct: 253 KKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIQAGLDLECGDDVYDEYLL 312

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A  ++ + RM+LGLF+G     P+ +I   V+ S  HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLTARMKLGLFDGT-ERNPYTRISPSVIGSKEHQQIALDAA 371

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQN 457
           ++ IVLLKN + +LPL  +K  S+A++G NA   K   G+Y+G P    ++ LQ +++
Sbjct: 372 RECIVLLKNKNNMLPLNVNKVKSIAVVGINA--GKCEFGDYSGAPVVDPVSILQGIKD 427



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + + +  
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D +I +I+ A YPGE G  A+A+V+FGD+NP
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNP 690


>gi|393719789|ref|ZP_10339716.1| glycosyl hydrolase family protein [Sphingomonas echinoides ATCC
           14820]
          Length = 896

 Score =  272 bits (696), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 168/419 (40%), Positives = 232/419 (55%), Gaps = 47/419 (11%)

Query: 56  ARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATS 115
           A  LV+++T +EK+ QL+N APAIPRL IPAY WW+E+LHG  G              T+
Sbjct: 46  AATLVAQMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGPIP----------TTN 95

Query: 116 FPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDP 167
           FP+ I  AASFD+ L +++  AI  E RAL+  G+          G+  W+PNINIFRDP
Sbjct: 96  FPEPIGLAASFDAPLVHQVADAISTEVRALHTQGRQTGKLGRIGTGLDTWSPNINIFRDP 155

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGRGQET GEDP +  +  V++V G+QG   +  ++       +  KHF A+   +   
Sbjct: 156 RWGRGQETYGEDPYLAARMGVAFVTGMQGPNPDLPRVI------STPKHFAAH---SGPE 206

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
           +TR+  +  V+  DL DTY P F + + +GRA  IMCAYNR+ G P+CA   LL    R 
Sbjct: 207 STRHAANVYVSPHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIEGQPACASDLLLKDHLRG 266

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-------FLQKHTK 340
            WGF GY+ SDCDAV  I D   YA     AV   +KAG+D  C +        L     
Sbjct: 267 AWGFTGYVVSDCDAVKDIADNHKYAPDQATAVAAAMKAGVDNECNTQTIGDIGGLPDRFG 326

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQVLAL 397
            A+K+  + + +IDR L  LFS R+R G      G P + P   I      +P H  LAL
Sbjct: 327 DALKRNLISQGDIDRTLVRLFSARLRNGDLPGVAGAPAVVPVSAI-----LTPDHIALAL 381

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
            AA+  +VLLKN  G+LPL       +A++GP  ++ + L GNY+  S +S  P+  ++
Sbjct: 382 DAAEKSLVLLKND-GVLPL--RPGARIAVVGPLGDATRVLRGNYS--STQSAPPVSVVE 435



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 52/95 (54%), Gaps = 13/95 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EE           D+  L LP  QQ L+   A+A  KP+I+V + G P+++ 
Sbjct: 633 GLTSDMEAEETGTDIPGFKGGDKTSLDLPAEQQALLEH-AKATGKPLIVVAMNGSPINLA 691

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +AK   N  +I+ A YPG++G +A+A V+ G  NP
Sbjct: 692 WAK--DNAAAIVEAWYPGQSGGLAVANVLTGKTNP 724


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 162/430 (37%), Positives = 237/430 (55%), Gaps = 34/430 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +     P  +R  DL+S+LT++EKIS L  ++P IPRL I  Y   +EALHGV   G   
Sbjct: 28  YLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGVVRPG--- 84

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQAIG-----MT 155
                    T FPQ I  AA ++  L   I   I  EARA +N      + +G     +T
Sbjct: 85  -------NFTVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLT 137

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++GK  VS+V+G+QGD          L+  +  K
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD------PRYLKIVSTPK 191

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A    N +   R++ +  ++ +DL + Y P FE C+ +G+A+ IM AYN +N +P  
Sbjct: 192 HFAA----NNEEHNRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAINDVPCT 247

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SF 334
            +  LL K  R  WGF GY+ SDC   S +     Y K+ E A    ++AG+D+ CG   
Sbjct: 248 LNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLECGDEV 307

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +    A KQ  + E+EID A +++   RMRLGLF+ +P + P+ KI   +V    H  
Sbjct: 308 YMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCEKHSK 366

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+AA+  IVLLKN    LPL   K  S+A++G NA +++   G+Y+G P  + ++ L+
Sbjct: 367 LALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVGINAGNSE--FGDYSGTPVNQPVSILE 424

Query: 454 ALQNYVENTV 463
            ++  V + V
Sbjct: 425 GIKKKVGDQV 434



 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 3/102 (2%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           +A DI +  D  V ++G++++ E+E  DR  + LP  QQ  I    +     V+ VL+ G
Sbjct: 594 EAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAG 652

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             + I +   D +I +I+ A YPGEAG  A+AEV+FGD+NPG
Sbjct: 653 SSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPG 692


>gi|395490413|ref|ZP_10421992.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 898

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 226/422 (53%), Gaps = 59/422 (13%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           +V+R+T +EK+ QL+N APAIPRL IPAY WW+E+LHG  G              T+FP+
Sbjct: 51  IVARMTTEEKLPQLLNVAPAIPRLKIPAYNWWTESLHGALGAVP----------TTNFPE 100

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
            I  AASFD+ L + +  AI  E RAL+  G+          G+  W+PNINIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHEVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           RGQET GEDP ++    V++V G+QG   +           A  KHF  +   +   +TR
Sbjct: 161 RGQETYGEDPYLSAHMGVAFVTGMQGPNPD------LPDVIATPKHFAVH---SGPESTR 211

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           +  +  V+  DL DTY P F + + +GRA  IMCAYNR++G P+CA+  LL    R  WG
Sbjct: 212 HGANVFVSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWG 271

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SDCDAV  I D   YA     A+   +KAG D  C      HT+       LPE
Sbjct: 272 FTGYVVSDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNEC------HTQTIGDMGGLPE 325

Query: 351 -------------SEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQV 394
                         +IDR+L  LFS R+R G      G P   P   I      SP H  
Sbjct: 326 RFGDSLRRGLISQDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPVRAI-----LSPDHMA 380

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LALQA +  +VLLKN  G+LPL       +A+IGP  ++ + L GNY+  S +S  P+  
Sbjct: 381 LALQATEKSLVLLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVSI 435

Query: 455 LQ 456
           ++
Sbjct: 436 VE 437



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 48/75 (64%), Gaps = 3/75 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L LP  Q  ++ R A+A  KPVI+V + G P+++ +AK   N  +I+ A YPG++G
Sbjct: 656 DKTSLDLPAEQMAMLER-AKATGKPVIVVAMNGSPINLAWAK--DNAAAIVEAWYPGQSG 712

Query: 567 AVALAEVIFGDHNPG 581
            VA+A V+ G  +PG
Sbjct: 713 GVAVANVLTGKTDPG 727


>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
 gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  272 bits (695), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 236/437 (54%), Gaps = 33/437 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EK++ + N +  I RLGI  Y+WW+EALHGVA  G  
Sbjct: 24  PYRNPNLSPEERAEDLLGRLTLKEKVTLMQNESFPIERLGIAHYDWWNEALHGVARAGI- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
                    AT FP  +  A+SFD      I  A+  E RA Y+          +  G+T
Sbjct: 83  ---------ATVFPITMGMASSFDDKAIEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PNINIFRDPRWGRGQET GEDP +T +  V+ V+G+QG          KL   AC K
Sbjct: 134 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGP---ADAKYDKLH--ACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ F+A  ++ +DL +TY P F++ V++     +MCAYNR  G P 
Sbjct: 189 HYAVHSGPEAK---RHSFNAENISPRDLWETYLPAFKTLVQEADVKEVMCAYNRFEGDPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVNCG 332
           C    LL++  R  WG+   + SDC A+S  + ++ +A  K+  DA    + +G D+ CG
Sbjct: 246 CGSNRLLTQILRDDWGYKHVVVSDCGAISDFFYSDRHATHKNAADASAAAVLSGTDLECG 305

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                H   AVK+  + E  I+ +L  L   R  LG  + +  + P+ KI  D V    H
Sbjct: 306 -IEYAHLDKAVKEGLISEERINTSLFRLLKARFELGEMDDD-ALVPWSKISIDTVDCQMH 363

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL   +  +VLL N +G+LPL K+    +A++GPNAN +    GNY G    ++T L
Sbjct: 364 KQMALDITRKSMVLLHN-NGVLPLGKT-GARIAVMGPNANDSVMQWGNYEGTPSHTVTVL 421

Query: 453 QALQNYVENTVYYPGCD 469
           + ++N + N +Y  GC+
Sbjct: 422 EGIRNKIGNVIYEKGCE 438



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 55/106 (51%), Gaps = 14/106 (13%)

Query: 486 KGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           K AD V+ + G+    E EE            DR  + LP  Q+ ++  + +A KK  ++
Sbjct: 597 KEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRNILKALKKAGKK--VI 654

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + C G   +      ++  +IL A YPG+AG  A+A++IFGD+NP
Sbjct: 655 FVNCSGSA-MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNP 699


>gi|404254492|ref|ZP_10958460.1| glycosyl hydrolase family protein [Sphingomonas sp. PAMC 26621]
          Length = 898

 Score =  271 bits (694), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 227/422 (53%), Gaps = 59/422 (13%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           +V+R+T +EK+ QL+N APAIPR+ IPAY WW+E+LHG  G              T+FP+
Sbjct: 51  IVARMTTEEKLPQLLNVAPAIPRIKIPAYNWWTESLHGALGAVP----------TTNFPE 100

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWG 170
            I  AASFD+ L +++  AI  E RAL+  G+          G+  W+PNINIFRDPRWG
Sbjct: 101 PIGLAASFDAPLVHQVAGAISTEVRALHTMGRQTGKLGRIGTGLDTWSPNINIFRDPRWG 160

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           RGQET GEDP +     V++V G+QG   +           A  KHF  +   +   +TR
Sbjct: 161 RGQETYGEDPYLAAHMGVAFVTGMQGPNPD------LPDVIATPKHFAVH---SGPESTR 211

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           +  +  V+  DL DTY P F + + +GRA  IMCAYNR++G P+CA+  LL    R  WG
Sbjct: 212 HGANVFVSAHDLEDTYLPAFRAAIVEGRAGSIMCAYNRIDGQPACANDLLLKDHLRGAWG 271

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SDCDAV  I D   YA     A+   +KAG D  C      HT+       LPE
Sbjct: 272 FTGYVVSDCDAVKDIADGHKYAPDQATAIAAAMKAGTDNEC------HTQTIGDMGGLPE 325

Query: 351 -------------SEIDRALHNLFSVRMRLGLF---NGNPTMQPFGKIGADVVCSPAHQV 394
                         +IDR+L  LFS R+R G      G P   P   I      SP H  
Sbjct: 326 RFGDSLRRGLISQDDIDRSLVRLFSARLRNGDLPGIAGAPAAVPVRAI-----LSPDHMA 380

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LALQAA+  +VLLKN  G+LPL       +A+IGP  ++ + L GNY+  S +S  P+  
Sbjct: 381 LALQAAEKSLVLLKND-GVLPL--RPGARIAVIGPLGDATRVLRGNYS--STQSAPPVPI 435

Query: 455 LQ 456
           ++
Sbjct: 436 VE 437



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 48/75 (64%), Gaps = 3/75 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           D+  L LP  Q  ++ R A+A  KPVI+V + G P+++ +AK   N  +I+ A YPG++G
Sbjct: 656 DKTSLDLPAEQMAMLER-AKATGKPVIVVAMNGSPINLAWAK--DNAAAIVEAWYPGQSG 712

Query: 567 AVALAEVIFGDHNPG 581
            +A+A V+ G  +PG
Sbjct: 713 GLAVANVLTGKTDPG 727


>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  271 bits (694), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 164/446 (36%), Positives = 242/446 (54%), Gaps = 38/446 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +T P+    L  ++RA+DLVSRLTL+EK   + + + AIPRLGI  + WWSEALHGVA  
Sbjct: 19  QTLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G            T FP+ +  AASF+  L + I  A   E RA +N     G       
Sbjct: 79  G----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFH 128

Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PN+NIFRDPRWGRGQET GEDP +T +  ++ V+G+QG      K +  L   A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--EDEKYRKLL---A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W   +R+  +   ++ +DL +TY P F++ V++     +MCAY R++ 
Sbjct: 184 CAKHYAVHSGPEW---SRHSANLNNISPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
            P C    LL +  R +WGF   + SDC A++  + +    KS  DAV   +K   AG D
Sbjct: 241 DPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  +    AV +  + E EID+ +  L   R  LG  + +P++  + +I   VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEMD-DPSLVKWSQIPMSVV 356

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
            S AH+ L+L  ++  + LL+N + +LPL KS    +A+IGPNA+    L GNY G   +
Sbjct: 357 NSKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415

Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
           +IT L   +  ++    +Y  GCD V
Sbjct: 416 TITILDGFKTKLKKNQIIYMKGCDLV 441



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 57/112 (50%), Gaps = 13/112 (11%)

Query: 479 DKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAA 528
           D+++   K  D VV + G+    E EE+          DR D+ LP  Q+  +  + EA+
Sbjct: 591 DESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQRNFLKALKEAS 650

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           K+  ++ + C G   +       +  +IL A Y GE G  A+A+V+FGD+NP
Sbjct: 651 KQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGELGGQAVADVLFGDYNP 699


>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 863

 Score =  271 bits (693), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 176/458 (38%), Positives = 246/458 (53%), Gaps = 36/458 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+LP  +RA  L+  LTL+EK+S +++ A  + RLGI  Y WW+EALHGVA  G   
Sbjct: 23  YKTTSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL--YNAGQAI-----GMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T W
Sbjct: 81  --------ATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGSYERYQGLTMW 132

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N G+     +  AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGG--NDGRYD---KLHACAKHF 187

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 188 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 244

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y+  G+   P  E A    + +G D+ CGS 
Sbjct: 245 SNRLLMQILRDEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECGSS 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    AVK  K+ E  +D A+  L + R  LG  +  P    + KI   VV S  H  
Sbjct: 305 Y-KALVEAVKLGKIEEKAVDVAVKRLLTARFALGEMD-EPDKVSWTKIPFSVVASAKHDS 362

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A+  + LL N + +LPL K   +++A++GPNAN +    GNY G    ++T L+ 
Sbjct: 363 LALDIARKSMTLLMNKNDILPL-KQGGLTVAVMGPNANDSVMQWGNYNGMPSHTVTILEG 421

Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++  +   +  +Y  GC  V    + I  A ++ K AD
Sbjct: 422 VRKALGADDKLIYEQGCGWV--DRSLIQSAFNLCKSAD 457



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/168 (30%), Positives = 71/168 (42%), Gaps = 20/168 (11%)

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNY-VENTVYYPGCDTVACSSASIDKAVDIAKGA 488
           N    K     + G        +QA + Y +E    Y   D          K VDI K  
Sbjct: 535 NGEEVKGFSNKHGGRKSTYNLRVQAGKPYNIELDFEYLRSDAQLNFDLGFRKEVDIKKSV 594

Query: 489 DHV------VLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPV 532
             V      + + G+  + E EE+          DR D+ LP  Q+E+I  +  + KK V
Sbjct: 595 AVVKDADVVIFVGGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQREMIAALHRSGKK-V 653

Query: 533 ILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           I V   G P  I      +   +IL A YPG+AG  A+AEV+FG +NP
Sbjct: 654 IFVNCSGSP--IGLEPETKECEAILQAWYPGQAGGTAVAEVLFGKYNP 699


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  271 bits (693), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 151/426 (35%), Positives = 237/426 (55%), Gaps = 29/426 (6%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PIS R  +L+S++TL+EK++QL N   +IPRL +P+Y +W+E LHGVA  G+        
Sbjct: 55  PISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE-------- 106

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALY-NAGQAIGMTFWAPNINIFRDPRW 169
              T FPQ I  A+++D+ L  ++  AI  EAR  Y   G+  G+T+W+P IN+ RDPRW
Sbjct: 107 --VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGK--GLTYWSPTINMARDPRW 162

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GEDP +T +  V++V+G+QGD  +       L+  A  KHF A + +N     
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQGDHPD------YLKTVATIKHFVANNQEN----D 212

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R+   +++  + L + Y P +E+CVK+  A  +M AYN  NG+       LL    R++W
Sbjct: 213 RFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLRKEW 272

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLP 349
           GF G++ SDC A+ ++        S E+A    + +G D+ CG   ++   AAVK   + 
Sbjct: 273 GFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGINSGCDLECGGTYREKLVAAVKMGLVS 332

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN 409
           E  ID+AL  + + R +LG F+    + P+      ++       LA +AA   IVLLKN
Sbjct: 333 EQAIDKALTRVLTARFKLGEFD-PIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVLLKN 391

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYP 466
            +  LP+ K K  S+A++GP A++    LG Y+G    +++ LQ +++ V   V   Y  
Sbjct: 392 DNDFLPVDKKKIRSVAIVGPFADN--NYLGGYSGKPVHNVSLLQGVKDLVGKKVKVSYME 449

Query: 467 GCDTVA 472
           G   ++
Sbjct: 450 GTSVIS 455



 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 57/107 (53%), Gaps = 3/107 (2%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
           +S  IDK  +   GAD V++ +G D+   +E  D   + LP  Q+ L+  + +   +   
Sbjct: 595 NSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMTQELLLKEIYKVNPRTA- 653

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           L+L  G P+   +A    N+ +IL A YPG+ G  ALA ++FG  NP
Sbjct: 654 LILHTGNPLTSKWAA--ENVPAILQAWYPGQEGGKALAGILFGSENP 698


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  271 bits (693), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 163/428 (38%), Positives = 232/428 (54%), Gaps = 34/428 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F   T P  +R  DL++RLT++EK+S LVN A  IPRL I  Y   +EALHGV   G+  
Sbjct: 29  FRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGVVRPGE-- 86

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
                    T FPQ I  AA+++  L +R+  AI  EAR  +          AG +  +T
Sbjct: 87  --------FTVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAGGSDLLT 138

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++G+    +V+G+QGD          L+  +  K
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGDN------PRYLKTVSTPK 192

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A    N +   R   +AR++ +DL + Y P FE C+  G+A  IM AYN VN +P  
Sbjct: 193 HFAA----NNEEHNRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPCT 248

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
            +  L+ K  R  W F+GYI SDC A   +     Y K+ E A    LKAG+D+ CG  +
Sbjct: 249 VNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDRV 308

Query: 336 QKH-TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
                  A  +  + E+EID A +++   RM LGLF+ +P+  P+ KI   V+    HQ 
Sbjct: 309 YTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIGCKEHQE 367

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+ A+  +VLLKN    LPL + K  S+A++G +A  A    G+Y+G P    ++ L 
Sbjct: 368 LALETARQSMVLLKNQKNFLPLNRKKIRSIAVVGISA--AHCEFGDYSGNPKNTPVSVLD 425

Query: 454 ALQNYVEN 461
            ++ Y EN
Sbjct: 426 GIKKYAEN 433



 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 60/98 (61%), Gaps = 3/98 (3%)

Query: 484 IAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           +AK  D  V ++G++++ E+E  DR  L LP  QQE I  + +     V+ VL+ G  + 
Sbjct: 599 VAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVV-VLVAGSSMA 657

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           I +   D N+ +IL A YPGE G  A+AEV+FGD+NPG
Sbjct: 658 INWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPG 693


>gi|347736808|ref|ZP_08869356.1| putative exo-1,4-beta-xylosidase xlnD [Azospirillum amazonense Y2]
 gi|346919574|gb|EGY01048.1| putative exo-1,4-beta-xylosidase xlnD [Azospirillum amazonense Y2]
          Length = 903

 Score =  271 bits (693), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 171/418 (40%), Positives = 230/418 (55%), Gaps = 42/418 (10%)

Query: 54  QRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
           Q+A+ LV+R+TLDEK+ QLVN APAIPRL IPAY WW+E+LHG  G              
Sbjct: 43  QKAQSLVARMTLDEKLGQLVNVAPAIPRLDIPAYNWWTESLHGAIGAVP----------T 92

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MTFWAPNINIFR 165
           T+FP+ I  AA+FD+ L + +   + +E RAL+  G+A G        +  W+PNINI+R
Sbjct: 93  TNFPEPIGLAATFDAPLVHDVAGTVSVEVRALHTLGRATGHLGRIGTGLDTWSPNINIYR 152

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRGQET GEDP +T + AV++V G+QG   +           A  KHF  +   + 
Sbjct: 153 DPRWGRGQETYGEDPYLTARLAVAFVGGMQGPNPD------LPDVVATPKHFAVH---SG 203

Query: 226 KGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTA 285
             +TR+  +  V+  DL DTY P F + + + RA  IMCAYNR++G P+CA   LL    
Sbjct: 204 PESTRHAANVYVSPHDLEDTYLPAFRAAIVEARAGSIMCAYNRIDGQPACASDLLLKDHL 263

Query: 286 RRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG--------SFLQK 337
           R  W F GY+ SDCDAV  I D   YA     AV   LKAG+D  C           L  
Sbjct: 264 RGAWHFTGYVVSDCDAVKDIADNHKYAPDQAAAVAAALKAGVDNECNVATLFGGGGGLAA 323

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             K A  +  +  ++ID AL  LFS R R G   G    +P   +    + +  HQ LA 
Sbjct: 324 RYKEAYTRGLIGAADIDLALVRLFSARYRNGDLPG--LDRPKQDVPVQAIGTAEHQQLAR 381

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           + A + +VLLKN  G+LPL     V +A+IGP+ +S + L GNY+  S  S  P+  L
Sbjct: 382 KVAGESMVLLKND-GVLPL--KPGVKVAVIGPHGDSTRVLRGNYS--SALSAPPVSVL 434



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 57/95 (60%), Gaps = 13/95 (13%)

Query: 496 GLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           GL    E EE+          D+  L LP  Q+ L+ + A+A  KPV++VLL G P+D++
Sbjct: 640 GLSADLEGEEMKRQIEGFAGGDKTSLDLPSEQRRLL-QAAKATGKPVVVVLLNGSPLDLS 698

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +AK   N  +I+ A YPG+AG +A+A+V+ G  NP
Sbjct: 699 WAK--DNAAAIVEAWYPGQAGGLAVADVLSGKTNP 731


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  271 bits (693), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 163/424 (38%), Positives = 238/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 88  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V G+QGD  +       L+  +  KHF A  
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLL 255

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   +++ +A  Y K+ E A    +KAG+D+ CG         
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A +++ + RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN    LPL   K  S+A++G NA   K   G+Y+G P    ++ LQ ++N V
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432

Query: 460 ENTV 463
            + V
Sbjct: 433 GDRV 436



 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + I +  
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAINWM- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D +I +I+ A YPGE G  A+AEV+FGD+NP
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNP 693


>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 903

 Score =  271 bits (692), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 166/417 (39%), Positives = 237/417 (56%), Gaps = 48/417 (11%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           LV++LT +EK+ QL+N+APAIPRLG+P Y WW+E+LHG  G              T+F +
Sbjct: 54  LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT--------FWAPNINIFRDPRWG 170
            I  AA+FD  L  R+GQ I  E R L+   +  G T         W+PNINIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           RGQET GEDP +  +  V+YV+GVQG+      +       A  KH+  +   +   +TR
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYDI------IATPKHYAVH---SGPESTR 214

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           +  +  V+  DL DTY P F + + +G A  +MCAYNRV+G P+CA+  LL    R  W 
Sbjct: 215 HSANVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRVDGQPACANDLLLKDYLRGAWD 274

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-------SFLQKHTKAAV 343
           F GY+ SDCDAV+ I     YA     AV   +++G+D  C        + L +  + A+
Sbjct: 275 FKGYVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVDNECNGATLVDMAGLARPYQEAL 334

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADV--VCSPAHQVLALQAA 400
           ++  +  +++DRAL  LF+ R R    NG+ P ++P     A    V  P H  LAL+ A
Sbjct: 335 ERGLISMADVDRALVRLFAARYR----NGDLPGLRPLSTETASPADVGKPEHAALALEVA 390

Query: 401 QDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           +  +VLLKNS G+LPL P++K   LA++GP A++ + L GNY+ P  +S  P+  L+
Sbjct: 391 EKSLVLLKNS-GVLPLRPQAK---LAVVGPLADATRVLRGNYSSP--QSGPPVSVLE 441



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 13/106 (12%)

Query: 485 AKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           A  +D VV ++GL    E EE           D+  L L   QQ L+   A A  KP+++
Sbjct: 629 AAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALL-EAARATGKPLVV 687

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           V++ G PV++ +A+  ++  +I+ A YPG++G +A+  V+ G  NP
Sbjct: 688 VVMNGSPVNLAWAR--QHAAAIIEAWYPGQSGGLAIGNVLAGHANP 731


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  271 bits (692), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 161/427 (37%), Positives = 241/427 (56%), Gaps = 36/427 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +     P   R  DL+SRLT++EKIS L  ++P IPRL IP Y   +E+LHGV   G+  
Sbjct: 30  YLDEKAPTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPGR-- 87

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM---------T 155
                    T FPQ I  A+ ++  L ++I  AI  EAR  +N  +   +         T
Sbjct: 88  --------FTVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLT 139

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP ++G    ++VRG+QGD          L+  +  K
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDD------PRYLKIVSTPK 193

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A    N +   R+  + +++ + L + Y P FE CVK G+++ IM AYN +N +P  
Sbjct: 194 HFAA----NNEEHNRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAINDVPCT 249

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-- 333
           A+  LL+K  R  WGF+GY+ SDC   S++  A  Y K+ E A    +KAG+D+ CG   
Sbjct: 250 ANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIKAGLDLECGDDV 309

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
           ++Q     A  Q  +  ++ID A + +   RM LGLF+ +P + P+ KI   VV S  H+
Sbjct: 310 YMQPLLN-AYNQYMVSRADIDTAAYRVLRARMHLGLFD-DPDLNPYNKISPSVVGSAEHK 367

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPL 452
            LAL+AA+  IVLLKN++  LPL   K  S+A++G NA +++   G+Y+G P+   ++ L
Sbjct: 368 QLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVGINAGNSE--FGDYSGIPANAPVSIL 425

Query: 453 QALQNYV 459
           Q +++ V
Sbjct: 426 QGIKDKV 432



 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + V+ ++G+++T E+E  DR D+ LP  Q+E I  + +     +++VL+ G  + I +  
Sbjct: 605 EQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVLVAGSSLAINWM- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D ++ +I+ A YPGE G  A+AEV+FG++NPG
Sbjct: 663 -DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPG 694


>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
 gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
           ATCC 21756]
          Length = 895

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 167/430 (38%), Positives = 239/430 (55%), Gaps = 44/430 (10%)

Query: 46  CKTTLPI----SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
            +TT P      Q A DLV+++T +EK+ QL+N+APAIPRLG+PAY WW+E+LHG  G  
Sbjct: 29  AQTTSPTRQMADQMAADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGALGS- 87

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------- 153
                       T+FP+ +  AA+FD+ L   +G AIG E R L+   +A G        
Sbjct: 88  ---------LPTTNFPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTA 138

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASAC 213
           +  W+PNINIFRDPRWGRGQET GEDP +  +  V++V GVQG   +        Q  A 
Sbjct: 139 LNTWSPNINIFRDPRWGRGQETYGEDPHLAARMGVAFVEGVQGPDPD------LPQVIAT 192

Query: 214 CKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIP 273
            KH+  +   +   +TR+  +  V+ +DL DTY P F + + + +A  IMCAYNR++G P
Sbjct: 193 PKHYAVH---SGPESTRHHANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAYNRIDGQP 249

Query: 274 SCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG- 332
           +CA   LL    R  W F GY+ SDCDAV  I D   YA     AV   ++AG+D  C  
Sbjct: 250 ACASDMLLKDYLRTAWKFDGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAGVDNECNG 309

Query: 333 ------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
                 + L    + A++Q  +   ++D+AL  L+S R+R+G   G   +    K     
Sbjct: 310 ATLTDTAGLSNRYRDALQQGLITIGDVDQALVRLYSARLRVGDLPGVRPVDTAPK-APSA 368

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           V +P H+ LAL A++  +VLLKN  GLLPL     + +A+IGP  ++ + L GNY+  S 
Sbjct: 369 VMTPEHEALALAASEKSLVLLKND-GLLPL--KPGLRVAVIGPLGDATRVLRGNYS--SA 423

Query: 447 RSITPLQALQ 456
            S  P+  ++
Sbjct: 424 LSAPPISVVE 433



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 59/102 (57%), Gaps = 13/102 (12%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D +V ++GL    E EE           D+  L +P  QQ L+ + A+A  KP+++V + 
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQ-AKALGKPLVVVAMN 683

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G P+++++AK   N  +IL A YPG++G +A+A V+ G  NP
Sbjct: 684 GSPLNLSWAK--DNAAAILEAWYPGQSGGLAIANVLTGKTNP 723


>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 862

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 164/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +T P+    L  ++RA+DLV RLTL+EK   + + + AIPRLGI  + WWSEALHGVA  
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G            T FP+ +  AASF+  L + I  A+  E RA +N     G       
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PN+NIFRDPRWGRGQET GEDP +T +  ++ V+G+QG      K +  L   A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W   +R+  +   V+ +DL +TY P F++ V++     +MCAY R++ 
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
            P C +  LL +  R +WGF   + SDC A++  + +    KS  DAV   +K   AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  +    AV +  + E E+D+ +  L   R  LG  + +P++  + KI   VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
              AH+ L+L  ++  + LL+N + +LPL KS    +A+IGPNA+    L GNY G   +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415

Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
           +IT L   ++ ++    VY  GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441



 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR D+ LP  Q+  +  + +A K+  ++ + C G   +       +  +IL A Y GE G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGELG 685

Query: 567 AVALAEVIFGDHNP 580
             A+A+V+FGD+NP
Sbjct: 686 GYAVADVLFGDYNP 699


>gi|332185848|ref|ZP_08387595.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014206|gb|EGI56264.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 838

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 160/411 (38%), Positives = 226/411 (54%), Gaps = 36/411 (8%)

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           +LT+++K +QLVN APA P L +PAY+WWSE LHG+A  G           AT FPQ + 
Sbjct: 24  KLTVEQKAAQLVNEAPAEPALDLPAYDWWSEGLHGIARNGP----------ATVFPQAMG 73

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAPNINIFRDPRWGRGQE 174
            AA++D  L  ++G  + +EARA +N        G   G+T W+PNINIFRDPRWGRGQE
Sbjct: 74  MAATWDPALIEKVGDVVAVEARAKFNTKPINADRGLFEGLTIWSPNINIFRDPRWGRGQE 133

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           T GEDP +T +  +++VRG+QG      K+       A  KH   +         R  FD
Sbjct: 134 TYGEDPFLTSRIGIAFVRGLQGPDLKHPKV------IATVKHLAVHSGPE---GGRDSFD 184

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
             V+ +DL  TY P F + V +G+A  +MCAYN ++G P CA+  L+++  R  WGF G 
Sbjct: 185 VMVSPRDLEATYLPAFRATVTEGKALSLMCAYNAIHGTPVCANPMLMTERLRTDWGFKGL 244

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEID 354
             SDCDAV  I+           A    L+AG D+NCG+   +    A+K+  + E EID
Sbjct: 245 TVSDCDAVGNIWMFHHAQPDAPAAAAAALRAGTDLNCGN-TYRALPEALKRGLITEGEID 303

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
            AL    +VR  L + +      P+ +I    + +PA++ LAL+ A+ GI+LL N + LL
Sbjct: 304 TALARALAVRRMLTVDS------PWNRIKPSQLGTPANRALALETARKGIILLDNPNDLL 357

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYY 465
           PL   K   LA+ G +A+    L GNY G +   +TPL  ++      V Y
Sbjct: 358 PL---KGKKLAVFGADADDLGVLEGNYHGSAIDPVTPLDGIRAKFGRDVRY 405



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 15/111 (13%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
           +A+ +A  AD +V ++GL    E E L          DR ++VLP  Q+ L+  + +A  
Sbjct: 555 EALKVAGTADVLVAIVGLSPDLEGEALGVSVPGFAGGDRTEVVLPEPQRNLLAAL-QATG 613

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KPV+ V++ G  V +     D    + L A YPG  G  ALAE++ GD NP
Sbjct: 614 KPVVAVIVSGSAVSLG----DIKPAATLAAFYPGAEGGTALAEILSGDVNP 660


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 158/424 (37%), Positives = 240/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 33  PVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 84

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N            +  +TFW+P +
Sbjct: 85  --FTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLTFWSPTV 142

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QG+          L+  +  KHF A  
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGED------PRYLKIVSTPKHFVA-- 194

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 195 --NNEEHNRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALNDVPCTLNAWLL 252

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     ++  
Sbjct: 253 QKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKTKETAATLSIKAGLDLECGDDVYDEYLL 312

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A  ++ + RM+LG+F+      P+ +I   V+ S  HQ +AL AA
Sbjct: 313 NAYKQYMVSDADIDSAACHVLAARMKLGMFDSK-ERNPYARISPSVIGSKDHQQVALDAA 371

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN   +LPL   K  S+A++G NA + +   G+Y+G P    ++ LQ ++N V
Sbjct: 372 RECIVLLKNQKNMLPLNVDKLKSIAVVGINAGTCE--FGDYSGAPVIEPVSVLQGIKNRV 429

Query: 460 ENTV 463
              V
Sbjct: 430 GEKV 433



 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 59/93 (63%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + + +  
Sbjct: 602 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 659

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D +I +I+ A YPGE G  A+A+V+FGD+NP 
Sbjct: 660 -DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPA 691


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 162/436 (37%), Positives = 245/436 (56%), Gaps = 37/436 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P  +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV  V  G F     
Sbjct: 36  PQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPGNF----- 88

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  A+ ++  L Y I  AI  EAR  +N            +  +TFW+P +
Sbjct: 89  ---TVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKGFFSDLLTFWSPTV 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++GK  V++V+G+QG+          L+  +  KHF A  
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGND------PRYLKIVSTPKHFAA-- 197

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R++ +  ++ ++L + Y P FESC+K+G+A  IM AYN +N +P   +  LL
Sbjct: 198 --NNEEHNRFECNPHISERNLREYYLPAFESCIKEGKAQSIMSAYNAINDVPCTLNPWLL 255

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-KHTK 340
           ++  R++WGF+GY+ SDC     +     Y K+PE A    +KAG+D+ CG  +  +   
Sbjct: 256 TQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIKAGLDLECGDNVYIEPLM 315

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A + +   RM LGLF+ +P   P+  I   +V    H+ LAL+AA
Sbjct: 316 NAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAISPSIVGCEKHRQLALEAA 374

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  +VLLKN    LPL   K  S+A++G NA + +   G+Y+G P    ++ L+ ++N V
Sbjct: 375 RQSLVLLKNEKNFLPLNPKKVKSIAVVGINAGNCE--FGDYSGTPVNAPVSVLEGIKNRV 432

Query: 460 EN---TVYYPGCDTVA 472
            +    VY P   +++
Sbjct: 433 GDDVKIVYAPWSSSIS 448



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 56/93 (60%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D  V ++G++++ E+E  DR  + LP  QQ L  + A  A    ++VL+ G  + I +  
Sbjct: 605 DVTVAVLGINKSIEREGQDRYTIELPADQQ-LFIKEAYKANPNTVVVLVAGSSLAINWI- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D NI +IL A YPGE G  A+AE +FGD+NPG
Sbjct: 663 -DENIPAILNAWYPGEQGGTAVAEALFGDYNPG 694


>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
 gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
          Length = 846

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 183/489 (37%), Positives = 259/489 (52%), Gaps = 55/489 (11%)

Query: 60  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQV 119
           +++  L EK+ QL ++APA   +G+PAY+WW+E LHG+A  G+          AT FPQ 
Sbjct: 17  IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNGE----------ATVFPQA 66

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYN-----AGQAI--GMTFWAPNINIFRDPRWGRG 172
           I  AA+FD  L  ++G  I  EARA +N     A + I  G+T W+PNINIFRDPRWGRG
Sbjct: 67  IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           QET GEDP ++G  AV +++G+QG            +  A  KHF  +         R  
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTD------PAHPRVIATPKHFAVHSGPE---AGRDS 177

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
           FD  V+ QD   TY P F   V +GR   +MCAYN  +G P CAD  L+++  R+ WGF 
Sbjct: 178 FDVDVSPQDREATYLPAFRKAVTEGRPLSVMCAYNSTHGTPVCADDALINQRLRKDWGFK 237

Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
           G++ SDCDA++ ++    YA    +A    +KAG D +CG+      + AV++  + E+ 
Sbjct: 238 GFVVSDCDAIANVWMFHHYAADAAEASAASIKAGTDFDCGNTYAALPQ-AVERGLVDEAT 296

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           IDRAL      R  LG+  G     P+G+I    V + A + LAL+AA+  IVLL+N + 
Sbjct: 297 IDRALARSLQARHSLGIAFG--AANPWGRIKPSQVHTLASRALALEAARKSIVLLQNDNA 354

Query: 413 LLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTV 471
            LPL P +K   LA++G NA+    +  +Y G +   ITPLQ ++          G   V
Sbjct: 355 RLPLKPGTK---LAVVGANADDLSVIEASYHGTAADPITPLQGIRRRF-------GAANV 404

Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-----EELDRVDLVLPGRQQELI----T 522
           A +  S+     +A+GA  +V    L  +Q K     E LD    V+  RQ   +    T
Sbjct: 405 AYAQGSV-----LAEGAAVIVPETAL-ASQGKPGLKGEYLDATGKVVLTRQDRRVDFNYT 458

Query: 523 RVAEAAKKP 531
           R A A   P
Sbjct: 459 RYAPAGIDP 467



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 56/111 (50%), Gaps = 15/111 (13%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAK 529
           +A+   + AD +V ++GL    E E L          DR D+ LP  Q +L+  +  A  
Sbjct: 564 EAIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEAL-HATG 622

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KPVI+VL  G  + I     D    ++L A YPGE G  ALAE + G +NP
Sbjct: 623 KPVIVVLTSGSAIAIDPKLAD----AVLAAWYPGEEGGTALAETLGGINNP 669


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 162/424 (38%), Positives = 238/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 88  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V G+QGD  +       L+  +  KHF A  
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNPWLL 255

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   +++ +A  Y K+ E A    +KAG+D+ CG         
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A +++ + RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  +VLLKN    LPL   K  S+A++G NA   K   G+Y+G P    ++ LQ ++N V
Sbjct: 375 RQCVVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432

Query: 460 ENTV 463
            + V
Sbjct: 433 GDRV 436



 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + I +  
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAINWM- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D +I +I+ A YPGE G  A+AEV+FGD+NP
Sbjct: 663 -DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNP 693


>gi|393720890|ref|ZP_10340817.1| beta-glucosidase [Sphingomonas echinoides ATCC 14820]
          Length = 563

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 162/415 (39%), Positives = 223/415 (53%), Gaps = 33/415 (7%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           LT  +K +QL +SAPA P  G+PAY+WW+E LHG+A  G           AT FPQ I  
Sbjct: 20  LTTAQKAAQLQSSAPADPAAGLPAYDWWNEGLHGLARNGY----------ATVFPQAIGM 69

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAI-------GMTFWAPNINIFRDPRWGRGQET 175
           AA++D+ L  RIG+ +  EARA +NA           G+T W+PNINIFRDPRWGRGQET
Sbjct: 70  AATWDAPLLNRIGEVVATEARAKFNAQPVAADRKIYQGLTIWSPNINIFRDPRWGRGQET 129

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GEDP +TG   V+++ G+QG      K+       A  KHF  +         R  FD 
Sbjct: 130 YGEDPYLTGHLGVAFITGLQGPDPKHPKV------IATPKHFVVHSGPE---AGRDGFDV 180

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
             + QDL  TY P F   V +G+A  +MCAYN ++G+P+CA+  L+    R  WGF G+ 
Sbjct: 181 DPSPQDLEATYLPAFRLAVTEGKAQSLMCAYNAIHGVPACANAPLMVDRLRHDWGFTGFT 240

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDR 355
            SDCDAV+ ++    Y      A    LK G D+NCG+      + A+++  +   +ID 
Sbjct: 241 VSDCDAVANVHLFHHYRLDAAGAAAASLKGGTDLNCGTTYAALPE-ALQRGLVTSQDIDT 299

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
           A     + R  LG+  G     P+ +I    + +P H  +AL+AA+  IVLLKN    LP
Sbjct: 300 AFGRAMAARKALGIAYG--ATSPWARISPRDIGTPEHHAVALEAARKAIVLLKNEGDRLP 357

Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY--VENTVYYPGC 468
           L       +ALIG +A+    L GNY G +   +TPL+ L+     EN +Y  G 
Sbjct: 358 L--KTGTRIALIGADADDIGVLQGNYHGTAIAPVTPLEGLRQQFGAENVLYAQGS 410


>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 885

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 164/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +T P+    L  ++RA+DLV RLTL+EK   + + + AIPRLGI  + WWSEALHGVA  
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G            T FP+ +  AASF+  L + I  A+  E RA +N     G       
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PN+NIFRDPRWGRGQET GEDP +T +  ++ V+G+QG      K +  L   A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W   +R+  +   V+ +DL +TY P F++ V++     +MCAY R++ 
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
            P C +  LL +  R +WGF   + SDC A++  + +    KS  DAV   +K   AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  +    AV +  + E E+D+ +  L   R  LG  + +P++  + KI   VV
Sbjct: 298 VECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
              AH+ L+L  ++  + LL+N + +LPL KS    +A+IGPNA+    L GNY G   +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415

Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
           +IT L   ++ ++    VY  GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 65/136 (47%), Gaps = 17/136 (12%)

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
           ++N+  N  +  G +     S SI K     KG D VV + G+    E EE+        
Sbjct: 571 VENWAANLKFNLGKEFPINYSESISKL----KGIDVVVFVGGISPQLEGEEMPVNIPGFK 626

Query: 507 --DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DR D+ LP  Q+  +  + +A K+  ++ + C G   +       +  +IL A Y GE
Sbjct: 627 GGDRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGE 683

Query: 565 AGAVALAEVIFGDHNP 580
            G  A+A+V+FGD+NP
Sbjct: 684 LGGYAVADVLFGDYNP 699


>gi|427384502|ref|ZP_18881007.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727763|gb|EKU90622.1| hypothetical protein HMPREF9447_02040 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 233/437 (53%), Gaps = 34/437 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+L   +RA  LV  LTL+EK   +++++ ++ RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKNTSLSPEERAELLVKELTLEEKARLMMDASQSVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQAI----GMTFW 157
                   AT FPQ I  AASFD  + Y +  A+  E RA    Y +  +     G+T W
Sbjct: 80  --------ATVFPQPIGMAASFDPEMVYEVFNAVSDEVRAKNTYYTSRDSRERYQGLTMW 131

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG     GK     +  AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPA--DGKYD---KLHACAKHF 186

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R+ F+A  +  +DL +TY PPFE+ VK+G    +MCAYNR  G P C 
Sbjct: 187 AVHSGPEW---NRHSFNAENINPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCCG 243

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + +DC A++  Y+  G+   P    A    + +G D++CGS 
Sbjct: 244 SDRLLMQILRGEWGFDGIVVADCGAIADFYNDRGHHTHPNAESASAAAVVSGTDLDCGSS 303

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             K    +VK+  + E  +D ++  L   R  LG  +  P    + KI   VV S AH  
Sbjct: 304 Y-KALVESVKKGLIAEETVDVSVKRLLKARFELGEMD-EPEKVSWTKIPFSVVASAAHDS 361

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL+ A+  + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 362 LALEIARKSMTLLMNKDNFLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 455 LQNYV---ENTVYYPGC 468
           ++N +   +  +Y  GC
Sbjct: 421 VRNILGAEDKLIYEQGC 437



 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/113 (38%), Positives = 60/113 (53%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I K+V+  K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  +  A
Sbjct: 589 IRKSVERVKDADVVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALHRA 648

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK +ILV   G P  I      +   ++L A YPG+ G  A+AEV+FG +NP
Sbjct: 649 GKK-IILVNCSGSP--IGLEPETKKCEALLQAWYPGQQGGTAVAEVLFGTYNP 698


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 163/424 (38%), Positives = 237/424 (55%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRLGI  Y   +EALHGV   G+        
Sbjct: 36  PVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPGR-------- 87

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  R+   I  EARA +N    G+         +TFW+P +
Sbjct: 88  --FTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLTFWSPTV 145

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V G+QGD  +       L+  +  KHF A  
Sbjct: 146 NMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPH------YLKIVSTPKHFAA-- 197

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE CVK+G+A+ IM AYN +N +P   +  LL
Sbjct: 198 --NNEEHNRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALNDVPCTLNSWLL 255

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R+ WGF GY+ SDC   +++ +A  Y K+ E A    +KAG+D+ CG         
Sbjct: 256 QKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDGPLL 315

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + +++ID A  ++ + RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 316 NAYKQYMVSDADIDSAACHVLTARMKLGLFDSG-ERNPYTKISPSVIGSKEHQQIALDAA 374

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  IVLLKN    LPL   K  S+A++G NA   K   G+Y+G P    ++ LQ ++N V
Sbjct: 375 RQCIVLLKNQKNRLPLNADKLKSIAVVGINA--GKCEFGDYSGAPVVEPVSILQGIRNRV 432

Query: 460 ENTV 463
            + V
Sbjct: 433 GDRV 436



 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 59/92 (64%), Gaps = 3/92 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV +MG++++ E+E  DR D+ LP  Q+E +  + +     +I+VL+ G  + + +  
Sbjct: 605 ETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLVAGSSLAVNWM- 662

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            D ++ +I+ A YPGE G  A+AEV+FGD+NP
Sbjct: 663 -DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNP 693


>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1278

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 156/425 (36%), Positives = 235/425 (55%), Gaps = 31/425 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     +RA DLVSR+TL+EK SQL N+ P IPRLG+  Y+ W EALHGV G     
Sbjct: 39  YLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGVVGRNNN- 97

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
             +G I  ATSFP  +   +++D  L  R    +  EAR  +N      +T+W+P I   
Sbjct: 98  --SGMI--ATSFPNSVAVGSTWDPALIKRETSVVADEARG-FNHDLIFTLTYWSPVIEPA 152

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGD--TFNGGKLKGKLQASACCKHFTAYDL 222
           RDPRWGR  ET GEDP +  +    +V+G+ GD  T+        L+   C KH+ A   
Sbjct: 153 RDPRWGRTAETFGEDPFLVSQIGSGFVQGLMGDDPTY--------LKTVPCGKHYFA--- 201

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
            N     R+   A +  +D+ + Y  P+ + +++ +   IM AY+ VNG+P  A + L+ 
Sbjct: 202 -NNSEFNRHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKFLVD 260

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
             A+R +G  GY+T DCDAV+ + ++  YAKS  +A    LK G+D +CG   Q     A
Sbjct: 261 TIAKRTYGLDGYVTGDCDAVADVVNSHRYAKSKAEAAAMGLKTGVDSDCGGIYQTSALEA 320

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           +KQ  + E+++D+AL N++++RMRLG F+    + P+  I   ++  P+H  LAL+ A  
Sbjct: 321 LKQGLISEADMDKALVNIYTIRMRLGEFDPQ-NIVPYAGIKPSIINDPSHNDLALEIATK 379

Query: 403 GIVLLKNS------HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPS--CRSITPLQA 454
             VLLKN+         LPL       +A++GP A+  K  LG+Y+G +     ITPL+ 
Sbjct: 380 SPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQAD--KVELGDYSGEADPKYKITPLEG 437

Query: 455 LQNYV 459
           ++NY+
Sbjct: 438 IKNYI 442



 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 66/107 (61%), Gaps = 3/107 (2%)

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
           A+  + +D+A  AD  V+ +G DQT  +EE DR  + LPG Q ELI  +  AA  P  +V
Sbjct: 603 ATDKETLDMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSI--AAVNPNTIV 660

Query: 536 LLCG-GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           ++ G G V++   K + N+  I++ GY G+A   A+A+V+FGD NPG
Sbjct: 661 VIQGMGMVEVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPG 707


>gi|300785890|ref|YP_003766181.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|384149201|ref|YP_005532017.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|399537773|ref|YP_006550435.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|299795404|gb|ADJ45779.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|340527355|gb|AEK42560.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|398318543|gb|AFO77490.1| beta-glucosidase [Amycolatopsis mediterranei S699]
          Length = 1218

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 195/596 (32%), Positives = 288/596 (48%), Gaps = 71/596 (11%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQL-VNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           +  T    ++RA DLV+R+TL EK+ QL  NSAPAIPRLG+  Y +WSE  HG+  +G  
Sbjct: 45  YRDTHYSFAERAADLVARMTLPEKVLQLRTNSAPAIPRLGVQQYTYWSEGQHGLNTLGAN 104

Query: 104 IFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN------AGQAIG- 153
              +GT+ G   ATSFP  + +  S+D  L  +   AI  EAR + +      A   IG 
Sbjct: 105 TN-DGTVTGGVHATSFPTNLASTMSWDPELIQQETTAISDEARGMLDKSLWGVAQNNIGP 163

Query: 154 -------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKG 206
                  +T+WAP +N+ RDPRWGR  E  GEDP +  K A ++V G QG T +G     
Sbjct: 164 DKNNYGSLTYWAPTVNLDRDPRWGRTDEGFGEDPYLVAKMAGAFVNGYQGQTASGRPATP 223

Query: 207 KLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            L+ +A  KH+   +++N     R+   +  T  +L D Y   F + ++    SG+M +Y
Sbjct: 224 YLKVAATAKHYALNNVEN----DRHADSSDTTEANLRDYYTKQFRNLIQDAHVSGLMTSY 279

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV---- 322
           N +NG PS +D    +  A+R +GF GY TSDC AV  +Y    +  +P           
Sbjct: 280 NAINGTPSPSDTYTANAIAQRTYGFDGYTTSDCGAVGDVYAPGSHNWAPPGWTTATSNGG 339

Query: 323 ----------------------LKAGMDVNCGSFLQK--HTKAAVKQKKLPESEIDRALH 358
                                 L+AG  +NC        + + A+K   L E  +D AL 
Sbjct: 340 TQWTNTATGQQVAGAAGGQAYALRAGTQLNCTGTEATVANIQEAIKAGVLSEGVLDNALV 399

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKN------SHG 412
           ++F+ RM+ G F+  P    + KI  DV+ SP HQ LA + A   +VLLKN      +  
Sbjct: 400 HVFTTRMQTGEFD-PPDRVAYTKITKDVIQSPEHQALAAKVAAHSLVLLKNDPVPGTAAP 458

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE-----NTVYYPG 467
           LLP   +K  ++ ++G  A   K  LG Y+G     +  +Q + + V+      TV +  
Sbjct: 459 LLPADPAKLGTVVVVGDLA--GKVTLGGYSGEPALQVNAVQGITSAVKAANPAATVTFDA 516

Query: 468 CDTVACSSASIDKAVDI---AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRV 524
           C T   ++ +   + +     K AD VV+  G D     E  DR  + +PG    LI +V
Sbjct: 517 CGTSTATTTAASCSAETLAALKTADLVVVFAGTDGNVATEGRDRTTIAMPGNYDSLIDQV 576

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             A      L +  GG V +  A     I  I+++GY GE+   ALA+V+FG  NP
Sbjct: 577 KAAGNPRTALAVQAGGAVSLGHAA---GIPGIVFSGYNGESQGTALADVLFGKQNP 629


>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 996

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 190/557 (34%), Positives = 289/557 (51%), Gaps = 54/557 (9%)

Query: 54  QRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGVAGVGKGIFF 106
           +R+   VSRLT+ E + Q+          APAI RLGI  Y+W +E L G A  G     
Sbjct: 449 KRSFSEVSRLTVKELVLQISRGGAGDNGPAPAITRLGIGPYQWNTECLRGYAMNGD---- 504

Query: 107 NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-------GQAIGMTFWAP 159
                 AT FPQ I  AA+FD  L Y++ + I LEARA +N        G   G++ ++P
Sbjct: 505 ------ATCFPQPIGLAATFDQGLIYKLAKTIALEARAKHNNFTKNGNFGDHTGLSCFSP 558

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
            INI R P WGR QET GEDP+++   A +YV G+QGD            A+A CKHF A
Sbjct: 559 VINILRHPLWGRNQETFGEDPVMSSLMARAYVTGLQGDEI-------YYPATANCKHFAA 611

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           YD      ++R  F+A V+++DL  TY P F  CV  G A GI+C+YN +NG P+CA  +
Sbjct: 612 YDGPENIPSSRLSFNANVSIEDLGRTYFPAFRECVHSG-AFGIVCSYNAINGEPACAS-S 669

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
            L    R ++ F GY++SD  A+        Y KS   + V    AG+D+   S+ + + 
Sbjct: 670 YLQTILRDKFNFKGYVSSDESAIEFFDIYFKYTKSNLLSAVAAFDAGVDLELTSYGKNNR 729

Query: 340 KA----AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
            +    AV+Q  + E+ + R+   LF  RM LG F+       +  +  DVV S AH+  
Sbjct: 730 YSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEFDPQ-EFNHWLNVPIDVVQSLAHRKQ 788

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR---SITPL 452
           A++ A    VLLKN  G+LPL   +  ++A++GP  N+++ L G+Y  P+       +PL
Sbjct: 789 AVEVAAKSFVLLKND-GVLPL-NHRYENVAIVGPFINNSEALTGDYH-PNYNLKYFSSPL 845

Query: 453 QALQNYVENTV--YYPGC------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKE 504
            A  +   + V  +  GC      +   C++ +     ++  G+D V++ +G     E E
Sbjct: 846 FAANSLSSSGVARFTTGCVGTNNQNLPICATYNSTHVKEVVTGSDIVLVTLGTGTGIEAE 905

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DR+ + LPG+Q ++I  V + A  PVI+VL   GP+D+++     N  +++   +  +
Sbjct: 906 GRDRLSMELPGKQLDMIKDVVKYANGPVIVVLFNAGPLDVSWVM--GNTAAVIACHFSAQ 963

Query: 565 AGAVALAEVIFGDHNPG 581
               A+ EV+ G  NP 
Sbjct: 964 MTGEAMLEVLTGIVNPA 980


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 161/424 (37%), Positives = 240/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P IPRL IP Y   +EALHGV   G+        
Sbjct: 34  PMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPGR-------- 85

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N    G+         +TFW+P +
Sbjct: 86  --FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 143

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+  +  KHF A  
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQGDD------DRYLKIVSTPKHFAA-- 195

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE+CVK G+++ IM AYN +N +P   +  LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 253

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R  WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     +   
Sbjct: 254 TKVLREDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVFDEPLL 313

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           +A +Q  +  ++ID A + +   RM+LGLF+      P+ KI   VV S  HQ +AL AA
Sbjct: 314 SAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEK-NPYTKISPAVVGSAKHQEVALNAA 372

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN   +LPL   K  S+A++G NA + +   G+Y+G P    I+ LQ +++ V
Sbjct: 373 RECIVLLKNQKKMLPLNAKKVKSIAVVGINAGNCE--FGDYSGSPVIAPISVLQGIKDRV 430

Query: 460 ENTV 463
            + V
Sbjct: 431 GDGV 434



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 60/93 (64%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV ++G++++ E+E  DR D+ LP  Q E +  + +     +++VL+ G  + + +  
Sbjct: 603 ETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVNPN-IVVVLVAGSSLAVNWM- 660

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D ++ +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 661 -DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPG 692


>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
           17393]
 gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  268 bits (686), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 163/446 (36%), Positives = 243/446 (54%), Gaps = 38/446 (8%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           +T P+    L  ++RA+DLV RLTL+EK   + + + AIPRLGI  + WWSEALHGVA  
Sbjct: 19  QTLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQ 78

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG------- 153
           G            T FP+ +  AASF+  L + I  A+  E RA +N     G       
Sbjct: 79  G----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFH 128

Query: 154 -MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            ++ W PN+NIFRDPRWGRGQET GEDP +T +  ++ V+G+QG      K +  L   A
Sbjct: 129 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGP--ENEKYRKLL---A 183

Query: 213 CCKHFTAYDLDNWKGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           C KH+  +    W   +R+  +   V+ +DL +TY P F++ V++     +MCAY R++ 
Sbjct: 184 CAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAFKALVQKADVREVMCAYQRLDD 240

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK---AGMD 328
            P C +  LL +  R +WGF   + SDC A++  + +    KS  DAV   +K   AG D
Sbjct: 241 DPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSH---KSSSDAVHAAVKGTMAGTD 297

Query: 329 VNCG-SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
           V CG  +  +    AV +  + E E+++ +  L   R  LG  + +P++  + KI   VV
Sbjct: 298 VECGYGYAYQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGEMD-DPSLVNWTKIPMSVV 356

Query: 388 CSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCR 447
              AH+ L+L  ++  + LL+N + +LPL KS    +A+IGPNA+    L GNY G   +
Sbjct: 357 NCKAHKDLSLNMSRQTMTLLQNKNNVLPLSKSIR-KIAVIGPNADDKPMLWGNYNGTPNQ 415

Query: 448 SITPLQALQNYVENT--VYYPGCDTV 471
           +IT L   ++ ++    VY  GCD V
Sbjct: 416 TITILDGFKSKLKKNQIVYMKGCDLV 441



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 65/136 (47%), Gaps = 17/136 (12%)

Query: 455 LQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-------- 506
           ++N+  N  +  G +     S SI K     KG D VV + G+    E EE+        
Sbjct: 571 VENWAANLKFNLGKEFPINYSESISKL----KGIDVVVFVGGISPQLEGEEMPVNIPGFK 626

Query: 507 --DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
             DR D+ LP  Q+  +  + +A K+  ++ + C G   +       +  +IL A Y GE
Sbjct: 627 GGDRTDIELPAVQRNFLKALKDAGKQ--VVFVNCSGS-SMALLPETESCDAILQAWYGGE 683

Query: 565 AGAVALAEVIFGDHNP 580
            G  A+A+V+FGD+NP
Sbjct: 684 LGGYAVADVLFGDYNP 699


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  ++L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    +  +   K+ A  C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+ +   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R+ WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P    + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 134/256 (52%), Positives = 181/256 (70%), Gaps = 1/256 (0%)

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           MD+NCGS+  ++ ++AV + KL E +IDRAL NLFSV++RLGLF+G+     F K+G + 
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           VC+  H+ LAL+AA+ GIVLLKN    LPL K    SLA+IGP AN+  +L G+Y G SC
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 447 RSITPLQALQNYVENTVYYPGCDTVAC-SSASIDKAVDIAKGADHVVLMMGLDQTQEKEE 505
              +    +Q Y++ T Y  GC  V+C S     +A+ IAK AD V+++ G+D +QE E+
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR+ L+LPG+Q  L++ VA A+KKPVILVL  GGPVD++FAK D  I SILW GYPGEA
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 566 GAVALAEVIFGDHNPG 581
           GA ALA++IFG++NPG
Sbjct: 241 GAKALADIIFGEYNPG 256


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  ++L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    +  +   K+ A  C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+ +   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R+ WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P    + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/114 (38%), Positives = 59/114 (51%), Gaps = 13/114 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  + +
Sbjct: 591 NIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKALCD 650

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           A KK VI V   G P  I      +   +IL A YPG++G  A AEV+FGD+NP
Sbjct: 651 AGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAAAEVLFGDYNP 701


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 160/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+ +R  DL+SRLT++EKIS L  ++P I RL IP Y   +EALHGV   G+        
Sbjct: 34  PMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 85

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N    G+         +TFW+P +
Sbjct: 86  --FTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 143

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QGD          L+  +  KHF A  
Sbjct: 144 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGDD------DRYLKIVSTPKHFAA-- 195

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE+CVK G+++ IM AYN +N +P   +  LL
Sbjct: 196 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 253

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R+ WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     +   
Sbjct: 254 TKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLL 313

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           +A +Q  + +++ID A + +   RM LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 314 SAYRQYMVTDADIDSAAYRVLRARMELGLFDSG-EQNPYTKISPAVIGSAEHQEVALNAA 372

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN   +LPL   K  S+A++G NA S++   G+Y+G P    I+ LQ +++ V
Sbjct: 373 RECIVLLKNQKKMLPLNAKKVKSIAVVGINAGSSE--FGDYSGLPVIAPISVLQGIKDRV 430

Query: 460 ENTV 463
            + V
Sbjct: 431 GDDV 434



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV ++G++++ E+E  DR D+ LP  QQE +  + +     +++VL+ G  + I +  
Sbjct: 603 ETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLVAGSSLAINWM- 660

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D +I +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 661 -DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPG 692


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  268 bits (685), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 241/443 (54%), Gaps = 33/443 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  ++L   +RA DL+ +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-------MTFW 157
                   AT FPQ I  AASF+    + I  A+  EARA   A  A G       +T W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T    V+ V+G+Q    +  +   K+ A  C KHF
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ--CMDANQKYDKIHA--CAKHF 189

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY  PFE+ VK+ +   +MCAYNR+ G P C 
Sbjct: 190 AVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDPCCG 246

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R+ WG+ G + SDC A+   Y  +G+   P  E A    + +G D+ CGS 
Sbjct: 247 SDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDAESASAAAVLSGTDLECGSS 306

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   ++A K+  + E +ID ++  L   R  LG  + +P    + KI   VVCS  H  
Sbjct: 307 YKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSAEHDS 364

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+L  A+  + LL N + +LPL K    ++A++GPNAN +    GNY G    +IT L+ 
Sbjct: 365 LSLDIARKSMTLLLNKNNILPL-KRGGQTIAVMGPNANDSVMQWGNYNGTPKHTITLLEG 423

Query: 455 LQNYV---ENTVYYPGCDTVACS 474
           +++ +   +  +Y  GC  V  S
Sbjct: 424 IRSAMGENDKLIYEQGCSWVERS 446



 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 60/117 (51%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
              +I   V   K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  
Sbjct: 588 EEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRKGDRTDIELPAVQRELIKA 647

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +A KK VI V   G P  I      +   +IL A YPG++G  A+AEV+FGD+NP
Sbjct: 648 LCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQSGGKAVAEVLFGDYNP 701


>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 869

 Score =  268 bits (684), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 159/436 (36%), Positives = 233/436 (53%), Gaps = 51/436 (11%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +P+  T L   +R  +L+S LT +EK+  ++N + +I RLGIP+Y WWSEA HGV     
Sbjct: 23  YPYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQ--- 79

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA-----------LYNAGQA 151
                    G T +PQ I  AA+F++ L+Y +   +  EARA           L+N    
Sbjct: 80  --------DGYTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMG 131

Query: 152 IG-------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           +        +TFW PN+NIFRDPRWGRGQET GEDP +     V  V G+QG   N  K 
Sbjct: 132 VTYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQG---NNDKY 188

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
               +  AC KH+  +   +     R+  +   T +DL +TY P F++ VK+G    +MC
Sbjct: 189 ---FKTHACAKHYAVH---SGPEPLRHSMNVEPTNRDLWETYLPAFKALVKKGNVREVMC 242

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED---AVVD 321
           AY R  G P C    LL    R +WG+   + +DCDA++  ++  G  ++ +D   A VD
Sbjct: 243 AYQRFEGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAINNFFN-RGQHETHKDGLSASVD 301

Query: 322 VLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGK 381
            +  G D+ CG       +  +K+  + ES++D  L      R  LG+F+    M P+ K
Sbjct: 302 AVLNGTDLECGKVFMSLVE-GLKKGLIKESDLDNHLRKTLMGRFELGMFDP-ADMLPWAK 359

Query: 382 IGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY 441
           +GADV+ S  +  +A+QAA++ +VLL+N  G+LPL KS   +LA++GPNA+    L GNY
Sbjct: 360 LGADVISSEKNDAMAVQAARESMVLLENKAGILPLSKSIK-TLAVLGPNADDVNMLNGNY 418

Query: 442 AGPSCRSITPLQALQN 457
            G      TP  A Q+
Sbjct: 419 GG------TPTAAHQH 428



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/96 (41%), Positives = 54/96 (56%), Gaps = 3/96 (3%)

Query: 486 KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
           K AD +V++ G+    E E  D+ D+ LP  QQ L+ +      KPVI V  C G   I 
Sbjct: 614 KEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLV-KAMHKTGKPVIFVN-CSGSA-IA 670

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           F   +    ++L A Y G+ GA ALAEV+FGD+NPG
Sbjct: 671 FGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPG 706


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  267 bits (683), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 161/432 (37%), Positives = 234/432 (54%), Gaps = 33/432 (7%)

Query: 53  SQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
           ++RA DLV +LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G           
Sbjct: 32  AERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG----------W 81

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA-------IGMTFWAPNINIFR 165
           AT FPQ I  AASF     +    A+  EARA   A  A        G+T W P +NI+R
Sbjct: 82  ATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIWTPTVNIYR 141

Query: 166 DPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNW 225
           DPRWGRG ET GEDP +     VS V+G+Q    N    K      AC KHF  +    W
Sbjct: 142 DPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDK----VHACAKHFAVHSGPEW 197

Query: 226 KGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
               R+ F+A  ++ +DL +TY PPFE+ VK+G+   +MCAYNR  G P C    LL+  
Sbjct: 198 ---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGEPCCGSNRLLNHI 254

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYA--KSPEDAVVDVLKAGMDVNCGSFLQKHTKAA 342
            RR+WG+ G + +DC A+S  ++ +G+        A    + +G D+ CGS  +  T+  
Sbjct: 255 LRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVLSGTDLECGSNYRSLTE-G 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           VK+  + E++IDR++  L   R  LG  +  P    + +I   VVCS  H  L+L  A+ 
Sbjct: 314 VKKGFIDEADIDRSVKRLLQARFELGEMD-EPDQVRWAQIPYSVVCSDKHDSLSLDMARK 372

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--- 459
            + LL N +  LPL +    ++A++GPNAN +    GNY G   R+IT L  +++ +   
Sbjct: 373 SMTLLLNKNNALPLERG-GTTIAVMGPNANDSVMQWGNYNGLPKRTITILDGIRSAMGKD 431

Query: 460 ENTVYYPGCDTV 471
           +  +Y  GC  V
Sbjct: 432 DKLIYEQGCSWV 443



 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 60/117 (51%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
             A I ++V   K AD V+   G+    E EE+          DR D+ LP  Q+E+I  
Sbjct: 588 EEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQREMIKA 647

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +A KK VI V   G P  I          +IL A YPG++G  A+AEV+FGD+NP
Sbjct: 648 LHDAGKK-VIFVNCSGSP--IAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDYNP 701


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 156/424 (36%), Positives = 234/424 (55%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P  +R  DL+SRLT++EKIS L  ++P IPRL I  Y   +EALHG+   G         
Sbjct: 33  PQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIVRPG--------- 83

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQ---------AIGMTFWAPNI 161
              T FPQ I  A+ ++    Y I   I  EARA +N            +  +TFW+P +
Sbjct: 84  -NFTVFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLTFWSPTV 142

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++GK  V++V+G+QG+          L+  +  KHF A  
Sbjct: 143 NMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQGND------PRYLKVVSTPKHFAA-- 194

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R++ + +++ +DL + Y P FE C+  G+A  IM AYN +N +P   +  LL
Sbjct: 195 --NNEEHNRFECNPQISERDLREYYLPAFERCIIDGKAQSIMTAYNAINDVPCTLNTWLL 252

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQ-KHTK 340
            K  R  WGF+GY+ SDC A S++     Y K+PE A    LKAG+D+ CG  +  +   
Sbjct: 253 KKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALKAGLDLECGDNVYIEPLM 312

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A KQ  + E+EID A + +   RM LGLF+ +P   P+  +   +V    H+ +AL+AA
Sbjct: 313 NAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALSPSIVGCEKHKNMALEAA 371

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  +VLLKN +  LP+   K  S+A++G NA + +   G+Y+G P    ++ L  ++N V
Sbjct: 372 RQSLVLLKNENNFLPINPKKIKSIAVVGINAGNCE--FGDYSGKPVNVPVSVLDGIRNRV 429

Query: 460 ENTV 463
            + +
Sbjct: 430 GDDI 433



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 54/93 (58%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           D  + +MG++++ E+E  DR  + LP + QEL    A      + +VL+ G  + + +  
Sbjct: 602 DMTIAVMGINKSIEREGRDRDHIELP-KDQELFIEEAYKLNPKMAVVLVAGSSLAVNWM- 659

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D ++ +IL A YPGE G  A+AE +FGD+NP 
Sbjct: 660 -DEHVPAILNAWYPGEQGGTAVAEALFGDYNPA 691


>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
 gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
          Length = 312

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 138/298 (46%), Positives = 177/298 (59%), Gaps = 12/298 (4%)

Query: 34  DPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEA 93
           D +N    T   C T+    +RA  L+S  TL+EKI+   N+AP +PRLG+PAY+WW EA
Sbjct: 22  DCANGPLTTNTVCDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEA 81

Query: 94  LHGVAGVGKGIFF--NGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           LHGVA    G+ F  +G    ATSFPQ IL  A+FD  L   +   +  EARA  NA ++
Sbjct: 82  LHGVA-ESPGVIFAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRS 140

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
            G+ FW PNIN F+DPRWGRGQETPGEDP     Y  + + G+QG     G      +  
Sbjct: 141 -GIDFWTPNINPFKDPRWGRGQETPGEDPFHLQSYVYNLITGLQG-----GLDPEYKRIV 194

Query: 212 ACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           A CKHF AYDL+NW+G  RY FDA V++QDL++ Y   F +C +       MC+YN VNG
Sbjct: 195 ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNG 254

Query: 272 IPSCADRNLLSKTARRQWGFHG---YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG 326
           +PSCA+  LL    R  WG+     YITSDCDA+  IY+   Y  +  + V D L AG
Sbjct: 255 VPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVADALNAG 312


>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 241

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 126/205 (61%), Positives = 153/205 (74%), Gaps = 6/205 (2%)

Query: 33  CDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSE 92
           CD  N  T    FC+ +L I +R +DL+ RLTL EKI  LVN+A A+PRLGI  YEWWSE
Sbjct: 34  CDKRNVGTRNMGFCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSE 93

Query: 93  ALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI 152
           ALHGV+ VG G  F GT  GATSFPQVI TAASF+  LW  IG+ +  EARA+YN G A 
Sbjct: 94  ALHGVSNVGPGTKFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTA- 152

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+T+W+PN+NIFRDPRWGRGQETPGEDP++  KYA +YV+G+QG   N GK   +L+ +A
Sbjct: 153 GLTYWSPNVNIFRDPRWGRGQETPGEDPILAAKYAANYVQGLQG---NDGKK--RLKVAA 207

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARV 237
           CCKH+TAYDLDNW G  RY F+A+V
Sbjct: 208 CCKHYTAYDLDNWNGVDRYHFNAKV 232


>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 126/249 (50%), Positives = 173/249 (69%), Gaps = 21/249 (8%)

Query: 136 QAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQ 195
           Q +  EARA+YN G A G+TFW+PN+NIF+DPRWGRGQETPGEDPL++ KYA  YVRG+Q
Sbjct: 77  QVVSTEARAMYNVGLA-GLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ 135

Query: 196 GDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVK 255
               +G     +L+ +ACCKH+TAYDLDNWKG   + F+A VT QD+ DT+QPPF+SCV 
Sbjct: 136 QSD-DGSP--DRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTFQPPFKSCVI 192

Query: 256 QGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH-GYITSDCDAVSIIYDAEGYAKS 314
            G  + +MC+YN+   I                + +H  Y+ SDCD+V + Y+++ Y K+
Sbjct: 193 DGNVASVMCSYNQFYLI----------------YKYHLQYLVSDCDSVDVFYNSQHYTKT 236

Query: 315 PEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           PE+AV   + AG+D+NCGSFL +HT+AAVK   + ES +D+A+ N F+  MRLG F+GNP
Sbjct: 237 PEEAVAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNP 296

Query: 375 TMQPFGKIG 383
           +   +GK+G
Sbjct: 297 SKAIYGKLG 305


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 158/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI +R  DL+SRLT++EKIS L  ++P I RL IP Y   +EALHGV   G+        
Sbjct: 35  PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N    G+         +TFW+P +
Sbjct: 87  --FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QG+          L+  +  KHF A  
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE+CVK+G+++ IM AYN +N +P   +  LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLL 254

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R  WGF GY+ SDC   +++ +A  Y K+ E A    +KAG+D+ CG         
Sbjct: 255 TKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL 314

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q  + +++ID A + +   RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 315 NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQKVALDAA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN + +LPL   K  S+A++G NA   ++  G+Y+G P    ++ LQ +++ V
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGIKDRV 431

Query: 460 ENTV 463
            +TV
Sbjct: 432 GDTV 435



 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 19/163 (11%)

Query: 435 KTLLGNYAGPSCRSITP---LQALQNYVENTVYYPGCDTVACS----------SASID-- 479
           K L+  + G + R+ T    L+A ++Y     YY   D                  +D  
Sbjct: 534 KMLIDAWQGHAIRTDTATIYLEAGRDYRLKAEYYDNRDYAVAKLQWKVPQIGKKTRLDLY 593

Query: 480 -KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
            +A  + +  + VV ++G+++  E+E  DR D+ LP  Q+E +  + +     +++VL+ 
Sbjct: 594 GEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVA 652

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           G  + I +   D +I +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 653 GSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPG 693


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 158/424 (37%), Positives = 241/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI +R  DL+SRLT++EKIS L  ++P I RL IP Y   +EALHGV   G+        
Sbjct: 35  PIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N    G+         +TFW+P +
Sbjct: 87  --FTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLTFWSPTV 144

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QG+          L+  +  KHF A  
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE+CVK+G+++ IM AYN +N +P   +  LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALNDVPCTLNAWLL 254

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R  WGF GY+ SDC   +++ +A  Y K+ E A    +KAG+D+ CG         
Sbjct: 255 TKVLREDWGFKGYVVSDCGGPALLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDAPLL 314

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A +Q  + +++ID A + +   RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 315 NAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEN-NPYTKISPKVIGSKEHQKVALDAA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN + +LPL   K  S+A++G NA   ++  G+Y+G P    ++ LQ +++ V
Sbjct: 374 RECIVLLKNQNKMLPLDAKKIKSIAVVGINA--GRSEFGDYSGLPVIAPVSILQGIKDRV 431

Query: 460 ENTV 463
            +TV
Sbjct: 432 GDTV 435



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 84/163 (51%), Gaps = 19/163 (11%)

Query: 435 KTLLGNYAGPSCRSITP---LQALQNYVENTVYYPGCDTVACS----------SASID-- 479
           K L+  + G + R+ T    L+A ++Y     YY   D                  +D  
Sbjct: 534 KMLIDAWQGHAIRTDTATIYLEAGRDYRLKAEYYDNRDYAVAKLQWKVPQIGKKTRLDLY 593

Query: 480 -KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
            +A  + +  + VV ++G+++  E+E  DR D+ LP  Q+E +  + +     +++VL+ 
Sbjct: 594 GEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLVA 652

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           G  + I +   D +I +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 653 GSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPG 693


>gi|317480996|ref|ZP_07940076.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316902889|gb|EFV24763.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 864

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 175/458 (38%), Positives = 241/458 (52%), Gaps = 36/458 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +LP  +RA  L+  LTL+EK+S +++S+  + RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y   G+   P  E A    + +G D+ CGS 
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 303

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   + AVKQ K+ E  +D A+  L + R  LG  +  P    +  I   VV S  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A+  + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++  +   +  +Y  GC  V    A I    +  K AD
Sbjct: 421 IRKALGTDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 63/117 (53%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           +   + K+V+  K AD VV + G+    E EE+          DR D+ LP  Q+ELI  
Sbjct: 586 NEVDVRKSVERVKDADVVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  A KK V+LV   G P  I         G+IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 646 LHHAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|298480647|ref|ZP_06998843.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298273081|gb|EFI14646.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 862

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 175/460 (38%), Positives = 245/460 (53%), Gaps = 37/460 (8%)

Query: 44  PFCKT-TLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           P  KT +L   +RA  L+  LTL+EK+S +++ + A+ RLGI  Y WW+EALHGVA  G 
Sbjct: 20  PTYKTISLSPEKRAELLLKELTLEEKVSLMMDGSKAVERLGIKPYNWWNEALHGVARAGL 79

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMT 155
                     AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T
Sbjct: 80  ----------ATVFPQPIGMAASFSPQTVYGVFDAVSDEARAKNTYYASQGSYERYQGLT 129

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            W P +N++RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC K
Sbjct: 130 MWTPTVNVYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGG--NDGKYD---KLHACAK 184

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           HF  +    W    R++F+   +  +DL +TY PPFE+ VK+ +   +MCAYNR  G P 
Sbjct: 185 HFAVHSGPEW---NRHEFNVENIKPRDLYETYLPPFEALVKEAKVKEVMCAYNRFEGDPC 241

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCG 332
           C    LL +  R +WGF G + SDC A++  Y+  G+   P  E A    + +G D+ CG
Sbjct: 242 CGSDRLLMQILRNEWGFDGIVLSDCGAIADFYNDRGHKTHPDAESASAAAVLSGTDLECG 301

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
           S   K    AVK  K+ E ++D A+  L + R  LG  +  P    + KI   VV S  H
Sbjct: 302 SSY-KALIEAVKLGKIDEKDVDVAVKRLLAARFALGEMD-EPDKVSWTKIPFSVVASEKH 359

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LAL  A+  + LL N + +LPL K   +++A++GPNAN +    GNY G    ++T L
Sbjct: 360 DSLALDIARKSMTLLMNKNDILPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPSHTVTVL 418

Query: 453 QALQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
             ++  +   +  +Y  GC  V    + I  A ++ K AD
Sbjct: 419 DGVRKALDANDKLIYEQGCGWV--DRSLIQSAFNLCKSAD 456



 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/113 (40%), Positives = 63/113 (55%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I K+V+  KGAD V+ + G+  + E EE+          DR D+ LP  Q+ELI  +  +
Sbjct: 589 IRKSVEKVKGADVVIFVGGISPSLEGEEMGVNLPGFKRGDRTDIELPAVQRELIAALHHS 648

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK VILV   G P  I      +   +IL A YPG+AG  A+AEV+FG +NP
Sbjct: 649 GKK-VILVNCSGSP--IGLEPETKKCEAILQAWYPGQAGGTAVAEVLFGKYNP 698


>gi|270294390|ref|ZP_06200592.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270275857|gb|EFA21717.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 864

 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 175/458 (38%), Positives = 241/458 (52%), Gaps = 36/458 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +LP  +RA  L+  LTL+EK+S +++S+  + RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y   G+   P  E A    + +G D+ CGS 
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAVAVLSGTDLECGSS 303

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   + AVKQ K+ E  +D A+  L + R  LG  +  P    +  I   VV S  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A+  + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++  +   +  +Y  GC  V    A I    +  K AD
Sbjct: 421 IRKALGTDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 63/117 (53%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           +   + K+V+  K AD VV + G+    E EE+          DR D+ LP  Q+ELI  
Sbjct: 586 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  A KK V+LV   G P  I         G+IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 646 LHRAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|160891087|ref|ZP_02072090.1| hypothetical protein BACUNI_03534 [Bacteroides uniformis ATCC 8492]
 gi|156859308|gb|EDO52739.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 865

 Score =  264 bits (675), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 173/458 (37%), Positives = 242/458 (52%), Gaps = 36/458 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +LP  +RA  L+  LTL+EK+S +++S+  + RLGI  Y WW+EALHGVA  G   
Sbjct: 23  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQA----IGMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA    Y + ++     G+T W
Sbjct: 81  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYTSQESHERYQGLTMW 132

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KHF
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 187

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 188 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 244

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y   G+   P  E A    + +G D+ CGS 
Sbjct: 245 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHPDAESASAAAVLSGTDLECGSS 304

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   + AVKQ K+ E  +D A+  L + R  LG  +  P    +  I   VV S  H  
Sbjct: 305 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 362

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A+  + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 363 LALDMARKSMTLLMNKDNTLPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 421

Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++  +   +  +Y  GC  V    A I    +  K AD
Sbjct: 422 IRKALGSDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 457



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 63/117 (53%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           +   + K+V+  K AD VV + G+    E EE+          DR D+ LP  Q+ELI  
Sbjct: 587 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 646

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  A KK V+LV   G P  I         G+IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 647 LHHAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 700


>gi|393782347|ref|ZP_10370532.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673618|gb|EIY67077.1| hypothetical protein HMPREF1071_01400 [Bacteroides salyersiae
           CL02T12C01]
          Length = 862

 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 160/437 (36%), Positives = 230/437 (52%), Gaps = 33/437 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   QRA DL+ RLTL+EK+  + N +  + RLGI  Y+WW+EALHGVA  G  
Sbjct: 23  PYKNPDLSPEQRAEDLLGRLTLEEKVRLMQNESFPVERLGIAHYDWWNEALHGVARAGI- 81

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--------QAIGMT 155
                    AT FP  +  A++FD  L   +   +  EARA Y+          +  G+T
Sbjct: 82  ---------ATVFPITMGMASTFDDKLVEDVFTVVSDEARAKYHNAHREGRRGVRCEGLT 132

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ V G+QG          KL   AC K
Sbjct: 133 FWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVNGLQGP---ADAKYDKLH--ACAK 187

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ F+A  ++ +DL +TY P F+  V++     +MCAYNR  G P 
Sbjct: 188 HYAVHSGPEAK---RHYFNAENISPRDLWETYLPAFKDLVQEADVKEVMCAYNRFEGDPC 244

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCG 332
           C    LL++  R +WG+   + SDC A+S  +  D     K   DA    + +G D+ CG
Sbjct: 245 CGSNRLLTQILRDEWGYKHMVVSDCGAISDFFYKDRHATHKDAADASAAAVLSGTDLECG 304

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                H + AVK+  + E  I+ +L  L   R  LG  + +  + P+ KI  D V    H
Sbjct: 305 -IEYAHLEEAVKKGLISEERINTSLRRLLKARFELGEMD-DDALVPWSKISIDTVDCETH 362

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL   +  +VLL N +G+LPL K+    +A++GPNA  +    GNY G    + T L
Sbjct: 363 KQMALDVTRKSMVLLHN-NGVLPLAKT-GTRIAVMGPNAVDSVMQWGNYKGTPSHTSTIL 420

Query: 453 QALQNYVENTVYYPGCD 469
           + ++N + N  Y  GC+
Sbjct: 421 EGIRNKIGNVPYEKGCE 437



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 14/104 (13%)

Query: 488 ADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           AD V+ + G+    E E+            DR  + LP  Q+ ++  + +A KK  ++ +
Sbjct: 598 ADVVIFVGGISPDLEGEDKYFVNCPGFSGGDRTTIELPEVQRNILKALKQAGKK--VIFV 655

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            C G   +      ++  +IL A YPG+AG  A+A+V+FGD+NP
Sbjct: 656 NCSGSA-VALVPETKSCDAILQAWYPGQAGGTAVADVLFGDYNP 698


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 158/424 (37%), Positives = 239/424 (56%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P  +R  DL+SRLT++EKIS L  ++P I RL IP Y   +EALHGV   G+        
Sbjct: 35  PTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR-------- 86

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNI 161
              T FPQ I  AA+++  L  ++   I  EARA +N    G+         +TFW+P +
Sbjct: 87  --FTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTV 144

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP ++G    ++V+G+QG+          L+  +  KHF A  
Sbjct: 145 NMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGNDSR------YLKIVSTPKHFAA-- 196

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R+  + +++ + L + Y P FE+CVK G+++ IM AYN +N +P   +  LL
Sbjct: 197 --NNEEHNRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLL 254

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
           +K  R  WGF GY+ SDC   S++ +A  Y K+ E A    +KAG+D+ CG     +   
Sbjct: 255 TKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEPLL 314

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
           +A +Q  + +++ID A + +   RM+LGLF+      P+ KI   V+ S  HQ +AL AA
Sbjct: 315 SAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEK-NPYTKISPAVIGSKEHQEVALNAA 373

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           ++ IVLLKN   +LPL   K  S+A++G NA S++   G+Y+G P    ++ LQ +++ V
Sbjct: 374 RECIVLLKNQKKMLPLNAKKIKSIAVVGINAGSSE--FGDYSGLPVIAPVSVLQGIKDRV 431

Query: 460 ENTV 463
              V
Sbjct: 432 GEDV 435



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV ++G++++ E+E  DR D+ LP  Q+E +  + +     +++VL+ G  + + +  
Sbjct: 604 ETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IVVVLVAGSSLAVNWM- 661

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D ++ +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 662 -DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPG 693


>gi|94442924|emb|CAJ91136.1| beta-xylosidase [Platanus x acerifolia]
          Length = 231

 Score =  263 bits (673), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 119/234 (50%), Positives = 166/234 (70%), Gaps = 5/234 (2%)

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           A SYVRG+Q       +LK     +ACCKH+TAYDLDNWKG  R+ F+ARV+ QDL DT+
Sbjct: 1   AASYVRGLQQPYGRDSRLK----VAACCKHYTAYDLDNWKGIDRFHFNARVSKQDLEDTF 56

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF  CV +G+ + +MC+YN+VNGIP+CAD NLL  T R +W  +GYI +DCD++ ++Y
Sbjct: 57  NVPFRECVVEGKVASVMCSYNQVNGIPTCADPNLLRNTIRGEWRLNGYIVTDCDSIGVLY 116

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMR 366
           D + Y  +PE++  D +KAG+D++CG FL  HT+ A+ ++ L E  +D AL N  +V+MR
Sbjct: 117 DTQHYTSTPEESAADAIKAGVDLDCGPFLAVHTQEAITRRMLSEVYVDGALANTLAVQMR 176

Query: 367 LGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
           LG+F+G P+ QPFG +G   VC+ AHQ LAL+AA+ GIVL+KN  G LPL  ++
Sbjct: 177 LGMFDGEPSAQPFGHLGPRDVCTSAHQQLALEAARQGIVLMKN-QGSLPLSTAR 229


>gi|423305378|ref|ZP_17283377.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
 gi|423311197|ref|ZP_17289166.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392679729|gb|EIY73108.1| hypothetical protein HMPREF1073_03916 [Bacteroides uniformis
           CL03T12C37]
 gi|392681368|gb|EIY74727.1| hypothetical protein HMPREF1072_02317 [Bacteroides uniformis
           CL03T00C23]
          Length = 864

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 174/458 (37%), Positives = 240/458 (52%), Gaps = 36/458 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +LP  +RA  L+  LTL+EK+S +++S+  + RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKNASLPAEERAGLLLKELTLEEKVSLMMDSSKPVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYEVFTAVSDEARAKNAYYASQGSRERYQGLTMW 131

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KHF
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKHF 186

Query: 218 TAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
             +    W    R++F+A  +  +DL +TY PPFE+ VK+G+   +MCAYNR  G P C 
Sbjct: 187 AVHSGPEW---NRHEFNAENIKPRDLYETYLPPFEALVKEGKVKEVMCAYNRFEGDPCCG 243

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAK--SPEDAVVDVLKAGMDVNCGSF 334
              LL +  R +WGF G + SDC A++  Y   G+      E A    + +G D+ CGS 
Sbjct: 244 SDRLLMQILRDEWGFDGIVLSDCGAIADFYRDYGHKTHLDAESASAAAVLSGTDLECGSS 303

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
            +   + AVKQ K+ E  +D A+  L + R  LG  +  P    +  I   VV S  H  
Sbjct: 304 YEALVE-AVKQGKIDEKAVDVAVKRLLTARFALGEMD-EPEKVSWTGIPFSVVASAGHDS 361

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL  A+  + LL N    LPL K   +++A++GPNAN +    GNY G    ++T L  
Sbjct: 362 LALDMARKSMTLLMNKDNTLPL-KRGGLTIAVMGPNANDSVMQWGNYNGMPPHTVTILDG 420

Query: 455 LQNYV---ENTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           ++  +   +  +Y  GC  V    A I    +  K AD
Sbjct: 421 IRKALGSDDRLIYEQGCGWV--ERAQIQSVFNRCKTAD 456



 Score = 72.0 bits (175), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 63/117 (53%), Gaps = 13/117 (11%)

Query: 474 SSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITR 523
           +   + K+V+  K AD VV + G+    E EE+          DR D+ LP  Q+ELI  
Sbjct: 586 NEVDVRKSVERVKDADIVVFVGGVSPNLEGEEMGVELPGFRGGDRTDIELPAVQRELIAA 645

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  A KK V+LV   G P  I         G+IL A YPG+AG  A+AEV+FGD+NP
Sbjct: 646 LHRAGKK-VVLVNCSGSP--IGLEPETGRCGAILQAWYPGQAGGTAVAEVLFGDYNP 699


>gi|189464498|ref|ZP_03013283.1| hypothetical protein BACINT_00840 [Bacteroides intestinalis DSM
           17393]
 gi|189438288|gb|EDV07273.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 167/457 (36%), Positives = 241/457 (52%), Gaps = 40/457 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+L   +RA  LV  LTL+EK+  +++ +  + RL I  Y WW+EALHGVA  G   
Sbjct: 22  YKNTSLSPEERADLLVKELTLEEKVQLMMDGSRPVERLDIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA---LYNAGQA----IGMTFW 157
                   AT FPQ I  AASF     Y +  A+  EARA    Y++  +     G+T W
Sbjct: 80  --------ATVFPQPIGMAASFSPETVYDVFTAVSDEARAKNTYYSSKDSRERYQGLTMW 131

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDT-FNGGKLKGKLQASACCKH 216
            P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG    N  KL       AC KH
Sbjct: 132 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGPADGNYDKLH------ACAKH 185

Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           F  +    W    R+ F+A  +  +DL +TY PPFE+ VK+G    +MCAYNR  G P C
Sbjct: 186 FAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGEVKEVMCAYNRFEGEPCC 242

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCGS 333
               LL +  R +WGF G + SDC A++  Y+  G+   P+   A    + +G D+ CGS
Sbjct: 243 GSNRLLMQILRDEWGFDGIVVSDCGAIADFYNDRGHHTHPDAESASAAAVVSGTDLECGS 302

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
              K    +VK+  + E +++ ++  L   R  LG  +    +  + KI   VV S AH 
Sbjct: 303 SY-KALVESVKKGLISEEKVNTSVRRLLKARFELGEMDDLEKVS-WAKIPFSVVASAAHD 360

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LAL+ A+  + LL N    LPL +   +++A++GPNAN +    GNY G    ++T L 
Sbjct: 361 SLALKIARKSMTLLMNKDNFLPLRRG-GLTVAVMGPNANDSVMQWGNYNGMPPHTVTILD 419

Query: 454 ALQNYV---ENTVYYPGC----DTVACSSASIDKAVD 483
            ++N +   +  +Y  GC     T+  S+ S  K+VD
Sbjct: 420 GVRNLLGADDKLIYEQGCPWVERTLIHSAFSQCKSVD 456



 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 61/113 (53%), Gaps = 13/113 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I K+V+  K AD V+   G+  + E EE+          DR D+ LP  Q+ELI  +  A
Sbjct: 589 IQKSVEQVKEADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQRELIDALYRA 648

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KK ++LV   G P  I      +   +IL A YPG+ G  A+AEV+FGD+NP
Sbjct: 649 GKK-IVLVNCSGSP--IGLEPETKKCEAILQAWYPGQQGGTAVAEVLFGDYNP 698


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 137/263 (52%), Positives = 181/263 (68%), Gaps = 4/263 (1%)

Query: 320 VDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
           + V  AG+D+NCG+FL +HT AAV+  KL ES++DRA+ N     MRLG F+G+P   PF
Sbjct: 26  IAVAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPF 85

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
           G +G   VC+P++Q LA +AA+ GIVLLKN+ G LPL  +   S+A+IGPNAN++ T++G
Sbjct: 86  GNLGPSDVCTPSNQELAREAARQGIVLLKNT-GKLPLSATSIKSMAVIGPNANASFTMIG 144

Query: 440 NYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS--SASIDKAVDIAKGADHVVLMMGL 497
           NY G  C+  TPLQ L   V  TVY PGC  V CS  S  +D A   A  AD  VL++G 
Sbjct: 145 NYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGA 203

Query: 498 DQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
           DQ+ E+E LDR  L+LPG+Q +L++ VA A+  P ILV++ GGP DI+FAK    I +IL
Sbjct: 204 DQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAIL 263

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
           W GYPGEAG  A+A+V+FG HNP
Sbjct: 264 WVGYPGEAGGAAIADVLFGYHNP 286


>gi|357489449|ref|XP_003615012.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
 gi|355516347|gb|AES97970.1| Auxin-induced beta-glucosidase, partial [Medicago truncatula]
          Length = 190

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 124/177 (70%), Positives = 141/177 (79%), Gaps = 2/177 (1%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQ--PPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           + F  L L     L   DS    PP+SCD +NP T++ PFC   L I+QRA+D+VSRLTL
Sbjct: 11  ISFIFLFLTRYHRLVHADSPTHVPPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTL 70

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
           DEKISQLVN+AP+IPRLGIP+Y+WW EALHGVA  GKGI  NG++ GATSFPQVILTAAS
Sbjct: 71  DEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAAS 130

Query: 126 FDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLV 182
           FDS LWY+I + IG EAR +YNAGQA GMTFWAPNINIFRDPRWGRGQET GEDPLV
Sbjct: 131 FDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLV 187


>gi|380692851|ref|ZP_09857710.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 854

 Score =  261 bits (668), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 162/437 (37%), Positives = 229/437 (52%), Gaps = 33/437 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EKI  + NS+ A+ RLGI  Y WWSE LHGVA  G  
Sbjct: 15  PYKNHQLSPKERADDLLGRLTLKEKIGLMQNSSNAVERLGIAPYNWWSEGLHGVARNGL- 73

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMT 155
                    AT FP  +  A++FD     +I  A+  E RA ++             G+T
Sbjct: 74  ---------ATVFPITMGMASTFDDEAIEKIYTAVSDEGRAKFHDAHRRNRYGCGNEGLT 124

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  VS V+G+QG            +  AC K
Sbjct: 125 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVSVVKGMQGPADAAYD-----KTHACAK 179

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD   ++ +DL +TY P F++ V++     +MCAY R  G P 
Sbjct: 180 HYAVHSGPEAK---RHSFDVEHLSPRDLWETYLPAFKALVQEADVKEVMCAYQRFEGEPC 236

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY--DAEGYAKSPEDAVVDVLKAGMDVNCG 332
           C    LL++  R +WG+   + SDC A+S  +  D  G  K   DA    +  G D+ CG
Sbjct: 237 CDSNRLLTQILRDEWGYKHLVVSDCGAISDFFYKDRHGTHKDAADASASAVINGTDLECG 296

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                H + AVK+  + E  I+ +L  L   R  LG  + + ++  + KIG D V    H
Sbjct: 297 -VEYAHLEEAVKRGLITEERINTSLRRLLEARFALGEMDPD-SIVAWSKIGIDTVDCDLH 354

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL   +  +VLL N+ G+LPL K+ S  +A++GPNA  +    GNY G    + T L
Sbjct: 355 RQMALDITRKSMVLLHNN-GILPLAKT-STRIAVMGPNAVDSVMQWGNYKGVPSHTYTIL 412

Query: 453 QALQNYVENTVYYPGCD 469
           + ++N V N  Y  GC+
Sbjct: 413 EGIRNKVGNVPYEKGCE 429



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 14/110 (12%)

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKK 530
           VD  K AD V+ + G+    E EE            DR  + LP  Q++ I +  + A K
Sbjct: 584 VDKVKDADIVIFLGGISPDLEGEEKHFVNCPGFSGGDRTSIELPQVQRD-ILKALKKAGK 642

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            V+ V   G  V +       +  +IL A YPG+AG +A+A+V+FGD NP
Sbjct: 643 KVVFVNCSGSAVALVPEM--ESCDAILQAWYPGQAGGLAVADVLFGDFNP 690


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  261 bits (667), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 143/376 (38%), Positives = 212/376 (56%), Gaps = 51/376 (13%)

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYN 267
           L+ S+CCKH+  YD+D+W           V+ QD+ +T+  PFE                
Sbjct: 19  LKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE---------------- 53

Query: 268 RVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYA-KSPEDAVVDVLKAG 326
                             R +W  HGYI SDC  + +I D + Y  +S  DAV   L+AG
Sbjct: 54  ------------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAG 95

Query: 327 MDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV 386
           +D+ CG +       +V   K+ + E+DRAL N++ + MR+G F+G P  +    +G   
Sbjct: 96  LDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYE---SLGLKD 152

Query: 387 VCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSC 446
           +C+  H  LA +AA+ GIVLLKN + +LPL   K   L L+GP+AN+ + ++GNYAG   
Sbjct: 153 ICAADHIELAREAARQGIVLLKNDYEVLPLKPGKK--LVLVGPHANATEVMIGNYAGLPY 210

Query: 447 RSITPLQALQNYVENTVYYPGCDTVACSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEE 505
           + ++PL+A    + N  Y  GC   +CS+ +   +A + AK A+  ++ +G D + E E 
Sbjct: 211 KYVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEF 269

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
           +DRVD +LPG Q ELI +VAE +  PVILV+L G  +DITFAK +  I +ILW G+PGE 
Sbjct: 270 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 329

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+V+FG +NPG
Sbjct: 330 GGHAIADVVFGKYNPG 345


>gi|332881173|ref|ZP_08448832.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
           oral taxon 329 str. F0087]
 gi|332680887|gb|EGJ53825.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
          Length = 675

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 160/446 (35%), Positives = 231/446 (51%), Gaps = 35/446 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S+P+ +  P+    L   +RA DL+ RLTL EKIS + N +P + RLGI  Y WWSEALH
Sbjct: 17  SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
           GVA  G           AT +P  +  A+ FD  L   I   +  E RA ++  +  G  
Sbjct: 76  GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125

Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
                 +TFW PN+NIFRDPRWGRGQET GEDP +T +  V+ VRG+QG      K    
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPA--DAKYD-- 181

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            +  AC KH+  +     K   R+ FD   +  +DL +TY P F++ V++     +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
            R  G P C    LL++  R +WG+   + SDC A+S  +  +G  ++  DA      A 
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296

Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
             G D+ CG     H   AV++  + E  ID +L  L   R  LG  + +  + P+ +I 
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
            D V    H+ +AL   +  +VLL N+ G+LPL K  +  +A++GPNA  +    GNY G
Sbjct: 355 IDTVDCGTHRRMALDVTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKG 413

Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
               + T L+ ++  + N  Y  GC+
Sbjct: 414 VPAHTYTILEGIRGAIGNVPYEKGCE 439


>gi|357047866|ref|ZP_09109459.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
 gi|355529205|gb|EHG98644.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
          Length = 676

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 160/446 (35%), Positives = 231/446 (51%), Gaps = 35/446 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S+P+ +  P+    L   +RA DL+ RLTL EKIS + N +P + RLGI  Y WWSEALH
Sbjct: 17  SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
           GVA  G           AT +P  +  A+ FD  L   I   +  E RA ++  +  G  
Sbjct: 76  GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125

Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
                 +TFW PN+NIFRDPRWGRGQET GEDP +T +  V+ VRG+QG      K    
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVRGMQGPA--DAKYD-- 181

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            +  AC KH+  +     K   R+ FD   +  +DL +TY P F++ V++     +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
            R  G P C    LL++  R +WG+   + SDC A+S  +  +G  ++  DA      A 
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296

Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
             G D+ CG     H   AV++  + E  ID +L  L   R  LG  + +  + P+ +I 
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
            D V    H+ +AL   +  +VLL N+ G+LPL K  +  +A++GPNA  +    GNY G
Sbjct: 355 IDTVDCGTHRRMALDVTRKSMVLLHNN-GILPLDKGDAGKIAVMGPNAVDSVMQWGNYKG 413

Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
               + T L+ ++  + N  Y  GC+
Sbjct: 414 VPAHTYTILEGIRGAIGNVPYEKGCE 439


>gi|87200432|ref|YP_497689.1| beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
 gi|87136113|gb|ABD26855.1| Beta-glucosidase [Novosphingobium aromaticivorans DSM 12444]
          Length = 849

 Score =  259 bits (662), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 158/395 (40%), Positives = 223/395 (56%), Gaps = 32/395 (8%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           EK +QL ++  A     +PAY++W+EALHG+A        NG    AT FPQ I  AA++
Sbjct: 30  EKAAQLQSTVTA-SSPDLPAYDYWNEALHGLA-------RNGV---ATVFPQAIGLAATW 78

Query: 127 DSYLWYRIGQAIGLEARALYNA------GQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
           D+ L  RIG  +  EARA YNA       +  G+T W+PNINIFRDPRWGRGQET GEDP
Sbjct: 79  DAPLLERIGTVVSTEARAKYNALPGKDRRRYQGLTIWSPNINIFRDPRWGRGQETYGEDP 138

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           ++TG  A +YVRG+QG   +  ++       A  KH  A+         R  F+ + +  
Sbjct: 139 VLTGTLATAYVRGLQGPDLDHPRVI------ATPKHLVAHSGPE---AGRDSFNVQSSAY 189

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           D+  TY P F   + +G+A  +MC+YN V+G+P C    LL++  R+ WGF G + SDCD
Sbjct: 190 DMEATYLPAFRRALTEGKALSVMCSYNSVHGVPVCGADWLLNQRVRKDWGFDGLVVSDCD 249

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
           A+  I   + Y ++   A    + AGMD+NCG       + A+ Q       +DRAL   
Sbjct: 250 AIGNINHYQRYRQTNAAASAAAINAGMDLNCGRTYAALPE-ALAQGLTTREVVDRALART 308

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
           F+ R +LG   G     P+  I A+ V +  H+ LAL+AA+  +VLLKN +G+LPL   K
Sbjct: 309 FAARRKLGDAFG--ATSPWATIPANTVDTAEHRALALEAARKSLVLLKN-NGVLPL--RK 363

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
              +A++GPNA+S  TL  NY G + + +TPL   
Sbjct: 364 GARIAVVGPNADSLDTLEANYHGTAAQPVTPLDGF 398



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 47/74 (63%), Gaps = 3/74 (4%)

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR D+ LP  QQ+L+  + +A  KP+++VLL G  V + + K   N  +++ A YPG++G
Sbjct: 601 DRTDIALPRAQQDLLETL-KATGKPLVVVLLSGSAVAMPWVK--ENADAVVAAWYPGQSG 657

Query: 567 AVALAEVIFGDHNP 580
             A+A+++ G  NP
Sbjct: 658 GTAIADLVDGTLNP 671


>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
 gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 862

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 167/441 (37%), Positives = 237/441 (53%), Gaps = 36/441 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+L   +RA  L+  LTL+EK++ +++++ A+ RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMTF 156
                   AT FPQ I  AASF     Y +  A+  EARA  NA  A         G+T 
Sbjct: 80  --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTM 130

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KH
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKH 185

Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           F  +    W    R+ F+   ++ +DL +TY PPFE+ VK+ +   +MCAYN   G P C
Sbjct: 186 FAVHSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCC 242

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
               LL +  R  WGF G I SDC A++  Y+  G+   P  E A    +  G D+ CGS
Sbjct: 243 GSNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGS 302

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
             +   K AV+  ++ E +ID+A+  L   R  LG  + +P    + KI   VV S  H 
Sbjct: 303 SYKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHD 360

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LAL  A+  + LL+N+  +LPL K   +++A++GPNAN +    GNY G    ++T L 
Sbjct: 361 SLALNMARKSMTLLQNTGNILPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILG 419

Query: 454 ALQNYV---ENTVYYPGCDTV 471
            ++  +   +  +Y  GC  V
Sbjct: 420 GIRKALGADDKLIYEQGCSWV 440



 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/114 (41%), Positives = 62/114 (54%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I+ +V   K AD VV   G+    E EE+          DR D+ LP  Q+ELI+ +  A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK +ILV   G P  I      +N  +IL A YPG+ G  A+AEV+FGD+NPG
Sbjct: 649 GKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPG 699


>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 862

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 167/441 (37%), Positives = 237/441 (53%), Gaps = 36/441 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T+L   +RA  L+  LTL+EK++ +++++ A+ RLGI  Y WW+EALHGVA  G   
Sbjct: 22  YKTTSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEALHGVARAGL-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMTF 156
                   AT FPQ I  AASF     Y +  A+  EARA  NA  A         G+T 
Sbjct: 80  --------ATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQGSYARYQGLTM 130

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           W P +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N GK     +  AC KH
Sbjct: 131 WTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGKYD---KLHACAKH 185

Query: 217 FTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           F  +    W    R+ F+   ++ +DL +TY PPFE+ VK+ +   +MCAYN   G P C
Sbjct: 186 FAVHSGPEW---NRHSFNVEDLSARDLYETYLPPFEALVKEAKVKEVMCAYNSFEGEPCC 242

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSP--EDAVVDVLKAGMDVNCGS 333
               LL +  R  WGF G I SDC A++  Y+  G+   P  E A    +  G D+ CGS
Sbjct: 243 GSNRLLMQILRNDWGFDGIILSDCGAIADFYNEHGHKAYPDAESASAAAVLNGTDLECGS 302

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
             +   K AV+  ++ E +ID+A+  L   R  LG  + +P    + KI   VV S  H 
Sbjct: 303 SYKALVK-AVRAGQINEKDIDKAVMRLLEARFALGEMD-DPDNVSWTKIPFSVVASAEHD 360

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            LAL  A+  + LL+N+  +LPL K   +++A++GPNAN +    GNY G    ++T L 
Sbjct: 361 SLALDMARKSMTLLQNTGNILPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPPHTVTILG 419

Query: 454 ALQNYV---ENTVYYPGCDTV 471
            ++  +   +  +Y  GC  V
Sbjct: 420 GIRKALGADDKLIYEQGCSWV 440



 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/114 (41%), Positives = 62/114 (54%), Gaps = 13/114 (11%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEA 527
           I+ +V   K AD VV   G+    E EE+          DR D+ LP  Q+ELI+ +  A
Sbjct: 589 INASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIELPAIQRELISALHRA 648

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KK +ILV   G P  I      +N  +IL A YPG+ G  A+AEV+FGD+NPG
Sbjct: 649 GKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAEVLFGDYNPG 699


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 178/529 (33%), Positives = 280/529 (52%), Gaps = 64/529 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIPA     E L G              +GAT FPQ I  A++F+  L  R+   I  
Sbjct: 87  RLGIPAI-IHEECLSGFMA-----------KGATVFPQAIGMASTFEPELIRRVSDVIRQ 134

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
             RA  N  Q +     +P ++I RDPRWGR +ET GEDP +  + A  YV+G+QG+ + 
Sbjct: 135 HMRAA-NVHQGL-----SPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQGEDWR 188

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G +       A  KHFTAY +       R    A+V  ++L + +  PFE  +K+G+A 
Sbjct: 189 EGII-------ATVKHFTAYGISE---GARNLGPAKVGERELREVFLFPFEVAIKEGQAG 238

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M AY+ ++G+P  + + LL+K  R +WGF GY+ SD  A+ ++ +    AK  ++A V
Sbjct: 239 SLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAV 298

Query: 321 DVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
             L+AG+D+   S     +    AVK+  + E  I+ ++  +   +  LGLF+G+    P
Sbjct: 299 LALEAGIDIELPSVDCYGEPLIQAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDP 358

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
             K   D+   P  + L+ + A+  IVLLKN  G+LPL K+   ++A+IGPNA++ + L 
Sbjct: 359 --KKVYDIFDKPEFRELSREVARRSIVLLKND-GILPLSKNIR-TVAVIGPNADNPRNLH 414

Query: 439 GNYAG----PS---------------CRSITPLQALQNYVE---NTVYYPGCDTVACSSA 476
           G+Y+     PS                R+++ L+ ++N V      +Y  GC+ ++ S  
Sbjct: 415 GDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEGIKNKVSAETQVLYAKGCEILSDSKE 474

Query: 477 SIDKAVDIAKGADHVVLMMG-----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
             D+A++IAK AD ++ +MG       +    E  DR  L L G Q++L+  + +   KP
Sbjct: 475 GFDEAIEIAKRADVIIAVMGEESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLG-KP 533

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++LVL+ G P  + +     N+ +IL A YPGE G  A+A+VIFGD+NP
Sbjct: 534 IVLVLVNGRPQALKWEH--ENLNAILEAWYPGEEGGDAVADVIFGDYNP 580


>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
          Length = 852

 Score =  258 bits (659), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 160/441 (36%), Positives = 229/441 (51%), Gaps = 38/441 (8%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F     P  +R  DL+SRLT++EKIS LVN A  IPRL I  Y   +EALHG+   G+  
Sbjct: 30  FRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGIVRPGE-- 87

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN---------AGQAIGMT 155
                    T FPQ I  AA+++  L + +  AI  EAR  +          AG +  +T
Sbjct: 88  --------FTVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLT 139

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW+P +N+ RDPRWGR  ET GEDP +TG     +V+G+QGD          L+  +  K
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDH------PRYLKTVSTPK 193

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A    N +   R   +AR++ +DL + Y P FE C+   +A  IM AYN VNG+P  
Sbjct: 194 HFAA----NNEEHNRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNGVPCT 249

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
            +  L+    R  WGF+GYI SDC A   +     Y +  + A    +KAG+D+ CG  +
Sbjct: 250 VNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIKAGLDLECGDRV 309

Query: 336 QKH-TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
                  A  +  + +++ID A + +   RM LGLF+ +P+  P+ +I   V+    HQ 
Sbjct: 310 YTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIEPSVIGCKKHQE 368

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQ 453
           LAL+ A+  +VLLKN    LPL   K  S+A++G NA   +   G+Y+G P    ++ L 
Sbjct: 369 LALETARQSMVLLKNQKNFLPLNLKKVKSIAVVGINAGHCE--FGDYSGIPKNAPVSVLD 426

Query: 454 ALQNYVE----NTVYYPGCDT 470
            ++ Y E      VY P   T
Sbjct: 427 GIRKYAEKANVEVVYAPWVST 447



 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 59/97 (60%), Gaps = 3/97 (3%)

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           AK  D  V ++G++++ E+E  DR  L LP  QQE I  + +     V+ VL+ G  + I
Sbjct: 601 AKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVV-VLVAGSSLAI 659

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +   D N+ +IL A YPGE G  A+AEV+FGD+NPG
Sbjct: 660 NWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPG 694


>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 855

 Score =  258 bits (659), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 163/424 (38%), Positives = 237/424 (55%), Gaps = 34/424 (8%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           PI  R  DL+SR+T++EK+S ++++AP IPRL I  Y   +EALHG+   GK        
Sbjct: 38  PIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYYHGNEALHGIVRPGK-------- 89

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA---------GQAIGMTFWAPNI 161
              T FPQ I  AAS++  L Y+I  AI  EAR  +NA         G +  ++FW+P +
Sbjct: 90  --FTVFPQAIGMAASWNPELIYKISTAISDEARGKWNALGLGKKQLDGSSDLLSFWSPTV 147

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           N+ RDPRWGR  ET GEDP +TG    ++V+G+QG   N  K    L+A A  KHF A  
Sbjct: 148 NMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQG---NHPKY---LKAVATPKHFAA-- 199

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             N +   R   +A ++ +DL + Y P FE C+ +G+A  IM AYN VNGIP   +  L+
Sbjct: 200 --NNEEHNRAHCNAVISERDLREYYLPSFEKCIVEGKAQSIMTAYNAVNGIPCTVNTYLI 257

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
            K  R  WGF GY+ +DC A + +     Y K  E A V + KAG D+ C  +   +   
Sbjct: 258 KKVLREDWGFQGYVVTDCSAPAWMVTQHKYVKDYETAAVLMAKAGSDMECADNVYTQPLL 317

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A    ++ +++ID   ++L   RM LGLF+ +P   P+ KI  + V    HQ LAL+ A
Sbjct: 318 NAYYNYRVSDADIDSIAYHLLRGRMLLGLFD-DPEKNPYNKISPEKVGCKEHQELALETA 376

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYV 459
           +  +VLLKN +  LP+   K  S+A++G NA+  +   G+Y+G P   S+T L+ ++  V
Sbjct: 377 RQSLVLLKNENNFLPINPKKIKSIAVVGINADRCE--FGDYSGTPVNESVTVLEGIKRLV 434

Query: 460 ENTV 463
            + V
Sbjct: 435 GDQV 438



 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 58/97 (59%), Gaps = 3/97 (3%)

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           AK  D  V ++G+D++ E+E  DR  L LP  QQE I  + +   K V+ VL+ G  + I
Sbjct: 603 AKECDITVAVLGIDKSIEREGQDRYTLELPADQQEFIREIYKINPKTVV-VLVAGSSIAI 661

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            +   D NI +I+ A YPGE G  A+AE +FG +NPG
Sbjct: 662 NWI--DENIPAIIDAWYPGEQGGTAVAEALFGKYNPG 696


>gi|423683845|ref|ZP_17658684.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
 gi|383440619|gb|EID48394.1| glycoside hydrolase family 3 [Bacillus licheniformis WX-02]
          Length = 981

 Score =  258 bits (659), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           L  +C        D  QP  +   S      +PF   +LP+ +R  DL+SRLTL+EK+S 
Sbjct: 10  LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           +    PA+PRLGIPA++  +EALHGVA +G+          AT FPQ    A ++D YL 
Sbjct: 68  MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117

Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
            +IG A+G E R  ++   A  G+  WAP +++ RDPRWGR +E   EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
             G++GD          L+     KHF AY+ +  +G +    D R    ++ + Y  PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
           E+ + +  A G+M AYN VN  P+     LL  T +R+W G   +I SD    S I +  
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286

Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
            Y  S E A    +KAG+D   + G      +     A+KQ  + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
            R G F+ +  + P+ ++  DV+ SP HQ+LA +AA+  IVLLKN   LLP    K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
           A+IGP  N+       Y+G      TPL  + + +  +   +  G +     SA   K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSAPTGKFV 463

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
              +      L    D+ ++    D  D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSTAFDAAD 490



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 4/104 (3%)

Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           D+AV +AK AD  ++ +G +     +E  DR D+ LP  Q+ L+  V++A     +LV+ 
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              P  + +AK   +I +I+++ + G+    ALA++++GD NPG
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPG 684


>gi|52081845|ref|YP_080636.1| glycoside hydrolase family 3 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|404490730|ref|YP_006714836.1| glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC 14580]
 gi|52005056|gb|AAU24998.1| putative Glycoside Hydrolase Family 3 [Bacillus licheniformis DSM
           13 = ATCC 14580]
 gi|52349736|gb|AAU42370.1| putative glycoside hydrolase [Bacillus licheniformis DSM 13 = ATCC
           14580]
          Length = 981

 Score =  258 bits (658), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           L  +C        D  QP  +   S      +PF   +LP+ +R  DL+SRLTL+EK+S 
Sbjct: 10  LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           +    PA+PRLGIPA++  +EALHGVA +G+          AT FPQ    A ++D YL 
Sbjct: 68  MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117

Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
            +IG A+G E R  ++   A  G+  WAP +++ RDPRWGR +E   EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
             G++GD          L+     KHF AY+ +  +G +    D R    ++ + Y  PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
           E+ + +  A G+M AYN VN  P+     LL  T +R+W G   +I SD    S I +  
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286

Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
            Y  S E A    +KAG+D   + G      +     A+KQ  + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
            R G F+ +  + P+ ++  DV+ SP HQ+LA +AA+  IVLLKN   LLP    K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
           A+IGP  N+       Y+G      TPL  + + +  +   +  G +     SA   K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSALTGKFV 463

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
              +      L    D+ ++    D  D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSAAFDAAD 490



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 4/104 (3%)

Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           D+AV +AK AD  ++ +G +     +E  DR D+ LP  Q+ L+  V++A     +LV+ 
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              P  + +AK   +I +I+++ + G+    ALA++++GD NPG
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPG 684


>gi|319647763|ref|ZP_08001981.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
 gi|317390104|gb|EFV70913.1| hypothetical protein HMPREF1012_03020 [Bacillus sp. BT1B_CT2]
          Length = 981

 Score =  258 bits (658), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 173/508 (34%), Positives = 257/508 (50%), Gaps = 36/508 (7%)

Query: 12  LLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQ 71
           L  +C        D  QP  +   S      +PF   +LP+ +R  DL+SRLTL+EK+S 
Sbjct: 10  LGAMCTILSAAGFDPPQP--AAAKSQADEYEYPFQDPSLPVQKRVSDLLSRLTLEEKVSL 67

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           +    PA+PRLGIPA++  +EALHGVA +G+          AT FPQ    A ++D YL 
Sbjct: 68  MHQYQPAVPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAFGLAHTWDRYLI 117

Query: 132 YRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
            +IG A+G E R  ++   A  G+  WAP +++ RDPRWGR +E   EDP +TG+ A +Y
Sbjct: 118 KQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTGEIATAY 177

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
             G++GD          L+     KHF AY+ +  +G +    D R    ++ + Y  PF
Sbjct: 178 ASGLRGDH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMHEYYLKPF 227

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVSIIYDAE 309
           E+ + +  A G+M AYN VN  P+     LL  T +R+W G   +I SD    S I +  
Sbjct: 228 ETAISKKAAYGLMPAYNSVNDKPAILS-PLLDSTVKRRWAGDDFFIVSDAFDPSGIVNDH 286

Query: 310 GYAKSPEDAVVDVLKAGMD--VNCG---SFLQKHTKAAVKQKKLPESEIDRALHNLFSVR 364
            Y  S E A    +KAG+D   + G      +     A+KQ  + E ++D+AL N FS+R
Sbjct: 287 KYYDSHEKAHAHAVKAGIDNFTDQGENPELTRNALTGALKQGLISEKDLDQALANTFSIR 346

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
            R G F+ +  + P+ ++  DV+ SP HQ+LA +AA+  IVLLKN   LLP    K+ S+
Sbjct: 347 FRTGEFDPD-ELNPYSRLTDDVINSPKHQLLAKKAAEKAIVLLKNDQDLLPFNARKNESI 405

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAV 482
           A+IGP  N+       Y+G      TPL  + + +  +   +  G +     SA   K V
Sbjct: 406 AVIGPFGNALYEDW--YSGTMPYRKTPLDEIADKIGKDRVSFAEGLEQSGFKSALTGKFV 463

Query: 483 DIAKGADHVVLMMGLDQTQEKEELDRVD 510
              +      L    D+ ++    D  D
Sbjct: 464 TAGRDGKQ-PLTAAADKLEKSAAFDAAD 490



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 4/104 (3%)

Query: 479 DKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           D+AV +AK AD  ++ +G +     +E  DR D+ LP  Q+ L+  V++A     +LV+ 
Sbjct: 584 DEAVQLAKSADKAIVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANTN-TVLVVT 642

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              P  + +AK   +I +I+++ + G+    ALA++++GD NPG
Sbjct: 643 SSYPFALNWAK--AHIPAIIYSAHGGQEAGSALADILYGDVNPG 684


>gi|323452325|gb|EGB08199.1| hypothetical protein AURANDRAFT_4612, partial [Aureococcus
           anophagefferens]
          Length = 262

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 134/266 (50%), Positives = 171/266 (64%), Gaps = 10/266 (3%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV-AGV 100
           + PFC+  LP ++R  DLV R+  DE    LVN A  +PRL +  + WWSEALHGV AG 
Sbjct: 2   SLPFCRGDLPRAKRVADLVDRVEPDEVQKLLVNGAAGLPRLWLQPHNWWSEALHGVQAGC 61

Query: 101 GKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPN 160
            +    +G+ R  T FP  I TAASF++ L+  +G  IG EARAL N G   G TFW+PN
Sbjct: 62  AQAD--DGSSRCPTGFPAAISTAASFNATLFRAVGSVIGAEARALANEGVTNGFTFWSPN 119

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
           +NI RDPRWGRGQETPGEDPL+ GKY  ++V G +    + G     + ASAC KHF AY
Sbjct: 120 LNILRDPRWGRGQETPGEDPLLNGKYGENFVLGFEHPDGSSGD---AIAASACPKHFFAY 176

Query: 221 DLDNW---KGTTRYKFD-ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           +L+N    K   R+ FD A ++  +L  TY PPFE  +  G+ASG+MC+YN VNG PSCA
Sbjct: 177 NLENCFKVKDNCRHTFDMANLSQGELEATYLPPFEQAISSGKASGLMCSYNAVNGTPSCA 236

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAV 302
           +   +   AR +WGF GY+TSDC AV
Sbjct: 237 NAWGIETLARGKWGFEGYVTSDCHAV 262


>gi|330995911|ref|ZP_08319806.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329574250|gb|EGG55825.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 865

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 159/441 (36%), Positives = 230/441 (52%), Gaps = 33/441 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EK+S ++NS+PA+ RLGI  Y WWSEALHGVA  G  
Sbjct: 24  PYKNPDLSPRERAEDLLKRLTLKEKVSLMINSSPAVERLGIKPYNWWSEALHGVARSGI- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MT 155
                    AT +P  +  A+ FD      +   +  EARA Y+     G        +T
Sbjct: 83  ---------ATVYPITMGMASVFDDEAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ VRG+QG          KL   AC K
Sbjct: 134 FWTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPA---DAKYDKLH--ACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD  ++  +DL +TY P F+  V++     +MCAY R  G P 
Sbjct: 189 HYAVHSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
           C  + LL +  R +WGF   + SDC A+   ++   +   P+   A    + +G D+ CG
Sbjct: 246 CGSKRLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECG 305

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                  +AAV +  + E  ID +L  L   R  LG  + + ++ P+ +IG D V    H
Sbjct: 306 WGDYMQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTH 364

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL  A+  +VLL N  G+LPL K++   + ++GPNA  +    GNY G    + T L
Sbjct: 365 KQMALDIARKSLVLLHND-GVLPLDKTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVL 422

Query: 453 QALQNYVENTVYYP-GCDTVA 472
             ++  +   V Y  GC+ ++
Sbjct: 423 DGIRERLGRDVRYEKGCNLLS 443



 Score = 47.0 bits (110), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 14/106 (13%)

Query: 486 KGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAAKKPVIL 534
           K A  VV + G+    E EE            DR  + LP  Q++++  +  A KK  ++
Sbjct: 599 KDAGTVVFVGGISPDLEGEEKNHVNCPGFAGGDRTSIELPQVQRDILKALKAAGKK--VV 656

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + C G   +       +  +IL A YPG+AG +A+A+V+FGD NP
Sbjct: 657 FVNCSGSA-MALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNP 701


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 202/589 (34%), Positives = 296/589 (50%), Gaps = 80/589 (13%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +   +LP + R   L   + L++ I QLVN+APA+  + +PAY W ++  HGV G     
Sbjct: 71  YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVKGTAH-- 128

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA--------------GQ 150
                   AT +P      AS+   L +R+G AIG E+RA +N               G+
Sbjct: 129 --------ATVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180

Query: 151 AI----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGKL 204
            +    G+T +APN+N+ RDPRWGR +E  GEDP +T + AV  V G+QG  +    G  
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGNAEGSTSGPG 240

Query: 205 KGKLQASACCKHFTAY-------DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
            G L   ACCKHF A+       DL       R   DA V+ +DL +TY P  ++CV + 
Sbjct: 241 GGPLVTGACCKHFAAHFAVYQNEDLP----ADRMVLDANVSSRDLWETYLPVMKACVVRA 296

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
           +A+        VNG P+CA   LL+   R  WGF G++ SD DA S +     Y  + E+
Sbjct: 297 KAT-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVTTHKYVSTWEE 349

Query: 318 AVVDVLKAGMDVNCGSFLQKHTKA---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           A    + AGMD   G        A   AV+   +  + + R+   L  VR+RLG+F+   
Sbjct: 350 AAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPA 409

Query: 375 TMQPFGK-IGADVVCSPAHQV-LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
           +   +G+    D  C  A ++ LA +AA++GIVL KN+ G LPL  +K   +AL+GP  +
Sbjct: 410 STAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPL--AKGARIALVGPQVD 467

Query: 433 SAKTLLG--NYA---GPSCRSITPLQALQNYVENTVYYPGCDTVACSS-ASIDKAVDIAK 486
             + LLG  NYA   GP    +T  + L+  V N     GCD+VAC++   +D A  +A 
Sbjct: 468 DWRVLLGAVNYAFEDGPDVAPVTIQKGLEA-VANVSVAAGCDSVACAALVDVDGAKRLAA 526

Query: 487 GADHVVLMMG---------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
            AD  V+++G                D   E E  DR  + LPG Q  L+  +  A+ + 
Sbjct: 527 AADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVALVAALRAASSR- 585

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           ++ VL+ GG V +  A  D +    LW   PG+ G  ALA+V+FGD++P
Sbjct: 586 LVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGAALADVLFGDYSP 632


>gi|409196584|ref|ZP_11225247.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 1272

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 153/447 (34%), Positives = 234/447 (52%), Gaps = 29/447 (6%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           +  T     +RA DLVSR+T +EK SQ  N+ PAIPRLG+  Y +W EALHG+ G     
Sbjct: 33  YLNTAYSFEERAADLVSRMTPEEKQSQFGNTMPAIPRLGVNGYNFWGEALHGIMGRN--- 89

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
             N +   ATSFP  +   A++D  L  R  + I  EAR  +N      +T+W+P I   
Sbjct: 90  --NNSGMTATSFPNSVAAGATWDPELIKRETEVISDEARG-FNHDLIFTLTYWSPVIEPA 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGR  ET GEDP +  +    +++G+ GD          L+   C KH+ A    N
Sbjct: 147 RDPRWGRTAETFGEDPFLVSEIGKGFIQGMMGDD------PRYLKTVPCGKHYLA----N 196

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
                R+   + +  +D+ + Y  P+ + ++      IM +YN VNG+P  A + L+   
Sbjct: 197 NTEFNRHSGSSELDQRDMYEYYLLPYRTLIRDYNLPSIMTSYNAVNGVPVSASKFLVDTI 256

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVK 344
           AR+ +G  GY+TSDC A+  I     YA +  +A    + +G+D++CG   Q +   A++
Sbjct: 257 ARKLYGLDGYVTSDCGAIDDIVRGHHYADNYTEAAAMGISSGVDIDCGGVYQNNALKALE 316

Query: 345 QKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGI 404
              + + E+D+AL N+F+ RMR G F+    M P+  I    V  P+H  LA++ A    
Sbjct: 317 AGLITQGELDKALINIFTTRMRTGEFDPK-GMVPYAGIKPHKVNDPSHHDLAIEIATKTP 375

Query: 405 VLLKNSH------GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP--SCRSITPLQALQ 456
           VLLKN++        LP+  SK   +A++GP A+  +  LG+Y+GP      ITPL  +Q
Sbjct: 376 VLLKNNNVANTEKKALPIDLSKIKKIAVLGPQADLVE--LGDYSGPVEPDLRITPLAGIQ 433

Query: 457 NYVENTVYYPGCDTVACSSASIDKAVD 483
           NY++   +    + V+ S  +  K  D
Sbjct: 434 NYIKK--HNLNTEVVSTSGGNTSKKTD 458



 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/102 (40%), Positives = 62/102 (60%), Gaps = 1/102 (0%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCG 539
           + +D+A  AD  ++ +G DQ+  +EE DR  + LPG Q E+I  VA+A     I+V+   
Sbjct: 600 ETLDMAASADVAIVFVGTDQSTGREESDRFSITLPGNQNEVIKAVADANPN-TIVVMQTM 658

Query: 540 GPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           G V++   K   NI  I+W GY G+A   A+A+++FGD NPG
Sbjct: 659 GMVEVEEFKNHPNIKGIIWTGYNGQAQGTAMAKILFGDVNPG 700


>gi|330996730|ref|ZP_08320605.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella
           xylaniphila YIT 11841]
 gi|329572575|gb|EGG54218.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 725

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/446 (35%), Positives = 230/446 (51%), Gaps = 35/446 (7%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           S+P+ +  P+    L   +RA DL+ RLTL EKIS + N +P + RLGI  Y WWSEALH
Sbjct: 17  SSPAQDE-PYKNPDLSPQERADDLLKRLTLKEKISLMQNQSPGVERLGIKPYNWWSEALH 75

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG-- 153
           GVA  G           AT +P  +  A+ FD  L   I   +  E RA ++  +  G  
Sbjct: 76  GVARNGL----------ATVYPITMGMASVFDDKLIEDIYVTVSDEGRAKFHDARRHGRY 125

Query: 154 ------MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGK 207
                 +TFW PN+NIFRDPRWGRGQET GEDP +T +  V+ V+G+QG      K    
Sbjct: 126 GRGNEGLTFWNPNVNIFRDPRWGRGQETWGEDPYLTTRMGVAVVQGMQGPA--DAKYD-- 181

Query: 208 LQASACCKHFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAY 266
            +  AC KH+  +     K   R+ FD   +  +DL +TY P F++ V++     +MCAY
Sbjct: 182 -KTHACAKHYAVHSGPEAK---RHSFDVENLEPRDLWETYLPAFKALVQEADVKEVMCAY 237

Query: 267 NRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA- 325
            R  G P C    LL++  R +WG+   + SDC A+S  +  +G  ++  DA      A 
Sbjct: 238 QRFEGEPCCGSNRLLTQILRDEWGYKHLVVSDCGAISDFF-YQGRHETHPDAATSSASAV 296

Query: 326 --GMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG 383
             G D+ CG     H   AV++  + E  ID +L  L   R  LG  + +  + P+ +I 
Sbjct: 297 INGTDLECG-VEYAHLDEAVERGLITEHRIDTSLRRLLEARFALGEMD-DDALVPWSRIS 354

Query: 384 ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
            D V    H+ +AL   +  +VLL N+ G+LPL K     +A++GPNA  +    GNY G
Sbjct: 355 IDTVDCDMHRQMALDVTRKSMVLLHNN-GILPLDKGDVGKIAVMGPNAVDSVMQWGNYKG 413

Query: 444 PSCRSITPLQALQNYVENTVYYPGCD 469
               + T L+ ++  V N  Y  GC+
Sbjct: 414 VPAHTYTILEGIRMEVGNVPYEKGCE 439



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 57/112 (50%), Gaps = 14/112 (12%)

Query: 480 KAVDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVAEAA 528
           + V+  K A+ ++ + G+    E E+            DR  + LP  Q++++  +  A 
Sbjct: 592 EVVERVKDAETIIFVGGISPNLEGEDKYFVYCPGFAGGDRTSIELPQVQRDILKALKAAG 651

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           KK  ++ + C G   +       +  +IL A YPG+AG +A+A+V+FGD NP
Sbjct: 652 KK--VVFVNCSGSA-VALVPELESCDAILQAWYPGQAGGLAVADVLFGDFNP 700


>gi|357046682|ref|ZP_09108302.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
 gi|355530484|gb|EHG99896.1| glycosyl hydrolase family 3 protein, partial [Paraprevotella clara
           YIT 11840]
          Length = 677

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/441 (36%), Positives = 228/441 (51%), Gaps = 33/441 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EK S ++NS+PA+ RLGI  Y WWSEALHGVA  G  
Sbjct: 24  PYKNPDLSPRERAEDLLKRLTLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MT 155
                    AT +P  +  A+ FD      +   +  EARA Y+     G        +T
Sbjct: 83  ---------ATVYPITMGMASVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ VRG+QG          KL   AC K
Sbjct: 134 FWTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPA---DAKYDKLH--ACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD  ++  +DL +TY P F+  V++     +MCAY R  G P 
Sbjct: 189 HYAVHSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
           C    LL +  R +WGF   + SDC A+   ++   +   P+   A    + +G D+ CG
Sbjct: 246 CGSERLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECG 305

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                  +AAV +  + E  ID +L  L   R  LG  + + ++ P+ +IG D V    H
Sbjct: 306 WGDYMQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTH 364

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL  A+  +VLL N  G+LPL K++   + ++GPNA  +    GNY G    + T L
Sbjct: 365 KQMALDIARKSLVLLHND-GVLPLDKTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVL 422

Query: 453 QALQNYVENTVYYP-GCDTVA 472
             ++  +   V Y  GCD ++
Sbjct: 423 DGIRERLGRDVRYEKGCDLLS 443


>gi|332877556|ref|ZP_08445303.1| glycosyl hydrolase family 3 protein, partial [Capnocytophaga sp.
           oral taxon 329 str. F0087]
 gi|332684662|gb|EGJ57512.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
          Length = 676

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/441 (36%), Positives = 228/441 (51%), Gaps = 33/441 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EK S ++NS+PA+ RLGI  Y WWSEALHGVA  G  
Sbjct: 24  PYKNPDLSPRERAEDLLKRLTLKEKASLMINSSPAVERLGIKPYNWWSEALHGVARSGV- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIG--------MT 155
                    AT +P  +  A+ FD      +   +  EARA Y+     G        +T
Sbjct: 83  ---------ATVYPITMGMASVFDDKAVEAVYTTVSDEARAKYHDAHKKGRYGQMNEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ VRG+QG          KL   AC K
Sbjct: 134 FWTPNVNIFRDPRWGRGQETWGEDPYLTSRMGVAVVRGLQGPA---DAKYDKLH--ACAK 188

Query: 216 HFTAYDLDNWKGTTRYKFDA-RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD  ++  +DL +TY P F+  V++     +MCAY R  G P 
Sbjct: 189 HYAVHSGPEAK---RHYFDVEQLDPRDLWETYLPAFKVLVQEADVKEVMCAYQRFEGEPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE--DAVVDVLKAGMDVNCG 332
           C    LL +  R +WGF   + SDC A+   ++   +   P+   A    + +G D+ CG
Sbjct: 246 CGSERLLHQILRDEWGFKYVVVSDCGAIGDFFNPGLHETHPDAATASASAVTSGTDLECG 305

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                  +AAV +  + E  ID +L  L   R  LG  + + ++ P+ +IG D V    H
Sbjct: 306 WGDYMQLEAAVDRGLITEHRIDTSLCRLLEARFALGEMD-DDSLVPWSRIGIDTVDCQTH 364

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + +AL  A+  +VLL N  G+LPL K++   + ++GPNA  +    GNY G    + T L
Sbjct: 365 KQMALDIARKSLVLLHND-GVLPLDKTRG-DVVVMGPNAVDSVMQWGNYEGTPSHTYTVL 422

Query: 453 QALQNYVENTVYYP-GCDTVA 472
             ++  +   V Y  GCD ++
Sbjct: 423 DGIRERLGRDVRYEKGCDLLS 443


>gi|62733930|gb|AAX96039.1| Similar to F28K19.27 [Oryza sativa Japonica Group]
          Length = 244

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 120/193 (62%), Positives = 146/193 (75%), Gaps = 9/193 (4%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC   L   QRA DLV+ LTL EK+SQL + A  + RLG+PAYEWWSE LHG++  G+GI
Sbjct: 50  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 109

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            FNGT+R  TSFPQVILTAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 110 RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 169

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGEDP+   +YAV++V G+QG    GG      +ASACCKH TAYDLD 
Sbjct: 170 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQGI---GG------EASACCKHATAYDLDY 220

Query: 225 WKGTTRYKFDARV 237
           W    RY +D++V
Sbjct: 221 WNNVVRYNYDSKV 233


>gi|253574725|ref|ZP_04852065.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845771|gb|EES73779.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 953

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 163/455 (35%), Positives = 246/455 (54%), Gaps = 33/455 (7%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           +T  +PF    LP+++R  DLVSR TLDEKI  +      +PRLG+  Y+  +E  HGVA
Sbjct: 4   NTYQYPFQNPDLPLAERVDDLVSRFTLDEKIELMCQFQTEVPRLGVKKYKHGTEGAHGVA 63

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFW 157
            +G+          ATSFPQ    A +++  L  +IG  IG EAR  +     + G+T W
Sbjct: 64  WLGE----------ATSFPQNTGLACTWNPELMRQIGSVIGDEARVYFQRNPEVNGLTIW 113

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           AP +++ RDPRWGR +E  GEDP +TG      V+G+QGD           +A A  KHF
Sbjct: 114 APTVDMERDPRWGRTEEAYGEDPHLTGALTTELVKGLQGDH------PFYYKAVATLKHF 167

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
            A + +  +G+     D R    +  + Y   FE   ++GRA  +M AYN +NG P C  
Sbjct: 168 YANNNEIGRGSESVSIDPR----NKREYYLKAFERAFREGRAGSMMTAYNGINGTP-CNL 222

Query: 278 RNLLSKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSF 334
            + +++  + +W   G++  D  D +  + D + Y +S  +AV   +KAG+D   +    
Sbjct: 223 NSEVNEIVKGEWEMDGFVVGDAGDVMGTVLDHK-YVESYAEAVAGSIKAGIDSMTDEQPI 281

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
           +    + A++Q  L ES++DRAL N F VR RLG F+      P+ ++  + +C+P H  
Sbjct: 282 MFHALRNALEQGLLAESDLDRALRNAFRVRFRLGEFDPE-ERNPYSRVPEEKLCAPEHAA 340

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           LAL+A+++ IVLLKN  GLLPLP++   S+A++GP AN A T    Y+G     ITPLQ 
Sbjct: 341 LALKASREAIVLLKN-EGLLPLPQTLG-SVAVVGPLANEAYTDW--YSGTPPYRITPLQG 396

Query: 455 LQNYVEN--TVYYPGCDTVACSSASIDKAVDIAKG 487
           ++  + +   ++  G D +   SA   K V I  G
Sbjct: 397 IKEKMGDRPVLFGTGLDRIRLRSALTGKYVRITSG 431



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 61/111 (54%), Gaps = 4/111 (3%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           V    + I++AV  AK A+  V+ +G +     KE +DRVD+VLP  QQ LI  V  A  
Sbjct: 549 VEIVESGIEQAVQAAKSAETAVVFVGNNPFINAKETIDRVDIVLPPAQQALIQAV-RAVN 607

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
              ++V++   P  + +   + ++ SIL+  +  +    A+ +V+FGD+NP
Sbjct: 608 PNTVVVIVGSYPFAVNWE--NAHVPSILFTSHSAQELGHAVTDVLFGDYNP 656


>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 823

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 191/610 (31%), Positives = 293/610 (48%), Gaps = 97/610 (15%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS------- 91
           +  +T PI  R  DL+ ++TL+EK  QL        I +  +P  EW    W        
Sbjct: 73  YEDSTQPIEARLNDLIGQMTLEEKTCQLATLYGYKRILKDSVPTPEWKNEIWKDGIANID 132

Query: 92  EALHGVAGVGK------------------------------GI---FFNGTIRG-----A 113
           E L+G    GK                              GI   F N  IRG     A
Sbjct: 133 EHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFIEQTRLGIPVDFTNEGIRGVEAYQA 192

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRG 172
           T+FP  +    ++D  L  ++G   G+EARAL       G T  +AP +++ RD RWGR 
Sbjct: 193 TAFPTQLNMGMTWDKPLVNQMGNITGMEARAL-------GYTNVYAPILDVARDQRWGRL 245

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYK 232
           +E  GEDP +  +  V   +G+Q +           Q +A  KHF  Y  +        +
Sbjct: 246 EEVYGEDPYLVARLGVEMAKGMQQNN----------QIAATAKHFAVYSANKGGREGLAR 295

Query: 233 FDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH 292
            D +V  +++ +    PF+  +K+    G+M +YN  +GIP       L +  R+++GF 
Sbjct: 296 TDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYNDYDGIPISGSSYWLIQRLRQEFGFK 355

Query: 293 GYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKL 348
           GY+ SD DA+  +Y+    A   +DAV     AGM+V         +  + +  VK+ KL
Sbjct: 356 GYVVSDSDALEYLYNKHHVAADLKDAVYQAFMAGMNVRTTFRTPDSIIIYARQLVKEGKL 415

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
           P   I+  + ++  V+ +LGLF+ +P +Q   +  A +V   A+Q +ALQA+++ IVLLK
Sbjct: 416 PIDTINSRVRDVLRVKFKLGLFD-HPYVQD-AEASAKLVNCAANQAVALQASKESIVLLK 473

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYP 466
           N   +LPL  SK  +LA+IGPNA +      +Y   + +SI  L+ +Q  V     +Y  
Sbjct: 474 NKGAILPL--SKQQTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVLYAL 531

Query: 467 GCDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GC+ V                   A ID AV IA+ AD  V+++G +     E   R  L
Sbjct: 532 GCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSRTSL 591

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPG Q  L+ +  +A  KPV++VL+   P+ I +   D++I  I++AGYPG  G  A+A
Sbjct: 592 DLPGYQLRLV-KAVKATGKPVVVVLIGSQPMTINWI--DQHIDGIIYAGYPGTQGGTAVA 648

Query: 572 EVIFGDHNPG 581
           +V+FGD+NPG
Sbjct: 649 DVLFGDYNPG 658


>gi|337749758|ref|YP_004643920.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           KNP414]
 gi|336300947|gb|AEI44050.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           mucilaginosus KNP414]
          Length = 937

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 173/479 (36%), Positives = 254/479 (53%), Gaps = 37/479 (7%)

Query: 39  STET-FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV 97
           +TET  PF  T LP+ +R +DLVSRLTL+EKI+ +      IPRLGI  Y+  +E  HGV
Sbjct: 2   TTETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGV 61

Query: 98  AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTF 156
           A +G+          AT FPQ +  A +++  L   IG  IG EAR  Y     I G+T 
Sbjct: 62  AWLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTI 111

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           WAP +++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+ +A  KH
Sbjct: 112 WAPTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDH------PFYLKTAATLKH 165

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F    L N     R +  A +  +++ + Y   FE   ++G A  +M AYN VNG P C 
Sbjct: 166 F----LGNNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTP-CN 220

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSF 334
               ++   + +WG  G++ SD   V    +   Y  S  +AV   +++G+D   +    
Sbjct: 221 LNPDVNAIVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDDAGI 280

Query: 335 LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQV 394
             +  + A++   L E ++DRA+ N F VR+RLG F+G P   P+  +    +C+P H  
Sbjct: 281 TLRAVRDALEGGLLAEEDLDRAVGNAFRVRIRLGEFDG-PEENPYAHVPEAKLCAPEHAA 339

Query: 395 LALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA 454
           L+LQAA++G VLLKN  GLLPL  +K  S+A+IGP A+   T    Y+G     ITPL+ 
Sbjct: 340 LSLQAAREGFVLLKN-EGLLPL--AKPASIAVIGPLADVVHTDW--YSGTPPYRITPLEG 394

Query: 455 LQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-DRVD 510
           +   +     ++  G D +   S    + V I  GAD   L    D T+E+ E+ +R D
Sbjct: 395 VAERMAPGTVIHRTGGDRIRIKSLRTGRYVRIG-GADGFSLTA--DGTEEEAEVFERTD 450


>gi|329928264|ref|ZP_08282164.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328937943|gb|EGG34345.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 934

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 166/474 (35%), Positives = 247/474 (52%), Gaps = 33/474 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP+ +R  DLVSRLTLDEKI  +      IPRLG+  Y+  +E  HGVA +G+
Sbjct: 3   YPFQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE 62

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     AT FPQ    A +++  L   IG  I  EAR  Y   +AI G+T WAP +
Sbjct: 63  ----------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTV 112

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD           +  A  KHF   +
Sbjct: 113 DLERDPRWGRTEEAYGEDPHLTGQLSTGLVKGMQGDH------PFYYKTVATLKHFYGNN 166

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    +  + Y   FE+  ++G+A  +M AYN +NG P C   + +
Sbjct: 167 NEADRGSASVSIDPR----NKREYYLKAFEAPFREGKAGSMMTAYNGINGTP-CNLNHEV 221

Query: 282 SKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKH 338
           +   +++WG  G++  D  D +  + D   Y  S  +AV D +KAG+D   +      + 
Sbjct: 222 NDIVKQEWGMDGFVVGDAGDVLGTVMDHH-YVASYAEAVADSVKAGIDSITDDQDISFRA 280

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            + A++Q  L E  +D AL N F VR RLG F+      P+ ++    +C+P H  L+L+
Sbjct: 281 LRDALEQGLLEEQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPESKLCAPEHAELSLR 339

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           AA++ IVLLKN  GLLPLP+    S A+IGP AN A T    Y+G     ITPLQ +Q  
Sbjct: 340 AARESIVLLKND-GLLPLPRDPFKSAAVIGPLANEAFTDW--YSGTPPYRITPLQGVQAK 396

Query: 459 V--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
               +  ++ G D V   SA     V ++   +  +L+ G  ++ +    +R D
Sbjct: 397 AGDRSVQFHTGLDQVRLRSAVSGTYVALSS-EEQGILLAGTSESADAAVFERND 449



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 49/79 (62%), Gaps = 3/79 (3%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           KE +DR D+ LP  QQ LI  V EA  + V+ V++ G P  + + +   ++ +IL+  + 
Sbjct: 564 KETIDRPDITLPPAQQALIQAVFEANPRTVV-VIVGGYPFAVNWEQ--EHLPAILFTSHA 620

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G+    A+A+V++GD+NPG
Sbjct: 621 GQELGHAVADVLYGDYNPG 639


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 123/216 (56%), Positives = 157/216 (72%)

Query: 365 MRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSL 424
           MRLG F+GNP  QP+G +G   VC+  ++ LA++ A+ GIVLLKNS G LPL  S   +L
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 425 ALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDI 484
           A+IGPNAN  KT++GNY G +C+  TPLQ L+  V  T Y+ GC  V C+ A +D A  +
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTCTEADLDSAKTL 120

Query: 485 AKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDI 544
           A  AD  VL+MG DQT EKE LDR+DL LPG+QQEL+T+VA+AA+ PV+LV++ GG  DI
Sbjct: 121 AASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFDI 180

Query: 545 TFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           TFAK D  I SI+W GYPGEAG +A+A+VIFG HNP
Sbjct: 181 TFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNP 216


>gi|29348418|ref|NP_811921.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29340322|gb|AAO78115.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 863

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 156/438 (35%), Positives = 233/438 (53%), Gaps = 35/438 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EKI  + NS+ A+ RLG+  Y WWSEALHGVA  G  
Sbjct: 24  PYKNPKLTPEERAEDLLGRLTLKEKIGLMKNSSFAVERLGVAPYNWWSEALHGVARNGL- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMT 155
                    AT FP  +  A++FD     R+  A+  E RA ++             G+T
Sbjct: 83  ---------ATVFPITMGMASTFDDEAIERVYVAVSDEGRAKFHDAHRSNRYGYGNEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ V+G+QG            +A AC K
Sbjct: 134 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVAVVKGMQGPADAEYD-----KAHACVK 188

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD   ++ +DL +TY P F++ V++     +MCAY R+ G P 
Sbjct: 189 HYAVHSGPEAK---RHSFDVEDLSPRDLWETYLPAFKALVQEADVKEVMCAYQRLEGEPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY---DAEGYAKSPEDAVVDVLKAGMDVNC 331
           C    LL++  R +WG+   + SDC A+   +     E +  + + +   V+  G D+ C
Sbjct: 246 CDSNRLLTQILRDEWGYKHLVVSDCGAIDDFFVKGRHETHKDAADASASAVIN-GTDLEC 304

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GS +  H + AVKQ  + E  ID +L  L   R  LG  + + ++ P+ +I  D V    
Sbjct: 305 GS-IYSHLEEAVKQGLITEERIDTSLRRLLKARFALGEMDPD-SIVPWSRISIDTVDCDL 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ +AL  A+  +VLL N +G+LPL K+    +A++GPNA  +    GNY G    + T 
Sbjct: 363 HKQMALDLARKSMVLLCN-NGVLPLAKT-GARIAVMGPNAVDSVMQWGNYEGVPSHTYTI 420

Query: 452 LQALQNYVENTVYYPGCD 469
           L+ ++  + +  +  GC+
Sbjct: 421 LEGIRCKIGDVPFEKGCE 438



 Score = 46.6 bits (109), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 14/115 (12%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVA 525
           S +  VD  K AD V+   G+  + E EE+           DR  + LP  Q++ I +  
Sbjct: 588 SPEMVVDRVKEADIVIFAGGISPSLEGEEMYSVNSPGFAGGDRTSIELPQVQRD-ILKAL 646

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + A K V+ V   G  V +       +  +IL A YPG++G +A+A+V+FGD NP
Sbjct: 647 KKAGKKVVFVNCSGSAVALVPEM--ESCDAILQAWYPGQSGGLAVADVLFGDFNP 699


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 129/262 (49%), Positives = 181/262 (69%), Gaps = 2/262 (0%)

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
           D +KAG+D++CG FL  HT+AAV++  + + EI+ AL N  +V+MRLG+F+G P+   +G
Sbjct: 2   DAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQYG 61

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
            +G   VC+PAHQ LAL+AA+ GIVLL+N    LPL   +  ++A+IGPN++   T++GN
Sbjct: 62  NLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIGN 121

Query: 441 YAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASI-DKAVDIAKGADHVVLMMGLDQ 499
           YAG +C   TPLQ +  Y   T++  GC  V C+   +   A   A+ AD  VL+MGLDQ
Sbjct: 122 YAGVACGYTTPLQGIGRYTR-TIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180

Query: 500 TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
           + E E +DR  L+LPG QQEL++RVA A++ P ILVL+ GGP+D+TFAK D  I +I+W 
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240

Query: 560 GYPGEAGAVALAEVIFGDHNPG 581
           GYPG+AG  A+A V+FG  NPG
Sbjct: 241 GYPGQAGGTAIANVLFGTANPG 262


>gi|383124608|ref|ZP_09945271.1| hypothetical protein BSIG_1643 [Bacteroides sp. 1_1_6]
 gi|251841237|gb|EES69318.1| hypothetical protein BSIG_1643 [Bacteroides sp. 1_1_6]
          Length = 863

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 156/438 (35%), Positives = 233/438 (53%), Gaps = 35/438 (7%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P+    L   +RA DL+ RLTL EKI  + NS+ A+ RLG+  Y WWSEALHGVA  G  
Sbjct: 24  PYKNPKLTPEERAEDLLGRLTLKEKIGLMKNSSFAVERLGVAPYNWWSEALHGVARNGL- 82

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--------GMT 155
                    AT FP  +  A++FD     R+  A+  E RA ++             G+T
Sbjct: 83  ---------ATVFPITMGMASTFDDEAIERVYVAVSDEGRAKFHDAHRSNRYGYGNEGLT 133

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           FW PN+NIFRDPRWGRGQET GEDP +T +  V+ V+G+QG            +A AC K
Sbjct: 134 FWNPNVNIFRDPRWGRGQETFGEDPYLTTRMGVAVVKGMQGPADAEYD-----KAHACVK 188

Query: 216 HFTAYDLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           H+  +     K   R+ FD   ++ +DL +TY P F++ V++     +MCAY R+ G P 
Sbjct: 189 HYAVHSGPEAK---RHSFDVEDLSPRDLWETYLPAFKALVQEADVKEVMCAYQRLEGEPC 245

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIY---DAEGYAKSPEDAVVDVLKAGMDVNC 331
           C    LL++  R +WG+   + SDC A+   +     E +  + + +   V+  G D+ C
Sbjct: 246 CDSNRLLTQILRDEWGYKHLVVSDCGAIDDFFVKGRHETHKDAADASASAVIN-GTDLEC 304

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GS +  H + AVKQ  + E  ID +L  L   R  LG  + + ++ P+ +I  D V    
Sbjct: 305 GS-IYSHLEEAVKQGLITEERIDTSLRRLLKARFALGEMDPD-SIVPWSRISIDTVDCDL 362

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H+ +AL  A+  +VLL N +G+LPL K+    +A++GPNA  +    GNY G    + T 
Sbjct: 363 HKQMALDLARKSMVLLCN-NGVLPLAKT-GARIAVMGPNAVDSVMQWGNYEGVPSHTYTI 420

Query: 452 LQALQNYVENTVYYPGCD 469
           L+ ++  + +  +  GC+
Sbjct: 421 LEGIRCKIGDVPFEKGCE 438



 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 39/115 (33%), Positives = 58/115 (50%), Gaps = 14/115 (12%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL-----------DRVDLVLPGRQQELITRVA 525
           S +  VD  K AD V+   G+  + E EE+           DR  + LP  Q++ I +  
Sbjct: 588 SPEMVVDRVKEADIVIFAGGISPSLEGEEMYSVNSPGFAGGDRTSIELPQVQRD-ILKAL 646

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + A K V+ V   G  V +       +  +IL A YPG+AG +A+A+V+FGD NP
Sbjct: 647 KKAGKKVVFVNCSGSAVALVPEM--ESCDAILQAWYPGQAGGLAVADVLFGDFNP 699


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 135/323 (41%), Positives = 195/323 (60%), Gaps = 19/323 (5%)

Query: 269 VNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI-IYDAEGYAKSPEDAVVDVLKAGM 327
           +NG+P+CAD  LL++T RR W  HGYI SDCD+V + + DA+    +  +A    +KAG+
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60

Query: 328 DVNCG-------SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
           D++CG        F   +   AV+Q KL ES +D AL NL+   MRLG F+G P ++   
Sbjct: 61  DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELE--- 117

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP--NANSAKTLL 438
            +GA  VC+  H+ LA  AA+ G+VLLKN   LLPL   K  S+AL G   + N+   +L
Sbjct: 118 SLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVML 177

Query: 439 GNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           G+Y G  CR +TP   ++  V +T  +      AC   S D A   AK  D  +++ GL+
Sbjct: 178 GDYRGKPCRVVTPYDGVRKVVSSTSVH------ACDKGSCDTAAAAAKTVDATIVVAGLN 231

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
            + E+E  DR DL+LP  Q   I  VAEA+  P++LV++  G VD++FA+ +  IG+++W
Sbjct: 232 MSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVW 291

Query: 559 AGYPGEAGAVALAEVIFGDHNPG 581
           AGYPGE G  A+A+V+FG +NPG
Sbjct: 292 AGYPGEEGGTAIADVLFGKYNPG 314


>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
           cellulosilyticus DSM 14838]
 gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 808

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 153/411 (37%), Positives = 231/411 (56%), Gaps = 34/411 (8%)

Query: 64  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTA 123
           T++EKIS L  ++P I RL IP Y   +EALHGV   G+           T FPQ I  A
Sbjct: 1   TVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPGR----------FTVFPQAIGLA 50

Query: 124 ASFDSYLWYRIGQAIGLEARALYNA---GQAIG------MTFWAPNINIFRDPRWGRGQE 174
           A+++  L  ++   I  EARA +N    G+         +TFW+P +N+ RDPRWGR  E
Sbjct: 51  ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           T GEDP ++G    ++V+G+QGD          L+  +  KHF A    N +   R+  +
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQGDD------DRYLKIVSTPKHFAA----NNEEHNRFVCN 160

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
            +++ + L + Y P FE+CVK G+++ IM AYN +N +P   +  LL+K  R+ WGF GY
Sbjct: 161 PQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGY 220

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-SFLQKHTKAAVKQKKLPESEI 353
           + SDC   S++ +A  Y K+ E A    +KAG+D+ CG     +   +A +Q  + +++I
Sbjct: 221 VVSDCGGPSLLVNAHKYVKTKEAAAALSIKAGLDLECGDDVYDQPLLSAYRQYMVTDADI 280

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D A + +   RM LGLF+      P+ KI   V+ S  HQ +AL AA++ IVLLKN   +
Sbjct: 281 DSAAYRVLRARMELGLFDSG-EQNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQKKM 339

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNYAG-PSCRSITPLQALQNYVENTV 463
           LPL   K  S+A++G NA S++   G+Y+G P    I+ LQ +++ V + V
Sbjct: 340 LPLNARKVKSIAVVGINAGSSE--FGDYSGLPVIAPISVLQGIKDRVGDDV 388



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 61/93 (65%), Gaps = 3/93 (3%)

Query: 489 DHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAK 548
           + VV ++G++++ E+E  DR D+ LP  QQE +  + +     +++VL+ G  + I +  
Sbjct: 557 ETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLVAGSSLAINWM- 614

Query: 549 YDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            D +I +I+ A YPGE+G  A+AEV+FGD+NPG
Sbjct: 615 -DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPG 646


>gi|167764323|ref|ZP_02436448.1| hypothetical protein BACSTE_02707 [Bacteroides stercoris ATCC
           43183]
 gi|167697728|gb|EDS14307.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 862

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 231/437 (52%), Gaps = 34/437 (7%)

Query: 48  TTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFN 107
           T+L   +RA  L+  LTL+EK++ +++++  + RLGI  Y WW+EALHGVA  G      
Sbjct: 25  TSLTPDKRAELLLKELTLEEKVALMMDTSQPVERLGIKPYNWWNEALHGVARAGL----- 79

Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA--LYNAGQAI-----GMTFWAPN 160
                AT FPQ I  AASF     Y +  A+  EARA   Y A Q       G+T W P 
Sbjct: 80  -----ATVFPQPIGMAASFSPQTVYEVFNAVSDEARAKNTYYASQGSYERYQGLTMWTPT 134

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
           +NI+RDPRWGRG ET GEDP +T +  V  V+G+QG   N G+     +  AC KHF  +
Sbjct: 135 VNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGT--NDGRYD---KLHACAKHFAVH 189

Query: 221 DLDNWKGTTRYKFDAR-VTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
               W    R+ F+   ++ +DL +TY PPFE+ VK+     +MCAYN   G P C    
Sbjct: 190 SGPEW---NRHSFNVENLSTRDLYETYLPPFEALVKEAGVKEVMCAYNSFEGEPCCGSNR 246

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA--GMDVNCGSFLQK 337
           LL +  R  WGF G + SDC A++  Y+  G+    +           G D+ CGS  + 
Sbjct: 247 LLMQILRNDWGFDGIVLSDCGAIADFYNEYGHKAYSDAKSASAAAVLNGTDLECGSSYKA 306

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             KAA ++ K+ E +ID+A+  L   R  LG  +  P    + KI   VV S AH  L L
Sbjct: 307 LVKAA-QEGKIDEKDIDKAVLRLLEARFALGEMDA-PEDVSWTKIPFSVVASAAHDSLVL 364

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
             A+  + LL+N   +LPL K   +++A++GPNAN +    GNY G    ++T L  ++N
Sbjct: 365 DMARKSMTLLQNKDNILPL-KRGGLTVAVMGPNANDSVMQWGNYNGMPSHTVTILDGIRN 423

Query: 458 YV---ENTVYYPGCDTV 471
            +   +  +Y  GC  V
Sbjct: 424 ALGADDKLIYEQGCSWV 440



 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/115 (41%), Positives = 63/115 (54%), Gaps = 13/115 (11%)

Query: 477 SIDKAVDIAKGADHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAE 526
           +ID +V   K AD VV   G+    E EE+          DR D+ LP  Q+ELI  +  
Sbjct: 588 NIDASVACVKDADVVVFAGGISPLLEGEEMGVNLPGFRGGDRTDIELPAVQRELIHALHR 647

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A KK +ILV   G P  I      +N  +IL A YPG+AG  A+A+V+FGD+NPG
Sbjct: 648 AGKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQAGGTAVADVLFGDYNPG 699


>gi|116194378|ref|XP_001223001.1| hypothetical protein CHGG_03787 [Chaetomium globosum CBS 148.51]
 gi|88179700|gb|EAQ87168.1| hypothetical protein CHGG_03787 [Chaetomium globosum CBS 148.51]
          Length = 572

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 197/580 (33%), Positives = 272/580 (46%), Gaps = 101/580 (17%)

Query: 16  CFTSLLTRVDSTQPPFSCDP---SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL 72
           C   +   V     PF   P   S P  E    C  +LP ++RA  LV+ LT +EK+  L
Sbjct: 7   CLVGMGAAVYGLDAPFQKYPDCTSGPLAEN-KVCDRSLPPAERAAALVAALTNEEKLQNL 65

Query: 73  VNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNG--TIRGATSFPQVILTAASFDSYL 130
           V+ +    RLG+PAY WWSEALHGVA      F +G      +TSFP  +L AA+FD  L
Sbjct: 66  VSKSKGASRLGLPAYNWWSEALHGVAYAPGTQFRDGPGDFNSSTSFPMPLLMAAAFDDQL 125

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
              IG  IG+E RA  NAG + GM +W PN+N FRDPRWGRG ETPGED L   +YA S 
Sbjct: 126 IEDIGTVIGIEGRAFGNAGWS-GMDYWTPNVNPFRDPRWGRGSETPGEDILRVKRYAASM 184

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           VRG++G          + +  A CKH+ A D ++WKG TR+ F+A V+ QDLA+ Y  PF
Sbjct: 185 VRGLEGP-------GPERRIVATCKHYAANDFEDWKGATRHNFNAEVSAQDLAEYYLSPF 237

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFH-GYITSD----CDAVSII 305
           +                                     W  H  Y+TSD    C  VS  
Sbjct: 238 QHT---------------------------------GNWTEHDNYVTSDWRGPCWDVSAK 264

Query: 306 Y-----DAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNL 360
           +       EG A   ED               +     +  A  Q  L    +DRAL  L
Sbjct: 265 HPLRQDHVEGTALCFEDPA-------WTPAASTRAPPTSTGASSQGLLSWDTVDRALRRL 317

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
           +   +R+G F+G+    P  K+G   V  P  Q LALQ A +GIVLLKN  G LP+P +K
Sbjct: 318 YQGIIRVGYFDGD--QSPHAKLGWSDVNQPEAQRLALQVAAEGIVLLKND-GTLPIP-AK 373

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDK 480
             ++A+IG  A++   L G  A                                    + 
Sbjct: 374 KPNVAMIGFWADAGDKLSGGSA-------------------------------KDTWTEG 402

Query: 481 AVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG 540
           A+  A+ AD+++   GLD +   E  DR+ +  P  Q  L+ ++  A KKP+++V + G 
Sbjct: 403 ALAAAQDADYILYFGGLDTSAAGETKDRMTIDWPAAQLALLEKLG-ALKKPLVVVQM-GD 460

Query: 541 PVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +D T       + SILWA +PG+ G  A+ +++ G  +P
Sbjct: 461 QIDDTPLLEMDAVNSILWANWPGQDGGTAVLQLLSGAKSP 500


>gi|261404262|ref|YP_003240503.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261280725|gb|ACX62696.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 934

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 167/477 (35%), Positives = 247/477 (51%), Gaps = 39/477 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP+ +R  DLVSRLTLDEKI  +      IPRLG+  Y+  +E  HGVA +G+
Sbjct: 3   YPFQDPDLPLEERVHDLVSRLTLDEKIELMCQYQTEIPRLGVQKYKHGTEGAHGVAWLGE 62

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     AT FPQ    A +++  L   IG  I  EAR  Y   +AI G+T WAP +
Sbjct: 63  ----------ATVFPQNTGLACTWNPELMREIGSVIAEEARVYYQRDRAINGLTIWAPTV 112

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD           +  A  KHF   +
Sbjct: 113 DLERDPRWGRTEEAYGEDPHLTGELSTGLVKGMQGDH------PFYYKTVATLKHFYGNN 166

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    +  + Y   FE+  ++G+A  +M AYN +NG P C   + +
Sbjct: 167 NEADRGSASVSIDPR----NKREYYLKAFEAPFREGKAGSMMTAYNGINGTP-CNLNHEV 221

Query: 282 SKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKH 338
           +   +++WG  G++  D  D +  + D   Y  S  +AV   +KAG+D   +      + 
Sbjct: 222 NDIVKQEWGMDGFVVGDAGDVLGTVMDHH-YVASYAEAVAGSVKAGIDSITDDQDISFRA 280

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            + A++Q  L E  +D AL N F VR RLG F+      P+ ++    +C+P H  L+L+
Sbjct: 281 LRDALEQGLLEEQHLDHALRNTFRVRFRLGEFDPE-ERNPYSRVPETKLCAPEHAELSLR 339

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ-- 456
           AA++ IVLLKN  GLLPLP+    S A+IGP AN A T    Y+G     ITPLQ +Q  
Sbjct: 340 AARESIVLLKND-GLLPLPRDPFKSAAVIGPLANEAFTDW--YSGTPPYRITPLQGVQAK 396

Query: 457 ---NYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
               YV+   ++ G D V   SA     V ++   +  +L+ G   + +    +R D
Sbjct: 397 AGDRYVQ---FHTGLDQVRLRSAVSGTYVALSS-EEQGILLAGTSASADAAVFERND 449



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 49/80 (61%), Gaps = 5/80 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KE +DR D+ LP  QQ LI  V EA   P  +V++ GG P  + + +   ++ +IL+  +
Sbjct: 564 KETIDRPDITLPPAQQALIQAVFEA--NPHTVVVIVGGYPFAVNWEQ--EHVPAILFTSH 619

Query: 562 PGEAGAVALAEVIFGDHNPG 581
            G+    A+A+V++GD+NPG
Sbjct: 620 AGQELGHAVADVLYGDYNPG 639


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  251 bits (642), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 151/437 (34%), Positives = 234/437 (53%), Gaps = 28/437 (6%)

Query: 154 MTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG-DTFNGGKLKGKLQASA 212
           MT+W+PN+NIFRDPRWGRGQET GEDP +T +   +YVRG+QG D F        L+A+A
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQGNDPFF-------LKAAA 53

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGI 272
           C KH+  +   +     R++F+A  + +DL +TY P FE+ VK+ +   +M AYNRV G 
Sbjct: 54  CAKHYAVH---SGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNRVYGE 110

Query: 273 PSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG 332
            +     LL+   R++WGF G++ SDC AV  IY     AK   +A    LK+G+++NCG
Sbjct: 111 SASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAEASAIALKSGLNLNCG 170

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
                  K A+++K + E ++D AL  L   R++LG    +    P+  I   V+ S  H
Sbjct: 171 GSFHA-LKEALERKLITEVDLDNALMPLMMTRLKLGNLTDDDE-SPYKNISDSVIASYTH 228

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
            ++A + AQ  +VLLKN++  LPL K    ++ + GP A     ++GNY G S RS T L
Sbjct: 229 AMVAREVAQKSMVLLKNNNHTLPLKKDVK-TIFVTGPYAADTYVMMGNYYGVSPRSNTFL 287

Query: 453 QALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           Q +   V       Y  G      +    D  V   + A+  ++++GL    E EE D +
Sbjct: 288 QGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGEEGDAI 347

Query: 510 ---------DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAG 560
                    +L LP  Q + +  ++      ++ V+  G P+D+   +      +++ A 
Sbjct: 348 ASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVITGGSPIDLE--EVSELSDAVIMAW 405

Query: 561 YPGEAGAVALAEVIFGD 577
           YPG+ G +AL +++FGD
Sbjct: 406 YPGQEGGMALGDLLFGD 422


>gi|379722647|ref|YP_005314778.1| glycoside hydrolase family protein [Paenibacillus mucilaginosus
           3016]
 gi|378571319|gb|AFC31629.1| glycoside hydrolase family 3 domain-containing protein
           [Paenibacillus mucilaginosus 3016]
          Length = 937

 Score =  251 bits (641), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 173/480 (36%), Positives = 256/480 (53%), Gaps = 39/480 (8%)

Query: 39  STET-FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV 97
           +TET  PF  T LP+ +R +DLVSRLTL+EKI+ +      IPRLGI  Y+  +E  HGV
Sbjct: 2   TTETKMPFQDTRLPLEERVQDLVSRLTLEEKINLMCQYQEEIPRLGIAKYKHGTEGAHGV 61

Query: 98  AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTF 156
           A +G+          AT FPQ +  A +++  L   IG  IG EAR  Y     I G+T 
Sbjct: 62  AWLGE----------ATVFPQNVGLACTWNPELMREIGSVIGDEARVYYQRNPEINGLTI 111

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           WAP +++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+ +A  KH
Sbjct: 112 WAPTVDMERDPRWGRTEEAYGEDPYLTGRLSTELVKGMQGDH------PFYLKTAATLKH 165

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F    L N     R +  A +  +++ + Y   FE   ++G A  +M AYN VNG P C 
Sbjct: 166 F----LGNNNEMDRGECSASIDPRNMREYYLKAFEPAFREGGAQSMMTAYNSVNGTP-CN 220

Query: 277 DRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGS 333
               ++   + +WG  G++ SD   V    +   Y  S  +AV   +++G+D    + G 
Sbjct: 221 LNPDVNAIVKGEWGMDGFVVSDAGDVLGTVNEHRYFASYAEAVAASVRSGIDSITDDVGI 280

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            L +  + A++   L E ++DRA+ N F VR+RLG F+ +P   P+  +    +C+P H 
Sbjct: 281 TL-RAIRDALEGGLLAEEDLDRAVGNAFRVRIRLGEFD-SPEENPYAHVPEAKLCAPEHA 338

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            L+LQAA++G VLLKN  GLLPL  +K  S+A+IGP A+   T    Y+G     ITPL+
Sbjct: 339 ALSLQAAREGFVLLKN-EGLLPL--AKPASIAVIGPLADVVHTDW--YSGTPPYRITPLE 393

Query: 454 ALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL-DRVD 510
            +   +     ++  G D +   S    + V +  GAD + L    D T E+ E+ +R D
Sbjct: 394 GVAERMAPGTVIHRTGGDRIRIKSLRTGRYVRLG-GADGLSLTA--DGTAEEAEVFERTD 450


>gi|402299776|ref|ZP_10819350.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
 gi|401725066|gb|EJS98379.1| glycoside hydrolase family protein [Bacillus alcalophilus ATCC
           27647]
          Length = 937

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 153/462 (33%), Positives = 249/462 (53%), Gaps = 33/462 (7%)

Query: 38  PSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGV 97
           P    FPF  T L +  R +DLV+R TLDEKI  ++   P I RLGI A++  +EA HG+
Sbjct: 2   PQAVKFPFQNTELLVEDRVKDLVNRFTLDEKIELMMQHQPEIKRLGIKAHKQGTEAAHGI 61

Query: 98  AGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-QAIGMTF 156
           A +G+          AT FPQ I  +++++  L  +IG  IG EAR  Y       G+T 
Sbjct: 62  AWLGE----------ATVFPQNIGLSSTWNKDLMNKIGTVIGDEARVFYQKDPSKHGLTL 111

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           WAP +++ RDPRWGR +E  GEDP +TG+     V+G+QG+          L++ A  KH
Sbjct: 112 WAPTVDMERDPRWGRTEEAYGEDPELTGQLTTELVKGMQGND------PDYLKSVATLKH 165

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           F    L N     R K  A +  +++ + Y   F+  + +G+A  IM AYN +NG P+  
Sbjct: 166 F----LGNNNEINRDKCSASIDPRNMHEYYLKAFKPAIVEGKAKSIMTAYNSINGTPALL 221

Query: 277 DRNLLSKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGS 333
               + +  +++WG  G+I SD  D V I+ D + Y ++  +A+ D +K+G+D   +   
Sbjct: 222 -HPYVEEVVKQEWGMDGFIVSDAGDVVGIVNDHK-YYETYSEALADTIKSGIDSITDDAE 279

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
              +  + A+ +  L E+++D+AL N F VR RLG F  +    P+  +    + +  H+
Sbjct: 280 ISLQAMRDALSKGLLVEADLDKALINTFKVRFRLGEF--DEERNPYAFVSESKLLAKEHK 337

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            L+ QAAQ+ +VLLKN  GLLPL +  + S+A+IGP ++    L+  Y+G     IT L+
Sbjct: 338 QLSYQAAQEQVVLLKND-GLLPLNEGTTESIAVIGPLSDDV--LVDWYSGTPSYRITALE 394

Query: 454 ALQNYVEN--TVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           A+++ ++N   ++  G   +   S      + + +  +H ++
Sbjct: 395 AIEHKLQNKKVIHKSGNPLIRIRSVQTGSYIGLDQSFNHKLM 436



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 61/112 (54%), Gaps = 4/112 (3%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           +    A I++AV  A   D+ V+++G +     KE +DR D+VLP  Q++L+  V +   
Sbjct: 537 IEVIEAGIEEAVKAAAEQDYAVVVLGNNPFINGKECIDREDIVLPPEQEKLLQAVKQVNP 596

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              I VL+   P  + +A  + N+  I++  + G     A+A+V+FGD+NP 
Sbjct: 597 N-TIAVLVSSYPYALNWA--NENLSGIMYTSHAGPELGKAVADVLFGDYNPA 645


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 184/562 (32%), Positives = 286/562 (50%), Gaps = 68/562 (12%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFN 107
           I Q +R    R    EK  +L N          RLGIPA     E L G           
Sbjct: 54  IGQISRVAGERGVSPEKAVELANKIQKFLKEKTRLGIPAI-IHEECLSGFMA-------- 104

Query: 108 GTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDP 167
              +GAT FPQ I  A++F+  L  R+   I    +A  N  Q +     +P ++I RDP
Sbjct: 105 ---QGATVFPQAIGMASTFEPELIRRVSDVIRQHMKAA-NVHQGL-----SPVLDIPRDP 155

Query: 168 RWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKG 227
           RWGR +ET GEDP +  + A  YV+G+QG+ +  G +       A  KHFTAY +     
Sbjct: 156 RWGRTEETFGEDPYLVSRMATEYVKGLQGEDWREGIV-------ATVKHFTAYGISE--- 205

Query: 228 TTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARR 287
             R    A+V  ++L + +  PFE  +K+G+A  +M AY+ ++G+P  + + LL+K  R 
Sbjct: 206 GARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRW 265

Query: 288 QWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAAVKQ 345
           +WGF GY+ SD  AV ++ +    A+  ++A V  L+AG+D+   S     +    AVK+
Sbjct: 266 EWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIELPSVDCYGEPLIQAVKE 325

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + E  I+ ++  +   +  LGLF+ N    P  K   +V   P  + L+ + A+  IV
Sbjct: 326 GLISEEVINASVERVLRAKFMLGLFDDNLEKDP--KKVYEVFDKPEFRDLSREVARRSIV 383

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG----PS---------------C 446
           LLKN  G LPL K+    +A+IGPNA++ + L G+Y+     PS                
Sbjct: 384 LLKND-GTLPLSKNLK-KVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVV 441

Query: 447 RSITPLQALQNYVE---NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LD 498
           R+++ L+ ++N V      +Y  GCD ++ S     +A+++AK AD ++ +MG       
Sbjct: 442 RTVSILEGIRNKVSPETEVLYAKGCDIISDSKDGFAEAIEMAKEADVIIAVMGEESGLFH 501

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
           +    E  DR  L L G Q++L+  + +   KP++LVL+ G P  + +     N+ +IL 
Sbjct: 502 RGISGEGNDRTTLELFGVQRDLLKELHKLG-KPIVLVLINGRPQALKWEH--ENLNAILE 558

Query: 559 AGYPGEAGAVALAEVIFGDHNP 580
           A YPGE G  A+A+VIFGD+NP
Sbjct: 559 AWYPGEEGGNAVADVIFGDYNP 580


>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
 gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
          Length = 787

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 175/533 (32%), Positives = 281/533 (52%), Gaps = 72/533 (13%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G           AT FP  +  A++++  L  ++G+ IGL
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR     G  IG   + P ++I R+PRW R +ET GEDP +TG    ++V+G+QG  F 
Sbjct: 173 EARL---QGANIG---YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQG 257
            G+        +  KH  AY      G  R   +   A + ++ L D Y P F+  V+ G
Sbjct: 227 DGR-----HVYSTLKHLAAY------GVPRGGHNGGPADMGLRALLDEYLPGFQRAVEVG 275

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
           +A+ +M +YN ++G+P  +++ L+    R++WGF G++ SD  ++  I  A   A + ED
Sbjct: 276 KAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVYSDLASIDGIAGAH-VAANLED 334

Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
           A +  ++AG D++ G+   +    AV+  K+ ES I+RA+ N+  ++ R+GLF   P + 
Sbjct: 335 AAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRAVSNVLRLKFRMGLFE-QPYVS 393

Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
           P  +  A +V    H++LA + A++G VLLKN +G+LPL K K +  A+IGPNA+     
Sbjct: 394 P--EEAARLVNCEDHRMLARKIAREGTVLLKN-NGILPLGKVKRI--AVIGPNADVMYNY 448

Query: 438 LGNYAGPSCRS--ITPLQALQNYVENTV--YYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           LG+Y  P  RS  +T L AL+N + +    Y  GC     + ++I +AV+ A+ AD V+L
Sbjct: 449 LGDYTAPQERSKVVTLLDALRNRMPDVRIDYVKGCAIRDTTQSNIKEAVEAARKADLVIL 508

Query: 494 MMG--------------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEA 527
            +G                          L   +  E  DR  L L G Q++LI  +A A
Sbjct: 509 AVGGSSARDFKTKYINTGAATVDSENSGILSDMECGEGFDRATLDLLGDQEKLIRAIA-A 567

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +KP++ V + G P+++  A    +  ++L A YPGE G   + +V+ G++NP
Sbjct: 568 TEKPLVTVYIAGRPLNMNLASEVSD--ALLTAWYPGEQGGNGIVDVLTGEYNP 618


>gi|312126424|ref|YP_003991298.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor hydrothermalis 108]
 gi|311776443|gb|ADQ05929.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           hydrothermalis 108]
          Length = 771

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 175/511 (34%), Positives = 263/511 (51%), Gaps = 71/511 (13%)

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
           AT FPQ I  A +FD+ L   + + I L+ +A+  A QA+     AP I++ RD RWGR 
Sbjct: 103 ATVFPQSIGVACTFDNELVKEMAKVIRLQMKAV-GAHQAL-----APLIDVARDARWGRV 156

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGT 228
           +ET GEDP +    AVSYV G+QG  F         +  A  KHF  Y +     NW   
Sbjct: 157 EETFGEDPYLVANMAVSYVEGIQGKNFEE-------KIIATGKHFVGYAMSEGGMNWAPV 209

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
                   +  ++L + Y  PFE  VK      IM AY+ ++GIP  A+R LL++ AR +
Sbjct: 210 -------HIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTEIARDE 262

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN---CGSFLQKHTKAAVKQ 345
           WGF G   SD   V  I D     K+ E+A    L AG+D+       F +K  + A+K+
Sbjct: 263 WGFDGIFVSDYSGVKNILDYHKSVKTYEEAAYISLWAGLDIELPRIECFTEKFIE-ALKE 321

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            K   + +D A+  +  ++ RLGLF+ NP ++    I  ++  +   + LA + AQ+ +V
Sbjct: 322 GKFDMAVVDAAVKRVLEMKFRLGLFD-NPFVKTENII--ELFDNKEQRSLARKVAQESMV 378

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE----- 460
           LLKN  G+LPL +     +A+IGPNANS + LLG+Y+ P+  S T +  ++  V+     
Sbjct: 379 LLKND-GILPLKEKDLKKVAVIGPNANSVRNLLGDYSYPAHISTTEMFFMKEEVDLGDED 437

Query: 461 ------------------------NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
                                     VY  GCD  +   +S ++A   A+GAD V++++G
Sbjct: 438 AFVKKVVNIKSVYEVIKERIGKHTEVVYAKGCDVNSQDKSSFEEAKKAAQGADVVIVVVG 497

Query: 497 ------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
                 LD T   E  DR  L LPG Q+ELI  +++  +  ++++L+ G PV      + 
Sbjct: 498 DKAGLKLDCTS-GESRDRASLKLPGVQEELIEEISKVNQN-IVVILVNGRPV--ALENFW 553

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +   +IL A +PGE GA A+A+VIFG +NPG
Sbjct: 554 QKSKAILEAWFPGEEGAEAIADVIFGKYNPG 584


>gi|312136056|ref|YP_004003394.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor owensensis OL]
 gi|311776107|gb|ADQ05594.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           owensensis OL]
          Length = 771

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 178/515 (34%), Positives = 264/515 (51%), Gaps = 79/515 (15%)

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
           AT FPQ I  A +FD+ L   + + I L+ +A+  A QA+     AP I++ RD RWGR 
Sbjct: 103 ATVFPQSIGVACTFDNELVKEMAKVIRLQMKAV-GAHQAL-----APLIDVARDARWGRV 156

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGT 228
           +ET GEDP +    AVSYV G+QG  F         +  A  KHF  Y +     NW   
Sbjct: 157 EETFGEDPYLVANMAVSYVEGIQGKDFEE-------KVIATGKHFVGYAMSEGGMNWAPV 209

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
                   +  ++L + Y  PFE  VK      IM AY+ ++GIP  A+R LL++ AR +
Sbjct: 210 -------HIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTEIARNE 262

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN---CGSFLQKHTKAAVKQ 345
           WGF G   SD   V  I D     K+ E+A    L AG+D+       F +K  + A+K+
Sbjct: 263 WGFDGIFVSDYSGVKNILDYHKSVKTYEEAAYISLWAGLDIELPRIECFTEKFIE-ALKE 321

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            K   + +D A+  +  ++ RLGLF+ NP ++    I  ++  +   + LA + AQ+ +V
Sbjct: 322 GKFDMAVVDAAVKRVLEMKFRLGLFD-NPFVKTENII--ELFDNKEQRSLARKVAQESMV 378

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE----- 460
           LLKN  G+LPL +     +A+IGPNANS + LLG+Y+ P+  S T +  ++  V+     
Sbjct: 379 LLKND-GILPLKEKDLKKVAVIGPNANSVRNLLGDYSYPAHISTTEMFFMKEEVDLGDED 437

Query: 461 ------------------------NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
                                     VY  GCD  +   +S ++A   A+GAD V++++G
Sbjct: 438 AFVKKVVNIKSVYEVIKERIGKHTEVVYAKGCDVNSQDKSSFEEAKKAAQGADVVIVVVG 497

Query: 497 ------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
                 LD T   E  DR  L LPG Q+ELI  +A+  +  ++++L+ G PV +      
Sbjct: 498 DKAGLKLDCTS-GESRDRASLKLPGVQEELIEEIAKVNQN-IVVILVNGRPVAL------ 549

Query: 551 RNI----GSILWAGYPGEAGAVALAEVIFGDHNPG 581
            NI     +IL A +PGE GA A+A+VIFG +NPG
Sbjct: 550 ENIWQKSKAILEAWFPGEEGAEAIADVIFGKYNPG 584


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 173/497 (34%), Positives = 271/497 (54%), Gaps = 50/497 (10%)

Query: 92  EALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA 151
           EALHG              +G+T FPQ I  A++++  L Y++  AIG E R+     + 
Sbjct: 138 EALHGCMA-----------KGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181

Query: 152 IGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQAS 211
           I     +P INI RDPR GR +ET GEDP +  + AV+Y++GVQ           +    
Sbjct: 182 IHQVL-SPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQ-----------EQGVI 229

Query: 212 ACCKHFTA-YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           A  KHF A +  D  + +    F  R+    L + Y P F + +++  A  +M AYN ++
Sbjct: 230 ATPKHFVANFVGDGGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYNSLD 285

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           GIP  +++ LL++  R++WGF GY+ SD  +V  +      A+S  +A    L+AG+D+ 
Sbjct: 286 GIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKVAESKAEAAKLSLEAGLDME 345

Query: 331 C-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP-FGKIGADVVC 388
              S   +     +++ KL +  +D A+  +  V+  +GLF+ NP + P + +   D  C
Sbjct: 346 LPDSDCFEEIPGLIRESKLSQDTLDEAVRRVLRVKFWIGLFD-NPFVDPDYAERIND--C 402

Query: 389 SPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS 448
           S  H+ LAL+ A++ IVLLKN  G+LPL K    S+A+IGPNA  A   LG Y+G   + 
Sbjct: 403 S-EHRELALRVARESIVLLKN-EGILPLNKDIR-SIAVIGPNA--AVPRLGGYSGYGVKV 457

Query: 449 ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ-EKE 504
           +TPL+ ++N + + V   +  GC     S +  D+A+ IA+ +D  +L MG    + E E
Sbjct: 458 VTPLEGIKNKLGDKVKVYFAEGCGLNDTSKSGFDEAIKIAQKSDVAILFMGNSVPETEGE 517

Query: 505 ELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGE 564
           + DR +L LPG Q++LI  +      PVI+VL+ G    IT   +   + +++ A YPGE
Sbjct: 518 QRDRHNLNLPGVQEDLIKEICN-TNTPVIVVLINGSA--ITMMNWIDKVQAVIEAWYPGE 574

Query: 565 AGAVALAEVIFGDHNPG 581
            G  A+A+V+FGD+NPG
Sbjct: 575 EGGNAIADVLFGDYNPG 591


>gi|430749766|ref|YP_007212674.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
           KWC4]
 gi|430733731|gb|AGA57676.1| beta-glucosidase-like glycosyl hydrolase [Thermobacillus composti
           KWC4]
          Length = 971

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 163/469 (34%), Positives = 247/469 (52%), Gaps = 50/469 (10%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP+  R  DL+ RLT +EKI+ +     A+PRLGI  Y+  +EA HG+A +G 
Sbjct: 7   YPFRDPDLPLETRVNDLIGRLTTEEKINLMAQYQDAVPRLGIAPYKHGTEAAHGIAWLGP 66

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     AT+FPQ I  A ++D+ L  R+G  IG EAR  Y    A+ G+T WAP +
Sbjct: 67  ----------ATTFPQPIGLACTWDADLLRRVGSVIGDEARVWYRRNPAVNGLTLWAPTV 116

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP + GK A + +RG+QGD          L+A A  KHF   +
Sbjct: 117 DLERDPRWGRTEEAYGEDPFLVGKLASALIRGIQGDH------PFYLKAVATLKHFLGNN 170

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G      D R    ++ + Y   FE+  K+G A  +M AYN VNG+P+    NLL
Sbjct: 171 NEAGRGDKSVSIDPR----NMREYYLKVFETAFKEGGALSMMTAYNAVNGVPA----NLL 222

Query: 282 ---SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQ 336
              +   +++WG +G++ SD   V+      GY  S   AV   ++AG+D   +    ++
Sbjct: 223 PMITSVVKQEWGMNGFVVSDAFDVTGTVRDHGYMASLPAAVAASIRAGIDSITDDAKVVK 282

Query: 337 KHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLA 396
           +  + A++   L E+++D AL N F VR RLG F+ +    P+ +IG   +  P H+ L+
Sbjct: 283 QAIREALEMGLLTENDLDVALRNTFRVRFRLGEFDPD-DRNPYAEIGESHMMRPEHEALS 341

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL---- 452
           L+AA+  IVLLKN  G+LPL   +   +A+IGP A+        Y+G    ++TPL    
Sbjct: 342 LEAARKSIVLLKND-GVLPLSADRIRKIAVIGPLADVVYRDW--YSGTLPYAVTPLAGIV 398

Query: 453 QALQNYVE------------NTVYYPGCDTVACSSASIDKAVDIAKGAD 489
           +AL   V+              ++  G   V   S++  + V IA+G D
Sbjct: 399 EALAGRVQPHGPAGAVDAGDRVIHAKGTTRVRLKSSTTGRYVRIAEGED 447


>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
 gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
           SJ95]
          Length = 777

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 181/541 (33%), Positives = 281/541 (51%), Gaps = 81/541 (14%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIPA+    E L G              RGAT FPQ+I  A++++  L  R+  +I  
Sbjct: 91  RLGIPAF-MHEECLSGY-----------MTRGATIFPQMIGAASTWEPPLIERMTTSIRN 138

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           + +AL   G   G++   P +++ RDPRWGR +ET GEDP +  K  V+YV+G+Q D   
Sbjct: 139 QMKAL---GIHQGLS---PVVDVTRDPRWGRTEETFGEDPYLIAKMGVAYVKGLQSDDLK 192

Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
            G +       A  KHF  Y +     NW         A +  ++L +T+  PFE+ +K+
Sbjct: 193 NGIV-------ATLKHFVGYGVSEGGMNWA-------PAHIPERELKETFLFPFEAAIKE 238

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
           G+   +M AY+ ++GIP  A   LL +  R +WGF G + SD  A++ + +    A + E
Sbjct: 239 GKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIVVSDYFAINSLMEYHKIALNKE 298

Query: 317 DAVVDVLKAGMDVNCGSF--LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           +A +  LKAG+DV   SF   ++  K A++  +  E+ ID+++ N+  ++  +GLF  NP
Sbjct: 299 EAAIKALKAGIDVELPSFDCYKEPLKNAIENGEFSEAFIDKSVRNILRLKFEMGLFE-NP 357

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK-SKSVSLALIGPNANS 433
            +    K+  D + +P  + LA + A+  IVLLKN  G++PL K SK   +A+IGPNANS
Sbjct: 358 YVD-LEKV-PDNLDTPEDRKLAYEIAKKSIVLLKND-GIVPLKKNSKIKKVAVIGPNANS 414

Query: 434 AKTLLGNYA-------------GPSC-------RSITPL----QALQNYVE----NTVYY 465
           A+ L G+Y              G S         S  P+    ++L+  +E     T Y 
Sbjct: 415 ARNLTGDYTYLTHLETLKQGAFGTSAMEGITFSESELPIKTIYESLKEKLEKLNVETSYA 474

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQE 519
            GC+    +   I +AV++A+ +D  +L++G      LD T   E  D   L+LPG Q +
Sbjct: 475 KGCEINDDNKEMIKEAVELAENSDVALLVLGDKSGLTLDCTT-GESRDSSTLILPGVQLD 533

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L+  V      PVI+VL+ G P  + +    +N+ +I  A  PGE G  ALA++I GD +
Sbjct: 534 LLKSVINTG-TPVIVVLVNGRPYSLDWV--SKNVSAIFEAWLPGEEGGNALADIILGDES 590

Query: 580 P 580
           P
Sbjct: 591 P 591


>gi|452973347|gb|EME73169.1| glycoside hydrolase [Bacillus sonorensis L12]
          Length = 989

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 162/477 (33%), Positives = 248/477 (51%), Gaps = 35/477 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FPF    L + +R  DL+SRLTL+EK+S +    PAIPRLGIPA++  +EALHGVA +GK
Sbjct: 48  FPFQNPGLSVEKRVSDLISRLTLEEKVSLMHQYQPAIPRLGIPAFKTGTEALHGVAWLGK 107

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     AT FPQ +  A +++  L  R+G A+G E R  ++   A  G+  WAP +
Sbjct: 108 ----------ATVFPQAVGLAHTWNPALIKRVGSAVGDEVRGFHHLNPAANGVNVWAPVV 157

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E   EDP +TGK + +Y  G++GD          L+     KHF  Y+
Sbjct: 158 DLLRDPRWGRNEEGYSEDPFLTGKISAAYASGLKGDH------PFYLKTVPTLKHFLGYN 211

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G +    D R    ++ + Y  PFE+ + +  A G+M AYN +N  P+     LL
Sbjct: 212 NETDRGFSSSSIDPR----NMQEYYIKPFETAITEKAAYGLMPAYNSINDKPAILS-PLL 266

Query: 282 SKTARRQW-GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCG---SFL 335
               + +W G H +I SD    S I +   Y  + E+A    LKAG+D   + G   +  
Sbjct: 267 HTAVKNKWAGKHFFIVSDAFDPSGIVNDHRYYDNHEEAHAHALKAGIDSFTDQGENPALT 326

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           +     A+K+  + E ++D+AL N+FS+R R G F+ +    P+ ++  +V+ SP HQ L
Sbjct: 327 KNAVTGALKKGLIAEKDLDQALANVFSLRFRTGEFDPD-KRNPYSRLTENVINSPKHQKL 385

Query: 396 ALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL 455
           A + A+  IVLLKNS  LLPL +     +A+IGP  ++       Y+G     ITPL  +
Sbjct: 386 AKKTAEQSIVLLKNSKQLLPLERRADEKIAVIGPFGDALYEDW--YSGTMPYQITPLAGI 443

Query: 456 QNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
            + +  +   +  G +     SA   K V      +   L    D+ +++E  D  D
Sbjct: 444 TDKIGKDRVSFAEGIEQSGFRSALTGKYVTAQD--EKQPLSATADELKKRETFDLAD 498



 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 70/133 (52%), Gaps = 15/133 (11%)

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD-QTQEKEELDR 508
           TPL A + + +  V            +++++A  +AK AD  ++ +G +     +E  DR
Sbjct: 574 TPLTAAEKFAKTVV-----------KSAVEEAAALAKSADKTIVFVGNNPYINGRETQDR 622

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
            D+ LP  Q+ELI  V +A      LV+    P  + +A  DR++ SIL++ + G+    
Sbjct: 623 KDITLPPAQEELIKAVTKANPN-TALVVTSSYPFALNWA--DRHVPSILYSAHGGQEAGR 679

Query: 569 ALAEVIFGDHNPG 581
           ALA+++FGD  PG
Sbjct: 680 ALADILFGDKAPG 692


>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 819

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 183/613 (29%), Positives = 281/613 (45%), Gaps = 102/613 (16%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV- 97
           F     PI +R +DL+S++ LDEK  QL        +    +P  EW    W + +  + 
Sbjct: 53  FENPKQPIEKRVQDLLSQMNLDEKTCQLATLYGYKRVMSDSLPTPEWKNKIWKDGIANID 112

Query: 98  ---AGVGKGI-----------------------------------FFNGTIRG-----AT 114
               GVG+G                                    F N TI G     AT
Sbjct: 113 EQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTKAT 172

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQ 173
             P  I   +++++ L Y+ G   G EA+AL       G T  +AP +++ RDPRWGR  
Sbjct: 173 PLPAPIGIGSTWNAPLVYKAGSIAGKEAKAL-------GYTNIYAPILDLARDPRWGRVL 225

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +        V+G+Q           +   +A  KHF  Y +         + 
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQ-----------EQGVAATLKHFAVYSVPKGGRDGSVRT 274

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  V  +++   +  PF+  ++     G+M +YN  +G+P  A    L++  R+++GF G
Sbjct: 275 DPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFGFDG 334

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLP 349
           Y+ SD DAV  +Y+    A++ E+AV  VL+AG++V              +  VK+ +L 
Sbjct: 335 YVVSDSDAVEYVYNKHHVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKEGRLS 394

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQPF---GKIGADVVCSPAHQVLALQAAQDGIVL 406
              ID  + ++  V+ RLGLF+     QPF    K    +V +  ++   L   +  +VL
Sbjct: 395 MKVIDERVADVLRVKFRLGLFD-----QPFVADPKAADKIVGADKNKDFVLDIQRQSLVL 449

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV--- 463
           LKN + LLPL K+K   + + GP A     ++  Y      +IT  + ++NY+ N V   
Sbjct: 450 LKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVD 509

Query: 464 YYPGCDT---------------VACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDR 508
           Y  GC                        I  AV+ AK +D V+ ++G D+    E   R
Sbjct: 510 YALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSR 569

Query: 509 VDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAV 568
             L LPGRQQ+L+  +  A  KPV+LVL+ G P+ I +A  DR I +IL A +PG+ G  
Sbjct: 570 SGLDLPGRQQQLLEALY-ATGKPVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGT 626

Query: 569 ALAEVIFGDHNPG 581
           A+AE +FGD+NPG
Sbjct: 627 AIAETLFGDYNPG 639


>gi|304406516|ref|ZP_07388172.1| Beta-glucosidase [Paenibacillus curdlanolyticus YK9]
 gi|304344574|gb|EFM10412.1| Beta-glucosidase [Paenibacillus curdlanolyticus YK9]
          Length = 939

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 167/451 (37%), Positives = 236/451 (52%), Gaps = 33/451 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           FPF    LP+ +R  DLVSRLTL+EKI  +     AI RLGI AY+  +E  HG+A +G+
Sbjct: 7   FPFRNPALPLEERVSDLVSRLTLEEKIHNMPQYQSAIERLGIQAYKHGTEGAHGIAWLGE 66

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ I    ++D  L   IG  IG EAR  Y     I G+T WAP +
Sbjct: 67  ----------ATSFPQPIGLGCTWDQELMKEIGDVIGTEARVFYKRNPQINGLTLWAPTV 116

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +T + AV+  +G+QGD          L+A A  KHF   +
Sbjct: 117 DMERDPRWGRTEEAYGEDPHLTSELAVALTQGMQGDH------PKYLKAVATLKHFLGNN 170

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    ++ + Y   FE    +G A  +M AYN VNG P      + 
Sbjct: 171 NEINRGSCSVSIDPR----NMREYYFKAFEPVFVKGGAQSMMTAYNSVNGTPVLLHETVD 226

Query: 282 SKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKH 338
           S   + QW   G+I SD  D + I+ D   Y  S   AV + +KAG+D   +    +   
Sbjct: 227 S-VVKEQW-MDGFIVSDAGDMIGIVNDHH-YYDSLAQAVAESIKAGIDSITDEAKAVTDA 283

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            + A++Q  L E+++D+AL N F VRMRLG F+    + P+  I    +  P HQ L+L+
Sbjct: 284 IRDALEQGLLGEADLDKALSNTFRVRMRLGEFDPQE-LVPYAAIDDSSLMKPEHQALSLK 342

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           AA+  IVLLKN  G LPL   K   +A+IGP  ++       Y+G     ITPLQA+Q  
Sbjct: 343 AARKNIVLLKND-GALPLDAEKLSKIAVIGPLGDT--VYRDWYSGTFPYKITPLQAIQQK 399

Query: 459 V--ENTVYYPGCDTVACSSASIDKAVDIAKG 487
           +  ++  +  G D +   S +  + + IA G
Sbjct: 400 LAGKSVAFADGNDRIKLRSRANGQYLAIADG 430



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 58/104 (55%), Gaps = 4/104 (3%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           +  AV+ AK ++  ++ +G +     KEE DR  L L   Q+ L+  V EA   P  +V+
Sbjct: 543 LQAAVEAAKQSEVAIVFVGNNPLVNGKEENDRPGLELAASQERLVKAVFEA--NPNTVVV 600

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + GG    T    + NI +++++ + G+    ALA+V+FGD+NP
Sbjct: 601 VTGG-YPFTLNWIEANIPAVIYSSHSGQEHGAALADVLFGDYNP 643


>gi|332882179|ref|ZP_08449812.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|357047409|ref|ZP_09109015.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
 gi|332679875|gb|EGJ52839.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|355529667|gb|EHG99093.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
          Length = 840

 Score =  248 bits (633), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 156/401 (38%), Positives = 221/401 (55%), Gaps = 34/401 (8%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           +  +  D+   LT++EKI  L   APA+ RLGI  Y+WWSE L+GVA  GK         
Sbjct: 23  VGGKVSDIRKGLTVEEKIDLLCADAPAVERLGIIRYDWWSECLNGVARAGK--------- 73

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAIGMTFWAPNINI 163
            AT+FP+ I   A++D  L  RI  AI  EARA +         + +  G+TF++P +NI
Sbjct: 74  -ATAFPKPIGLGATWDVDLIKRISTAISDEARAKHRKVMEEKGYSPRQFGLTFFSPTLNI 132

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
            RDPRWGR  E   EDPL+TG+  V+YV+G+QGD          L+A A  KHF A + +
Sbjct: 133 ARDPRWGRTSECFSEDPLLTGEMGVAYVQGLQGDDARW------LKAVATIKHFVANNEE 186

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
           N     R    A V    L + Y P F   V +G A+ +M AYN +NG+P CA+  LL+ 
Sbjct: 187 N----RRAGGSADVDELSLREYYFPAFREAVVRGGAASVMGAYNALNGVPCCANAYLLND 242

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAV 343
             R++WGF G + SD  AV  IY    YA +P +A    LK+G D++     +   + A 
Sbjct: 243 VLRKEWGFRGVVISDGSAVEKIYTHHKYASTPAEAAAMALKSGCDMSLRDEYRDGLRRAY 302

Query: 344 KQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDG 403
            +K + + ++DRA+  +  +R+RLGL +G+    P+  I   VV  PAH+ LAL+AA+  
Sbjct: 303 TEKLIDDDDLDRAVERVLELRVRLGL-DGDIGGNPYVDIPYSVVECPAHRELALEAAEKS 361

Query: 404 IVLLKNSHGLLPLP--KSKSVSLALIGPNANSAKTLLGNYA 442
           +VLLKN  GLLPL         +ALIG   N+     G+Y+
Sbjct: 362 MVLLKND-GLLPLELGSGDVRKIALIGDAFNT--VYYGDYS 399


>gi|302872724|ref|YP_003841360.1| glycoside hydrolase family protein [Caldicellulosiruptor
           obsidiansis OB47]
 gi|302575583|gb|ADL43374.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           obsidiansis OB47]
          Length = 771

 Score =  248 bits (632), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 173/511 (33%), Positives = 261/511 (51%), Gaps = 71/511 (13%)

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRG 172
           AT FPQ I  A +FD+ L   + + IGL+ +A+  A QA+     AP I++ RD RWGR 
Sbjct: 103 ATVFPQSIGVACTFDNELVKEMAKVIGLQMKAV-GAHQAL-----APLIDVARDARWGRV 156

Query: 173 QETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGT 228
           +ET GEDP +    AVSYV G+QG  F         +  A  KHF  Y +     NW   
Sbjct: 157 EETFGEDPYLVANMAVSYVEGIQGKDFEE-------KVIATGKHFVGYAMSEGGMNWAPV 209

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
                   +  ++L + Y  PFE  VK      IM AY+ ++GIP  A+R LL++ AR +
Sbjct: 210 -------HIPERELREVYLYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTEIARNE 262

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN---CGSFLQKHTKAAVKQ 345
           WGF G   SD   V  I D     K+ E+A    L AG+D+       F +K  + A+K+
Sbjct: 263 WGFDGIFVSDYSGVKNILDYHKSVKTYEEAAYISLWAGLDIELPRIECFTEKFIE-ALKE 321

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            K   + +D A+  +  ++ RLGLF+ NP ++    I  ++  +   + LA + AQ+ IV
Sbjct: 322 GKFDMAVVDAAVKRVLEMKFRLGLFD-NPFVKTENII--ELFDNKEQRDLARKVAQESIV 378

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE----- 460
           LLKN  G+LPL +     +A+IGPNANS + LLG+Y+ P+  S T +  ++  V+     
Sbjct: 379 LLKND-GILPLKEKDLKKVAVIGPNANSVRNLLGDYSYPAHISTTEMFFMKEEVDLGDED 437

Query: 461 ------------------------NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
                                     VY  GCD  +   +  ++A   A+ AD V++++G
Sbjct: 438 AFVKKVVNIKSVYEVIKERIGEKTEVVYAKGCDVNSHDKSGFEEAKKAAQDADVVIVVVG 497

Query: 497 ------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
                 LD T   E  DR  L LPG Q+ELI  +A+  +  ++++L+ G PV        
Sbjct: 498 DKAGLKLDCTS-GESRDRASLKLPGVQEELIEEIAKVNQN-IVVILVNGRPV--ALENIW 553

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +   ++L A +PGE GA A+A+++FG +NPG
Sbjct: 554 QKSKAVLEAWFPGEEGAEAIADIVFGKYNPG 584


>gi|374311316|ref|YP_005057746.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358753326|gb|AEU36716.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 773

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 200/628 (31%), Positives = 297/628 (47%), Gaps = 81/628 (12%)

Query: 5   KLSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLT 64
           K  LV   +CL   +L           S   S  +     + ++ LP+  R  DL+ R+T
Sbjct: 14  KRFLVLTGVCLSSVTLFPAYSQ-----SIASSGKTKTVLIYEQSNLPLETRLADLLGRMT 68

Query: 65  LDEKISQL-----------VNSAPAIPRLGIPAYEWWSEALHGVAGVGK----------- 102
           L+EK+ QL             S  ++P    P     ++AL G  G G            
Sbjct: 69  LEEKVRQLDFYSGTDSLLDRGSKNSLPSKQSPFSTAKADALFGSLGAGAIHDLDPTPEQY 128

Query: 103 ----------------GIFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEAR 143
                            +F    + G    T FP  +  A+++D  +  + G AI  EAR
Sbjct: 129 NTIQRWVIEHNRLHIPALFIEEGLHGFDTGTVFPAPLNLASTWDPSVAEKTGSAIAAEAR 188

Query: 144 ALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGK 203
           A       +GM   AP +++ RDPRWGR +E  GEDP +TG+  ++YVRG QG++ N   
Sbjct: 189 A-----TGVGMIL-APVLDLARDPRWGRIEEDFGEDPYLTGQMGLAYVRGAQGESLNTDH 242

Query: 204 LKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIM 263
                   A  KHF A+        T       +  ++L       FE   +QG A   M
Sbjct: 243 -----NVVAEPKHFAAHGSPEGGTNTS---PVHIGERELRSVMLKSFEPAFRQGHAMATM 294

Query: 264 CAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
            AY+ ++GIP  AD  LL    R++WGF G + SD  A+  +Y     A SP+ A    +
Sbjct: 295 AAYHEIDGIPVTADPYLLKTILRQEWGFQGMVLSDLGAIRRLYQLHQVASSPKAASCLAI 354

Query: 324 KAGMDVNCGSF----LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
           K+G+D+    F     QK     V +  LP++++DRA   +  ++  LGLF+  P + P 
Sbjct: 355 KSGVDMQFYDFDHDVFQKALIDCVHEGSLPQADVDRAASAVLRLKFTLGLFD-RPYVDP- 412

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
             + A    S  H  ++LQ+A++ +VLLKN +GLLP  KS    +A+IGPNA+ A+   G
Sbjct: 413 -TLNAKAYRSKPHLDVSLQSARESLVLLKNENGLLPFSKSIQ-RIAVIGPNADVAR--YG 468

Query: 440 NYAGPSC-RSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           +Y   +    I+ LQ ++         P       S   I  AV  AK AD V+L +G  
Sbjct: 469 DYEEEANGLHISILQGVKAEA------PHAQVEFDSGKDIAAAVAKAKSADVVILGLGEW 522

Query: 499 QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILW 558
           +    E  DR  L LPG Q++L+  +  A  KPV+LVL  G P+ I +AK   ++G+I+ 
Sbjct: 523 RGISGEAFDRTSLDLPGEQEKLLEAIT-ATNKPVVLVLENGRPLTIGWAK--AHVGAIVE 579

Query: 559 AGYPGEAGAVALAEVIFGDHNP-GEFTV 585
           A YPGE G  A+AE +FGD+NP G  T+
Sbjct: 580 AWYPGEFGGQAIAETLFGDNNPAGRLTI 607


>gi|383640112|ref|ZP_09952518.1| glycosyl hydrolase family protein [Sphingomonas elodea ATCC 31461]
          Length = 884

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 166/412 (40%), Positives = 232/412 (56%), Gaps = 47/412 (11%)

Query: 62  RLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVIL 121
           +LT +EK+ QL+N APAIPRL IP Y WW+E+LHG  G              T+FP+ I 
Sbjct: 41  QLTPEEKLGQLLNVAPAIPRLNIPRYNWWTESLHGALGTVP----------TTNFPEPIG 90

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWGRGQ 173
            AASFD  L + +  AI  E + L+   +          G+  W+PNINIFRDPRWGRGQ
Sbjct: 91  LAASFDVPLVHDVAAAISSEVQGLHALARQTGRMGRIGTGLDTWSPNINIFRDPRWGRGQ 150

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           ET GEDP +T +  V++V G+QG   +           A  KHF  +   +   +TR++ 
Sbjct: 151 ETYGEDPFLTARMGVAFVTGMQGPNPD------LPDVIATPKHFAVH---SGPESTRHEA 201

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           +  V+  DL DTY P F + V +G+A  +MCAYNR++G P+CA   LL    R  WGF G
Sbjct: 202 NVFVSRHDLEDTYLPAFRAAVVEGKAGSVMCAYNRIDGRPACASDELLKDHLRGAWGFKG 261

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-------FLQKHTKAAVKQK 346
           Y+ SDCDAV  I D   YA     AV   ++AG+D  C +        L    + A+ + 
Sbjct: 262 YVVSDCDAVKDISDNHHYAPDGATAVAAAIRAGVDNECHTQTLTDTGGLTDRYREALAKG 321

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGN-PTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
            + + EIDR L  LFS R+R    NG+ P ++  G   +D + +PAH+ LAL AA+  +V
Sbjct: 322 LITQGEIDRTLVRLFSARLR----NGDLPGIRTTGGKRSD-IGAPAHRALALAAAEKSLV 376

Query: 406 LLKNSHGLLPL-PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQ 456
           LLKN  GLLPL P S+   +A+IGP  ++ + L GNY+  S +S  P+  ++
Sbjct: 377 LLKND-GLLPLKPGSR---IAVIGPLGDATRVLRGNYS--SSQSGEPISVVE 422



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 54/102 (52%), Gaps = 13/102 (12%)

Query: 489 DHVVLMMGLDQTQEKEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLC 538
           D +V ++GL    E EE           D+  L L   QQ L+   A A  KP+++V + 
Sbjct: 614 DVLVAVVGLTSDLEAEESPVQIPGFKGGDKTSLDLLPDQQALL-EAARATGKPLVVVAMN 672

Query: 539 GGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           G P+++++AK   +  +IL A YPGE G  A+A V+ G  +P
Sbjct: 673 GSPINLSWAK--DHAAAILEAWYPGELGGTAIANVLSGKTSP 712


>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 756

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 183/516 (35%), Positives = 273/516 (52%), Gaps = 54/516 (10%)

Query: 72  LVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLW 131
           L NS   IP L         EALHG              +G+T FPQ I  A++++  L 
Sbjct: 124 LENSEIKIPVL------IHDEALHGCMA-----------KGSTIFPQAIGMASTWNPELI 166

Query: 132 YRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYV 191
           Y++  AIG E R+     + I     +P INI RDPR GR +ET GEDP +  + AV+Y+
Sbjct: 167 YQVATAIGKETRS-----RGIHQVL-SPTINIARDPRCGRTEETYGEDPYLASRMAVAYI 220

Query: 192 RGVQGDTFNGGKLKGKLQASACCKHFTA-YDLDNWKGTTRYKFDARVTMQDLADTYQPPF 250
           +GVQ           +    A  KHF A +  D  + +    F  R+    L + Y P F
Sbjct: 221 KGVQ-----------EQGVIATPKHFAANFVGDGGRDSYPIHFSERL----LREVYFPAF 265

Query: 251 ESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEG 310
           ++ +K+  A  +M AYN ++GIP  +++ LL+   R++WGF GY+ SD  +V  +     
Sbjct: 266 KASIKEAGALSLMAAYNSLDGIPCSSNKWLLTDVLRKEWGFKGYVVSDYFSVLHLMTKHK 325

Query: 311 YAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGL 369
            A+S  +A    L+AG+D+    S   +     VK  KL E  I+ A+  +  V+   GL
Sbjct: 326 VAESKAEAARLALEAGLDMELPDSDCFEEMINLVKGGKLSEETINEAVRRILGVKFWAGL 385

Query: 370 FNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGP 429
           F+ NP + P       V     H+ LAL+ A++ IVLLKN  G+LPL K    S+A+IGP
Sbjct: 386 FD-NPFVDP--DYAERVNDCAEHRELALRVARESIVLLKN-EGILPLSKDIG-SIAVIGP 440

Query: 430 NANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYY-PGCDTVACSSASIDKAVDIAK 486
           NA  A   LG Y+G   + +TPL+ ++N +EN   +Y+  GC     S +  D+A+ IA+
Sbjct: 441 NA--AVPRLGGYSGYGVKIVTPLEGIKNKMENKAKIYFAEGCGLNDTSKSGFDEAIKIAQ 498

Query: 487 GADHVVLMMGLDQTQ-EKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDIT 545
            +D  +L +G    + E E+ DR +L LPG Q+ELI  +      PVI+VL+ G    IT
Sbjct: 499 KSDVAILFVGNSVPETEGEQRDRHNLNLPGVQEELIKEICN-TNTPVIVVLINGSA--IT 555

Query: 546 FAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
              +   + +++ A YPGE G  A+A+V+FGD+NPG
Sbjct: 556 MMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNPG 591


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  244 bits (624), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 179/586 (30%), Positives = 276/586 (47%), Gaps = 72/586 (12%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNS--------APAIPRLGIPAYEWWSEALH 95
           PF   +LP + R  DL++RLT +E I Q+ N         AP I RL I AY+W +    
Sbjct: 28  PFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQWRTNP-- 85

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI--- 152
              G G+           T FPQ +   A+FD +  YR+ +A GLE RA +N  +A    
Sbjct: 86  ---GDGR----------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTY 132

Query: 153 ----GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG---------DTF 199
               G+  +AP +N+ R P WGR QET GEDP + GK A ++VRG+ G         D  
Sbjct: 133 RDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQ 192

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
           N       L   A CKHF  +        +R  F+A VT  DL  TY P F +C++ G  
Sbjct: 193 NLSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAGAV 252

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS-IIYDAEGYAKSPEDA 318
           S +MCAY+ +NG P C +  LL++  R++W F G++ +DC A+  +I+  + +    E A
Sbjct: 253 S-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETA 311

Query: 319 VVDVLKAGMDVNCGSFLQKHTKAAVKQ----KKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           +  V +AG+++           + +        L   ++      LF  R+  G FN   
Sbjct: 312 MAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEFNP-V 369

Query: 375 TMQPFGKIG-ADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS------LALI 427
            M P+  +   + + +  H+ +AL      IVLL+N    LPL  + S S      +A++
Sbjct: 370 EMDPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRHIAIV 429

Query: 428 GPNANSAKTLLGNY-AGPSCRSITPLQALQNYVENTVYYPG--CDTVACSSASIDKAVDI 484
           GP A S   L G+Y   P      PL    + +   ++      D   CSS + D A+  
Sbjct: 430 GPFATSVTELYGHYRTAPEPEIEVPLSKGLSQLSRRMHASDICTDGGRCSSLN-DDALHS 488

Query: 485 AKGADH---VVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK----------- 530
             G D    +VL +G     E E +DR ++ LPG+Q EL+    + +             
Sbjct: 489 TLGYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKRTV 548

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           P+IL++   GP++I+ A  + N+ +I W G+PG     A+  ++ G
Sbjct: 549 PIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLG 594


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 192/603 (31%), Positives = 293/603 (48%), Gaps = 99/603 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI +R +DL+S++ +DEK  Q+     +  +    +P  +W    W + +  +     G+
Sbjct: 38  PIEERVKDLLSQMNMDEKTCQMATLYGSGRVLADALPTEKWKSEIWKDGIGNIDEEHNGL 97

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 98  GKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATFFPAQCGQG 157

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           ++++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  E  GEDP +
Sbjct: 158 STWNKELIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRAVECYGEDPYL 210

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     ++ +Q         K KL A+   KHF  Y +      G TR   D  V  +
Sbjct: 211 VGQLGKQMIQSLQ---------KHKLVATP--KHFAVYSIPVGGRDGGTRT--DPHVAPR 257

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF    ++  A G+M +YN  +G P       L++  R++WGF GY+ SD D
Sbjct: 258 EMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKGYVVSDSD 317

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEIDRA 356
           AV  I      A + E+AVV  + AG++V       +   K  ++A+ + K+ ++ ID+ 
Sbjct: 318 AVEFISSKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIKPLRSAIAKGKVSQATIDQR 377

Query: 357 LHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
           +  +  V+  LGLF+ NP  +  GK+   +V    HQ +AL+AA+  IVLLKN   LLPL
Sbjct: 378 VSEILYVKFWLGLFD-NP-YRGDGKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLLPL 435

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYV--ENTVYYPGC----- 468
            K+   S+A+IGPNA+  K L+  Y GPS   I T  + ++  +     VY  GC     
Sbjct: 436 QKTLK-SVAVIGPNADEQKELICRY-GPSNAPIKTVYKGIKEALPGAKVVYKKGCEIVDP 493

Query: 469 ----------DTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
                     D        +D+A++ AK A+ V++++G  +   +EE  R  L LPGRQ+
Sbjct: 494 HFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGRQE 553

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           EL+  V +   KP ILV++ G    I +AK  + + +IL A +PGE    A+AE IFGD+
Sbjct: 554 ELLKAVCKLG-KPTILVMIDGRASSINYAK--KYVPAILHAWFPGEFCGQAVAETIFGDN 610

Query: 579 NPG 581
           NPG
Sbjct: 611 NPG 613


>gi|312794525|ref|YP_004027448.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor kristjanssonii 177R1B]
 gi|312181665|gb|ADQ41835.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           kristjanssonii 177R1B]
          Length = 770

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 190/615 (30%), Positives = 292/615 (47%), Gaps = 115/615 (18%)

Query: 50  LPISQRARDLVSRLTLDEKISQLV-----------------------NSAPAIPRLGIPA 86
           + I +R  DL+ ++T++EK+ QL                        N    I RL   +
Sbjct: 1   MSIEKRVNDLLQKMTIEEKVYQLTSILVQDILENDKFSPQKAKEKIPNGIGQITRLAGAS 60

Query: 87  YEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAASFDSYL 130
                EA      + K +  N  +                +GAT FPQ I  A +FD+ +
Sbjct: 61  NLSPEEAAKTANEIQKFLIENTRLGIPAMIHEESCSGFMAKGATVFPQSIGVACTFDNEI 120

Query: 131 WYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSY 190
              + + I  + +A+  A QA+     AP I++ RD RWGR +ET GEDP +    AVSY
Sbjct: 121 VEELAKVIRTQMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSY 174

Query: 191 VRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTY 246
           V+G+QGD    G +       A  KHF  Y +     NW           +  ++L + Y
Sbjct: 175 VKGLQGDDIKDGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVY 220

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PFE  VK      IM AY+ ++GIP  A+R LL+  AR +WGF G   SD   V  + 
Sbjct: 221 LYPFEVAVKVAGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIYVSDYSGVKNLL 280

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSV 363
           D     K+ E+A    L AG+D+       F ++  K A+K+ K   + +D A+  +  +
Sbjct: 281 DYHKSVKTYEEAAALSLWAGLDIELPKIECFTEEFIK-ALKEGKFDMALVDAAVKRVLEM 339

Query: 364 RMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVS 423
           + RLGLF+ NP ++  G +  ++  +   + L+ + AQ+ +VLLKN    LPL K     
Sbjct: 340 KFRLGLFD-NPYIKTEGVV--ELFDNKEQRQLSRKVAQESMVLLKND-SFLPLSKDLK-K 394

Query: 424 LALIGPNANSAKTLLGNYAGPSCRSITPLQAL-----------QNYVEN----------- 461
           +A+IGPNANS + LLG+Y+ P+   I  L+             + +V+N           
Sbjct: 395 IAVIGPNANSVRNLLGDYSYPA--HIATLEMFFIKEDRGVGNEEEFVKNVINMKSIFEAI 452

Query: 462 ---------TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEEL 506
                     VY  GCD  +   +  ++A   A+GAD V+L++G      LD T   E  
Sbjct: 453 KDKVSSNTEVVYAKGCDVNSQDKSGFEEAKKAAEGADAVILVVGDKAGLRLDCTS-GESR 511

Query: 507 DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
           DR  L LPG Q++L+  +       V+ VL+ G PV + +     N+ ++L A +PGE G
Sbjct: 512 DRASLRLPGVQEDLVKEIVSVNPNTVV-VLVNGRPVALDWIM--ENVKAVLEAWFPGEEG 568

Query: 567 AVALAEVIFGDHNPG 581
           A A+A+++FGD+NPG
Sbjct: 569 ADAVADILFGDYNPG 583


>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 805

 Score =  242 bits (618), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 183/602 (30%), Positives = 290/602 (48%), Gaps = 97/602 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQL--VNSAPAIPRLGIPAYEW----WSEAL---------- 94
           PI +R  DL+ ++T++EK  QL  +    A+ +  +P  EW    W + +          
Sbjct: 67  PIDKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNIDEHLNGE 126

Query: 95  ------------HGVAGVGKGIFF-------------NGTIRG-----ATSFPQVILTAA 124
                       H  A      FF             N  IRG     +T FP  I    
Sbjct: 127 WKRTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGC 186

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLVT 183
           ++D  L Y IG+  G EA+AL       G T  ++P +++ RDPRWGR  E+ GED  + 
Sbjct: 187 TWDKELIYEIGRITGEEAKAL-------GYTNIYSPILDLSRDPRWGRTVESYGEDSYLA 239

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           G+     V G+Q +           +  +  KHF  Y +         + D   + Q++ 
Sbjct: 240 GELGRQQVLGIQSN-----------RVVSTPKHFAIYGIPGGGRDCYSRTDPHASPQEVH 288

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
           + +  PF    ++  A G MC++N  NG P  A   L+++  R QWGF GY+ SD  A+ 
Sbjct: 289 ELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAID 348

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT------KAAVKQKKLPESEIDRAL 357
                     + E+AV   L AG++V   +F ++        + A+++  + ES +D+ +
Sbjct: 349 KNVKFYHIVDTEEEAVASELNAGLNVR--TFFEQSEVFIEALRRALQKGLVEESTLDQRV 406

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  V+  LGLF+ +P ++   K+   +V S  ++ ++L+AA++ IVLLKN +  LPL 
Sbjct: 407 REVLYVKFWLGLFD-DPYVKD-TKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPLS 464

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVACS 474
           K+   ++A+IGP A+  K+L   Y   +   IT LQ L+N +    N +Y  GC+    +
Sbjct: 465 KTLK-NIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDKN 523

Query: 475 ---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
                             ID+AV+IAK A+  ++ +G D     E   RV+L L GRQ+E
Sbjct: 524 FPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQKE 583

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L+ R  +A   PV+LVL  G PV + +   D N+ +I+ A YPGE    A+AEV+FGD+N
Sbjct: 584 LV-RAVQATGTPVVLVLFNGRPVTLNWE--DANLPAIVEAWYPGEFSGQAVAEVLFGDYN 640

Query: 580 PG 581
           PG
Sbjct: 641 PG 642


>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
 gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
          Length = 841

 Score =  242 bits (618), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 177/611 (28%), Positives = 282/611 (46%), Gaps = 98/611 (16%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV- 97
           F   + P+ +R +DL+S++T++EK  QL        + +  +P   W    W + +  + 
Sbjct: 82  FEDPSQPVEKRVKDLLSQMTIEEKSCQLATLYGFGRVLKDSLPTPAWKEAIWKDGIANID 141

Query: 98  ---AGVGKGI-----------------------------------FFNGTIRG-----AT 114
               GVG+G                                    F N  I G     AT
Sbjct: 142 EQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIPVDFSNEGIHGLNHTKAT 201

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQ 173
             P  I   +++++ L    G+ +G EAR L       G T  +AP +++ RDPRWGR  
Sbjct: 202 PLPAPIAIGSTWNTELVREAGEIVGKEARVL-------GYTNVYAPILDVVRDPRWGRTL 254

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP + G+  V  V G+Q               +A  KHF  Y           + 
Sbjct: 255 ECYGEDPYLIGELGVQMVDGIQSQG-----------VAATLKHFAVYSSPKGGRDGNCRT 303

Query: 234 DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
           D  VT ++L + Y  PF+  ++Q    G+M +YN  NG P  +    L+K  R ++GF G
Sbjct: 304 DPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSYYFLTKLLREEYGFDG 363

Query: 294 YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKL 348
           Y+ SD  AV  ++     A+  ++AV  VL+AG++V         F+    +  + + K+
Sbjct: 364 YVVSDSQAVEFVHTKHQVAEDYDEAVRQVLEAGLNVRTHFTPPADFILP-IRRLLAENKI 422

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
             + ID+ +  + +V+ RLGLF+      P  K   +V  +  H     +  +  +VLLK
Sbjct: 423 SMATIDKRVSEVLAVKFRLGLFDAPYRDNP--KEADEVAGADKHSEFVKEMQRQSLVLLK 480

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYY 465
           N   LLPL K +   + + GP A+    ++  Y      +IT LQ +++Y++     VY 
Sbjct: 481 NDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPTITVLQGIKDYLKGDVEVVYS 540

Query: 466 PGCDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
            GC+ +               A   A +DKAV  A+ AD ++ +MG D+ +  E   R  
Sbjct: 541 KGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADVIIAVMGEDEYRVGESRSRTS 600

Query: 511 LVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVAL 570
           L LPGRQ+EL+ +   A  KPV+LVL+ G P+ I +   D+N+ +IL A +P   G   +
Sbjct: 601 LELPGRQRELL-QALHATGKPVVLVLINGQPLTINWE--DQNLPAILEAWFPSFQGGKII 657

Query: 571 AEVIFGDHNPG 581
           AE +FGD+NPG
Sbjct: 658 AETLFGDYNPG 668


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  242 bits (618), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 180/601 (29%), Positives = 283/601 (47%), Gaps = 95/601 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+  R +DL+S++TL+EK  Q+     +  + +  +P   W    W + +  +     G+
Sbjct: 49  PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQNNWKTEVWKDGIGNIDEEHNGL 108

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 109 GAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q           K    A  KHF  Y +       + + D  V  +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF    ++  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I +    A + ED +   + AG+++         F+    K AV   K+ +  +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDNGKISQETLDKRV 389

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  ++ RLGLF+ NP  +  GK    +V S  HQ ++L+AA+  +VLLKN   LLPL 
Sbjct: 390 AEILRIKFRLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS 447

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV---- 471
           KS   S+A+IGPNAN    L+  Y   +    T  Q ++  + +T  +Y  GCD +    
Sbjct: 448 KSIR-SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGCDIIDPHF 506

Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
                      A     +++A+  AK A+ VV+++G ++   +E+  R  L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V  A  KP+ILV+L G    I +A    +I +IL A +PGE    A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPIILVMLDGRASSINYAA--AHIPAILHAWFPGEFCGQAVAEALFGDYNP 623

Query: 581 G 581
           G
Sbjct: 624 G 624


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 180/602 (29%), Positives = 293/602 (48%), Gaps = 97/602 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+  R ++L+S++TL+EK  Q+     +  +    +P+  W    W + +  +     G+
Sbjct: 59  PLEARVQNLLSQMTLEEKSCQMATLYGSGRVLNDALPSDNWKNEVWKDGIGNIDEEHNGL 118

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 119 GSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 178

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  +IG+A   EA  L       G T  ++P ++I +DPRWGR  ET GEDP  
Sbjct: 179 ATWNKELIAQIGEAEAREASVL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYH 231

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q         K KL ++   KHF  Y +       + + D  V  +++
Sbjct: 232 AGQMGKQMILSLQ---------KNKLVSTP--KHFAVYSIPVGGRDGKTRTDPHVAPREM 280

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF     +  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 281 RTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 340

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I      A   EDAV   + AG+++         F+    ++AVK+ K+ +  +++ +
Sbjct: 341 EFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRV 399

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  V+  LGLF+ NP      + G  +V SP HQ LAL+AA+  +VLLKN H  LPL 
Sbjct: 400 AEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS 457

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL-QALQNYV--ENTVYYPGCDTV--- 471
           KS   S+A+IGPNA+  + L+  Y GP+   IT + + ++  +   + VY  GCD +   
Sbjct: 458 KSIR-SVAVIGPNADERQQLICRY-GPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPH 515

Query: 472 ------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
                       A  +  +++A++ AKGA+  V+++G ++   +E+  R  L LPGRQ+E
Sbjct: 516 FPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQKE 575

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L+ ++ +   KPV+LV++ G    I FA    ++ +I+ A +PGE G  A+AE +FGD+N
Sbjct: 576 LLKKICQLG-KPVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGDYN 632

Query: 580 PG 581
           PG
Sbjct: 633 PG 634


>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 780

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 177/535 (33%), Positives = 281/535 (52%), Gaps = 74/535 (13%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G            T FP  +  A+++++ L +R+G+AI L
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTSLSQASTWNAELMHRMGEAIAL 172

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR+    G  IG   + P ++I R+PRW R +ET GEDP++T    V++++G+QG + N
Sbjct: 173 EARS---QGANIG---YGPVLDIAREPRWSRMEETFGEDPVLTTHLGVAFMKGMQGKSQN 226

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        +  KHF AY +            A V M+ L   Y PPF+  V++G A+
Sbjct: 227 DGK-----HLYSTLKHFAAYGIPE---AGHNGARANVGMRQLFSDYLPPFKKAVEEGVAT 278

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            IM +YN ++G+P  +++ LL+   R QWGF G++ SD  ++  I  A   AK  ++A V
Sbjct: 279 -IMTSYNTIDGVPCTSNKYLLTDVLRDQWGFKGFVYSDLTSIEGIVGAR-VAKDNKEAAV 336

Query: 321 DVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             LKAG+D++  G+   K+ + A+++  +   +++RA+ N+  ++ R+GLF  NP + P 
Sbjct: 337 LALKAGLDMDLGGNAYGKNLQKALEEGAITMDDLNRAVANVLRLKFRMGLFE-NPYVSP- 394

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
            +    VV S AH+ LA + A++GIVLLKN  G+LPL K+   ++A+IGPNA+     LG
Sbjct: 395 -EQAKQVVRSKAHKELAREVAREGIVLLKN-EGVLPLKKNIG-NIAVIGPNADMMYNQLG 451

Query: 440 NYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASID--------------- 479
           +Y  P  R   +T L  ++  V  +    Y  GC     ++++I                
Sbjct: 452 DYTAPQEREEIVTVLDGIRKAVSPSTKVNYVKGCAIRDITTSNITAAVEAARAADAVVLV 511

Query: 480 ---------KAVDIAKGADHVV-----LMMGLDQTQEKEELDRVDLVLPGRQQELITRVA 525
                    K   I  GA  V      L+  +D     E  DR  L L G Q++L+  VA
Sbjct: 512 VGGSSARDFKTKYIGTGAADVSNDGNQLLSDMDC---GEGYDRSTLRLLGDQEKLLKAVA 568

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            A  KP++++ + G  +++  A       ++L A YPGE G  A+A+V+FGD+NP
Sbjct: 569 -ATGKPLVVIYIQGRTLNMNLA--SEKAQALLTAWYPGEQGGTAIADVLFGDYNP 620


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  241 bits (616), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 180/602 (29%), Positives = 293/602 (48%), Gaps = 97/602 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+  R ++L+S++TL+EK  Q+     +  +    +P+  W    W + +  +     G+
Sbjct: 62  PLEVRVQNLLSQMTLEEKSCQMATLYGSGRVLNDALPSDNWKNEVWKDGIGNIDEEHNGL 121

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 122 GSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 181

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  +IG+A   EA  L       G T  ++P ++I +DPRWGR  ET GEDP  
Sbjct: 182 ATWNKELIAQIGEAEAREASVL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYH 234

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q         K KL ++   KHF  Y +       + + D  V  +++
Sbjct: 235 AGQMGKQMILSLQ---------KNKLVSTP--KHFAVYSIPVGGRDGKTRTDPHVAPREM 283

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF     +  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 284 RTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 343

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I      A   EDAV   + AG+++         F+    ++AVK+ K+ +  +++ +
Sbjct: 344 EFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILP-LRSAVKKGKISQETLNQRV 402

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  V+  LGLF+ NP      + G  +V SP HQ LAL+AA+  +VLLKN H  LPL 
Sbjct: 403 AEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTLPLS 460

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL-QALQNYV--ENTVYYPGCDTV--- 471
           KS   S+A+IGPNA+  + L+  Y GP+   IT + + ++  +   + VY  GCD +   
Sbjct: 461 KSIR-SVAVIGPNADERQQLICRY-GPANAHITTIYEGIKKMLPQADVVYKKGCDIIDPH 518

Query: 472 ------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQE 519
                       A  +  +++A++ AKGA+  V+++G ++   +E+  R  L LPGRQ+E
Sbjct: 519 FPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQEE 578

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           L+ ++ +   KPV+LV++ G    I FA    ++ +I+ A +PGE G  A+AE +FGD+N
Sbjct: 579 LLKKICQLG-KPVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGDYN 635

Query: 580 PG 581
           PG
Sbjct: 636 PG 637


>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 799

 Score =  241 bits (616), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 166/524 (31%), Positives = 262/524 (50%), Gaps = 58/524 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP  ++ +E +HG+         N T   AT  P  I   ++++  L ++ G   G 
Sbjct: 139 RLGIPV-DFSNEGIHGL---------NHT--KATPLPAPINIGSTWNRDLVHQAGDIAGK 186

Query: 141 EARAL-YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
           EA+AL YN         +AP +++ RDPRWGR  ET GEDP + G+  +  V+G+Q    
Sbjct: 187 EAKALGYN-------NVYAPILDVARDPRWGRVLETYGEDPYLVGELGIQMVKGIQQ--- 236

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
           NG         ++  KHF  Y +         + D  V  ++L + +  PF+  V++   
Sbjct: 237 NG--------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKAHP 288

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
            G+M +YN  +G+P  A    L++  R+++GF GYI SD +AV  +      A S E+AV
Sbjct: 289 KGVMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKHHVADSYEEAV 348

Query: 320 VDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
             V++AG++V       K      +  VK+ KL    +DR + ++  V+  LGLF+    
Sbjct: 349 RQVVEAGLNVRTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFELGLFDSPYV 408

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
             P  K    +V +  H+   L   +  +VLLKN + LLPL K+++  + + GP A    
Sbjct: 409 KDP--KAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETN 466

Query: 436 TLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGCDTVACS---------------SAS 477
            ++  Y      +IT    +++Y+ N    VY  GC+    +                  
Sbjct: 467 YMISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKG 526

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           I +A   A   D ++ ++G D++   E   R  L LPGRQQ+L+  +  A  KPV+LVL+
Sbjct: 527 IAEAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEAL-HATGKPVVLVLI 585

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G P+ I +A  DRNI SIL A +PG+ G  A+A+ +FGD+NPG
Sbjct: 586 NGQPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPG 627


>gi|116312033|emb|CAJ86398.1| OSIGBa0125M19.1 [Oryza sativa Indica Group]
          Length = 247

 Score =  241 bits (616), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 120/203 (59%), Positives = 151/203 (74%), Gaps = 6/203 (2%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++P+   +PFC  TLP   RAR LVS LTLDEKI+QL N+A   PRLG+P +EWWSE+LH
Sbjct: 31  ASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLH 90

Query: 96  GVAGVGKGI-FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
           GV   G G+ F +G +R AT FPQVIL+AA+F+  LW    +AI +EARA++NAGQA G+
Sbjct: 91  GVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA-GL 149

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TFWAPNIN+FRDPRWGRGQETPGEDP V   Y+V YV+G Q D       +G++  SACC
Sbjct: 150 TFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDY----GEEGRMMLSACC 205

Query: 215 KHFTAYDLDNWKGTTRYKFDARV 237
           KH+ AYDL+ W+G TRY F+A+V
Sbjct: 206 KHYIAYDLEKWRGFTRYTFNAKV 228


>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
 gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
          Length = 948

 Score =  241 bits (615), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 163/506 (32%), Positives = 259/506 (51%), Gaps = 52/506 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L  ++G   G EAR L       G T  +A
Sbjct: 168 FTNEGIRGIESYKATNFPTQLGLGTTWNRQLIRQVGYITGREARLL-------GYTNVYA 220

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +   RG+Q D           Q ++  KHF 
Sbjct: 221 PILDVGRDQRWGRYEEIYGESPFLVAELGIQMTRGLQTD----------FQVASTAKHFA 270

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D ++  +++ + +  P+E  V++    G M +YN  +GIP     
Sbjct: 271 AYSNNKGGREGMSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSY 330

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
           + L++  R ++GF GYI SD DA+  ++     A   ++AV   + AG++V C      S
Sbjct: 331 HWLTEVLRHRFGFRGYIVSDSDALEYLFSKHHTAADMKEAVYQAVMAGLNVRCTFRSPDS 390

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
           F+    +  +++ ++P S IDR + ++  V+   G+F+ NP      K     V S  +Q
Sbjct: 391 FVLP-LRELIREGRIPMSVIDRLVGDILRVKFITGIFD-NPYQMNL-KAADQEVNSERNQ 447

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            +ALQA++  IVLLKN   LLPL +SK   + + GPNA+ A   L +Y   +    T L+
Sbjct: 448 AVALQASRQSIVLLKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLE 507

Query: 454 ALQNYVENTV---YYPGCDTV---------------ACSSASIDKAVDIAKGADHVVLMM 495
            +++ VEN +   Y  GCD V               +     ID AV +AK +D  ++++
Sbjct: 508 GIRDKVENNIEVSYAKGCDVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVL 567

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           G +     E   R  L LPGRQ +L+ +  +A  KPV+LVL+ G P+ + +A  DR I +
Sbjct: 568 GGNSRTCGENKSRSSLDLPGRQLDLL-KAVQATGKPVVLVLINGRPLSVNWA--DRFIPA 624

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPG 581
           I+ A YPG  G  A+A+V+FGD+NPG
Sbjct: 625 IVEAWYPGSQGGTAVADVLFGDYNPG 650


>gi|167038437|ref|YP_001666015.1| glycoside hydrolase family 3 [Thermoanaerobacter pseudethanolicus
           ATCC 33223]
 gi|320116830|ref|YP_004186989.1| glycoside hydrolase family 3 domain-containing protein
           [Thermoanaerobacter brockii subsp. finnii Ako-1]
 gi|166857271|gb|ABY95679.1| glycoside hydrolase, family 3 domain protein [Thermoanaerobacter
           pseudethanolicus ATCC 33223]
 gi|319929921|gb|ADV80606.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           brockii subsp. finnii Ako-1]
          Length = 784

 Score =  241 bits (614), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 185/616 (30%), Positives = 299/616 (48%), Gaps = 103/616 (16%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
           +  +T  + +R  DL+ ++T++EK++QL NS                           I 
Sbjct: 5   YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGIGQIT 63

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
           RLG  +     E +     + K +  N  +                +GAT FPQ I  A+
Sbjct: 64  RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAS 123

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           ++++ +  ++   I  + +A+  A QA+     AP ++I RDPRWGR +ET GEDP +  
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           +  VSY+RG+Q ++   G +       A  KHF  Y   N +G   +   A +  ++L +
Sbjct: 178 RMGVSYIRGLQTESLKEGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
            +  PFE+ VK+ + S IM  Y+ ++G+P    + LL+   R+ WGF G + SD  A+S 
Sbjct: 228 VFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287

Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
           +Y+        + A    L+AG+DV   S  +     +  ++  ++    ++ A+  +  
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           ++  LGLF  NP +    +   ++  +   + LA + AQ+ IVLLKN + LLPL K    
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403

Query: 423 SLALIGPNANSAKTLLGNYAGP-------SCRS----------------------ITPLQ 453
           S+A+IGPNA+S + ++G+YA P         R                       +T LQ
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463

Query: 454 ALQNYVENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
            ++  V +    +Y  GCD +  S     +AV+IAK AD  V+++G      D     E 
Sbjct: 464 GIKAKVSSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR DL LPG Q+ELI  + E    PVI+VL+ G P+ I++      I +I+ A  PGE 
Sbjct: 524 RDRADLNLPGVQEELIKAIYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+VIFGD+NPG
Sbjct: 581 GGRAVADVIFGDYNPG 596


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 179/607 (29%), Positives = 288/607 (47%), Gaps = 99/607 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEAL-----HGVAG 99
           P+++R  DL+S++T++EK  Q+V       + +  +P  +W    W + +     H  A 
Sbjct: 24  PLNERIEDLLSQMTVEEKTCQMVTLYGYQRVLKDSLPTPDWKNQLWKDGIGAIDEHLNAF 83

Query: 100 VGKGI------------------------------------FFNGTIRG-----ATSFPQ 118
            G G+                                    F N  IRG     AT+FP 
Sbjct: 84  RGWGVPPMQNELVWPASNHAWALNEVQRFFVEETRLGIPADFTNEGIRGVENYIATNFPT 143

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  +IG   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 144 QLALGHTWNRELIRQIGYITGREARLL-------GYTNVYAPILDVGRDQRWGRYEEVYG 196

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  ++  +G+Q D          +Q ++  KHF AY  +        + D ++
Sbjct: 197 ESPYLVAELGIAMGKGLQTD----------MQVASTAKHFIAYSNNKGAREGFARVDPQM 246

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++ + +  PF   +++    G+M +YN  +G P  +    L++  R   GF GY+ S
Sbjct: 247 SWREVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVS 306

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
           D DAV  +Y     AK  ++AV   ++AG++V C +F    +     +  +++  L    
Sbjct: 307 DSDAVEYLYSKHKTAKDMKEAVRQSVEAGLNVRC-TFRSPESYVLPLRELIQEGGLSMET 365

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           ID  + ++  V+   GLF+  P       +    V S AHQ +ALQA+++G+VLLKN++ 
Sbjct: 366 IDNRVRDILRVKFLTGLFD-TPYQTDLA-LADKEVNSEAHQQVALQASREGLVLLKNANN 423

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGCD 469
           LLPL KS+   +A+ GPNA+ A   L +Y   +    T L+ ++  V+      Y  GCD
Sbjct: 424 LLPLDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCD 483

Query: 470 TV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
            V               A     I KAVD  K +D  V+++G       E   R  L LP
Sbjct: 484 LVDANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLP 543

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G QQ+L+  +  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A+AE +
Sbjct: 544 GHQQQLLEAIV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSQGGTAIAEAL 600

Query: 575 FGDHNPG 581
           FGD+NPG
Sbjct: 601 FGDYNPG 607


>gi|354583533|ref|ZP_09002432.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353198174|gb|EHB63648.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 931

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 161/475 (33%), Positives = 244/475 (51%), Gaps = 36/475 (7%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVG 101
            +PF    LP+ +R  DLVSR TL+EKI  +      IPRLG+  Y+  +E  HGVA +G
Sbjct: 2   NYPFQNPDLPLEERVNDLVSRFTLEEKIELMCQYQAEIPRLGVQKYKHGTEGAHGVAWLG 61

Query: 102 KGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPN 160
           +          AT FPQ    A +++  L  +IG  I  EAR  Y   +A+ G+T WAP 
Sbjct: 62  E----------ATVFPQNTGLACTWNPELMRQIGSVIADEARVYYQRDKAVNGLTIWAPT 111

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
           +++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD           +  A  KHF   
Sbjct: 112 VDMERDPRWGRTEEAYGEDPHLTGELSTGLVQGMQGDH------PFYYKTVATLKHFYGN 165

Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
           + +  +G      D R    +  + Y   FE   ++G A  +M AYN +NG P C   + 
Sbjct: 166 NNEVDRGNASVSIDPR----NKREYYLKAFEPAFRKGGAKSMMTAYNGINGTP-CNLNDE 220

Query: 281 LSKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQK 337
           +++  +++WG  G++  D  D +  + D   Y  S  +AV   +KAG+D   +      +
Sbjct: 221 VNRIVKQEWGMDGFVVGDAGDVLGTVMDHH-YVDSYAEAVARSVKAGIDSITDDQEISFR 279

Query: 338 HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLAL 397
             + A++Q  L E ++D AL N F VR RLG F+      P+  +    +C+P H  L+L
Sbjct: 280 ALRDALEQGLLTEQDLDHALRNTFRVRFRLGEFDPE-DRNPYSHVPESKLCAPEHAELSL 338

Query: 398 QAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQN 457
           +AA++ +VLLKN  GLLPL  +   S A+IGP A+ A T    Y+G     ITPLQ +  
Sbjct: 339 RAARESVVLLKND-GLLPL-NNNMPSAAVIGPLADEAFTDW--YSGTPPYRITPLQGITE 394

Query: 458 YV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
            +      ++ G D V   SA+  + V +   AD   L+ G +++ E    +R D
Sbjct: 395 KMNGRKVNFHTGLDQVRLRSAASGQYVTL---ADDNRLLFGTNESSEAALFERND 446



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 49/79 (62%), Gaps = 3/79 (3%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           KE +DR D+VLP  QQ+LI  V  AA    ++V++   P  + + +   ++ SI++  + 
Sbjct: 561 KETIDRPDIVLPPSQQQLIQAV-HAANPNTVVVIVGSYPFAVNWEQ--EHVPSIVFTSHA 617

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G+    A+A+V++GD+NPG
Sbjct: 618 GQELGRAVADVLYGDYNPG 636


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 179/601 (29%), Positives = 282/601 (46%), Gaps = 95/601 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+  R +DL+S++TL+EK  Q+     +  + +  +P   W    W + +  +     G+
Sbjct: 49  PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQNNWKTEVWKDGIGNIDEEHNGL 108

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 109 GAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q           K    A  KHF  Y +       + + D  V  +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF    ++  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I +    A + ED +   + AG+++         F+    K AV   K+ +  +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDNGKISQETLDKRV 389

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  ++ RLGLF+ NP  +  GK    +V S  HQ ++L+AA+  +VLLKN   LLPL 
Sbjct: 390 AEILRIKFRLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLLPLS 447

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV---- 471
           KS   S+A+IGPNAN    L+  Y   +    T  Q ++  + +   +Y  GCD +    
Sbjct: 448 KSIR-SIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHF 506

Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
                      A     +++A+  AK A+ VV+++G ++   +E+  R  L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V  A  KPVILV+L G    I +A    ++ +IL A +PGE    A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623

Query: 581 G 581
           G
Sbjct: 624 G 624


>gi|326389315|ref|ZP_08210883.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           ethanolicus JW 200]
 gi|325994678|gb|EGD53102.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           ethanolicus JW 200]
          Length = 784

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 190/616 (30%), Positives = 300/616 (48%), Gaps = 103/616 (16%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
           +  +T  + +R  DL+ ++T++EK++QL NS                           I 
Sbjct: 5   YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGISQIT 63

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
           RLG  +     E +     + K +  N  +                +GAT FPQ I  A+
Sbjct: 64  RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMAKGATIFPQTIGVAS 123

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           ++++ +  ++   I  + +A+  A QA+     AP ++I RDPRWGR +ET GEDP +  
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           +  VSY+RG+Q ++   G +       A  KHF  Y   N +G   +   A +  ++L +
Sbjct: 178 RMGVSYIRGLQTESLREGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
            +  PFE+ VK+ + S IM  Y+ ++G+P    + LL+   R+ WGF G + SD  A+S 
Sbjct: 228 VFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287

Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
           +Y+        + A    L+AG+DV   S  +     +  ++  ++    ++ A+  +  
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           ++  LGLF  NP +    +   ++  +   + LA + AQ+ IVLLKN + LLPL K    
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403

Query: 423 SLALIGPNANSAKTLLGNYAGP-------SCRSI-----TP----LQALQNYV------- 459
           S+A+IGPNA+S + ++G+YA P         R       TP    L+A   YV       
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463

Query: 460 -------ENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
                   NT  +Y  GCD +  S     +AV+IAK AD  V+++G      D     E 
Sbjct: 464 GIKAKASSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR DL LPG Q+ELI  V E    PVI+VL+ G P+ I++      I +I+ A  PGE 
Sbjct: 524 RDRADLNLPGVQEELIKAVYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+VIFGD+NPG
Sbjct: 581 GGRAVADVIFGDYNPG 596


>gi|344995394|ref|YP_004797737.1| glycoside hydrolase family protein [Caldicellulosiruptor
           lactoaceticus 6A]
 gi|343963613|gb|AEM72760.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           lactoaceticus 6A]
          Length = 770

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 181/543 (33%), Positives = 275/543 (50%), Gaps = 84/543 (15%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIPA     E+  G              +GAT FPQ I  A +FD+ +   + + I +
Sbjct: 83  RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIRI 130

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           + +A   + QA+     AP I++ RD RWGR +ET GEDP +    AVSYV+G+QGD   
Sbjct: 131 QMKAT-GSHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGIQGDDIK 184

Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
            G +       A  KHF  Y +     NW           +  ++L + Y  PFE  VK 
Sbjct: 185 DGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVYLYPFEVAVKV 230

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
                IM AY+ ++GIP  A+R LL+  AR +WGF G   SD   V  I D     K+  
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIYVSDYSGVRNILDYHKAVKTYA 290

Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +A    L AG+D+       F ++  K A+K+ K   + +D A+  +  ++ RLGLF+ N
Sbjct: 291 EAAYISLWAGLDIELPKIECFTEEFIK-ALKEGKFDMAVVDAAVKRVLEMKFRLGLFD-N 348

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P ++  G +  ++  +   + L+ + AQ+ +VLLKN +  LPL K     +A+IGPNA+S
Sbjct: 349 PYIKTEGIL--ELFDNKEQRELSRKVAQESMVLLKNDN-FLPLSKDVK-KIAVIGPNADS 404

Query: 434 AKTLLGNYAGPS----------------------CRSITPL----QALQNYVEN---TVY 464
            + LLG+Y+ P+                       R +  +    +A+++ V+N    VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDRGVGNEEEFVRKVINMKSIFEAVKDRVQNKAEVVY 464

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
             GCD      +  ++A   A+GAD V+L++G      LD T   E  DR  L LPG Q+
Sbjct: 465 AKGCDVNTQDESGFEEAKKAAQGADVVILVVGDKAGLRLDCTS-GESRDRASLKLPGVQE 523

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           +LI  V++  +  +++VL+ G PV        +   +IL A +PGE GA A+A+V+FGD+
Sbjct: 524 KLIEEVSKVNEN-IVVVLVNGRPV--ALEGIWQKAKAILEAWFPGEEGAEAVADVLFGDY 580

Query: 579 NPG 581
           NPG
Sbjct: 581 NPG 583


>gi|301096878|ref|XP_002897535.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262106995|gb|EEY65047.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 537

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 187/582 (32%), Positives = 274/582 (47%), Gaps = 85/582 (14%)

Query: 20  LLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAI 79
           ++ R    + P +C   +   +  PFC T+L  ++R  DL++RL L EK   L  +A A 
Sbjct: 1   MIGRAVDVEIPRAC--VSDENQKLPFCNTSLSTTERVEDLLARLPLGEKAILL--TARAS 56

Query: 80  PR-----LGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRI 134
           PR     +G+P Y W +  +HGV          GT    TSFP  +   A FD      +
Sbjct: 57  PRGNMTSIGLPEYNWGANCVHGVQSTC------GT-NCPTSFPNPVNMGAIFDPEAVREM 109

Query: 135 GQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGV 194
            Q IG E  AL +                     W  G  T      + G  A+   R  
Sbjct: 110 AQVIGWE-EALTS--------------------DWTAGHPTLTSTETLAGG-AIPRHRLQ 147

Query: 195 QGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCV 254
           +G   +       LQA    KH+ AY  +N+ G  R +FDA V+  D ADTY P F S +
Sbjct: 148 EGKRED----SRYLQAVVTLKHYIAYSYENYGGGNRKEFDAIVSPYDFADTYFPAFRSSI 203

Query: 255 KQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKS 314
             G A+G+MC+YN +NG+P+CA+                   +D  A+  I D   Y  +
Sbjct: 204 VDGNATGVMCSYNSINGVPTCAN------------------NADSGAIEAISDRHHYVAT 245

Query: 315 PEDAVVDVLKAGMDVNCGSFL--QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
             +A    + AG DVN G      K     V+  +L E  +D AL +   +R  LGLF+ 
Sbjct: 246 RCEAARIAILAGTDVNSGRLFGYMKCLPELVRSNQLEEKALDDALRHTLKLRFELGLFDP 305

Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
               QP+  +    V +P  + LAL  A   IVLL+N++ +LPL   K   LA++GP+A 
Sbjct: 306 IDD-QPYWHVKPSDVNTPKSRQLALNLATKSIVLLQNNNSVLPL--RKGTKLAVLGPHAE 362

Query: 433 SAKTLLGNYAGPSCRS--------ITPLQALQ--NYVENTVYYPGCDTVACSSASIDKA- 481
           S + LLGNY G  C           TP++A+   N    + Y  GC+    S+   ++  
Sbjct: 363 SKRGLLGNYLGQMCHGGYSEVGCIQTPMEAVSTTNGAATSTYALGCNISGNSTDGFEETK 422

Query: 482 VDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGP 541
           V + + AD VVL +G+D + E+E  DR D+ LP  Q +L+ RV  A  KP ++VL+ GG 
Sbjct: 423 VKVTQEADAVVLFLGIDTSIEEEVNDRNDIKLPSIQMQLLQRV-RAVGKPTVVVLINGGV 481

Query: 542 V---DITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +   D+          +++ A YPG  GA A+A+V+FGD NP
Sbjct: 482 LGAEDLIL-----QTDALVEAFYPGFFGAQAMADVLFGDANP 518


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 182/610 (29%), Positives = 287/610 (47%), Gaps = 105/610 (17%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
           P+  R  +L+ ++TLDEK  Q+V       + +  +P  EW                   
Sbjct: 107 PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 166

Query: 90  -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
                        W  + H           V     GI   F N  IRG     AT+FP 
Sbjct: 167 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 226

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  ++G   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 227 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 279

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D ++
Sbjct: 280 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 329

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++ + +  PF+  +++    G+M +YN  +GIP       L+   R + GF GY+ S
Sbjct: 330 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 389

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
           D DAV  +Y   G AK  ++AV   ++AG++V C      SF+    +  VK+  L E  
Sbjct: 390 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 448

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV---VCSPAHQVLALQAAQDGIVLLKN 409
           I+  + ++  V+  +GLF+      P+    AD    V    ++ +ALQA+++ IVLLKN
Sbjct: 449 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 503

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
           +  LLPL  + +  +A+ GPNAN     L +Y   +    T L+ +Q   +     +Y  
Sbjct: 504 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 563

Query: 467 GCDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GCD V                   A IDKAV+ A+ AD  ++++G  Q    E   R  L
Sbjct: 564 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 623

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPGRQ +L+  + +A  KPV+L+L+ G P+ I +A  D+ + +IL A YPG  G  ALA
Sbjct: 624 DLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 680

Query: 572 EVIFGDHNPG 581
           +++FGD+NPG
Sbjct: 681 DILFGDYNPG 690


>gi|337748463|ref|YP_004642625.1| beta-glucosidase [Paenibacillus mucilaginosus KNP414]
 gi|336299652|gb|AEI42755.1| Beta-glucosidase [Paenibacillus mucilaginosus KNP414]
          Length = 973

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 155/434 (35%), Positives = 227/434 (52%), Gaps = 32/434 (7%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+  R  DL+++LTLDEKI  +     AI RLGI AY+  +EA HG+A +G+        
Sbjct: 15  PLEDRVIDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE-------- 66

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRW 169
             ATS+PQ I  A ++DS L  RIG AIG EAR  Y       G+T WAP +++ RDPRW
Sbjct: 67  --ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRW 124

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +E  GEDP++ GK A +  +G+QGD          L+A A  KHF   + +  +G T
Sbjct: 125 GRTEEAYGEDPVLAGKLAAALTQGIQGDH------PFYLKAVATLKHFIGNNNEAGRGDT 178

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
               D R    +L + Y   FE   K+G A  +M AYN VNG+P+     L+    + +W
Sbjct: 179 SVSLDPR----NLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLS-ELVIDIVKGEW 233

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVV-DVLKAGMD--VNCGSFLQKHTKAAVKQK 346
           G +G++ SD   V+       Y ++ ++AV   + + G+D   +  + + +    A+K  
Sbjct: 234 GMNGFVVSDAFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDG 293

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            L E+++D AL N F VR RLG F+  P   P+  I    +  P H  LA +A+Q  +VL
Sbjct: 294 LLTENDLDTALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVL 352

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
           LKN    LPL   K   +A+IGP A+        Y+G    ++TPLQ ++  +       
Sbjct: 353 LKNDEKTLPLQADKLSKVAVIGPLADVVYQDW--YSGSLPYAVTPLQGIRERLAA----E 406

Query: 467 GCDTVACSSASIDK 480
           G D V   +A  D+
Sbjct: 407 GKDAVTSYAAGTDR 420



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 4/111 (3%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           + C S+ +D AV  A+GAD  ++ +G       KE +DR D+ L   Q++L   V E   
Sbjct: 566 IECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAESQEKLALAVIEKNP 625

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +I+V+   G              +I++  + G+    A+A V+FGD NP
Sbjct: 626 NTIIVVV---GSYPFALNALQEKAKAIIYTSHAGQELGRAVAAVLFGDVNP 673


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 183/608 (30%), Positives = 287/608 (47%), Gaps = 101/608 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
           P+  R  +L+ ++TLDEK  Q+V       + +  +P  EW                   
Sbjct: 59  PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118

Query: 90  -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
                        W  + H           V     GI   F N  IRG     AT+FP 
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  ++G   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++ + +  PF+  +++    G+M +YN  +GIP       L+   R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
           D DAV  +Y   G AK  ++AV   ++AG++V C      SF+    +  VK+  L E  
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
           I+  + ++  V+  +GLF+  P        GAD  V    ++ +ALQA+ + +VLLKN+ 
Sbjct: 401 INDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKEENEAIALQASHESVVLLKNAD 457

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN---TVYYPGC 468
            LLPL  + +  +A+ GPNAN     L +Y   +    T L+ +Q   ++    +Y  GC
Sbjct: 458 ELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGC 517

Query: 469 DTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           D V                   A IDKAV+ A+ AD  V+++G  Q    E   R  L L
Sbjct: 518 DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDL 577

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +L+  + +A  KPV+L+L+ G P+ I +A  D+ + +IL A YPG  G  ALA++
Sbjct: 578 PGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADI 634

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 635 LFGDYNPG 642


>gi|371940122|dbj|BAL45498.1| glycoside hydrolase [Bacillus licheniformis]
          Length = 980

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 166/515 (32%), Positives = 256/515 (49%), Gaps = 37/515 (7%)

Query: 6   LSLVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTL 65
           L  V  ++     ++L+   +  PP +   +      +PF   +LP+ +R  DL+SRLTL
Sbjct: 2   LKTVRTIMLAAMCTILSAA-AVNPPPTAAKTQADEYEYPFQDPSLPVQKRVSDLLSRLTL 60

Query: 66  DEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAAS 125
           +EK+S +    PAIPRLGIPA++  +EALHGVA +G+          AT FPQ +  A +
Sbjct: 61  EEKVSLMHQYQPAIPRLGIPAFKTGTEALHGVAWLGE----------ATVFPQAVGLAHT 110

Query: 126 FDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           +D  L  +IG A+G E R  ++   A  G+  WAP +++ RDPRWGR +E   EDP +TG
Sbjct: 111 WDRSLIKQIGSAVGDEVRGFHHLDPAANGVNVWAPVVDLLRDPRWGRNEEGYSEDPFLTG 170

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           + A +Y  G++G+          L+     KHF AY+ +  +G +    D R    ++ +
Sbjct: 171 EIATAYASGLRGEH------PFYLKTVPTLKHFLAYNNETDRGFSSSSIDPR----NMNE 220

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW-GFHGYITSDCDAVS 303
            Y  PF++ +    A  +M AYN VN  P+     LL+ T + +W G   +I SD    S
Sbjct: 221 YYLKPFKTAISAKAAYSLMPAYNSVNDKPAILSP-LLNSTVKGKWAGDDFFIVSDAFDPS 279

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMD------VNCGSFLQKHTKAAVKQKKLPESEIDRAL 357
            I     Y  S E A    +KAG+D       N G   +     A+K   + E E+D+AL
Sbjct: 280 GIVTDHKYYDSHEKAHAHAVKAGIDNFTDQGENPG-LTKTALTGALKNGLMSEKELDQAL 338

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
            N FS+R R G F+ +  + P+ ++   V+ SP H+ LA +AA+  IVLLKN   LLP  
Sbjct: 339 ANTFSIRFRTGEFDPD-ELNPYSRLTDGVINSPKHRQLAKKAAEKAIVLLKNDRNLLPFN 397

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSS 475
             K+ ++A+IGP  N+       Y+G      TPL  + + +  +   +  G +     S
Sbjct: 398 TRKNENIAVIGPFGNALYEDW--YSGTMPYRKTPLDGIADKIGKDRVSFAEGLEQSGFKS 455

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVD 510
           A   K V   +      L    D+ ++    D  D
Sbjct: 456 ALTGKFVTAGRDGKQ-PLTASADKLEKSAAFDAAD 489



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 64/111 (57%), Gaps = 4/111 (3%)

Query: 472 ACSSASIDKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
           A   +S ++AV +AK AD  V+ +G +     +E  DR D+ LP  Q+ L+  V++A   
Sbjct: 576 AIIKSSTEEAVRLAKTADKAVVFVGNNPYINGRETEDRKDITLPPAQENLLKAVSKANPN 635

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             +LV+    P  + +AK   +I +I+++ + G+    ALA+++FGD NPG
Sbjct: 636 -TVLVVTSSYPFALNWAKV--HIPAIIYSAHGGQEAGSALADILFGDENPG 683


>gi|392305174|emb|CCI71537.1| beta-glucosidase [Paenibacillus polymyxa M1]
          Length = 973

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 151/444 (34%), Positives = 236/444 (53%), Gaps = 33/444 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    + + +R RDLVSRLTL+EKI  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 10  YLFQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE 69

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L  +IG  I  EAR  Y     + G+T WAP +
Sbjct: 70  ----------ATSFPQPVGLACTWDTELMRQIGSVIADEARVYYRRDPKLNGLTLWAPTV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+A A  KHF   +
Sbjct: 120 DMERDPRWGRNEEAYGEDPYLTGELSKELVKGIQGDH------PVYLKAVATLKHFLGNN 173

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G      D R    +L + Y   FE    +G A  +M +YN +NG P+      +
Sbjct: 174 NEVDRGVGSSSIDPR----NLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHG-V 228

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHT 339
           +   R +WG  G++ SD   V  I +   Y  S    V + ++AG+D   +     ++  
Sbjct: 229 NDIVRGEWGMDGFVVSDAGDVMGIVNDHQYYDSHTPGVAESIRAGIDSITDDAELSKQAI 288

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           + A++Q  L E ++DRAL + F VR RLG F+      P+  IG + + +   + L+L+A
Sbjct: 289 REALEQGLLQEEDLDRALFHTFRVRFRLGEFD-PAADNPYASIGEEALMTEQARELSLRA 347

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A++ IVLLKN   LLPL  ++   +A+IG + N  +     Y+G     +TPL+A++  +
Sbjct: 348 AREQIVLLKNEQELLPLNPAQCGKVAVIGSHGN--EVFHDWYSGTLPYRVTPLEAIRAKL 405

Query: 460 EN------TVYYPGCDTVACSSAS 477
           E+       VY    D V  ++A+
Sbjct: 406 ESGSAEERVVYSDAKDRVTLTAAT 429



 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KEE DR  L L   QQ +I  V      P  +V+L G  P  I + +   +I +I++  +
Sbjct: 580 KEENDRPGLELAASQQRIIEEVYRV--NPNTIVVLTGSYPFAIPWLQ--EHIPAIVYTSH 635

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    A+A+V+FGD+ P
Sbjct: 636 AGQEYGTAIADVLFGDYAP 654


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 182/610 (29%), Positives = 287/610 (47%), Gaps = 105/610 (17%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
           P+  R  +L+ ++TLDEK  Q+V       + +  +P  EW                   
Sbjct: 59  PLEARIENLLQQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118

Query: 90  -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
                        W  + H           V     GI   F N  IRG     AT+FP 
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  ++G   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++ + +  PF+  +++    G+M +YN  +GIP       L+   R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
           D DAV  +Y   G AK  ++AV   ++AG++V C      SF+    +  VK+  L E  
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADV---VCSPAHQVLALQAAQDGIVLLKN 409
           I+  + ++  V+  +GLF+      P+    AD    V    ++ +ALQA+++ IVLLKN
Sbjct: 401 INDRVRDILRVKFLIGLFDA-----PYQTDLADADREVEKEENEAIALQASRESIVLLKN 455

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
           +  LLPL  + +  +A+ GPNAN     L +Y   +    T L+ +Q   +     +Y  
Sbjct: 456 AGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTK 515

Query: 467 GCDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GCD V                   A IDKAV+ A+ AD  ++++G  Q    E   R  L
Sbjct: 516 GCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSL 575

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPGRQ +L+  + +A  KPV+L+L+ G P+ I +A  D+ + +IL A YPG  G  ALA
Sbjct: 576 DLPGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALA 632

Query: 572 EVIFGDHNPG 581
           +++FGD+NPG
Sbjct: 633 DILFGDYNPG 642


>gi|310644496|ref|YP_003949255.1| glycoside hydrolase family protein [Paenibacillus polymyxa SC2]
 gi|309249447|gb|ADO59014.1| Glycoside hydrolase family 3 domain protein [Paenibacillus polymyxa
           SC2]
          Length = 976

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 151/456 (33%), Positives = 238/456 (52%), Gaps = 33/456 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    + + +R RDLVSRLTL+EKI  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 13  YLFQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE 72

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L  +IG  I  EAR  Y     + G+T WAP +
Sbjct: 73  ----------ATSFPQPVGLACTWDTELMRQIGSVIADEARVYYRRDPKLNGLTLWAPTV 122

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+A A  KHF   +
Sbjct: 123 DMERDPRWGRNEEAYGEDPYLTGELSKELVKGIQGDH------PVYLKAVATLKHFLGNN 176

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G      D R    +L + Y   FE    +G A  +M +YN +NG P+      +
Sbjct: 177 NEVDRGVGSSSIDPR----NLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHG-V 231

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHT 339
           +   R +WG  G++ SD   V  I +   Y  S    V + ++AG+D   +     ++  
Sbjct: 232 NDIVRGEWGMDGFVVSDAGDVMGIVNDHQYYDSHTPGVAESIRAGIDSITDDAELSKQAI 291

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           + A++Q  L E ++DRAL + F VR RLG F+      P+  IG + + +   + L+L+A
Sbjct: 292 REALEQGSLQEEDLDRALFHTFRVRFRLGEFD-PAADNPYASIGEEALMTEQARELSLRA 350

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A++ IVLLKN    LPL  ++   +A+IG + N  +     Y+G     +TPL+A++  +
Sbjct: 351 AREQIVLLKNEQEFLPLNPAQCGKVAVIGSHGN--EVFRDWYSGTLPYRVTPLEAIRAKL 408

Query: 460 EN------TVYYPGCDTVACSSASIDKAVDIAKGAD 489
           E+       VY    D V  ++A+    + +    D
Sbjct: 409 ESGGAEERVVYSDAKDRVTLTAATDGVKITVNSSGD 444



 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KEE DR  L L   QQ LI  V      P  +V+L G  P  I + +   +I +I++  +
Sbjct: 583 KEENDRPGLELAASQQRLIEEVYRV--NPNTIVVLTGSYPFAIPWLQ--EHIPAIIYTSH 638

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    A+A+V+FGD+ P
Sbjct: 639 AGQEHGTAIADVLFGDYAP 657


>gi|386724230|ref|YP_006190556.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
 gi|384091355|gb|AFH62791.1| beta-glucosidase [Paenibacillus mucilaginosus K02]
          Length = 973

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 155/434 (35%), Positives = 227/434 (52%), Gaps = 32/434 (7%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTI 110
           P+  R  DL+++LTLDEKI  +     AI RLGI AY+  +EA HG+A +G+        
Sbjct: 15  PLEDRVNDLIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE-------- 66

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRW 169
             ATS+PQ I  A ++DS L  RIG AIG EAR  Y       G+T WAP +++ RDPRW
Sbjct: 67  --ATSYPQPIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRW 124

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +E  GEDP++ GK A +  +G+QGD          L+A A  KHF   + +  +G T
Sbjct: 125 GRTEEAYGEDPVLAGKLAAALTQGIQGDH------PFYLKAVATLKHFIGNNNEAGRGDT 178

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
               D R    +L + Y   FE   K+G A  +M AYN VNG+P+     L+    + +W
Sbjct: 179 SVSLDPR----NLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLS-ELVIDIVKGEW 233

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVV-DVLKAGMD--VNCGSFLQKHTKAAVKQK 346
           G +G++ SD   V+       Y ++ ++AV   + + G+D   +  + + +    A+K  
Sbjct: 234 GMNGFVVSDAFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDG 293

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            L E+++D AL N F VR RLG F+  P   P+  I    +  P H  LA +A+Q  +VL
Sbjct: 294 LLTENDLDTALRNTFRVRFRLGEFD-PPERNPYATIDESAILHPEHAKLAREASQKAVVL 352

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYP 466
           LKN    LPL   K   +A+IGP A+        Y+G    ++TPLQ ++  +       
Sbjct: 353 LKNDGKTLPLQADKLSKVAVIGPLADVVYQDW--YSGSLPYAVTPLQGIRERLAA----E 406

Query: 467 GCDTVACSSASIDK 480
           G D V   +A  D+
Sbjct: 407 GKDAVTSYAAGTDR 420



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/111 (29%), Positives = 54/111 (48%), Gaps = 4/111 (3%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           + C S+ +D AV  A+GAD  ++ +G       KE +DR D+ L   Q++L   V E   
Sbjct: 566 IECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAESQEKLALAVIEKNP 625

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +I+V+   G              +I++  + G+    A+A V+FGD NP
Sbjct: 626 NTIIVVV---GSYPFALNALQEKAKAIIYTSHAGQELGRAVAAVLFGDVNP 673


>gi|312621303|ref|YP_004022916.1| glycoside hydrolase family 3 domain-containing protein
           [Caldicellulosiruptor kronotskyensis 2002]
 gi|312201770|gb|ADQ45097.1| glycoside hydrolase family 3 domain protein [Caldicellulosiruptor
           kronotskyensis 2002]
          Length = 770

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 182/543 (33%), Positives = 275/543 (50%), Gaps = 84/543 (15%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIPA     E+  G              +GAT FPQ I  A +FD+ +   + + I  
Sbjct: 83  RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIKT 130

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           + +A+  A QA+     AP I++ RD RWGR +ET GEDP +    AVSYV+G+QGD   
Sbjct: 131 QMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGIQGDDIK 184

Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
            G +       A  KHF  Y +     NW           +  ++L + Y  PFE  VK 
Sbjct: 185 DGIV-------ATGKHFVGYAMSEGGMNWAPV-------HIPERELREVYLYPFEVAVKV 230

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
                IM AY+ ++GIP  A+R LL+  AR +WGF G   SD   V  I D     K+  
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARGEWGFDGIFVSDYAGVRNILDYHKAVKTYA 290

Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +A    L AG+D+       F ++  K A+K+ K   + +D A+  +  ++ RLGLF+ N
Sbjct: 291 EAAYISLWAGLDIELPKIECFTEEFIK-ALKEGKFDMAVVDAAVKRVLEMKFRLGLFD-N 348

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P ++  G +  ++  +   + L+ + AQ+ +VLLKN +  LPL       +A+IGPNA+S
Sbjct: 349 PYIKTEGIL--ELFDNKEQRELSRKVAQESMVLLKNDN-FLPLSNDVK-KIAVIGPNADS 404

Query: 434 AKTLLGNYAGPS----------------------CRSI----TPLQALQNYVEN---TVY 464
            + LLG+Y+ P+                       R +    + L+A+++ V+N    VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDKGVGNEEEFVRKVINIKSILEAIKDRVQNKAEVVY 464

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
             GCD      +  ++A   A+GAD V+L++G      LD T   E  DR  L LPG Q+
Sbjct: 465 AKGCDVNNQDESGFEEAKKAAQGADVVILVVGDKAGLRLDCTS-GESRDRASLKLPGVQE 523

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           +LI  V++  +  +++VL+ G PV        +   +IL A +PGE GA A+A+V+FGD+
Sbjct: 524 KLIEEVSKVNEN-IVVVLVNGRPV--ALEGIWQKAKAILEAWFPGEEGAEAVADVLFGDY 580

Query: 579 NPG 581
           NPG
Sbjct: 581 NPG 583


>gi|222635353|gb|EEE65485.1| hypothetical protein OsJ_20901 [Oryza sativa Japonica Group]
          Length = 481

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 114/193 (59%), Positives = 144/193 (74%), Gaps = 5/193 (2%)

Query: 145 LYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           +YN GQA G+TFW+PN+NIFRDPRWGRGQETPGEDP V  +YA +YVRG+Q    + G+L
Sbjct: 1   MYNGGQA-GLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQQQPSSGRL 59

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
           K     +ACCKHFTAYDLDNW GT R+ F+A VT QDL DT+  PF SCV  GRA+ +MC
Sbjct: 60  K----LAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMC 115

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLK 324
           +YN+VNG+P+CAD   L  T RR+WG  GYI SDCD+V + Y  + Y ++ EDAV   L+
Sbjct: 116 SYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLR 175

Query: 325 AGMDVNCGSFLQK 337
           AG+D++CG FL +
Sbjct: 176 AGLDLDCGPFLAQ 188



 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 46/68 (67%), Positives = 55/68 (80%)

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ ELI+ VA+A+K PVILVL+ GGP+DI FA+ D  I  ILWAGYPG+AG  A+A+V
Sbjct: 220 PGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADV 279

Query: 574 IFGDHNPG 581
           IFG HNPG
Sbjct: 280 IFGHHNPG 287


>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 804

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 162/527 (30%), Positives = 254/527 (48%), Gaps = 65/527 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP  E+ +E +HG+               AT  P  I   ++++  L +R G+  G 
Sbjct: 145 RLGIPV-EFTNEGIHGL-----------NHSRATPLPAPIAIGSTWNRALVHRAGEIAGH 192

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR L           +AP +++ RDPRWGR  E  GEDP +  +  V  VRG+Q     
Sbjct: 193 EARVLGYK------NVYAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQSQG-- 244

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
                     ++  KH+ AY +         + D  +  ++L   Y  PF   +++    
Sbjct: 245 ---------VASTLKHYAAYSVPKGGRDGNCRTDPHIAPRELHQMYLYPFRRVIRESGPM 295

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
           G+M +YN  +G+P  A R  L+   R ++GF GY+ SD +AV  ++     A++ EDAV 
Sbjct: 296 GVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSEAVEYVHTKHAVAETYEDAVR 355

Query: 321 DVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
            VL+AG++V         F+    +  V++ +L    +D+ +  +  V+ RLGLF+ NP 
Sbjct: 356 QVLEAGLNVRTNFSPPARFILP-VRKLVREGRLSMEVVDQRVREVLRVKFRLGLFD-NPY 413

Query: 376 MQP---FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
             P     + GAD      H+   L   +  +VLLKN    LPL K K+  + + GP A+
Sbjct: 414 NDPREAVAEAGAD-----KHRDFVLDIQRQSLVLLKNEDKTLPLDKKKTARVLVAGPLAD 468

Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTV---------------ACS 474
               ++  Y      ++T L  ++NY+ +     Y  GCD V               A  
Sbjct: 469 EDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDVVDAGFPDSELTATPLTAAE 528

Query: 475 SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
            A I++AV  A G D +V ++G D  +  E   R  L LPGRQQ+L+  +  A   PV+L
Sbjct: 529 RAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPGRQQQLLEAL-HATGVPVVL 587

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           VL+ G P+ + +A   +N+ +IL   +P   G  A+AE +FGD+NPG
Sbjct: 588 VLINGQPLTVNWAA--QNVPAILEGWFPSVEGGTAIAETLFGDYNPG 632


>gi|1749831|emb|CAA91219.1| beta-xylo-glucosidase [Thermoanaerobacter brockii]
          Length = 730

 Score =  239 bits (611), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 167/510 (32%), Positives = 265/510 (51%), Gaps = 62/510 (12%)

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWG 170
           +GAT FPQ I  A+++++ +  ++   I  + +A+  A QA+     AP ++I RDPRWG
Sbjct: 56  KGATIFPQTIGVASTWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWG 109

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           R +ET GEDP +  +  VSY+RG+Q ++   G +       A  KHF  Y   N +G   
Sbjct: 110 RTEETFGEDPYLVMRMGVSYIRGLQTESLKEGIV-------ATGKHFVGYG--NSEGGMN 160

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           +   A +  ++L + +  PFE+ VK+ + S IM  Y+ ++G+P    + LL+   R+ WG
Sbjct: 161 WA-PAHIPERELREVFLYPFEAAVKEAKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWG 219

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKL 348
           F G + SD  A+S +Y+        + A    L+AG+DV   S  +     +  ++  ++
Sbjct: 220 FEGIVVSDYFAISQLYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEI 279

Query: 349 PESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
               ++ A+  +  ++  LGLF  NP +    +   ++  +   + LA + AQ+ IVLLK
Sbjct: 280 DIDFVNEAVKRVLKIKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLK 336

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGP-------SCRS------------- 448
           N + LLPL K    S+A+IGPNA+S + ++G+YA P         R              
Sbjct: 337 NENNLLPLKKDLK-SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESL 395

Query: 449 ---------ITPLQALQNYVENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
                    +T LQ ++  V +    +Y  GCD +  S     +AV+IAK AD  V+++G
Sbjct: 396 EAKDIYVPIVTVLQGIKAKVSSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVG 455

Query: 497 -----LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
                 D     E  DR DL LPG Q+ELI  + E    PVI+VL+ G P+ I++     
Sbjct: 456 DKSGLTDGCTSGESRDRADLNLPGVQEELIKAIYETG-TPVIVVLINGRPMSISWIA--E 512

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            I +I+ A  PGE G  A+A+VIFGD+NPG
Sbjct: 513 KIPAIIEAWLPGEEGGRAVADVIFGDYNPG 542


>gi|7259476|gb|AAF43783.1|AF135015_2 xylosidase/arabinosidase [Thermoanaerobacter ethanolicus JW 200]
          Length = 784

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 190/616 (30%), Positives = 301/616 (48%), Gaps = 103/616 (16%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSA------------------------PAIP 80
           +  +T  + +R  DL+ ++T++EK++QL NS                           I 
Sbjct: 5   YLDSTQSVEKRVEDLLQQMTIEEKVAQL-NSIWVYEILDDMKFSFDKAKRLMSYGISQIT 63

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTI----------------RGATSFPQVILTAA 124
           RLG  +     E +     + K +  N  +                +GAT FPQ I  A+
Sbjct: 64  RLGGASNLSPRETVRIANQIQKFLIENTRLGIPALIHEESCSGYMPKGATIFPQTIGVAS 123

Query: 125 SFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTG 184
           ++++ +  ++   I  + +A+  A QA+     AP ++I RDPRWGR +ET GEDP +  
Sbjct: 124 TWNNEIVEKMASVIREQMKAV-GARQAL-----APLLDITRDPRWGRTEETFGEDPYLVM 177

Query: 185 KYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLAD 244
           +  VSY+RG+Q ++   G +       A  KHF  Y   N +G   +   A +  ++L +
Sbjct: 178 RMGVSYIRGLQTESLREGIV-------ATGKHFVGYG--NSEGGMNWA-PAHIPERELRE 227

Query: 245 TYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSI 304
            +  PFE+ VK+ + S IM  Y+ ++G+P    + LL+   R+ WGF G + SD  A+S 
Sbjct: 228 VFLYPFEAAVKEPKLSSIMPGYHELDGVPCHKSKKLLNDILRKDWGFEGIVVSDYFAISQ 287

Query: 305 IYDAEGYAKSPEDAVVDVLKAGMDVNCGS--FLQKHTKAAVKQKKLPESEIDRALHNLFS 362
           +Y+        + A    L+AG+DV   S  +     +  ++  ++    ++ A+  +  
Sbjct: 288 LYEYHHVTSDKKGAAKLALEAGVDVELPSTDYYGLPLRELIESGEIDIDFVNEAVKRVLK 347

Query: 363 VRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSV 422
           ++  LGLF  NP +    +   ++  +   + LA + AQ+ IVLLKN + LLPL K    
Sbjct: 348 IKFELGLFE-NPYINE--EKAVEIFDTNEQRELAYKIAQESIVLLKNENNLLPLKKDLK- 403

Query: 423 SLALIGPNANSAKTLLGNYAGP-SCRSI-----------TP----LQALQNYV------- 459
           S+A+IGPNA+S + ++G+YA P    S+           TP    L+A   YV       
Sbjct: 404 SIAVIGPNADSIRNMIGDYAYPCHIESLLEMRETDNVFNTPLPESLEAKDIYVPIVTVLQ 463

Query: 460 -------ENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG-----LDQTQEKEE 505
                   NT  +Y  GCD +  S     +AV+IAK AD  V+++G      D     E 
Sbjct: 464 GIKAKASSNTEVLYAKGCDVLNNSKDGFKEAVEIAKQADVAVVVVGDKSGLTDGCTSGES 523

Query: 506 LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEA 565
            DR DL LPG Q+ELI  V E    PVI+VL+ G P+ I++      I +I+ A  PGE 
Sbjct: 524 RDRADLNLPGVQEELIKAVYETG-TPVIVVLINGRPMSISWIA--EKIPAIIEAWLPGEE 580

Query: 566 GAVALAEVIFGDHNPG 581
           G  A+A+VIFGD+NPG
Sbjct: 581 GGRAVADVIFGDYNPG 596


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 183/608 (30%), Positives = 287/608 (47%), Gaps = 101/608 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------- 89
           P+  R  +L+ ++TLDEK  Q+V       + +  +P  EW                   
Sbjct: 59  PLEARIENLLQQMTLDEKTCQVVTLYGYKRVLKDDLPTPEWKELLWKDGIGAIDEHLNGF 118

Query: 90  -------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQ 118
                        W  + H           V     GI   F N  IRG     AT+FP 
Sbjct: 119 QQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVESYRATNFPT 178

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  ++G   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 179 QLGLGHTWNRELIRQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D ++
Sbjct: 232 ESPYLVAELGIEMVRGLQHNH----------QVAATGKHFAAYSNNKGAREGMARVDPQM 281

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++ + +  PF+  +++    G+M +YN  +GIP       L+   R + GF GY+ S
Sbjct: 282 SPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESE 352
           D DAV  +Y   G AK  ++AV   ++AG++V C      SF+    +  VK+  L E  
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEV 400

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
           I+  + ++  V+  +GLF+  P        GAD  V    ++ +ALQA+++ IVLLKN+ 
Sbjct: 401 INDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKEENEAIALQASRESIVLLKNAG 457

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGC 468
            LLPL  + +  +A+ GPNAN     L +Y   +    T L+ +Q   +     +Y  GC
Sbjct: 458 ELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGC 517

Query: 469 DTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           D V                   A IDKAV+ A+ AD  ++++G  Q    E   R  L L
Sbjct: 518 DLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDL 577

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +L+  + +A  KPV+L+L+ G P+ I +A  D+ + +IL A YPG  G  ALA++
Sbjct: 578 PGRQLQLLQAI-QATGKPVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADI 634

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 635 LFGDYNPG 642


>gi|383778895|ref|YP_005463461.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
 gi|381372127|dbj|BAL88945.1| putative glycosyl hydrolase [Actinoplanes missouriensis 431]
          Length = 969

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 162/473 (34%), Positives = 238/473 (50%), Gaps = 40/473 (8%)

Query: 27  TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPA 86
           + P  + DP       +PF    LP+  R  DLV RLTLDEKIS L    PAIPRLG+  
Sbjct: 20  STPAHAADPG------YPFRDPALPLETRVDDLVGRLTLDEKISLLHQYQPAIPRLGLGV 73

Query: 87  YEWWSEALHGVAGVGKGIFFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
           ++  +EALHGVA           I   AT FPQ +  A+++D  L  R+G A+G E R  
Sbjct: 74  FKSGTEALHGVAWSNSYTRNGAKIDATATVFPQAVGLASTWDQELLTRVGAAVGDELRGF 133

Query: 146 YNAGQAI-GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKL 204
           +     + G+  WAP +N+ RDPRWGR +E   EDPL++G  A +Y +G+QG   +    
Sbjct: 134 HAQDPVVWGLNTWAPVVNLLRDPRWGRNEEGYSEDPLLSGAVATAYGKGLQGPDPD---- 189

Query: 205 KGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMC 264
              L+ +   KH+ AY+ +  +  T      RV    L +  + PF++ ++   A+G+M 
Sbjct: 190 --HLKTAPTLKHYAAYNNETKRDVTSSNVPQRV----LNEYDRAPFKAALENDAATGVMA 243

Query: 265 AYNRVNGIPSCADRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVL 323
           +YN +NG P+  DR+L  +T  R W     Y  SD  A + +  ++ Y  +  +A   +L
Sbjct: 244 SYNLINGRPATVDRDL--ETIVRDWSDRRLYNVSDAWAPTNLTGSQAYYATQAEANAAIL 301

Query: 324 KAGMDVNCGSFLQKHTKA---------AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
           KAG+D    SF    T A         A++Q+ L E+EID ++ N  S+R RLG F  +P
Sbjct: 302 KAGLD----SFTVNDTNAAPTVTAINEALQQRLLTEAEIDESVGNALSIRFRLGEF--DP 355

Query: 375 TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSA 434
              P   I  DV+ SPAH+ LA + A +  VLLKN   LLPL     V  A++GP A+  
Sbjct: 356 DGGPHADITPDVIDSPAHRALARETAAEAAVLLKNDRSLLPLKPGGDV--AVVGPLAD-- 411

Query: 435 KTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKG 487
           K     Y G     +T L  ++   +      G D +A    +  + +    G
Sbjct: 412 KLYSDWYGGQLPYEVTVLDGVRERAKAVTTSAGADRIALKEITTGRYLTATDG 464



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 55/104 (52%), Gaps = 4/104 (3%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           + +A D A+ +   V+++G D     +E  DR  L L  RQ++LI  V  A    V+ VL
Sbjct: 583 VREAADAARRSRTAVVVVGTDPFVAGREVHDRTSLGLGERQEDLIEAVRRANPNTVV-VL 641

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
               P  IT+A+  R++  ILW  + G     A+A+V+FG  NP
Sbjct: 642 QSSYPQSITWAQ--RHVPGILWTTHAGAETGHAVADVLFGAVNP 683


>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
           [Tannerella forsythia ATCC 43037]
 gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
           forsythia ATCC 43037]
          Length = 775

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 173/551 (31%), Positives = 274/551 (49%), Gaps = 76/551 (13%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++ L   A    RLGIP + +  E +HG   +G            T FP  I  A+++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           +  L  ++G AI  E R+    G  I    + P +++ R+PRW R +ET GEDP+++G  
Sbjct: 155 NRTLIEKMGAAIAHETRS---QGAHIA---YGPVLDLAREPRWSRVEETFGEDPVLSGIL 208

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
             ++VRG+QG  F  G+        +  KH  AY +       R    A++  ++L   +
Sbjct: 209 GSAFVRGLQGKDFADGR-----HTYSTLKHLAAYGIPVGGHNGR---QAQIGARELIAEH 260

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PFE  VK G A  +M +YN V+G+P  ++  +L K  R +W F+G++ SD  ++  I 
Sbjct: 261 LLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEGIA 319

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVNCG------SFLQKHTKAAVKQKKLPESEIDRALHNL 360
                A   + A    L AG++++ G      +  Q HT + +       SEID A+  +
Sbjct: 320 TTHRVAPDIKHAAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISM-----SEIDDAVSRI 374

Query: 361 FSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSK 420
             ++  +GLF  +P +QP      +++ S  H  LA + A++ IVLLKN+  LLPL K+ 
Sbjct: 375 LRLKFEMGLFE-SPYVQP--SRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPLSKNI 431

Query: 421 SVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSS 475
             S+A+IGPNA++    LG+Y  P      +T L+ ++N V  T    Y  GC     + 
Sbjct: 432 G-SIAVIGPNADNLYNQLGDYTAPQPEEHIVTILEGIRNAVSPTTVIRYVKGCAVRDTTQ 490

Query: 476 ASIDKAVDIAKGADHVVLMMG--------------------------LDQTQEKEELDRV 509
           ++ID+AV  A  ++ VVL++G                          +   +  E  DR 
Sbjct: 491 SNIDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRK 550

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
            L L G Q++LI  +A A  KP+I+V + G P+++  A  D+   ++L A YPGE G  A
Sbjct: 551 SLTLLGHQEKLIESIA-ATGKPLIMVYIQGRPLNMNLA--DKKASALLTAWYPGEEGGNA 607

Query: 570 LAEVIFGDHNP 580
           +A VIFGD NP
Sbjct: 608 VANVIFGDVNP 618


>gi|386346119|ref|YP_006044368.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339411086|gb|AEJ60651.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 942

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 152/437 (34%), Positives = 230/437 (52%), Gaps = 31/437 (7%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGT 109
           LP+ +R  DLVSRLTL+EKIS +     A+PRLGI  Y    EA HGVA VGK       
Sbjct: 17  LPLEERVNDLVSRLTLEEKISLIPVRQAAVPRLGIDPYPIGGEAAHGVAWVGK------- 69

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-GQAIGMTFWAPNINIFRDPR 168
              AT FPQ +  + ++D  L  RIG+ IG+EARA  +  G   G+T WAP +++ RDPR
Sbjct: 70  ---ATVFPQPVGLSCTWDRDLLGRIGEVIGVEARAYRDILGTEFGLTLWAPTVDLVRDPR 126

Query: 169 WGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGT 228
           WGR +E  GEDP +  + A S VRG+QGD          L+  A  KHF A + +  +G 
Sbjct: 127 WGRTEEAYGEDPCLASELAGSLVRGMQGDH------PFYLRMGATLKHFFANNNETDRGI 180

Query: 229 TRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQ 288
           +  +    +    + + Y   F   +++ R + IM AYN VNGIP C     + +  + +
Sbjct: 181 SSSEMHPWL----MHNYYLEVFRRIIERARVTCIMTAYNAVNGIP-CLIHPAVKRLVKEE 235

Query: 289 WGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTKAAVKQ 345
           WG  G++ +D     +      Y +   +++   +++G+D    +  + +    + A+++
Sbjct: 236 WGLPGFVVTDAADFGMTVGMHAYFEDHVESIAATIRSGVDAITEDDRTIVIDSLRHALER 295

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             L E ++D AL N+F +R RLG F+  P   P+  I  D +C+P+H  +A +AAQ  +V
Sbjct: 296 GLLKEEDLDTALRNIFRIRFRLGEFD-PPDRNPYAGITKDALCAPSHARIAREAAQKSVV 354

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE--NTV 463
           LLKN  GLLPL       +A++GP A+   T    Y+G     +TPL AL+  +      
Sbjct: 355 LLKN-RGLLPLHAHTLKRVAVVGPLAHEVHT--DWYSGTLPYVVTPLDALKERLTGAEVW 411

Query: 464 YYPGCDTVACSSASIDK 480
           Y  G   V   SA+  +
Sbjct: 412 YEEGSSRVRFRSAAYHR 428



 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 4/104 (3%)

Query: 478 IDKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           +++  ++A  AD V+   G +     KE +DR DL+LP  QQ L+ R A  A     L+L
Sbjct: 549 LERVRELAGKADVVLAFCGNNPYINGKETVDRPDLILPPLQQALV-RTAFEANPRTALIL 607

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +   P  I +   D ++ +IL++ + G+    AL +V+ GD +P
Sbjct: 608 IGSYPFSIAWE--DEHLPAILYSSHGGQEMGRALTDVLLGDVSP 649


>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
          Length = 236

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 110/218 (50%), Positives = 152/218 (69%), Gaps = 9/218 (4%)

Query: 31  FSCDPSNPS-----TETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIP 85
           + CD S  +      ++F FC  +L   +RA+DLVSR+TL EK+ Q V++A  + RLG+P
Sbjct: 17  YVCDESRYALLGLDMKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 76

Query: 86  AYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARAL 145
            Y WWSEALHG++ +G G+FF+ TI GATSFP VIL+ A+F+  LW  +G+ +  E RA+
Sbjct: 77  EYSWWSEALHGISNLGPGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRAM 136

Query: 146 YNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--DTFNGGK 203
           YN G A G+TFW+PNIN+ RD RWGR QET GEDP + G++AV+YVRG+Q    T N   
Sbjct: 137 YNLGHA-GLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 195

Query: 204 LKGK-LQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
           L  + L+ S+CCKH+ AYD+D+W    R+ FDARV++ 
Sbjct: 196 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSLN 233


>gi|375311103|ref|ZP_09776360.1| beta-glucosidase-related glycosidase, partial [Paenibacillus sp.
           Aloe-11]
 gi|375076843|gb|EHS55094.1| beta-glucosidase-related glycosidase, partial [Paenibacillus sp.
           Aloe-11]
          Length = 566

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 150/444 (33%), Positives = 236/444 (53%), Gaps = 33/444 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    + + +R RDLVSRLTL+EKI  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 10  YLFQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE 69

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L  +IG  IG EAR  Y     + G+T WAP +
Sbjct: 70  ----------ATSFPQPVGLACTWDTELMRQIGSVIGDEARVYYRRDPKLNGLTLWAPTV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+A A  KHF   +
Sbjct: 120 DMERDPRWGRNEEAYGEDPYLTGELSKELVKGIQGDH------PVYLKAVATLKHFLGNN 173

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    +L + Y   FE    +G A  +M +YN +NG P+      +
Sbjct: 174 NEVDRGSGSSSIDPR----NLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHG-V 228

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHT 339
           +   R +WG  G++ SD   V  I +   Y  S    V + ++ G+D   +     ++  
Sbjct: 229 NDIVRGEWGMDGFVVSDAGDVMGIANDHQYYDSHTPGVAESIRVGIDSITDDAELSKQAI 288

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           + A+ Q  L E+++DRAL + F VR RLG F+      P+  IG + + +   + L+L+A
Sbjct: 289 REALAQGTLEEADLDRALFHTFRVRFRLGEFD-PAADNPYALIGEEHLMTEQARELSLRA 347

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL---- 455
           A++ +VLLKN  GLLPL  ++   +A+IG   N  +     Y+G     +TPL+A+    
Sbjct: 348 AREQVVLLKNERGLLPLDPAQCGKVAVIGTLGN--EVYRDWYSGTLPYRVTPLEAICAKL 405

Query: 456 --QNYVENTVYYPGCDTVACSSAS 477
             ++  E  +Y    D V  ++A+
Sbjct: 406 ESEDTKERVMYRDAKDRVVLTAAA 429


>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 793

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 184/606 (30%), Positives = 290/606 (47%), Gaps = 102/606 (16%)

Query: 52  ISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHGVA 98
           ++ R  DL+S++TLDEK  Q+        + +  +P   W    W        E L+G A
Sbjct: 50  VNARTADLLSKMTLDEKTCQMATLYGWHRVLKDSLPTDSWKNAIWKDGIANIDEHLNGFA 109

Query: 99  GVGK------------------------------GI---FFNGTIRG-----ATSFPQVI 120
           G GK                              GI   F N  IRG     AT FP  +
Sbjct: 110 GWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYEATGFPTEL 169

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGED 179
               +++  L ++ G   G EARAL       G T  +AP +++ RD RWGR +E+ GED
Sbjct: 170 NMGMTWNKELVHQEGIITGREARAL-------GYTNVYAPIMDVARDQRWGRLEESYGED 222

Query: 180 PLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM 239
           P +     ++  +G+Q D        GK+ ++A  KHF  Y  +      + + D +V  
Sbjct: 223 PYLVASMGIALAKGIQQD--------GKVASTA--KHFAVYSANKGAREGQARTDPQVAP 272

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           +++ +    PF+  +K+    G+M +YN  +GIP       L +  R + GF GY+ SD 
Sbjct: 273 REVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEMGFTGYVVSDS 332

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEIDR 355
           DA+  +      A + ++AV     AGM+V         +  + +  VK+ ++P   I+ 
Sbjct: 333 DALEYLATKHHVAANLKEAVFQAFMAGMNVRTTFKAPDSIIIYLRQLVKEGRIPMDTINH 392

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD---VVCSPAHQVLALQAAQDGIVLLKNSHG 412
            + ++  V+ RLGLF+      P+ +  A+   VV S A Q +ALQA+++ +VLLKN++ 
Sbjct: 393 RVADVLRVKFRLGLFD-----HPYVESAAETRKVVNSDASQQIALQASRESVVLLKNNNN 447

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDT 470
           +LPL KS    +A++GPNA        +Y      S+  LQ +Q  +     +Y  G D 
Sbjct: 448 ILPLVKSLD-KIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVLYAKGVDL 506

Query: 471 V---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPG 515
           V               A   A +D AV+I K A   ++++G +     E   R DL LPG
Sbjct: 507 VDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSRTDLDLPG 566

Query: 516 RQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIF 575
            Q EL+ +  +A  KPV++VLL   P+ I +   D+ I  I++AGYPG  G +A+A+V+F
Sbjct: 567 HQLELV-KAIKATGKPVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKGGIAVADVLF 623

Query: 576 GDHNPG 581
           GD+NPG
Sbjct: 624 GDYNPG 629


>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 859

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 182/613 (29%), Positives = 280/613 (45%), Gaps = 100/613 (16%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS------ 91
           P+  ++ PI  R  DL++R++L+EK +QL      P + +   P   W    W       
Sbjct: 60  PYEDSSRPIDARIEDLLARMSLEEKTAQLTTLYGFPRVLKDERPTSAWREAMWKDGIGNI 119

Query: 92  -EALHGVAGVGKGI----------------------------------FFNGTIRG---- 112
            E L+G  G    +                                  F N  IRG    
Sbjct: 120 DEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGLLHS 179

Query: 113 -ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWG 170
            ATSFP  +  A+++D  L   IG+  G EARAL       G T  ++P +++ RDPRWG
Sbjct: 180 KATSFPAELAVASTWDPALVREIGRITGREARAL-------GYTNIYSPVLDLARDPRWG 232

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           R  ET GEDP + G   V  VRG+Q +              +  KHF  Y +        
Sbjct: 233 RTIETYGEDPFLVGTLGVEQVRGLQAE-----------HVVSTLKHFAVYSIPKGGRDGE 281

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
            + D + T +++   +  PF   +++  A G+M +YN  +G+P       LS+  R QWG
Sbjct: 282 ARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEILRGQWG 341

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQK 346
           F GY+ SD  AV  I+     A +P DA+   ++AG+++       +   +  +  V+  
Sbjct: 342 FRGYVVSDSAAVEFIHSKHRVAPTPADAIRQAVEAGLNIRTNFTPPAAYAEPLRQLVRDG 401

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIV 405
           KL  + ID  + ++  V+ +LGLF+      P     AD VV +P H V+A +A ++ IV
Sbjct: 402 KLAMATIDARVRDVLRVKFQLGLFDRPYVADP---AAADRVVRAPEHLVVAQRAGREAIV 458

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVY- 464
           LLKN   LLPL ++K   + + GP A+ A      Y       +TPL  L+  +   V  
Sbjct: 459 LLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKLGAAVEV 518

Query: 465 ------------YPGCDTV-----ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELD 507
                       +P  D +     A   A I+ AV  A+  D ++ ++G      +E   
Sbjct: 519 RYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDELCRESSS 578

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R+ L LPG QQEL+  +  A  KP++LVL  G P+ + +A   R++ +I+   +PGE G 
Sbjct: 579 RISLALPGYQQELLEAL-HATGKPLVLVLSNGRPLSVVWAA--RHVPAIVELWFPGEDGG 635

Query: 568 VALAEVIFGDHNP 580
            ALA V+ GD NP
Sbjct: 636 AALAAVLLGDANP 648


>gi|390454819|ref|ZP_10240347.1| beta-glucosidase-related glycosidase [Paenibacillus peoriae KCTC
           3763]
          Length = 998

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 230/422 (54%), Gaps = 27/422 (6%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    + + +R RDLVSRLTL+EKI  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 10  YLFQDPKIHLDERVRDLVSRLTLEEKIESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE 69

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L  ++G  IG EAR  Y     + G+T WAP +
Sbjct: 70  ----------ATSFPQPVGLACTWDTELMRQVGSVIGDEARVYYRRDPKLNGLTLWAPTV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+A A  KHF   +
Sbjct: 120 DMERDPRWGRNEEAYGEDPYLTGELSKELVKGIQGDH------PVYLKAVATLKHFLGNN 173

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    +L + Y   FE    +G A  +M +YN +NG P+      +
Sbjct: 174 NEVDRGSGSSSIDPR----NLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHG-V 228

Query: 282 SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHT 339
           +   R +WG  G++ SD   V  I +   Y  S    V + ++AG+D   +     ++  
Sbjct: 229 NDIVRGEWGMDGFVVSDAGDVMGIANDHQYYDSHTLGVAESIRAGIDSITDDAELSKQAI 288

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
           + A+ Q  L E+++DRAL + F VR RLG F+      P+  IG + + +   + L+L+A
Sbjct: 289 REALTQGTLEETDLDRALFHTFRVRFRLGEFD-PAADNPYALIGEEALMTEQARELSLRA 347

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV 459
           A++ +VLLKN  GLLPL  ++   +A+IG   N  +     Y+G     +TPL+A++  +
Sbjct: 348 AREQVVLLKNERGLLPLDPAQCGKVAVIGTLGN--EVYHDWYSGTLPYRVTPLEAIRAKL 405

Query: 460 EN 461
           E+
Sbjct: 406 ES 407



 Score = 43.1 bits (100), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KEE DR  L L   QQ LI  V  A   P  +V+L G  P  I + +   +I +I++  +
Sbjct: 591 KEENDRPGLELAASQQRLIEEVYRA--NPNTIVVLTGSYPFAIPWVQ--EHIPAIVYTSH 646

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    A+A+V+FGD+ P
Sbjct: 647 AGQEHGTAVADVLFGDYAP 665


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  238 bits (608), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 171/552 (30%), Positives = 271/552 (49%), Gaps = 67/552 (12%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++ +   A    RLGIP      E  HG   +G           AT FP  +   +++
Sbjct: 106 EAVNAIQRYAIENSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 153

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           +  L+  + +A+  E RA        G   ++P +++ RDPRWGR +E  GED  +  + 
Sbjct: 154 NVELYREMCRAVARETRA------QGGAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEM 207

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AV+ V G+QG++     L G+   +A  KHF  Y         R      +  ++L +  
Sbjct: 208 AVASVEGLQGES-----LDGEDSVAATLKHFVGYGSSE---GGRNAGPVHMGRRELLEVD 259

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF   V+ G AS IM AYN ++G+P   +  LL    R +WGF G + +DC A+ ++ 
Sbjct: 260 LLPFRKAVEAGAAS-IMPAYNEIDGVPCTTNEELLDGVLRGEWGFDGMVITDCGAIDMLA 318

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
                A+   DA +  ++AG+D+   G    KH   AV+  +L E  +DRA+  + +++ 
Sbjct: 319 SGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRSGQLEEEVLDRAVRRVLTLKF 378

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
           RLGLF   P   P  +    V+ S  H  LA Q A +G+VLLKN  G+LPL  + + ++A
Sbjct: 379 RLGLFE-RPYADP--ERAERVIGSAEHVELARQLASEGVVLLKNKDGVLPL-SADAGTIA 434

Query: 426 LIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYV----ENTVYYPGCDTVACSSASID 479
           +IGPNA++    LG+Y  P  RS   T L  +++ +    E  +Y PGC     S    D
Sbjct: 435 VIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLAETPERVLYAPGCRINGNSREGFD 494

Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
            A+  A+ AD VV+++G    ++                          E +DR++L L 
Sbjct: 495 VALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGASKVTDNAESDMDCGEGIDRMNLSLS 554

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q ELI  + +   KP+++V + G P+   +   D +  +IL A YPG+ G  A+A+++
Sbjct: 555 GVQLELIQEIHKLG-KPLVVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAIADIL 611

Query: 575 FGDHNP-GEFTV 585
           FGD NP G  T+
Sbjct: 612 FGDVNPSGRLTI 623


>gi|125541196|gb|EAY87591.1| hypothetical protein OsI_09002 [Oryza sativa Indica Group]
          Length = 194

 Score =  238 bits (606), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 113/193 (58%), Positives = 141/193 (73%), Gaps = 14/193 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           FC   L   QRA DLV+ LTL EK+SQL + A  + RLG+PAYEWWSE LHG++  G+GI
Sbjct: 5   FCDARLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 64

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
            FNGT+R  TSF     TAA+FD+ LW R+G+A+G EARALYN GQA G+T W+PN+NIF
Sbjct: 65  RFNGTVRAVTSF-----TAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 119

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGRGQETPGEDP+   +YAV++V G+Q       ++ G  +ASACCKH TAYDLD 
Sbjct: 120 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQ-------EIGG--EASACCKHATAYDLDY 170

Query: 225 WKGTTRYKFDARV 237
           W    RY +D++V
Sbjct: 171 WNNVVRYNYDSKV 183


>gi|15614471|ref|NP_242774.1| glucan 1,4-beta-glucosidase [Bacillus halodurans C-125]
 gi|10174526|dbj|BAB05627.1| glucan 1,4-beta-glucosidase [Bacillus halodurans C-125]
          Length = 926

 Score =  238 bits (606), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 158/454 (34%), Positives = 232/454 (51%), Gaps = 33/454 (7%)

Query: 52  ISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIR 111
           + +R   LVS LTLDEKI  +     A+PRLGI  Y+  +E  HGVA +GK         
Sbjct: 17  LDERVESLVSELTLDEKIRLMCQYQEAVPRLGIGKYKHGTEGAHGVAWLGK--------- 67

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG-QAIGMTFWAPNINIFRDPRWG 170
            AT FPQ I    +++  L  +IG  I  EAR  Y    +  G+T WAP +++ RDPRWG
Sbjct: 68  -ATVFPQNIGLGCTWNEELMKKIGDVIAEEARVYYQENPEQNGLTIWAPTVDMERDPRWG 126

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           R +E  GEDP +TGK     ++G+QGD          L+ +A  KHF A + +  +    
Sbjct: 127 RTEEAYGEDPYLTGKLTTELIKGMQGDH------PFYLKTAATLKHFLANNNEEARTECS 180

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
              D R    +L + Y   FE  VK+G    +M AYN VNG   C     ++   + +WG
Sbjct: 181 VSIDPR----NLREYYLKAFEPQVKEGHVQCVMTAYNAVNGT-LCNMNPDVNAILKEEWG 235

Query: 291 FHGYITSD-CDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHTKAAVKQKK 347
             G++ SD CD V  + D + + +S  +AV   +K G+D   +    + +    A++Q  
Sbjct: 236 LQGFVVSDACDVVGSVDDHQ-FVESYAEAVALSIKNGIDNVTDDEKIIHRAIGDALEQGL 294

Query: 348 LPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLL 407
           L E ++D+AL  +F VR+RLG F+      P+  +  + +C+P H  LALQAA++ IVLL
Sbjct: 295 LSEEDLDQALKRVFRVRIRLGEFD-QENKNPYATVAKEKLCAPEHSELALQAAKEAIVLL 353

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQA-LQNYVENTVYYP 466
           KN   LLPL   K V  A+IGP AN   T    Y+G    +I+PL+  +Q   E  +Y  
Sbjct: 354 KNDD-LLPLSPEKKV--AVIGPLANEIYTDW--YSGTPPYTISPLEGIMQKANEPVLYAD 408

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT 500
           G D +    A     +   +G+  +    G  +T
Sbjct: 409 GSDRIRLREAMSGNGIVAEEGSSQLTCHDGQGET 442



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 60/105 (57%), Gaps = 4/105 (3%)

Query: 478 IDKAVDIAKGADHVVLMMGLD-QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           ID+AV+ AK AD  V+ +G       KE +DR DL LP  QQ LI  V +  +  V+ V+
Sbjct: 534 IDQAVEAAKQADTAVVFVGNQPYINGKECVDRPDLALPPAQQRLIEEVVKVNRNTVV-VI 592

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +   P  I + K   N+ +I++  + G+    A+++V+FGD+NP 
Sbjct: 593 VGSYPYTIPWVK--ENVPAIVYTAHGGQEFGRAVSDVLFGDYNPA 635


>gi|222099590|ref|YP_002534158.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
 gi|2429092|gb|AAB70867.1| beta-xylosidase [Thermotoga neapolitana]
 gi|221571980|gb|ACM22792.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
          Length = 778

 Score =  238 bits (606), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 192/627 (30%), Positives = 288/627 (45%), Gaps = 143/627 (22%)

Query: 51  PISQRARDLVSRLTLDEKISQ--------------------------------------- 71
           P+  R +DL+SR+TL+EKI+Q                                       
Sbjct: 10  PVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQITRPGGST 69

Query: 72  ---------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
                    LVN          RLGIPA     E L G  G+G            T+FPQ
Sbjct: 70  NLEPQEAAELVNEIQRFLVEETRLGIPAM-IHEECLTGYMGLG-----------GTNFPQ 117

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
            I  A+++D  L  ++  AI  + R L  A Q +     AP +++ RDPRWGR +ET GE
Sbjct: 118 AIAMASTWDPDLIEKMTAAIREDMRKL-GAHQGL-----APVLDVARDPRWGRTEETFGE 171

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFD 234
            P +  +  VSYV+G+QG+    G         A  KHF  Y       NW  T      
Sbjct: 172 SPYLVARMGVSYVKGLQGENIKEG-------VVATVKHFAGYSASEGGKNWAPTN----- 219

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
             +  ++  + +  PFE+ VK+ R   +M +Y+ ++G+P  A+R LL+   R+ WGF G 
Sbjct: 220 --IPEREFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGI 277

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEI 353
           + SD  AV+++ +    AK   ++    L+AG+DV    +   +H K  V++  +PES I
Sbjct: 278 VVSDYFAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLI 337

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D A+  +  ++  LGLF  NP       +  +     +H+ LAL+ A+  I+LLKN  G 
Sbjct: 338 DEAVSRVLKLKFMLGLFE-NPY------VDVEKAKIESHRDLALEIARKSIILLKND-GT 389

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNY-----------------AGPSC---------R 447
           LPL K+K V  ALIGPNA   + LLG+Y                   P           +
Sbjct: 390 LPLQKNKKV--ALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKK 447

Query: 448 SI--------TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG--- 496
           SI        + L A +    +  Y  GC+      +   +A+++AK +D  ++++G   
Sbjct: 448 SIEEHMKSIPSVLDAFKEEGIDFEYAKGCEVTGEDRSGFKEAIEVAKRSDVAIVVVGDRS 507

Query: 497 ---LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
              LD T   E  D  +L LPG Q+EL+  +A+   KPV+LVL+ G P  +     DR +
Sbjct: 508 GLTLDCTT-GESRDMANLKLPGVQEELVLEIAKTG-KPVVLVLITGRPYSLKNL-VDR-V 563

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP 580
            +IL    PGEAG  A+ +VI+G  NP
Sbjct: 564 NAILQVWLPGEAGGRAIVDVIYGKVNP 590


>gi|374320933|ref|YP_005074062.1| beta-glucosidase-related glycosidase [Paenibacillus terrae HPL-003]
 gi|357199942|gb|AET57839.1| beta-glucosidase-related glycosidase [Paenibacillus terrae HPL-003]
          Length = 976

 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 153/445 (34%), Positives = 239/445 (53%), Gaps = 35/445 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    L + +R RDLVSRLTL+EKI  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 10  YLFQNPKLHLDERVRDLVSRLTLEEKIESMLQYQPAIERLGVAAYKHGTEAAHGIAWLGE 69

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L  ++G  I  EAR  Y     + G+T WAP +
Sbjct: 70  ----------ATSFPQPVGLACTWDTELMRQVGSVISDEARVYYRRDPVLNGLTLWAPTV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ A   V+G+QGD          L+A A  KHF   +
Sbjct: 120 DMERDPRWGRNEEAYGEDPYLTGELAKELVKGIQGDH------PVYLKAVATLKHFLGNN 173

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G+     D R    +L + Y   FE    +G+A  +M +YN +NG P+      +
Sbjct: 174 NEVDRGSDSSSIDPR----NLREYYLKAFEKPFTEGKAQSMMTSYNLINGTPATLYHG-V 228

Query: 282 SKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKH 338
           +   R +WG  G++ SD  D + I+ D + Y  S    V + ++AG+D   +     ++ 
Sbjct: 229 NDIVRDEWGMDGFVVSDAGDVMGIVKDHQ-YYDSHTPGVAESIRAGIDSITDDADLSKQA 287

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            + A+ Q  L E+++D+AL + F VR RLG F+   T  P+  IG + + +   + L+L+
Sbjct: 288 IRDALVQGMLQETDLDQALFHTFRVRFRLGEFD-PATDNPYALIGEEALMTEQARELSLR 346

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNY 458
           AA++ IVLLKN   LLPL  +    +A+IG   N  +     Y+G     +TPL+A++  
Sbjct: 347 AAREQIVLLKNERSLLPLDPAGCGKVAVIGALGN--EVYRDWYSGTFPYFVTPLEAIRAK 404

Query: 459 VEN------TVYYPGCDTVACSSAS 477
           +E        VY    D V  ++A+
Sbjct: 405 LETKDAQDRVVYRDAKDRVIFAAAA 429



 Score = 42.7 bits (99), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KEE DR  L L   QQ LI  V  A   P  +V+L G  P  I + +   +I +I++  +
Sbjct: 573 KEENDRPGLELAASQQRLIEEVYRA--NPNTIVVLTGSYPFAIPWVQ--EHIPAIVYTSH 628

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    A+A+V+FGD+ P
Sbjct: 629 AGQEHGTAVADVLFGDYAP 647


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 167/525 (31%), Positives = 254/525 (48%), Gaps = 59/525 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT+FP     A++++  L  R+G+ I  
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P ++I RDPRW R +E+ GED  +T +   +YVRG       
Sbjct: 168 EIRL------QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT-----G 216

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L     A +  KHF AY                +  ++L +TY PPFE+ VK G A 
Sbjct: 217 SGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-AR 272

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M AYN V+GIP  A+R +L+   R +WGF G++ SD  ++  +++  G A S  +A V
Sbjct: 273 SVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV 332

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L+AG+D +         + A +   + E+EIDRA+  + +++  +GLF  NP +    
Sbjct: 333 QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYID--- 388

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +  A  V   AH  LAL+AA+  + LL+N  G LPL   +   +A+IGPNA++    LG+
Sbjct: 389 EAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGD 448

Query: 441 YAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           Y      + T    L+  +  +  VY  GC       + I  AV  A+G D  V+++G  
Sbjct: 449 YTAQQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGS 508

Query: 499 QTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
             ++                        E  DR  L L G Q+EL+ R+ +A   P+I+V
Sbjct: 509 SARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVV 567

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + G P+D+  A    +  ++L A YPG  G  A+AE I G +NP
Sbjct: 568 CIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGRNNP 610


>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 944

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 167/507 (32%), Positives = 260/507 (51%), Gaps = 54/507 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L ++IG   G EAR L       G T  +A
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIHKIGFITGREARML-------GYTNVYA 212

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +  VRG+Q   +N        Q +A  KHF 
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ---YNH-------QVAATGKHFA 262

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D +++ +++ + +  PF   +++    G+M +YN  +GIP     
Sbjct: 263 AYSNNKGAREGMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSH 322

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
             L+   R + GF GY+ SD DAV  +Y   G AK  ++A+   ++AG+++ C      S
Sbjct: 323 YWLTTRLRGEIGFRGYVVSDSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRSPDS 382

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAH 392
           F+    +  VK+  L E  I+  + ++  V+   GLF+  P        GAD  V    +
Sbjct: 383 FVLP-LRELVKEGGLSEEIINDRVRDILRVKFLTGLFD-TPYQSDLA--GADREVEKEEN 438

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             +ALQA+++ IVLLKN + +LPL  S    +A+ GPNA+     L +Y   +   IT L
Sbjct: 439 GSIALQASRESIVLLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVL 498

Query: 453 QALQNYVE---NTVYYPGCDTV---------------ACSSASIDKAVDIAKGADHVVLM 494
           + +Q+ V      +Y  GCD V               A   A I+KA + A+ +D  V++
Sbjct: 499 KGIQDKVSGKAEVLYTKGCDLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVV 558

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           +G  Q    E   R  L LPGRQ +L+  + +A  KPVILVL+ G P+ + +A  D+ + 
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLQLLQAI-QATGKPVILVLINGRPLSVNWA--DKYVP 615

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
           +IL A YPG  G +ALA+V+FGD+NPG
Sbjct: 616 AILEAWYPGAKGGIALADVLFGDYNPG 642


>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
          Length = 785

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 178/532 (33%), Positives = 272/532 (51%), Gaps = 69/532 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G            T FP  +  A++++  L  ++G+AI L
Sbjct: 131 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTALSAASTWNEGLMLKMGEAIAL 178

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR     G  IG   + P +++ R+PRW R +ET GEDP++T    V+ ++G+QG   N
Sbjct: 179 EARL---QGANIG---YGPVLDVAREPRWSRMEETFGEDPVLTTIMGVAMMKGMQGKVQN 232

Query: 201 GGKLKGKLQASACCKHFTAYDL-DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
            GK        A  KHF AY + ++    +R    A   M+ L   Y PPF   VK+G A
Sbjct: 233 DGK-----HLYATLKHFAAYGVPESGHNGSR----ANCGMRQLLSEYLPPFRKAVKEG-A 282

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
             +M +YN ++G+P  A++ LL+   R QWGF G++ SD  ++  I      AK  ++A 
Sbjct: 283 GTLMTSYNAIDGVPCTANKELLTDVLRNQWGFKGFVYSDLISIEGIVGMRA-AKDNKEAA 341

Query: 320 VDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
           V  LKAG+D++  G+   K+ K A ++  +  +++DRA+ N+  ++ ++GLF  NP + P
Sbjct: 342 VKALKAGLDMDLGGNAFGKNLKKAYEEGLITMADLDRAVGNVLRLKFQMGLFE-NPYVSP 400

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
             ++   +V S  H+ LA Q A++G+VLLKN  G+LPL K     LA+IGPNA+     L
Sbjct: 401 --ELAKKLVHSKEHKELARQVAREGVVLLKN-EGVLPLSKHIG-HLAVIGPNADEMYNQL 456

Query: 439 GNYAGPSCRS--ITPLQALQNYVENT---VYYPGC---DTVAC----------------- 473
           G+Y  P  R    T L  ++  V  +    Y  GC   DT A                  
Sbjct: 457 GDYTAPQVREEVATVLDGIRAAVSESTRVTYVKGCAVRDTTATDIPAAVAAAQKADAVVL 516

Query: 474 ----SSASIDKAVDIAKGADHVVL-MMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAA 528
               SSA   K   I+ GA  V      L      E  DR  L L G Q++LI+ VA   
Sbjct: 517 VVGGSSARDFKTKYISTGAATVSEDAKTLPDMDCGEGFDRSSLRLLGDQEKLISAVASTG 576

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KP+++V + G  +++  A       ++L A YPGE G + +A+++FGD++P
Sbjct: 577 -KPLVVVYIQGRTMNMNLAA--EKAQALLTAWYPGEQGGMGIADILFGDYSP 625


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 167/525 (31%), Positives = 254/525 (48%), Gaps = 59/525 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT+FP     A++++  L  R+G+ I  
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P ++I RDPRW R +E+ GED  +T +   +YVRG       
Sbjct: 168 EIRL------QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT-----G 216

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L     A +  KHF AY                +  ++L +TY PPFE+ VK G A 
Sbjct: 217 SGDLSQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG-AR 272

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M AYN V+GIP  A+R +L+   R +WGF G++ SD  ++  +++  G A S  +A V
Sbjct: 273 SVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREAAV 332

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L+AG+D +         + A +   + E+EIDRA+  + +++  +GLF  NP +    
Sbjct: 333 QALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYID--- 388

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +  A  V   AH  LAL+AA+  + LL+N  G LPL   +   +A+IGPNA++    LG+
Sbjct: 389 EAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLGD 448

Query: 441 YAGPSCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD 498
           Y      + T    L+  +  +  VY  GC       + I  AV  A+G D  V+++G  
Sbjct: 449 YTAQQTAANTVRDGLEKLLGRDRVVYSRGCTVRGGDRSEIAAAVSAARGTDAAVVVIGGS 508

Query: 499 QTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
             ++                        E  DR  L L G Q+EL+ R+ +A   P+I+V
Sbjct: 509 SARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRI-KATGTPLIVV 567

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            + G P+D+  A    +  ++L A YPG  G  A+AE I G +NP
Sbjct: 568 CIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGHNNP 610


>gi|6006601|emb|CAB56857.1| beta-mannanase [Thermotoga neapolitana]
          Length = 821

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 192/627 (30%), Positives = 289/627 (46%), Gaps = 143/627 (22%)

Query: 51  PISQRARDLVSRLTLDEKISQ--------------------------------------- 71
           P+  R +DL+SR+TL+EKI+Q                                       
Sbjct: 53  PVEVRVKDLLSRMTLEEKIAQLGSVWGYELIDERGKFKREKAKDLLKNGIGQITRPGGST 112

Query: 72  ---------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
                    LVN          RLGIPA     E L G  G+G            T+FPQ
Sbjct: 113 NLEPQEAAELVNEIQRFLVEETRLGIPAM-IHEECLTGYMGLG-----------GTNFPQ 160

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGE 178
            I  A+++D  L  ++  AI  + R L  A Q +     AP +++ RDPRWGR +ET GE
Sbjct: 161 AIAMASTWDPDLIEKMTAAIREDMRKL-GAHQGL-----APVLDVARDPRWGRTEETFGE 214

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFD 234
            P +  +  VSYV+G+QG+    G +       A  KHF  Y       NW  T      
Sbjct: 215 SPYLVARMGVSYVKGLQGENIKEGVV-------ATVKHFAGYSASEGGKNWAPTN----- 262

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
             +  ++  + +  PFE+ VK+ R   +M +Y+ ++G+P  A+R LL+   R+ WGF G 
Sbjct: 263 --IPEREFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGI 320

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEI 353
           + SD  AV+++ +    AK   ++    L+AG+DV    +   +H K  V++  +PES I
Sbjct: 321 VVSDYFAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLI 380

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGL 413
           D A+  +  ++  LGLF  NP       +  +     +H+ LAL+ A+  I+LLKN  G 
Sbjct: 381 DEAVSRVLKLKFMLGLFE-NPY------VDVEKAKIESHRDLALEIARKSIILLKND-GT 432

Query: 414 LPLPKSKSVSLALIGPNANSAKTLLGNY-----------------AGPSC---------R 447
           LPL K+K V  ALIGPNA   + LLG+Y                   P           +
Sbjct: 433 LPLQKNKKV--ALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKK 490

Query: 448 SI--------TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG--- 496
           SI        + L A +    +  Y  GC+      +   +A+++AK +D  ++++G   
Sbjct: 491 SIEEHMKSIPSVLDAFKEEGIDFEYAKGCEVTGEDRSGFKEAIEVAKRSDVAIVVVGDRS 550

Query: 497 ---LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNI 553
              LD T   E  D  +L LPG Q+EL+  +A+   KPV+LVL+ G P  +     DR +
Sbjct: 551 GLTLDCTT-GESRDMANLKLPGVQEELVLEIAKTG-KPVVLVLITGRPYSLKNL-VDR-V 606

Query: 554 GSILWAGYPGEAGAVALAEVIFGDHNP 580
            +IL    PGEAG  A+ +VI+G  NP
Sbjct: 607 NAILQVWLPGEAGGRAIVDVIYGKVNP 633


>gi|308071263|ref|YP_003872868.1| beta-glucosidase-related glycosidase [Paenibacillus polymyxa E681]
 gi|305860542|gb|ADM72330.1| Beta-glucosidase-related glycosidase [Paenibacillus polymyxa E681]
          Length = 984

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 151/445 (33%), Positives = 237/445 (53%), Gaps = 35/445 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           + F    + + +R RDLV RLTL+EK+  ++   PAI RLG+ AY+  +EA HG+A +G+
Sbjct: 10  YLFQDPKIHLDERVRDLVLRLTLEEKVESMLQYQPAIDRLGVAAYKHGTEAAHGIAWLGE 69

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNI 161
                     ATSFPQ +  A ++D+ L   IG  I  EAR  Y     + G+T WAP +
Sbjct: 70  ----------ATSFPQPVGLACTWDTELMRLIGSVIADEARVYYRRDPKLNGLTLWAPTV 119

Query: 162 NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
           ++ RDPRWGR +E  GEDP +TG+ +   V+G+QGD          L+A A  KHF   +
Sbjct: 120 DMERDPRWGRNEEAYGEDPYLTGELSKELVKGIQGDH------PVYLKAVATLKHFLGNN 173

Query: 222 LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
            +  +G      D R    +L + Y   FE    +G A  +M +YN +NG P+      +
Sbjct: 174 NEVDRGIGSSSIDPR----NLREYYLRAFEKPFTEGGAQSMMTSYNLINGTPATLYHG-V 228

Query: 282 SKTARRQWGFHGYITSDC-DAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKH 338
           +   R +WG  G++ SD  D + I+ D + YA S    V + ++AG+D   +     ++ 
Sbjct: 229 NDIVRDEWGMDGFVVSDAGDIMGIVNDHQYYA-SHTPGVAESIRAGIDSITDDAELSKQA 287

Query: 339 TKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQ 398
            + A+ Q  L E ++DRAL + F VR RLG F+      P+  IG + + +   + L+L+
Sbjct: 288 IREALAQGTLQEEDLDRALFHTFRVRFRLGEFD-PAADNPYASIGEEALMTEQARELSLR 346

Query: 399 AAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQAL--- 455
           AA++ IVLLKN H LLPL  ++   +A+IG + N  +     Y+G     +TPL+A+   
Sbjct: 347 AAREQIVLLKNKHELLPLNPARCGKVAVIGTHGN--EVYRDWYSGTLPYRVTPLEAIRAK 404

Query: 456 ---QNYVENTVYYPGCDTVACSSAS 477
              ++  E  VY    + V  ++A+
Sbjct: 405 LEGEDAKERVVYRDAKNRVTLTAAN 429



 Score = 42.7 bits (99), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KEE DR  L L   QQ LI  V      P ++V+L G  P  I + +   +I +I++  +
Sbjct: 591 KEENDRRGLELAASQQRLIEEVYRV--NPNMIVILTGSYPFAIPWVQ--EHIPAIVYTSH 646

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    A+A+V+FGD+ P
Sbjct: 647 AGQEHGTAIADVLFGDYAP 665


>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 765

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 168/552 (30%), Positives = 273/552 (49%), Gaps = 67/552 (12%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++++   A    RLGIP      E  HG   +G           AT FP  +   +++
Sbjct: 89  EAVNEIQRYAVEHSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 136

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           ++ L+  + +A+ LE R+        G   ++P +++ RDPRWGR +E  GEDP + G++
Sbjct: 137 NTELYREMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEF 190

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           A + V G+QG++     L G+   +A  KHF  Y         R      +  ++L +  
Sbjct: 191 AAASVEGLQGES-----LDGEASVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF+  V+ G AS IM AYN ++G+P   +  LL    R++WGF G + +DC A++++ 
Sbjct: 243 MYPFKKAVEAGAAS-IMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGMVITDCGAINMLA 301

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
                A+   DA V  + AG+D+   G     + + AV++K+L  S +D A+  + +++ 
Sbjct: 302 AGHDTAEDGMDAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVLDEAVRRVLTLKF 361

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
           +LGLF  NP   P       V+    H+ +A Q A +GIVLLKN    LPL K   V +A
Sbjct: 362 KLGLFE-NPYADP--ARAEQVIGCSRHREMARQLAAEGIVLLKNEGSTLPLSKEDGV-IA 417

Query: 426 LIGPNANSAKTLLGNYAGPS--CRSITPLQALQNYV----ENTVYYPGCDTVACSSASID 479
           +IGPNA+     LG+Y  P    R +T L+ ++  +       +Y PGC     S    +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPGCRINGDSREGFE 477

Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
            A+  A  AD VVL++G    ++                          E +DR+ L L 
Sbjct: 478 LALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMDCGEGIDRMTLQLS 537

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q EL   + +  K+ +++V + G P+   +   DR+  +IL A YPG+ G  A+A+++
Sbjct: 538 GVQLELAREIHKLGKR-LVVVYINGRPIAEPW--IDRHADAILEAWYPGQEGGHAVADIL 594

Query: 575 FGDHNP-GEFTV 585
           FGD NP G+ T+
Sbjct: 595 FGDVNPSGKLTI 606


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 174/539 (32%), Positives = 271/539 (50%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  R+G+AI  
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GED  + G    + V+G QG+   
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEF-- 248

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             + KGK+ A+   KHF AY    W         A V  +++ +   PPF   V  G A 
Sbjct: 249 -PRTKGKVIAT--LKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+ NLL+   + +W F G++ SD  A+  + +  G A +  +A V
Sbjct: 302 SVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLRE-HGVADTDYEAAV 360

Query: 321 DVLKAGMDVNCGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+ +       AVK+  + E  I++A+  + +++  +GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              +    VV S  H  LA + A+  I+LLKN + LLPL K K+ ++A+IGPNA++   +
Sbjct: 416 VDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADNIYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P   S  +T L  ++  V N    +Y  GC     S +   +A++ A+ +D VV
Sbjct: 475 LGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGCAVRDSSKSGFQEAIEAARQSDVVV 534

Query: 493 LMMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           ++MG                       +   +  E  DR  L L GRQ+ELI  V +   
Sbjct: 535 MVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGK-LN 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KP++LVL+ G P  +     +  + +I+ A YPG  G  A+A+V+FGD+NP     I+V
Sbjct: 594 KPIVLVLIKGRP--LLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTISV 650


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 191/663 (28%), Positives = 302/663 (45%), Gaps = 115/663 (17%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTET---FPFCKTTL---------PISQR 55
           L+  +L L  ++LLT   + + P +  P+           F K  +          +  R
Sbjct: 4   LIAAVLLLSNSALLTAQKTMKVPATYKPTKSEMYHKGWIDFNKNGVMDVYEDPAATVDAR 63

Query: 56  ARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW------------------------ 89
             DL+ ++TLDEK  Q+V       + +  +P  EW                        
Sbjct: 64  VEDLLKQMTLDEKTCQMVTLYGYKRVLKDALPTPEWKQMLWKDGIGAIDEHLNGFQQWGL 123

Query: 90  --------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSFPQVILTA 123
                   W  + H           V     GI   F N  IRG     AT+FP  +   
Sbjct: 124 PPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNFPTQLGLG 183

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
            +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E  GE P +
Sbjct: 184 HTWNRALIHKVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGESPYL 236

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
             +  +  VRG+Q               +A  KHF AY  +        + D + +  ++
Sbjct: 237 VAELGIEMVRGLQQ------------HVAATGKHFAAYSNNKGAREGMARVDPQTSPHEV 284

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
            + +  PF   +K+    G+M +YN  +GIP       L+   R + GF GY+ SD DAV
Sbjct: 285 ENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRGYVVSDSDAV 344

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             +Y   G AK  ++AV   ++AG++V C      SF+    +  VK+  L E  ++  +
Sbjct: 345 EYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLDEETVNDRV 403

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSHGLLPL 416
            ++  V+  +GLF+  P        GAD  V    ++ +ALQA+++ +VLLKN +  LPL
Sbjct: 404 RDILRVKFLIGLFDA-PYQTDLA--GADKEVEKEENEAVALQASRESVVLLKNENSTLPL 460

Query: 417 PKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVAC 473
             +    +A+ GPNA+     L +Y   +    T L+ +Q+ V      +Y  GCD V  
Sbjct: 461 NINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCDLVDA 520

Query: 474 S---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQ 518
           +                A I+KAV+ A+ AD  V+++G  Q    E   R  L LPGRQ 
Sbjct: 521 NWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLPGRQL 580

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           +L+  V +A  KPV+L+L+ G P+ + +A  D+ + +IL A YPG  G VALA+++FGD+
Sbjct: 581 QLLQAV-QATGKPVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADILFGDY 637

Query: 579 NPG 581
           NPG
Sbjct: 638 NPG 640


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 185/607 (30%), Positives = 290/607 (47%), Gaps = 101/607 (16%)

Query: 52  ISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHGVA 98
           +  R  DL+S++TLDEK  Q+V       + +  +P  EW    W        E L+G  
Sbjct: 57  LDARIEDLLSQMTLDEKTCQMVTLYGYKRVLKDDLPTPEWKQMLWKDGIGAIDEHLNGFQ 116

Query: 99  GVGK-------------------------------GI---FFNGTIRG-----ATSFPQV 119
             G                                GI   F N  IRG     AT+FP  
Sbjct: 117 QWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVESYRATNFPTQ 176

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGE 178
           +    +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E  GE
Sbjct: 177 LGLGHTWNRKLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGE 229

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
            P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D +++
Sbjct: 230 SPYLVAELGIEMVRGMQHNH----------QVAATGKHFVAYSNNKGAREGMARVDPQMS 279

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
            +++   +  PF+  +K+    G+M +YN  +GIP       L+K  R + GF GY+ SD
Sbjct: 280 PREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRGEMGFRGYVVSD 339

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEI 353
            DAV  +Y     AK  ++AV   ++AG++V C +F    +     +  VK+  L E  I
Sbjct: 340 SDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEDII 398

Query: 354 DRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-HQVLALQAAQDGIVLLKNSHG 412
           +  + ++  V+  +GLF+  P        GAD     A ++ +ALQA+++ ++LLKN + 
Sbjct: 399 NDRVRDILRVKFLIGLFDA-PYQTDLA--GADKEVEKAENEAVALQASRESLILLKNENN 455

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCD 469
           +LPL  +   ++A+ GPNAN     L +Y   +   IT L+ ++   E     +Y  GCD
Sbjct: 456 VLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCD 515

Query: 470 TVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
            V  +                A I+KAV+ A+ AD  V+++G  Q    E   R  L LP
Sbjct: 516 LVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLP 575

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           GRQ +L+  V +A  KPV+LVL+ G P+ I +A  D+ + +IL   YPG  G  A+A+V+
Sbjct: 576 GRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVL 632

Query: 575 FGDHNPG 581
           FGD+NPG
Sbjct: 633 FGDYNPG 639


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 177/539 (32%), Positives = 265/539 (49%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K  A +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAAQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 174/539 (32%), Positives = 271/539 (50%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  R+G+AI  
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GED  + G    + V+G QG+   
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGEF-- 248

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             + KGK+ A+   KHF AY    W         A V  +++ +   PPF   V  G A 
Sbjct: 249 -PRTKGKVIAT--LKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAVAAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+ NLL+   +++W F G++ SD  A+  + +  G A +  +A V
Sbjct: 302 SVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLRE-HGVADTDYEAAV 360

Query: 321 DVLKAGMDVNCGSFLQK-HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+ +       AVK+  + E  I++A+  + +++  +GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFD-----HPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              +    VV S  H  LA + A+  I+LLKN + LLPL K K  ++A+IGPNA++   +
Sbjct: 416 VDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADNIYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P   S  +T L  ++  V N    +Y  GC     S +   +A++ A+ +D VV
Sbjct: 475 LGDYTAPQSESSVVTVLDGIRQKVSNDTHIIYAKGCAVRDSSKSGFQEAIEAARQSDVVV 534

Query: 493 LMMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           ++MG                       +   +  E  DR  L L GRQ+ELI  V +   
Sbjct: 535 MVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVGK-LN 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KP++LVL+ G P  +     +  + +I+ A YPG  G  A+A+V+FGD+NP     I+V
Sbjct: 594 KPIVLVLIKGRP--LLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTISV 650


>gi|167533461|ref|XP_001748410.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773222|gb|EDQ86865.1| predicted protein [Monosiga brevicollis MX1]
          Length = 781

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 193/617 (31%), Positives = 282/617 (45%), Gaps = 130/617 (21%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVN----SAPAIPRLGIPAYEWWSEALHGVA 98
            P+   TLP+S+R   LV+ +T+ E   QL      +APAI RLG+P+Y+W +  LHG+ 
Sbjct: 29  LPWFDPTLPLSERVALLVNNMTIKEVSMQLSTNFDTAAPAIERLGVPSYDWRNNFLHGL- 87

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQA------- 151
            V  GI        AT FPQ I  AA++ + L   IG+   +E RA YN  +        
Sbjct: 88  -VDNGI--------ATMFPQAIGLAATWSTELLTAIGRVGAVEQRAKYNINKTANGDVPM 138

Query: 152 -IGMTFW-----------APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQG--D 197
             G+  W            PN    R     RGQET GEDP++TG   V+ V+G+Q   D
Sbjct: 139 NYGLNVWQVAGHESSTLVKPN----RSSNACRGQETYGEDPVLTGTLGVAIVKGLQTLPD 194

Query: 198 TFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQG 257
           + +       L   A  KHF AY LD  K   R  FD  ++M DL  TY P F++ V++G
Sbjct: 195 SAD------YLALGATPKHFVAYSLD--KTPPRLSFDPTISMVDLRQTYFPAFKAVVQEG 246

Query: 258 RASGIMC-------------------------------------------AYNRVNGIPS 274
           RA+ +MC                                           +YN VNG P 
Sbjct: 247 RATSLMCRFVILTTHKAVHICWIACLRARATNVCKPLSMTSALASRTLPHSYNGVNGYPM 306

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-S 333
           CA   +L    R +WGF G+ TSD DA+      + Y+ +   A    L AG+D+N G +
Sbjct: 307 CAS-PMLDLIVRNEWGFDGFFTSDSDAIVFFVSEQNYSTNTIHAAAAALNAGVDLNSGPA 365

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
           +L+ H   A +   + E  +  +   LF+ R+R G F+ +  + PF       + SP HQ
Sbjct: 366 YLELHD--AYEHGLVTEQALRTSAERLFTFRLRTGEFDPD-ELVPFSSYDERNISSPVHQ 422

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            L L+ A++ +VLLKNS  +LPL  +    +A+IG +   A                   
Sbjct: 423 ALNLRVAEESLVLLKNSQDVLPLDLASLKHVAIIGLSTTLAT------------------ 464

Query: 454 ALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD----------QTQEK 503
              N V  T +  GCD  +  ++ I  AV+ A+ AD V L++GL              E 
Sbjct: 465 ---NNVTFT-FEEGCDIESNDTSRIPAAVEAARQADVVFLVLGLHVCTEHANTPYHNAEC 520

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E  DR  + LPG Q+EL   +  A  + VI + + GGPV       +R+  ++L A Y G
Sbjct: 521 EGHDRESIGLPGVQKELAESIFSAHTR-VITIYVNGGPVASPIVA-ERS-AALLEAWYGG 577

Query: 564 EAGAVALAEVIFGDHNP 580
           +    A++ +IFG  +P
Sbjct: 578 QHAGTAVSNIIFGKVSP 594


>gi|329954674|ref|ZP_08295734.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328527215|gb|EGF54219.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 855

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 188/602 (31%), Positives = 272/602 (45%), Gaps = 103/602 (17%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQL---------------------------- 72
           +  P+ +  LP+  R  DL+ R+TLDEKI+Q+                            
Sbjct: 23  QQLPYQQAALPVEVRVDDLLQRMTLDEKIAQIRHVHSWDIFNGQELDENKLAKVVGDKCW 82

Query: 73  --VNSAPAI------------------PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRG 112
             V   P                     RL IP +   +EALHG A             G
Sbjct: 83  GFVEGFPLTGENCRKNMRKIQEYMLNHTRLRIPCFNV-AEALHGSAH-----------EG 130

Query: 113 ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGR 171
           +T +PQ I   ++F+  L YR  + I  E         A GM    AP +++ RD RWGR
Sbjct: 131 STIYPQNIALGSTFNPDLAYRKAKMIAGEL-------HAQGMKQILAPCVDVVRDLRWGR 183

Query: 172 GQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRY 231
            +E+ GEDP + G + ++ V+G Q               S   KHF  +   N  G    
Sbjct: 184 VEESYGEDPFLCGLFGIAEVKGYQEQGI-----------SPMLKHFGPHG--NPLGGLNL 230

Query: 232 KFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
                  ++DL D Y  PFE  +K      +M  YN  N IP+ A   LL+   R+QWGF
Sbjct: 231 A-SVECGVRDLHDIYLKPFEMIIKSQPLKAVMSTYNSWNRIPNSASHYLLTDVLRKQWGF 289

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
            GY+ SD  A+ ++      A SPE+  +  L AG+DV   S      K AVK+ +LP S
Sbjct: 290 TGYVYSDWGAIEMLQTFHHTANSPEECALQALMAGLDVEASSECYPALKQAVKEGRLPVS 349

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
            ID A+  + + +   GLF       PFG K  A  V S     LA + A + +VLLKN 
Sbjct: 350 YIDEAVRRVLTAKFETGLFE-----DPFGEKYRAGKVHSEESVRLAREIADESVVLLKNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
             LLPL + +  ++A+IGPNA+  +     ++  +   +TPLQ ++  V N V   Y  G
Sbjct: 405 GKLLPLDEKRLDAVAVIGPNADQVQFGDYTWSRSNKDGVTPLQGIRRLVGNKVKVHYAKG 464

Query: 468 CDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQ 518
           CD ++  ++ I KAV+ A+ +D  +L +G            T   E  D  DL L G Q 
Sbjct: 465 CDMMSPDTSLIAKAVEAARKSDVAILFVGSSSASLARDYSNTNCGEGFDLNDLTLAGAQS 524

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
            LI  V  A   PV+LVL+ G P  +++ K   +I +IL   Y GE    ++A+++FG  
Sbjct: 525 GLIKAVY-ATGTPVVLVLVSGKPFVLSWEK--EHIPAILAQWYAGEQEGNSIADILFGRV 581

Query: 579 NP 580
           NP
Sbjct: 582 NP 583


>gi|307718113|ref|YP_003873645.1| glucan 1,4-beta-glucosidase [Spirochaeta thermophila DSM 6192]
 gi|306531838|gb|ADN01372.1| glucan 1,4-beta-glucosidase [Spirochaeta thermophila DSM 6192]
          Length = 952

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 149/439 (33%), Positives = 231/439 (52%), Gaps = 31/439 (7%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGI 104
           F    LP+ +R  DL+SRLTL+EK+S +      +PRLGI  Y    EA HGVA VG+  
Sbjct: 22  FRDPDLPLEERVSDLISRLTLEEKLSLIPVRQAGVPRLGIDPYPIGGEAAHGVAWVGE-- 79

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA-GQAIGMTFWAPNINI 163
                   AT FPQ +  + ++D  L  R+G+ IG+EARA  +  G   G+T WAP +++
Sbjct: 80  --------ATVFPQPVGLSCTWDKDLLRRVGEVIGVEARAYRDILGTEFGLTLWAPTVDL 131

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
            RDPRWGR +E  GEDP +  + A S VRG+QGD          L+  A  KHF A + +
Sbjct: 132 VRDPRWGRTEEAYGEDPCLASELAGSLVRGMQGDH------PFYLRMGATLKHFFANNNE 185

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
             +G +  +    +    + + Y   F   +++ R + IM AYN VNG P C     + +
Sbjct: 186 TDRGISSSEIHPWL----MHNYYLEVFRRIIERARVTCIMTAYNAVNGTP-CLLHPAVRE 240

Query: 284 TARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTK 340
             + +WG  G++ +D     +      Y +   +++   L+AG+D    +  + + +  +
Sbjct: 241 LVKERWGLPGFVVTDAADFGMTVGMHAYFEDHAESIAATLRAGVDALTEDDRTLVIESLR 300

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A+++  L E ++D AL N+F +R RLG F+  P   P+  I  + +C+P+H  LA +AA
Sbjct: 301 RALERGLLEEKDLDAALRNIFRIRFRLGEFD-PPERNPYAGITREALCAPSHTRLAREAA 359

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE 460
           +  +VLLKN  G+LPL  S    +A++GP A+   T    Y+G     +TPLQAL+  + 
Sbjct: 360 RKSVVLLKNE-GILPLSASSLRRVAVVGPLADEVHT--DWYSGTLPYVVTPLQALKERLP 416

Query: 461 --NTVYYPGCDTVACSSAS 477
                Y  G   V   SA+
Sbjct: 417 GVEVWYEEGSSRVRLRSAA 435



 Score = 42.7 bits (99), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 4/97 (4%)

Query: 485 AKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVD 543
           A+ AD V+  MG +     KE +DR DL LP  Q  L+ R A  A    +LVL+   P  
Sbjct: 566 AEAADVVLAFMGNNPFINGKETVDRPDLALPPLQHRLV-RTAFEANPRTVLVLIGSYPFA 624

Query: 544 ITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +   D ++ ++L++ + G+    AL +V+ G+ +P
Sbjct: 625 VPWE--DAHLPALLYSSHGGQEMGQALVDVLLGEVSP 659


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  236 bits (601), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLN 250

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 650


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCTVRDSSRTGFKDAIETARNADAVV 534

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 650


>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 799

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 175/607 (28%), Positives = 289/607 (47%), Gaps = 102/607 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHG- 96
           P++ R  +L+S++TL+EK  Q+        + +  +P  EW    W        E L+G 
Sbjct: 54  PLNARIDNLLSQMTLEEKTCQMATLYGWKRVLKDSLPTKEWKTAIWKDGIANIDEHLNGF 113

Query: 97  -----------VAGVGKGI---------------------FFNGTIRG-----ATSFPQV 119
                      V  + K +                     F N  IRG     AT FP  
Sbjct: 114 LTWGVTSTSELVTDIKKHVWAMNETQRFFIEQTRLGIPVDFTNEGIRGVEAYEATGFPTQ 173

Query: 120 ILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGE 178
           +    +++  L  ++G+  G EARAL       G T  +AP +++ RD RWGR +E  GE
Sbjct: 174 LNMGMTWNRNLIRKMGRITGQEARAL-------GYTNVYAPILDVARDQRWGRLEEVYGE 226

Query: 179 DPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVT 238
           DP +  +  V    G+Q +           Q ++  KHF  Y  +        + D +V+
Sbjct: 227 DPYLVARLGVEMTLGMQENN----------QIASTAKHFAVYSANKGAREGLARTDPQVS 276

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
            +++ D    PF+  +++    G+M +YN  NGIP       L++  R+ +GF GY+ SD
Sbjct: 277 PREVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIPITGSEYWLTQRLRKDFGFGGYVVSD 336

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG----SFLQKHTKAAVKQKKLPESEID 354
            DA+  +Y+    A + ++AV     AG++V         +  + +  V + ++P   I+
Sbjct: 337 SDALEYLYNKHHVAANLKEAVFQAFMAGLNVRTTFRPPDSIIIYARQLVNEGRIPIETIN 396

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGA---DVVCSPAHQVLALQAAQDGIVLLKNSH 411
             + ++  V+ +LGLF+     QP+ K  A    +V S AHQ +ALQA+++ IVLLKN++
Sbjct: 397 SRVKDVLRVKFKLGLFD-----QPYVKDAAASEKLVNSIAHQAVALQASKESIVLLKNNN 451

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCD 469
            +LPL +S    +A+IGPNA        +Y     +S   L+ ++N +  +   Y  GC+
Sbjct: 452 QILPLSRSLK-KIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVWYAKGCE 510

Query: 470 TV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
            V               A + A I+ AV+ A  AD  ++++G +     E   R  L LP
Sbjct: 511 LVDKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKSRTTLELP 570

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q  LI  + +   KPV+ V++   P+ I +   D+ I  I++AGYPG  G +A+A+V+
Sbjct: 571 GFQLNLIKAIQKTG-KPVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGGIAVADVL 627

Query: 575 FGDHNPG 581
           FGD+NPG
Sbjct: 628 FGDYNPG 634


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 178/601 (29%), Positives = 282/601 (46%), Gaps = 95/601 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+ +R +DL+S++TL+EK  Q+     +  + +  +P   W    W + +  +     G+
Sbjct: 49  PLEERVQDLLSQMTLEEKSCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 109 GTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  ET GEDP +
Sbjct: 169 ATWNKELIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q           K    A  KHF  Y +       + + D  V  +++
Sbjct: 222 VGELGKQMITSLQ-----------KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAPREM 270

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF    ++  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I      A + ED +   + AG+++         F+    K AV   K+ +  +D+ +
Sbjct: 331 EFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVADGKISQETLDKRV 389

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  V+  LGLF+ NP  +  GK    +V S  HQ ++L+AA+  +VLLKN   LLPL 
Sbjct: 390 AEILRVKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLLPLS 447

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV---- 471
           KS   S+A+IGPNA+    L+  Y   +    T  Q ++  + +T  +Y  GCD +    
Sbjct: 448 KSLR-SIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIIDPHF 506

Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
                         +  +++A+  AK A+ VV+++G ++   +E+  R  L LPGRQ+EL
Sbjct: 507 PESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V  A  KPV+LVLL G    I +A    ++ +IL A +PGE    A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623

Query: 581 G 581
           G
Sbjct: 624 G 624


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/600 (29%), Positives = 280/600 (46%), Gaps = 93/600 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+ +R +DL+S++TL+EK  Q+     +  + +  +P   W    W + +  +     G+
Sbjct: 49  PLEERVQDLLSQMTLEEKSCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 109 GTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 183
           A+++  L  RIG+    EA AL           ++P ++I +DPRWGR  ET GEDP + 
Sbjct: 169 ATWNKELIARIGEVEAKEAVALEYT------NIYSPILDIAQDPRWGRCVETYGEDPYLV 222

Query: 184 GKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLA 243
           G+     +  +Q           K    A  KHF  Y +       + + D  V  +++ 
Sbjct: 223 GELGKQMITSLQ-----------KHNLVATPKHFAVYSIPVGGRDGKTRTDPHVAPREMR 271

Query: 244 DTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVS 303
             Y  PF    ++  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV 
Sbjct: 272 TLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAVE 331

Query: 304 IIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRALH 358
            I      A + ED +   + AG+++         F+    K AV   K+ +  +D+ + 
Sbjct: 332 FISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVADGKISQETLDKRVA 390

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
            +  V+  LGLF+ NP  +  GK    +V S  HQ ++L+AA+  +VLLKN   LLPL K
Sbjct: 391 EILRVKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLLPLSK 448

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTV----- 471
           S   S+A+IGPNA+    L+  Y   +    T  Q ++  + +T  +Y  GCD +     
Sbjct: 449 SLR-SIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIIDPHFP 507

Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
                        +  +++A+  AK A+ VV+++G ++   +E+  R  L LPGRQ+EL+
Sbjct: 508 ESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEELL 567

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             V  A  KPV+LVLL G    I +A    ++ +IL A +PGE    A+AE +FGD+NPG
Sbjct: 568 KAVC-ATGKPVVLVLLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNPG 624


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|340377403|ref|XP_003387219.1| PREDICTED: probable beta-D-xylosidase 2-like, partial [Amphimedon
           queenslandica]
          Length = 435

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 151/432 (34%), Positives = 219/432 (50%), Gaps = 44/432 (10%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEW 89
           N     FPF   +LPI +R +D+V RLTL + + Q+ +        AP IP L I  Y+W
Sbjct: 18  NSYVPEFPFRDPSLPIEERVKDIVDRLTLAQLVEQMAHGGAGNNGPAPGIPDLNIKPYQW 77

Query: 90  WSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARA----- 144
            +E L G  G G           ATSFP  I  AA+F+  L  ++  A   E RA     
Sbjct: 78  GTECLSGDVGAGD----------ATSFPMSIGMAATFNYDLLKQVSNATAYEVRAKNTAA 127

Query: 145 LYNAGQAI--GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGG 202
           + N   A   G++ W+P +NI RDPRWGR QET GEDP ++     ++V G+QGD     
Sbjct: 128 VLNGSYAFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSSYLGQAFVTGLQGDD---- 183

Query: 203 KLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGI 262
                + A+A CKHF  +         R+ FDA VTM+D   T+ P F++CV+ G  S +
Sbjct: 184 --PTYVIANAGCKHFDVHGGPEDTPVPRFSFDANVTMRDWRMTFLPQFKACVEAGALS-L 240

Query: 263 MCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDV 322
           MC+YNR+NG+P+CA++ LL+   R +W F GY+ SD  A+  I     YA     A  + 
Sbjct: 241 MCSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAAEA 300

Query: 323 LKAGMDVNCGSFLQK------HTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTM 376
             AG  +  G+   K      +   AV++  +    +  A+  LF VR +LG F+     
Sbjct: 301 ANAGTCLEDGNSEDKDGNVFENLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNN 360

Query: 377 QPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG------LLPLPKSKSVSLALIGPN 430
            P+  I   ++ S  H  +ALQAA + IVL+KN         LLPL  +K     ++GP 
Sbjct: 361 NPYANIPLSIIQSDTHIKIALQAAMESIVLMKNEFNGSPFLPLLPL-ANKFKRACVVGPF 419

Query: 431 ANSAKTLLGNYA 442
             +  T+ G+Y+
Sbjct: 420 IKNPDTMFGDYS 431


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  235 bits (599), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 594 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 650


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
 gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
          Length = 805

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADTVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 188 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 241

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 242 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 292

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 293 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 351

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 352 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 406

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 407 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 465

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 466 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 525

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 526 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 584

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KPV+LVL+ G P+ +  A  +    +I+ A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 585 KPVVLVLIKGRPLLMEGAIQEAE--AIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 641


>gi|409198859|ref|ZP_11227522.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Marinilabilia salmonicolor JCM 21150]
          Length = 777

 Score =  234 bits (598), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 168/530 (31%), Positives = 266/530 (50%), Gaps = 67/530 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  A+++   L   +G+ IG 
Sbjct: 124 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGLASTWAPELIEEVGKVIGR 171

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R     G  IG   + P +++ RD RW R +ET GEDP++ G    + VRG+ GD   
Sbjct: 172 EVRL---QGGHIG---YGPVLDLVRDARWSRVEETYGEDPVLAGIMGAAMVRGMAGDG-- 223

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTM---QDLADTYQPPFESCVKQG 257
              L  K    A  KHF AY      GT     +  +T     D+   Y PPF+  +  G
Sbjct: 224 ---LHDKHGLIATLKHFLAY------GTPEAGQNGGITTIGPLDVMQNYLPPFKEAINAG 274

Query: 258 RASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPED 317
             S +M +YN ++GIPS ++  LL+   R++W F G++ SD  ++  +Y +   A+S ++
Sbjct: 275 ALS-VMTSYNSIDGIPSTSNHYLLTDVLRKKWNFDGFVVSDLASIEGLYGSHYVAQSVQE 333

Query: 318 AVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
           A    L AG+DV+ GS        +V   ++ ES ID A+  +  ++  +GLF+ NP + 
Sbjct: 334 AANMALSAGVDVDLGSNAFVKLVNSVNAGEVQESLIDTAVTRVLRLKFEMGLFD-NPYVD 392

Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
           P       +V S  H  LA + AQ+ +VLLKN H +LPL +     + ++GPNA++   +
Sbjct: 393 P--TKAEKMVRSAEHIALAREVAQNTLVLLKNDHEILPLKRDPG-KILVVGPNADNQYNM 449

Query: 438 LGNYAGP--SCRSITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           LG+Y  P  S   +T L+ + N +  E   Y  GC     S+ +IDKAV+ A+ +D ++ 
Sbjct: 450 LGDYTAPQDSDNVVTILEGVLNKLSPEQVEYVKGCAIRDTSNVNIDKAVEAAEHSDVILA 509

Query: 494 MMG-----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKK 530
           ++G                       +      E  DR  L L G+Q +L+  + +   K
Sbjct: 510 VVGGSSARDFKTEYLETGAAVANNQSVSDMDAGEGFDRATLDLLGKQTDLLKALKKTG-K 568

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           P+++V + G P+ + +A    N  +++ A YPG+ G  A+A+V+FGD+NP
Sbjct: 569 PLVVVYIQGRPMKMNWAA--ENADALICAWYPGQEGGNAIADVLFGDYNP 616


>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
 gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
          Length = 730

 Score =  234 bits (598), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 179/569 (31%), Positives = 278/569 (48%), Gaps = 53/569 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    LP  +R  DL++ +TL+EK+  L+   P IPRL +  Y   SE  HGVA  G 
Sbjct: 27  YPFQDPDLPNEERIDDLITCMTLEEKV-DLMGFVPGIPRLDV-KYTRISEGYHGVAQGGP 84

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAG--QAIGMTFWAPN 160
             +        T FPQ    AA++D  L  R+      E R LY +   Q  G+   APN
Sbjct: 85  SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144

Query: 161 INIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAY 220
            ++ RDPRWGR +E  GEDP +TG  A ++  G+ GD          L+A++  KHF A 
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDH------PRYLKATSLLKHFLAN 198

Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
             ++ +  +   FD R+      + Y  PFE  ++ G A  +M AYN +NG P+     +
Sbjct: 199 SNEDDRFFSSSDFDERL----WREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHV-HPM 253

Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTK 340
           L      +WG  G I +D   ++ + +          A    +KAG+++    FL  HT+
Sbjct: 254 LRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPTATAACIKAGINL----FLDNHTQ 309

Query: 341 A---AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH----Q 393
           A   AV+Q  + E+EID  +     + + LGL +  P + P+  IG +    P       
Sbjct: 310 AALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLDP-PELVPYSNIGHEPGLEPWELPETH 368

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
               +  +  IVLLKN + +LPL  SK  S+A++GP AN+  TLL  Y+G    +I P  
Sbjct: 369 AFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANT--TLLDWYSGTPPYAIPPRD 426

Query: 454 ALQNYVENTVYYP-----GCDTVACSSASIDKAVDIAKGADHVVLMMG------------ 496
            ++ Y  N+  +P     G + VA  S   D A+++A   D  ++++G            
Sbjct: 427 GIEGYA-NSGPFPSPAKFGSNWVADMS---DTALEVAASRDVAIVVVGNHPESNAGWGVV 482

Query: 497 LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSI 556
              ++ KE +DR +++L   Q+E I +V  AA    I+VL+   P  + +A    N  +I
Sbjct: 483 TSPSEGKEAVDRQEIILQPDQEEFIQKVY-AANPNTIVVLVSNFPYAMPWAA--ENAPAI 539

Query: 557 LWAGYPGEAGAVALAEVIFGDHNPGEFTV 585
           +   +  +    ALA+V+FGD+NPG  TV
Sbjct: 540 VHITHASQEQGNALADVLFGDYNPGGKTV 568


>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
 gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
          Length = 774

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 179/535 (33%), Positives = 273/535 (51%), Gaps = 71/535 (13%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E LHG A            R  TSFPQ I   A+F+  L  ++      
Sbjct: 112 RLGIPVM-FHEECLHGHAA-----------RDGTSFPQPIGLGATFNPALVEKLYAMTAH 159

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R +    QA+      P +++ RD RWGR +ET GEDP +  +  ++ VRG QGD   
Sbjct: 160 ETR-VRGGHQAL-----TPVVDVARDARWGRVEETYGEDPFLNTQLGIAAVRGFQGD--- 210

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
               K K    A  KHF A+       + +      V+ + L +T+  PF  C+K+G A 
Sbjct: 211 -ASFKDKKHVIATLKHFAAHGQPE---SGQNCAPVNVSERLLRETFLHPFRDCLKKGGAI 266

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV---SIIYDAEGY--AKSP 315
            +M +YN ++G+PS A R LL    R++WGF G++ SD  A+   S   D+ G+  A   
Sbjct: 267 SVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIWELSHRPDSHGHHVAADK 326

Query: 316 EDAVVDVLKAGMDV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLF 370
           ++A V  +KAG+++     +C     +H    V++K L E+E+D  +  +   + ++GLF
Sbjct: 327 KEACVLAVKAGVNIEFPEPDC----YRHLVELVRKKVLHETELDELIAPMLLWKFKMGLF 382

Query: 371 NGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPN 430
           + +P + P  +  A VV    H+ LA +AA++ I LLKN + LLPL  +K  ++A+IGPN
Sbjct: 383 D-DPYVDP--EEAARVVGCEVHRELASEAARETITLLKNENDLLPLNPAKLKTVAVIGPN 439

Query: 431 ANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGC----------DTVACSSAS 477
           AN  ++LLG Y+G    ++T L  ++  +      V+  GC          D V  S  +
Sbjct: 440 AN--RSLLGGYSGVPAHNVTVLDGIKARLGGAVKVVHAEGCKITVGGSWQQDEVLASDPA 497

Query: 478 -----IDKAVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAE 526
                ID+AV +A  AD V++ +G ++   +E        DR  L L G Q ELI R   
Sbjct: 498 EDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKHMGDRTSLDLIGHQDELI-RALL 556

Query: 527 AAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           A  KPV+ ++  G P+ I      +N+ +IL   Y G+    A+A V+FGDHNPG
Sbjct: 557 ATGKPVVALVFNGRPLAINHVA--QNVPAILECWYLGQECGSAVAAVLFGDHNPG 609


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 177/569 (31%), Positives = 281/569 (49%), Gaps = 77/569 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K +++LVN          ++ IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   I   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGR +ET GEDP +     ++Y+ G+QGD           Q  A  KHF A+    
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++GIP   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           +K+  +PES IDRA+  +  ++ RLGL +      PF    +  + +     + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQE 502
            +   V     +Y  GCD  + S     +A++IA+ AD ++ +MG         +D   E
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSE 487

Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           +E            DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +++ A +PGE G  A+A+VIFGD+NPG
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPG 573


>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
 gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
          Length = 946

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 186/610 (30%), Positives = 291/610 (47%), Gaps = 101/610 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T PI  R  DL+S++TL+EK  Q+V       + +  +P  EW    W        E L+
Sbjct: 57  TAPIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLN 116

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNF 176

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E 
Sbjct: 177 PTQLGLGHTWNRQLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D 
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  PF+  +++    G+M +YN  +G P  +    L+   R + GF GY+
Sbjct: 280 QMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYV 339

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  +Y   G AK  ++AV   ++AG++V C +F    +     +  VK+  L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSE 398

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKN 409
             I+  + ++  V+  +GLF+  P        GAD  V    ++ +ALQA+++ IVLLKN
Sbjct: 399 EVINDRVRDILRVKFLVGLFD-TPYQTDLK--GADEEVEKKENEEVALQASRESIVLLKN 455

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
              +LPL  SK   +A+ GPNA+     L +Y   +    + L+ +Q  ++   + +Y  
Sbjct: 456 EKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTK 515

Query: 467 GCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GCD V  +                  IDKAV  AK AD  ++++G  Q    E   R  L
Sbjct: 516 GCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSL 575

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G +A+A
Sbjct: 576 DLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVA 632

Query: 572 EVIFGDHNPG 581
           +++FGD+NPG
Sbjct: 633 DILFGDYNPG 642


>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
 gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
          Length = 946

 Score =  234 bits (597), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 186/610 (30%), Positives = 291/610 (47%), Gaps = 101/610 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T PI  R  DL+S++TL+EK  Q+V       + +  +P  EW    W        E L+
Sbjct: 57  TAPIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTSEWKNQLWKDGIGAIDEHLN 116

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNF 176

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E 
Sbjct: 177 PTQLGLGHTWNRQLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D 
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  PF+  +++    G+M +YN  +G P  +    L+   R + GF GY+
Sbjct: 280 QMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYV 339

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  +Y   G AK  ++AV   ++AG++V C +F    +     +  VK+  L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSE 398

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKN 409
             I+  + ++  V+  +GLF+  P        GAD  V    ++ +ALQA+++ IVLLKN
Sbjct: 399 EVINDRVRDILRVKFLVGLFD-TPYQTDLK--GADEEVEKKENEEVALQASRESIVLLKN 455

Query: 410 SHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYP 466
              +LPL  SK   +A+ GPNA+     L +Y   +    + L+ +Q  ++   + +Y  
Sbjct: 456 EKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTK 515

Query: 467 GCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GCD V  +                  IDKAV  AK AD  ++++G  Q    E   R  L
Sbjct: 516 GCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSL 575

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G +A+A
Sbjct: 576 DLPGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVA 632

Query: 572 EVIFGDHNPG 581
           +++FGD+NPG
Sbjct: 633 DILFGDYNPG 642


>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
 gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
          Length = 859

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 184/604 (30%), Positives = 283/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N         PRLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      GT +   
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GTPQGGL 227

Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVSNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G 
Sbjct: 463 KGCDLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 169/530 (31%), Positives = 265/530 (50%), Gaps = 64/530 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G           AT FP  +   +++++ L+  I +A+  
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNTELFRSISRAVAA 151

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E RA    G A     ++P +++ RDPRWGR +ET GEDP +  ++AV+ V+G+QG+   
Sbjct: 152 ETRA--QGGSAT----YSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGE--- 202

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             +L       A  KHF  Y         R      + +++L +    PF   V+ G  S
Sbjct: 203 --RLDSHTSLLATLKHFAGYGASE---GGRNGAPVHMGLRELHEVDLLPFRKAVEAGALS 257

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M AYN ++G+P  +   LL    R  WGF G++ +DC A+ ++      A S  +A  
Sbjct: 258 -VMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVITDCGAIHMLACGHNTAGSGVEAAA 316

Query: 321 DVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             LKAG+D+   G+  + H   A++Q  + E +++RA   +  ++ RLGLF+  P + P 
Sbjct: 317 QSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRAAGRVLELKFRLGLFD-RPYVDP- 374

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
                 V+    H  LA QAA +GIVLLKN   LLPL  S S ++A+IGPNA++    LG
Sbjct: 375 -AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL-DSSSGTIAVIGPNAHAPYHQLG 432

Query: 440 NYAGPS--CRSITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMM 495
           +Y  P    + +T L  ++  + ++  +Y PGC     S     +A+  A+ AD +V+++
Sbjct: 433 DYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCRIQGDSREGFPRALACAEQADVIVMVL 492

Query: 496 GLDQTQE-------------------------KEELDRVDLVLPGRQQELITRVAEAAKK 530
           G    ++                          E +DR  L L G Q EL+  + +   K
Sbjct: 493 GGSSARDFGEGTIDLRTGASVVTGHAESDMECGEGIDRSTLTLMGVQLELLQELHKLG-K 551

Query: 531 PVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           PVI+V + G P  IT    D +I SI+ A YPG+ G  A+A+++FGD NP
Sbjct: 552 PVIVVYINGRP--ITEPWIDEHIPSIVEAWYPGQEGGSAIADMLFGDINP 599


>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 781

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 173/532 (32%), Positives = 275/532 (51%), Gaps = 69/532 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G           AT FP  +  A+++D  L  ++G+AI L
Sbjct: 127 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTALSAASTWDESLMQQMGEAIAL 174

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR     G  IG   + P +++ R+PRW R +ET GEDP++T    V+ ++G+QGD  N
Sbjct: 175 EARL---QGANIG---YGPVLDVAREPRWSRMEETFGEDPVLTSVMGVALMKGMQGDVQN 228

Query: 201 GGKLKGKLQASACCKHFTAYDL-DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
            GK        +  KHF AY + ++    +R    A   M+ L   Y PPF+  V+ G A
Sbjct: 229 DGK-----HLYSTLKHFAAYGVPESGHNGSR----ANSGMRQLFSEYLPPFKKAVEAG-A 278

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
             IM +YN ++G+P  +++ LL++  R QWGF G++ SD  ++  I      AK  ++A 
Sbjct: 279 GTIMTSYNSIDGVPCTSNKFLLTEVLRNQWGFKGFVYSDLISIEGIVGMRA-AKDNKEAA 337

Query: 320 VDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP 378
              L+AG+D++  G    ++ K A ++  +   ++DRA+ N+  ++ ++GLF  NP + P
Sbjct: 338 AKALRAGLDMDLGGDAFGRNLKQAYEEGLITMDDLDRAVSNVLRLKFQMGLFE-NPYVSP 396

Query: 379 FGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLL 438
             +     + S  H+ LA + A++G+VLLKN  G+LPL K     +A+IGPNA+     L
Sbjct: 397 --EQAGKHIRSREHKELARRVAREGVVLLKND-GVLPLDKHLK-RIAVIGPNADMMYNQL 452

Query: 439 GNYAGPSCRS--ITPLQALQNYVENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVL 493
           G+Y  P  R   +T L  ++  V  T   VY  GC     + + I  AV  A+ AD V+L
Sbjct: 453 GDYTAPQDRKEIVTVLDGVRAAVSKTTQVVYVKGCAVRDTTESDIPAAVAAAQRADAVIL 512

Query: 494 MMGLDQTQE-------------------------KEELDRVDLVLPGRQQELITRVAEAA 528
           ++G    ++                          E  DR  L L G Q++LI  VA A 
Sbjct: 513 VVGGSSARDFKTKYISTGAATVSEDIKVLPDMDCGEGFDRSSLRLLGDQEKLINAVA-AT 571

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            KP++++ + G  +++  A       ++L A YPGE G   +A+++FGD+NP
Sbjct: 572 GKPLVVIYIAGRAMNMNLAA--DKARALLAAWYPGEQGGAGIADILFGDYNP 621


>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
           17230]
 gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
           DSM 17230]
          Length = 757

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 181/598 (30%), Positives = 289/598 (48%), Gaps = 90/598 (15%)

Query: 50  LPISQRARDLVSRLTLDEKISQLVN----------------------------------S 75
           + I +R R+L+ R++++EKI+QL++                                  S
Sbjct: 1   MDIDERVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSS 60

Query: 76  APAIPRLGIPAYEWWSEALHGVAGVG-KGIFFNGTIRG-----ATSFPQVILTAASFDSY 129
           A   PR  +  Y      L     +G   I    +I G     AT FP  +  A+++D  
Sbjct: 61  ARLSPREAVEIYNAIQRFLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWDPD 120

Query: 130 LWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVS 189
           L YR+  AI  +  A+ +          AP +++ R+PRWGR +ET GED  +     ++
Sbjct: 121 LVYRVAVAIRRQIMAIGSRHT------LAPVLDLCREPRWGRCEETYGEDSYLAASMGIA 174

Query: 190 YVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPP 249
           YV+G+QGD    G +       A  KHF  + +       R      V +++L + Y  P
Sbjct: 175 YVKGIQGDDIRYGVI-------ATGKHFVGHGVPE---GGRNIASIHVGLRELLEIYMYP 224

Query: 250 FESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAE 309
           FE+ VK+     IM AY+ ++ +P  A++ LL+   R  WGF G   SD + V  ++   
Sbjct: 225 FEATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQLHTIH 284

Query: 310 GYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLG 368
             A+   +A V  +KAG+D+   S    K    AV++  + E  I+RA+  +  ++  LG
Sbjct: 285 RVARDCMEAAVKAIKAGVDIEYPSGECFKQLVEAVRKGLIDEDTINRAVERVLKLKFMLG 344

Query: 369 LFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIG 428
           LF  NP +    K+    + + A + LA + A+  IVLLKN  G+LPL K    ++A+IG
Sbjct: 345 LFE-NPFIDE-TKV-PTTLDNEADRELAREVARKAIVLLKND-GILPL-KRDIKTIAVIG 399

Query: 429 PNANSAKTLLGNY---------------AGPSCRSITPLQALQNYVE---NTVYYPGCDT 470
           PNAN    +LG+Y                 PS R +T L+A+++ V      +Y  GCDT
Sbjct: 400 PNANDPWAMLGDYHYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKGCDT 459

Query: 471 VACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEKEELDRVDLVLPGRQQELITR 523
           +    +   +A++IAK AD ++ +MG       L      E +DR  L LPG Q+EL+  
Sbjct: 460 IGDDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEELLKE 519

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +A    KP+ILVL+ G P  +  +     + +I+ A  PGE G  A+A+++FGD++PG
Sbjct: 520 LASLG-KPIILVLINGRP--LALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYSPG 574


>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
          Length = 946

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 195/667 (29%), Positives = 309/667 (46%), Gaps = 121/667 (18%)

Query: 8   LVFPLLCLCFTSLLTRVDSTQPPFSCDPSNPSTETFP--------------FCKTTLPIS 53
           L+F  L L  ++LL+   +T+ P    P    TE +               +   T  I 
Sbjct: 4   LLFTALFLGSSTLLSAQKATKTPAVYKPVR--TEMYKKGWIDLNKNGIKDVYEDPTATID 61

Query: 54  QRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHGVAGV 100
            R  DL+S++TL+EK  Q+V       + +  +P  EW    W        E L+G    
Sbjct: 62  ARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGFQQW 121

Query: 101 GK-------------------------------GI---FFNGTIRG-----ATSFPQVIL 121
           G                                GI   F N  IRG     AT+FP  + 
Sbjct: 122 GLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQLG 181

Query: 122 TAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDP 180
              +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E  GE P
Sbjct: 182 LGHTWNRRLIHQVGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYGESP 234

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQ 240
            +  +  +  VRG+Q +           Q +A  KHF AY  +        + D +++ +
Sbjct: 235 YLVAELGIEMVRGMQHNH----------QIAATGKHFIAYSNNKGAREGMARVDPQMSPR 284

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++  T+  PF+  +++    G+M +YN  +G P  +    L+   R + GF GY+ SD D
Sbjct: 285 EVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDSD 344

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEIDR 355
           AV  +Y   G AK  ++AV   ++AG++V C +F    +     +  VK+  L E  I+ 
Sbjct: 345 AVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEVIND 403

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
            + ++  V+  +GLF+      P+    K   + V   A++ +ALQA+++ IVLLKN   
Sbjct: 404 RVRDILRVKFLVGLFD-----HPYQIDLKGADEEVEKAANEEIALQASRESIVLLKNDKN 458

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCD 469
           +LPL  S    +A+ GPNA+     L +Y   +    + L+ +Q  ++     +Y  GCD
Sbjct: 459 ILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEVLYTKGCD 518

Query: 470 TVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLP 514
            V  +                  I+KAVD  K AD  V+++G  Q    E   R  L LP
Sbjct: 519 LVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENKSRSSLDLP 578

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           GRQ +L+  VA A  KPV+LVL+ G P+ I +A  D+ + +I+ A YPG  G  A+A+V+
Sbjct: 579 GRQLDLLKAVA-ATGKPVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSKGGKAVADVL 635

Query: 575 FGDHNPG 581
           FG++NPG
Sbjct: 636 FGEYNPG 642


>gi|146295632|ref|YP_001179403.1| glycoside hydrolase family protein [Caldicellulosiruptor
           saccharolyticus DSM 8903]
 gi|145409208|gb|ABP66212.1| glycoside hydrolase, family 3 domain protein [Caldicellulosiruptor
           saccharolyticus DSM 8903]
          Length = 770

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 179/543 (32%), Positives = 274/543 (50%), Gaps = 84/543 (15%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIPA     E+  G              +GAT FPQ I  A +FD+ +   + + I  
Sbjct: 83  RLGIPAM-IHEESCSGFMA-----------KGATVFPQSIGVACTFDNEIVEELAKVIRT 130

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           + +A+  A QA+     AP I++ RD RWGR +ET GEDP +    AVSYV+G+QGD   
Sbjct: 131 QMKAV-GAHQAL-----APLIDVARDARWGRVEETFGEDPYLVANMAVSYVKGLQGDDIK 184

Query: 201 GGKLKGKLQASACCKHFTAYDLD----NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQ 256
            G +       A  KHF  Y +     NW           +  ++L + Y  PFE  VK 
Sbjct: 185 KGIV-------ATGKHFVGYAMSEGGMNWA-------PVHIPERELREVYLYPFEVAVKV 230

Query: 257 GRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPE 316
                IM AY+ ++GIP  A+R LL+  AR +WGF G   SD   V  + D     K+ E
Sbjct: 231 AGLKSIMPAYHEIDGIPCHANRKLLTDIARNEWGFDGIYVSDYSGVRNLLDYHKSVKTYE 290

Query: 317 DAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGN 373
           +A    L AG+D+       F ++  K A+K+ K   + +D A+  +  ++ RLGLF+ N
Sbjct: 291 EAAALSLWAGLDIELPKIECFTEEFIK-ALKEGKFDMTLVDAAVKRVLEMKFRLGLFD-N 348

Query: 374 PTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
           P ++  G +  ++  +   + L+ + AQ+ +VLLKN +  LPL K  S  +A+IGPNANS
Sbjct: 349 PYIKTDGIV--ELFDNKEQRQLSRRVAQESMVLLKNDN-FLPLSKD-SKKIAVIGPNANS 404

Query: 434 AKTLLGNYAGPS----------------------CRSITPLQALQNYVEN-------TVY 464
            + LLG+Y+ P+                       R++  ++++   +++        VY
Sbjct: 405 VRNLLGDYSYPAHIATLEMFFIKEDKGVGNEEEFVRNVINMKSIFEAIKDKVSSNTEVVY 464

Query: 465 YPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQEKEELDRVDLVLPGRQQ 518
             GCD  +   +  ++A   A  AD V+L++G      LD T   E  DR  L LPG Q+
Sbjct: 465 AKGCDVNSQDRSGFEEAKKAAADADAVILVVGDKAGLRLDCTS-GESRDRASLRLPGVQE 523

Query: 519 ELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDH 578
           +L+  +       V+ VL+ G PV + +     N+ ++L A +PGE GA A+A+V+FGD+
Sbjct: 524 DLVKEIVSVNPNTVV-VLVNGRPVALDWIM--ENVKAVLEAWFPGEEGANAVADVLFGDY 580

Query: 579 NPG 581
           NPG
Sbjct: 581 NPG 583


>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 799

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 170/543 (31%), Positives = 273/543 (50%), Gaps = 71/543 (13%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP      E  HG   +G            T FP  I  A++++  L   +  AI L
Sbjct: 137 RLGIPMM-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPALIQEMAAAIAL 184

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EAR     G  IG   + P +++ R+PRW R +ET GEDP +  +   + V G QG++  
Sbjct: 185 EARL---QGGHIG---YGPVLDLAREPRWSRVEETYGEDPYINSQMGRAMVSGFQGESIA 238

Query: 201 GGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGR 258
            GK        +  KHFTAY +      GT+       V  ++L ++Y PPF++ V +G 
Sbjct: 239 SGK-----NVISTLKHFTAYGVPEGGHNGTS-----VSVGQRELHESYLPPFKAAVAEGA 288

Query: 259 ASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDA 318
            S +M AYN ++G+P  ++ +LL+   R  WGF+G++ SD  ++S +  +    ++ E A
Sbjct: 289 LS-VMTAYNSIDGVPCTSNGHLLNDVLRDDWGFNGFVVSDLGSISGLRGSHHVTETAEGA 347

Query: 319 VVDVLKAGMDVNCGSF-LQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQ 377
               + AG+D + G +   K+  AAV+   + +  +D A+  +  V+  +GLF  NP + 
Sbjct: 348 AQLAINAGVDSDLGGYGFGKNLLAAVQAGGVSQEVLDEAVRRVLKVKFDMGLFE-NPYVD 406

Query: 378 PFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
           P       +V S  H  LA + A++ +VLLKN + LLPL K K  S+A+IGPNA++    
Sbjct: 407 P--SKAESLVRSAKHIALARKVARESVVLLKNENDLLPLRK-KVNSIAVIGPNADNTYNQ 463

Query: 438 LGNYAGPSCRS--ITPLQALQNYVENTV---YYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L+ ++N V   V   Y  GC     + + I KA  +A  +D  V
Sbjct: 464 LGDYTAPQPNENVVTVLEGIKNKVGKDVRVNYVKGCAIRDTTQSEIGKAASLAARSDVAV 523

Query: 493 LMMG---------------------------LDQTQEKEELDRVDLVLPGRQQELITRVA 525
           +++G                           +   +  E  DR+ L L G Q +L+  V 
Sbjct: 524 VVLGGSSARDFDTEYEETAAAKVSEAEEGQVISDMESGEGFDRMTLDLLGDQLKLVQAV- 582

Query: 526 EAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTV 585
           +A   PV++VL+ G P+++ +   D ++ +I+ A YPG+ G  A+A+V+FGD+NP     
Sbjct: 583 QATGTPVVVVLIKGRPLNLNW--IDEHVPAIVDAWYPGQEGGNAIADVLFGDYNPSGRLT 640

Query: 586 IAV 588
           I+V
Sbjct: 641 ISV 643


>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 791

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 166/528 (31%), Positives = 265/528 (50%), Gaps = 61/528 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G            T FP  I  AA++   L   +GQ I  
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    + V G+      
Sbjct: 186 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLIN---- 235

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G +  K    A  KHF AY +            A V M++L + + PPF+  +  G A 
Sbjct: 236 -GNISRKNSTIATLKHFLAYAVPEGGQNGN---QALVGMRELHENFLPPFKKAIDAG-AL 290

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+  LL++  R +W F G++ SD  ++  IY++   A S EDA +
Sbjct: 291 SVMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESHYTASSIEDAAI 350

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AVK+K+L E+ ID  +  +  ++  +GLF  NP + P  
Sbjct: 351 QAVSAGVDVDLGGEAYTNIYRAVKEKRLSEAIIDEVVCRVLRLKFEMGLFE-NPYVDP-- 407

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +I  + V +  H   A + AQ  + LLKN H +LPL K+    +A+IGPNA++   +LG+
Sbjct: 408 QIAIERVRNANHIANARRMAQASVTLLKNRHDILPLSKNIR-KVAVIGPNADNCYNMLGD 466

Query: 441 YAGP----SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P    + +++      +  +    Y  GC     ++  I KAV+ A  AD V+ ++G
Sbjct: 467 YTAPQKDENIKTVLDGIISKLSLSRVEYVRGCAIRDTTNNEIAKAVEAANRADVVIAVVG 526

Query: 497 ------------------LDQTQEK-----EELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                              D++Q       E  DR  L L G+Q EL+  + ++ +KP+I
Sbjct: 527 GSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESL-KSTRKPLI 585

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +V + G P++  +A    +  ++L A YPG+ G  A+A+V+FGD+NP 
Sbjct: 586 VVYIEGRPLNKNWAA--EHADALLTAYYPGQEGGDAIADVLFGDYNPA 631


>gi|379721503|ref|YP_005313634.1| beta-glucosidase [Paenibacillus mucilaginosus 3016]
 gi|378570175|gb|AFC30485.1| beta-glucosidase [Paenibacillus mucilaginosus 3016]
          Length = 951

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 151/426 (35%), Positives = 224/426 (52%), Gaps = 32/426 (7%)

Query: 59  LVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQ 118
           ++++LTLDEKI  +     AI RLGI AY+  +EA HG+A +G+          ATS+PQ
Sbjct: 1   MIAQLTLDEKIELMPQYQIAIERLGIQAYKHGTEAAHGMAWLGE----------ATSYPQ 50

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAPNINIFRDPRWGRGQETPG 177
            I  A ++DS L  RIG AIG EAR  Y       G+T WAP +++ RDPRWGR +E  G
Sbjct: 51  PIGLACTWDSELLKRIGSAIGDEARGFYKQNPNFNGLTLWAPTVDLERDPRWGRTEEAYG 110

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           EDP++ GK A +  +G+QGD          L+A A  KHF   + +  +G T    D R 
Sbjct: 111 EDPVLAGKLAAALTQGIQGDH------PFYLKAVATLKHFIGNNNEAGRGDTSVSLDPR- 163

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
              +L + Y   FE   K+G A  +M AYN VNG+P+     L+    + +WG +G++ S
Sbjct: 164 ---NLREYYLKVFEITFKEGGAQSMMTAYNAVNGVPANLS-ELVIDIVKGEWGMNGFVVS 219

Query: 298 DCDAVSIIYDAEGYAKSPEDAVV-DVLKAGMD--VNCGSFLQKHTKAAVKQKKLPESEID 354
           D   V+       Y ++ ++AV   + + G+D   +  + + +    A+K   L E+++D
Sbjct: 220 DAFDVTGTMRDHHYVETLKEAVARSIREGGIDSITDDAAVVTEAIHEALKDGLLTENDLD 279

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
            AL N+F VR RLG F+  P   P+  I    +  P H  LA +A+Q  +VLLKN    L
Sbjct: 280 TALRNMFRVRFRLGEFD-PPERNPYAAIDESAILHPEHAKLAREASQKAVVLLKNDGKTL 338

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVACS 474
           PL   K   +A+IGP A+        Y+G    ++TPLQ ++  +       G D V   
Sbjct: 339 PLQADKLSKVAVIGPLADVVYQDW--YSGSLPYAVTPLQGIRERLAA----EGKDAVTSY 392

Query: 475 SASIDK 480
           +A  D+
Sbjct: 393 AAGTDR 398



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 55/111 (49%), Gaps = 4/111 (3%)

Query: 471 VACSSASIDKAVDIAKGADHVVLMMGLDQ-TQEKEELDRVDLVLPGRQQELITRVAEAAK 529
           + C S+ +D AV  A+GAD  ++ +G       KE +DR D+ L   Q++L   V E   
Sbjct: 544 IECVSSGLDAAVASARGADAAIVFVGNHPLINGKETMDRPDITLAESQEKLALAVIEKNP 603

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             +I+V+   G              +I++  + G+    A+A+V+FG+ NP
Sbjct: 604 NTIIVVV---GSYPFALNALQEKAKAIIYTSHAGQELGRAVADVLFGNVNP 651


>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 770

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 167/493 (33%), Positives = 255/493 (51%), Gaps = 39/493 (7%)

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
            I  NG  RG+T +P  I  AA+FD  L  R+ +   +E RA       +  TF  PN+ 
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA-----SGMHWTF-TPNVE 189

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AY 220
           + RD RWGR  ET GEDP + G+   + VRG Q   F G       +  AC KH    + 
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-----KVIACAKHLVGGSQ 244

Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
             +   G       A ++ + L + + PPF+ C++ G  + +M A+N +NGIP   ++ L
Sbjct: 245 PANGINGAP-----AELSERTLQEVFFPPFKDCLEAGVFT-VMTAHNELNGIPCHGNKYL 298

Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHT 339
           +++  R QW F G++ SD   +  ++D    A++ +DA    + AGM ++  G    +  
Sbjct: 299 MTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYQISVDAGMGMHMHGPEFYEAI 358

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
              VK+  +PE +ID A+  +  V+ RLGLF  NP +    K   ++V +  HQ  AL+ 
Sbjct: 359 IECVKEGSIPEKQIDAAVSKILEVKFRLGLFE-NPFIDL--KKKDEIVFNEKHQQTALEG 415

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQN 457
           A+  IVLLKN   +LPL  SK   + + G NAN+ +++LG++A   P     T L+ L+ 
Sbjct: 416 ARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN-QSILGDWAMEQPEEHVTTVLKGLKA 474

Query: 458 YVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ----EK---EELD 507
               T Y     G +    S   I +AV  A+ +D  +L++G +  +    EK   E  D
Sbjct: 475 ISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSMRYHWNEKTCGENSD 534

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R +L LPGRQQEL+  VA A   P +++L+ G P  +T    D N+  I+ A  PG AG 
Sbjct: 535 RYELSLPGRQQELVEAVA-ATGVPTVVILVNGRP--LTTEWIDENMPCIIEAWEPGVAGG 591

Query: 568 VALAEVIFGDHNP 580
            ALAE+++G  NP
Sbjct: 592 QALAEILYGKVNP 604


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 182/558 (32%), Positives = 276/558 (49%), Gaps = 63/558 (11%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNS--APAIPRLGIPAYEWWSEALHGVAGVGK 102
           +    LP+  R  DL+S +T+++K+  L      P IP LG+PA     EA+HG      
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHG------ 222

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
             F  G+  GAT FPQ I   A+++  L      AIG E  +  NA QA     W+P ++
Sbjct: 223 --FSYGS--GATIFPQSIGMGATWNKRLIEAAAMAIGDETVSA-NAVQA-----WSPVLD 272

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           + +D RWGR +ET GEDP++  +   ++++G Q         KG +      KHF A+  
Sbjct: 273 VAQDARWGRCEETYGEDPVLVTEIGGAWIKGYQS--------KGLMTTP---KHFAAH-- 319

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
                  R   D  ++ +++ + +  PF    K+ +   IM +Y+   G+P    + LL 
Sbjct: 320 -GAPLGGRDSHDIGLSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKELLK 378

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGY-AKSPEDAVVDVLKAGMDVNCG-SFLQKHTK 340
              R +WGF G+I SDC A+  +   + Y A    +A    L AG+  NCG ++      
Sbjct: 379 GILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPDVI 438

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQP--FGKI--GADVVCSPAHQVLA 396
           AA K+ +L   ++D     L     R GLF  NP  +P  + KI  G +   SP HQ LA
Sbjct: 439 AAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPC-KPLDWNKIYPGWN---SPEHQALA 494

Query: 397 LQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-----GPSCRSITP 451
            + AQ+ IVLL+N   +LPL KS   ++A+IGP A++ +   G+Y      G     +T 
Sbjct: 495 RKTAQESIVLLENKGNILPLSKSLK-TIAVIGPGADNLQP--GDYTSKPQPGQLKSVLTG 551

Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK-------- 503
           ++A  N     +Y  GC  +      I KAV  A+ AD  VL++G   T E         
Sbjct: 552 IKAAVNSSTKVLYEEGCRFIGTEGTDIAKAVKAAENADVAVLVLGDCSTSEALKGITNTS 611

Query: 504 -EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
            E  D   L+LPG QQ+L+  V +   KPV+L+L  G P ++++A    N  ++L    P
Sbjct: 612 GENHDLATLILPGEQQKLLEAVCKTG-KPVVLILQAGRPYNLSYAA--ENCQAVLVNWLP 668

Query: 563 GEAGAVALAEVIFGDHNP 580
           G+ G  A A+V+FGD+NP
Sbjct: 669 GQEGGYATADVLFGDYNP 686


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 179/533 (33%), Positives = 269/533 (50%), Gaps = 67/533 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E LHG A +G            TSFPQ I   A+FD  L   +      
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EARA     QA+      P +++ R+PRWGR +ET GEDP +  +  ++ VRG QGD   
Sbjct: 152 EARA-RGTHQAL-----TPVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGD--- 202

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
               + K +  A  KHF A+      GT     +  V+M+ L +T+  PF+  + +G A 
Sbjct: 203 -ATFRDKTRVIATLKHFAAHGQPE-SGTNCAPVN--VSMRVLRETFLFPFKEALDKGCAI 258

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV-SIIYDAEGY----AKSP 315
            +M +YN ++G+PS A R LL    R++WGF G++ SD  A+  + Y  E +    AK  
Sbjct: 259 SVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESHGHFVAKDK 318

Query: 316 EDAVVDVLKAGMDVNCGS---FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNG 372
            +A    ++AG+++       +L  H    V +  L ES++D  +  +   + ++GLF+ 
Sbjct: 319 REACALAVQAGVNIELPEPDCYL--HLVDLVHKGVLQESQLDELVEPMLRWKFQMGLFD- 375

Query: 373 NPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
           +P + P       +    AH+ LA+QAA++ I LLKN   ++PL  S   ++A+IGPNAN
Sbjct: 376 DPYVDP--AEAERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIGPNAN 433

Query: 433 SAKTLLGNYAGPSCRSITPLQALQNYV---ENTVYYPGC----------DTVACSSASID 479
             ++LLG Y+G     +T L  ++  V      VY  GC          D V  S  + D
Sbjct: 434 --RSLLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSDPAED 491

Query: 480 K-----AVDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAA 528
           +     AV +AK AD +VL +G ++   +E        DR  L L GRQ+EL+ R   A 
Sbjct: 492 RRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELV-RAMVAT 550

Query: 529 KKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            KPVI  L  G P+ I +    +++ +I    Y G+    A+AEV+FGD NPG
Sbjct: 551 GKPVIAFLFNGRPISINYLA--QSVPAIFECWYLGQETGRAVAEVLFGDTNPG 601


>gi|296270377|ref|YP_003653009.1| family 3 glycoside hydrolase domain-containing protein
           [Thermobispora bispora DSM 43833]
 gi|296093164|gb|ADG89116.1| glycoside hydrolase family 3 domain protein [Thermobispora bispora
           DSM 43833]
          Length = 988

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 167/481 (34%), Positives = 245/481 (50%), Gaps = 36/481 (7%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PF   TLP++QR  DL+ RLTLDEKIS L    P I RLGI +++  +EALHGVA    
Sbjct: 39  LPFRDPTLPLAQRIDDLLGRLTLDEKISLLHQYQPPIERLGIKSFKTGTEALHGVAW-ST 97

Query: 103 GIFFNGTIRGA--TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAP 159
            +  NG +  A  T FPQ +  A+++D  L  R+G A+G EAR  +     + G+  WAP
Sbjct: 98  DVTDNGAVVTANGTVFPQAVGLASTWDPELIKRVGTAVGEEARGFHAQNPVVWGLNLWAP 157

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
            +N+ RDPRWGR +E   EDPL+TG  A++Y  G+QGD  +       L+A+   KH+ A
Sbjct: 158 VVNLLRDPRWGRNEEGYSEDPLLTGAIAIAYGSGIQGDDPD------HLRAAPTLKHYLA 211

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
            + +  + TT      RV  +     Y+ PF + +  G A+G+M AYN VNG P+  + +
Sbjct: 212 NNNEIRRDTTSSNLPPRVKHE----YYEAPFRAAITAGAATGVMTAYNLVNGRPATVNPD 267

Query: 280 LLSKTARRQWGFHGYIT-SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
           L      R W     +  +D  A + +  ++ Y  +  +A    LKAG+D    SF    
Sbjct: 268 LNDTV--RTWTDRDLLNVTDAGAPNNLVGSQAYFATLAEADAAALKAGID----SFTTDE 321

Query: 339 T---------KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           T         K A+ Q  L E +ID A+ ++  +R RLG F  +P   P+ KI  DV+ S
Sbjct: 322 TNSAPTITAIKTALSQGLLTEQDIDTAVRHILGIRFRLGEF--DPDGGPYAKITPDVIDS 379

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAH+ LA + A   +VLLKN  G LPL     + +A++GP A+   T    Y+G     +
Sbjct: 380 PAHRRLARETAAKAMVLLKNERGTLPL--DPGMKVAVVGPLADVLYTDW--YSGRPTYQV 435

Query: 450 TPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
           TPL  ++    +     G D VA    +  + +    G +   L +      E E+ D  
Sbjct: 436 TPLDGIRERAASVTSSEGVDRVAFKDLATGRYITAGSGPEGADLRLSATTIGETEQFDVF 495

Query: 510 D 510
           D
Sbjct: 496 D 496



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 4/106 (3%)

Query: 476 ASIDKAVDIAKGADHVVLMMG-LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
           + ID AV  AK AD  V+++G +     +E+ DR D+ L   Q+ L+  V  A  + V+ 
Sbjct: 593 SGIDDAVAKAKEADVAVVVVGSMPFINGREDHDRTDMNLAEGQEALVKAVFNANPRTVV- 651

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VL    P  I +   D ++ +ILW  + G     ALA+V++GD NP
Sbjct: 652 VLENSYPTTINW--IDEHVPAILWTTHAGAETGNALADVLYGDVNP 695


>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
 gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
          Length = 770

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 167/493 (33%), Positives = 255/493 (51%), Gaps = 39/493 (7%)

Query: 103 GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
            I  NG  RG+T +P  I  AA+FD  L  R+ +   +E RA       +  TF  PN+ 
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA-----SGMHWTF-TPNVE 189

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AY 220
           + RD RWGR  ET GEDP + G+   + VRG Q   F G       +  AC KH    + 
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND-----KVIACAKHLVGGSQ 244

Query: 221 DLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNL 280
             +   G       A ++ + L + + PPF+ C++ G  + +M A+N +NGIP   ++ L
Sbjct: 245 PANGINGAP-----AELSERTLQEVFFPPFKDCLEAGVFT-VMTAHNELNGIPCHGNKYL 298

Query: 281 LSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHT 339
           +++  R QW F G++ SD   +  ++D    A++ +DA    + AGM ++  G    +  
Sbjct: 299 MTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDAYRISVDAGMGMHMHGPEFYEAI 358

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQA 399
              VK+  +PE +ID A+  +  V+ RLGLF  NP +    K   ++V +  HQ  AL+ 
Sbjct: 359 IECVKEGSIPEKQIDAAVSKILEVKFRLGLFE-NPFIDL--KKKDEIVFNEKHQQTALEG 415

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQN 457
           A+  IVLLKN   +LPL  SK   + + G NAN+ +++LG++A   P     T L+ L+ 
Sbjct: 416 ARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN-QSILGDWAMEQPEEHVTTVLKGLKA 474

Query: 458 YVENTVYY---PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQ----EK---EELD 507
               T Y     G +    S   I +AV  A+ +D  +L++G +  +    EK   E  D
Sbjct: 475 ISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSMRYHWNEKTCGENSD 534

Query: 508 RVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGA 567
           R +L LPGRQQEL+  VA A   P +++L+ G P  +T    D N+  I+ A  PG AG 
Sbjct: 535 RYELSLPGRQQELVKAVA-ATGVPTVVILVNGRP--LTTEWIDENMPCIIEAWEPGVAGG 591

Query: 568 VALAEVIFGDHNP 580
            ALAE+++G  NP
Sbjct: 592 QALAEILYGKVNP 604


>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
 gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
          Length = 783

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 164/527 (31%), Positives = 261/527 (49%), Gaps = 62/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +G+AIG 
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P +++ RDPRW R +ET GEDP++TG+   + V G+      
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 227

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L       A  KHF AY +          F     +++L + + PPF   +  G A 
Sbjct: 228 GGDLSHPYSTLATLKHFLAYGISESGQNGNPSFAG---IRELHENFLPPFRQAIDAG-AL 283

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G+P  A+ +LL++  R +W F G + SD  ++  I+ +   A + E+A +
Sbjct: 284 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSHFVAPTMEEAAI 343

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+DV+ G     +   AV   ++ ++ +D ++  +  ++  +GLF  NP + P  
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +     V S     LA + AQ  I LLKN H LLPL K++ V  ALIGPNA++   +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458

Query: 441 YAGPSCRS--ITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P       T L  ++  + ++   Y  GC      +  I++AV  A+ ++ ++ ++G
Sbjct: 459 YTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 518

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQEL+ +  +A  KP+I
Sbjct: 519 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 577

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D  +A    N  ++L A YPG+ G +A+A+V+FGD NP
Sbjct: 578 VVYIEGRPLDKNWA--SENADAVLTAYYPGQEGGIAIADVLFGDFNP 622


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 166/533 (31%), Positives = 265/533 (49%), Gaps = 67/533 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E  HG   +G           AT FP  +   ++++  L+  + +A+  
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNPELFRSMCRAVAA 151

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   ++P +++ RDPRWGR +ET GEDP +  ++AV+ V+G+QGD   
Sbjct: 152 ETRS------QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGD--- 202

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             +L  +    A  KHF  Y         R      + +++L +    PF   V+ G A 
Sbjct: 203 --RLDAEDSLLATLKHFAGYGASEGG---RNGAPVHMGLRELHEIDLLPFRKAVEAG-AQ 256

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M AYN ++G+P  + R LL    R  WGF G++ +DC A+ ++      A S E+A  
Sbjct: 257 SVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCGAIDMLKSGHNTAASGEEAAA 316

Query: 321 DVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             L AG+D+   GS  + + + A++Q  + E +++ A+  + +++ RLGLF+  P   P 
Sbjct: 317 QALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGRVLAMKFRLGLFD-RPYTDP- 374

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
            +    V+    H  LA + A +GIVLLKN   +LPL   K+  +A+IGPNAN+    LG
Sbjct: 375 -ERAEKVIGCEEHIELARRVAAEGIVLLKNEGNVLPL-NPKTGKIAVIGPNANAPYNQLG 432

Query: 440 NYAGPS--CRSITPLQALQNYV-----ENTVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           +Y  P    + IT L+ ++ ++        +Y PGC     S   +  A+  A  AD +V
Sbjct: 433 DYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAPGCRIQGDSREGLSHALACAAEADVIV 492

Query: 493 LMM-----------------------GLDQTQEK--EELDRVDLVLPGRQQELITRVAEA 527
           + +                       GL Q+  +  E +DR  L L G Q EL+  + + 
Sbjct: 493 MAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMECGEGIDRSTLHLMGVQLELLQEIHKL 552

Query: 528 AKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
             KPV++V + G P  IT    D +I +IL A YPG+ G  A+A+++FGD NP
Sbjct: 553 G-KPVVVVYINGRP--ITEPWIDEHIPAILEAWYPGQEGGSAIADILFGDVNP 602


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 176/568 (30%), Positives = 278/568 (48%), Gaps = 77/568 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K +++LVN          ++ IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   I   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGR +ET GEDP +     ++Y+ G+QGD           Q  A  KHF A+    
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++GIP   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           +K+  +PES IDRA+  +  ++ RLGL +      PF    +  + +     + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQ--------TQEK 503
            +   V     +Y  GCD  + S     +A++IA+ AD ++ +MG              K
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSK 487

Query: 504 EEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           EE            DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +++ A +PGE G  A+A+VIFGD+NP
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNP 572


>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
 gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
          Length = 769

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 164/527 (31%), Positives = 261/527 (49%), Gaps = 62/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +G+AIG 
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P +++ RDPRW R +ET GEDP++TG+   + V G+      
Sbjct: 165 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 213

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L       A  KHF AY +          F     +++L + + PPF   +  G A 
Sbjct: 214 GGDLSHPYSTLATLKHFLAYGISESGQNGNPSF---AGIRELHENFLPPFRQAIDAG-AL 269

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G+P  A+ +LL++  R +W F G + SD  ++  I+ +   A + E+A +
Sbjct: 270 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSHFVAPTMEEAAI 329

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+DV+ G     +   AV   ++ ++ +D ++  +  ++  +GLF  NP + P  
Sbjct: 330 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 386

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +     V S     LA + AQ  I LLKN H LLPL K++ V  ALIGPNA++   +LG+
Sbjct: 387 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 444

Query: 441 YAGPSCRS--ITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P       T L  ++  + ++   Y  GC      +  I++AV  A+ ++ ++ ++G
Sbjct: 445 YTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 504

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQEL+ +  +A  KP+I
Sbjct: 505 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 563

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D  +A    N  ++L A YPG+ G +A+A+V+FGD NP
Sbjct: 564 VVYIEGRPLDKNWA--SENADAVLTAYYPGQEGGIAIADVLFGDFNP 608


>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
 gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
          Length = 955

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 174/609 (28%), Positives = 290/609 (47%), Gaps = 99/609 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T+PI  R  DL+S++ ++EK  Q+V       + +  +P  +W    W        E L+
Sbjct: 65  TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G E R L       G T  +AP +++ RD RWGR +E 
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +   +G+Q D           Q +A  KH+ AY  +        + D 
Sbjct: 238 YGESPYLVAELGIEMAKGMQTDH----------QVAATSKHYIAYSNNKGGREGMARVDP 287

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  P++  +K+    G+M +YN  +G P  +    L+   R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  ++   G A   +++V+  + AG+++ C +F    +     +  + +  +P 
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           S ID  + ++  V+  +GLF+ +P      +   +V C+   QV ALQA+++ +VLLKN 
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQQV-ALQASKESLVLLKNQ 464

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
             +LPL  +K   +A+ GPNA+     L +Y   +    T L+ ++N V+   N ++  G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTNVLFTKG 524

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD V               A   + IDKAV+ AK +D  V+++G       E   R  L 
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGENKSRSSLD 584

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LPGRQ +L+  V  A  KPV+L+L+ G P+ I +A  D+ + +IL A YPG  G  A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLILINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641

Query: 573 VIFGDHNPG 581
            +FGD+NPG
Sbjct: 642 ALFGDYNPG 650


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 176/568 (30%), Positives = 280/568 (49%), Gaps = 77/568 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K +++LVN          ++ IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   I   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGR +ET GEDP +     ++Y+ G+QGD           Q  A  KHF A+    
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++GIP   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYGEPLVNA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           +K+  +PES IDRA+  +  ++ RLGL +      PF    +  + +     + LAL+ A
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQE 502
            +   V     +Y  GCD  + S     +A++IA+ AD ++ +MG         +D   E
Sbjct: 428 GIVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSE 487

Query: 503 KEEL----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           +E            DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSSIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +++ A +PGE G  A+A+VIFGD+NP
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNP 572


>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
 gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
 gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.14.25]
 gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.27]
          Length = 755

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 162/503 (32%), Positives = 260/503 (51%), Gaps = 60/503 (11%)

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
           ++ AT+FPQ I  A+++D  L   +   I  +A+ L    Q +     +P +++ RDPRW
Sbjct: 98  VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK-LIGTNQCL-----SPVLDVCRDPRW 151

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GED  +     ++YV+G+QG+           +  A  KHF A+    +    
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQGEN----------ELIATVKHFAAH---GFPEGG 198

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R      V  ++L + +  PFE  +K G+A  +M AY+ ++GIP  ++  LL+K  R++W
Sbjct: 199 RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEW 258

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTKAAVKQK 346
           GF G + SD DA+  +      + + ++A +  L+AG+D    N   F +   + AVK+ 
Sbjct: 259 GFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE-AVKEG 317

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + ES IDRA+  +  ++ +LGLFN +   +    +   +  S + + LAL  A+  IVL
Sbjct: 318 LISESIIDRAVERVLRIKEKLGLFNNHYINE--NNVPEKLDNSKSRE-LALDVARKSIVL 374

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY-------AGPSCRSITPLQALQNYV 459
           LKN + +LPL K+   ++A+IGPNAN  + LLG+Y       A      +T L+ +   V
Sbjct: 375 LKNDN-ILPLNKNIG-TIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLEGIMRKV 432

Query: 460 ENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------------ 498
            N    +Y  GCD  A S     +A++IAK  D ++ +MG                    
Sbjct: 433 SNNTNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEK 492

Query: 499 -QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
            Q    E  DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     + +I+
Sbjct: 493 YQAVTGEGNDRTSLRLPGVQEELLKELHKTG-KPIILVLVNGRP--LALSSIFNEVNAII 549

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
            A +PGE G  A+A+VIFGD+NP
Sbjct: 550 DAWFPGEEGGNAIADVIFGDYNP 572


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 165/507 (32%), Positives = 257/507 (50%), Gaps = 54/507 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L  +IG   G EAR L       G T  +A
Sbjct: 160 FTNEGIRGVESYRATNFPTQLGLGHTWNRELIRQIGLITGREARML-------GYTNVYA 212

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +  VRG+Q +           Q +A  KHF 
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFV 262

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D +++ +++   +  PF+  +K+    G+M +YN  +G+P     
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
             L+   R + GF GY+ SD DAV  +Y     AK  ++AV   ++AG++V C +F    
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381

Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
           +     +  VK+  L E  I+  + ++  V+  +GLF+  P        GAD     A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLA--GADKEVEKAEN 438

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LALQA+++ +VLLKN + +LPL  +    +A+ GPNA+     L +Y   +    T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498

Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
           + ++   E     +Y  GCD V  +                A IDKAV+ A+ AD  V++
Sbjct: 499 EGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVV 558

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           +G  Q    E   R  L LPGRQ +L+  V +A  KPV+LVL+ G P+ I +A  D+ + 
Sbjct: 559 LGGGQRTCGENKSRSSLELPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
           +IL A YPG  G  A+A+V+FGD+NPG
Sbjct: 616 AILEAWYPGSKGGTAVADVLFGDYNPG 642


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 169/552 (30%), Positives = 269/552 (48%), Gaps = 67/552 (12%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++ +   A    RLGIP      E  HG   +G            T FP  +   +++
Sbjct: 89  EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           +  L+  + +A+ LE R+        G   ++P +++ RDPRWGR +E  GEDP +  +Y
Sbjct: 137 NVDLYRDMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEY 190

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AV+ V G+QG++     L      +A  KHF  Y         R      +  ++L +  
Sbjct: 191 AVASVEGLQGES-----LDSPSSVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF+  V+ G AS IM AYN ++G+P   +  LL    R++WGF G + +DC A+ ++ 
Sbjct: 243 MLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLA 301

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
                A+   DA V  ++AG+D+   G    KH + AV+  KL  S +D A+  + +++ 
Sbjct: 302 SGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKF 361

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
           +LGLF  NP + P  +   +V+ S  H  LA Q A +GIVLLKN    LPL K   V +A
Sbjct: 362 KLGLFE-NPYVDP--QTAENVIGSEQHVGLARQLAAEGIVLLKNEAKALPLSKEGGV-IA 417

Query: 426 LIGPNANSAKTLLGNYAGPS-CRSITPL-----QALQNYVENTVYYPGCDTVACSSASID 479
           +IGPNA+     LG+Y  P    ++T +       L    +  +Y PGC     S    +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFE 477

Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
            A+  A+ AD VV+++G    ++                          E +DR+ L L 
Sbjct: 478 FALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLS 537

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q EL+  + +  K+ +I+V + G P+   +   D +  +IL A YPG+ G  A+A+++
Sbjct: 538 GVQLELVQEIHKLGKR-MIVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAVADIL 594

Query: 575 FGDHNP-GEFTV 585
           FGD NP G+ T+
Sbjct: 595 FGDVNPSGKLTM 606


>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 947

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 179/609 (29%), Positives = 290/609 (47%), Gaps = 99/609 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLV----------------NSAPAIPRLGIPAY----- 87
           T  I +R  DL+S++TL+EK  Q+V                +    + + GI A      
Sbjct: 63  TAGIDERINDLISQMTLEEKTCQMVTLYGYKRVLQDDLPNESWKQKLWKDGIGAIDEHLN 122

Query: 88  ---EW----------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSF 116
              EW          W  + H           V     GI   F N  IRG     AT+F
Sbjct: 123 GFQEWGLPPSDNPWVWPASKHAWAINEVQRFFVEQTRLGIPVDFTNEGIRGVESFKATNF 182

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G EAR L       G T  +AP +++ RD RWGR +E 
Sbjct: 183 PTQLGLGTTWNRKLIHQVGYITGREARLL-------GYTNVYAPILDVGRDQRWGRYEEV 235

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +   RG+Q +           Q ++  KHF AY  +        + D 
Sbjct: 236 YGESPFLVAELGIQMTRGLQTN----------YQVASTGKHFAAYSNNKGAREGMARVDP 285

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++ + +  P+   V++    G M +YN  +G+P     + L++  R+Q+GF GY+
Sbjct: 286 QMSPREVQNIHLYPWGRVVREAGLLGAMSSYNDYDGVPIQGSFHWLTEVLRQQFGFKGYV 345

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPE 350
            SD DA+  ++     A + ++AV   + AG++V C      SF+    +  VK+ ++P 
Sbjct: 346 VSDSDALEYLFSKHRTAANMKEAVYKAVMAGLNVRCTFRSPDSFVLP-LRELVKEGRIPM 404

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID  L ++  V+  +G+F+    M    +     V   +HQ +ALQA+++ IVLLKN 
Sbjct: 405 KVIDERLRDILRVKFMVGIFDRPYQMNL--QAADKEVDGKSHQQVALQASRESIVLLKNQ 462

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPG 467
           +  LPL K+    +A+ GPNAN A   L +Y   +    T  + ++N V + V   Y  G
Sbjct: 463 NNTLPLNKASIKKIAVCGPNANDAAYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKG 522

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD V               A     IDKAV+  + +D  V+++G +     E   R  L 
Sbjct: 523 CDLVDAHWPESELVDYPMTADEQNEIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLE 582

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LPGRQ +L+ +  +A  K VILVL+ G P+ + +A  D+ + +I+ A YPG  G  A+A+
Sbjct: 583 LPGRQLQLL-KAVQATGKTVILVLINGRPLSVNWA--DKFVPAIVEAWYPGSQGGTAVAD 639

Query: 573 VIFGDHNPG 581
           V+FGD+NPG
Sbjct: 640 VLFGDYNPG 648


>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
 gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
          Length = 955

 Score =  232 bits (591), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 174/609 (28%), Positives = 292/609 (47%), Gaps = 99/609 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T+PI  R  DL+S++ ++EK  Q+V       + +  +P  +W    W        E L+
Sbjct: 65  TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G E R L       G T  +AP +++ RD RWGR +E 
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  V   +G+Q D           Q +A  KH+ AY  +        + D 
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  P++  +K+    G+M +YN  +G P  +    L+   R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  ++   G A   +++V+  + AG+++ C +F    +     +  + +  +P 
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           S ID  + ++  V+  +GLF+ +P      +   +V C+  +Q++ALQA+++ +VLLKN 
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCA-ENQLVALQASKESLVLLKNQ 464

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
             +LPL  +K   +A+ GPNA+     L +Y   +    T L+ ++N V+   + ++  G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKG 524

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD V               A   + IDKAV+ AK +D  V+++G       E   R  L 
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLD 584

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641

Query: 573 VIFGDHNPG 581
            +FGD+NPG
Sbjct: 642 ALFGDYNPG 650


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  231 bits (590), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 162/503 (32%), Positives = 260/503 (51%), Gaps = 60/503 (11%)

Query: 110 IRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRW 169
           ++ AT+FPQ I  A+++D  L   +   I  +A+ L    Q +     +P +++ RDPRW
Sbjct: 98  VKTATAFPQAIGLASTWDPDLIREVSSTIRYQAK-LIGTNQCL-----SPVLDVCRDPRW 151

Query: 170 GRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTT 229
           GR +ET GED  +     ++YV+G+QG+           +  A  KHF A+    +    
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQGEN----------ELIATVKHFAAH---GFPEGG 198

Query: 230 RYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQW 289
           R      V  ++L + +  PFE  +K G+A  +M AY+ ++GIP  ++  LL+K  R++W
Sbjct: 199 RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKILRQEW 258

Query: 290 GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV---NCGSFLQKHTKAAVKQK 346
           GF G + SD DA+  +      + + ++A +  L+AG+D    N   F +   + AVK+ 
Sbjct: 259 GFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE-AVKEG 317

Query: 347 KLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVL 406
            + ES IDRA+  +  ++ +LGLFN +   +    +   +  S + + LAL  A+  IVL
Sbjct: 318 LISESIIDRAVERVLRIKEKLGLFNDHYINE--NNVPEKLDNSKSRE-LALDVARKSIVL 374

Query: 407 LKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNY-------AGPSCRSITPLQALQNYV 459
           LKN + +LPL K+   ++A+IGPNAN  + LLG+Y       A      +T L+ +   V
Sbjct: 375 LKNDN-ILPLNKNIG-TIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEGIMRKV 432

Query: 460 ENT---VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------------ 498
            N    +Y  GCD  A S     +A++IAK  D ++ +MG                    
Sbjct: 433 SNNTNVLYAKGCDIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPGKDEFEK 492

Query: 499 -QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSIL 557
            Q    E  DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     + +I+
Sbjct: 493 YQAVTGEGNDRTSLRLPGVQEELLKELHKTG-KPIILVLVNGRP--LALSSIFNEVNAII 549

Query: 558 WAGYPGEAGAVALAEVIFGDHNP 580
            A +PGE G  A+A+VIFGD+NP
Sbjct: 550 DAWFPGEEGGNAIADVIFGDYNP 572


>gi|251795943|ref|YP_003010674.1| beta-glucosidase [Paenibacillus sp. JDR-2]
 gi|247543569|gb|ACT00588.1| Beta-glucosidase [Paenibacillus sp. JDR-2]
          Length = 940

 Score =  231 bits (590), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 148/424 (34%), Positives = 223/424 (52%), Gaps = 28/424 (6%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           ++ S + +PF    LP+ QR  DLV+R TL+EKI+ +      I  LG+ AY+  +EA H
Sbjct: 2   THNSEQKYPFQNPDLPLEQRVNDLVNRFTLEEKINLMCQYQDEITHLGVKAYKHGTEAAH 61

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GM 154
           G+A +G+          AT+FPQ I  A ++D  L  +IG  IG EAR  Y+      G+
Sbjct: 62  GMAWLGE----------ATTFPQPIGLACTWDKDLMKKIGSVIGDEARGFYSQNPTHNGL 111

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           T WAP +++ RDPRWGR +E  GEDP +TGK + +  +G+QGD          ++A A  
Sbjct: 112 TLWAPTVDMERDPRWGRTEEAYGEDPHLTGKLSAALTQGIQGDH------PFYVKAVASL 165

Query: 215 KHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPS 274
           KHF   + +  +G      D R    +  + Y   FE   K+G A  +M AYN VNG+P+
Sbjct: 166 KHFIGNNNEINRGECSVSLDPR----NKNEYYLKAFEIPFKEGGALSMMTAYNSVNGVPA 221

Query: 275 CADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCG 332
             + + ++   +R WG +G++ SD   V    +   Y  + ++AV   +KAG+D   +  
Sbjct: 222 NINPD-VNNIVKRDWGMNGFVVSDAGDVLGTVNDHHYVDTYKEAVALTIKAGVDSITDDH 280

Query: 333 SFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAH 392
              ++  + A+ +  L E+++D AL N F VR RLG F+      P+  I   V+  P H
Sbjct: 281 PISKQAIRDALAEGMLTENDLDIALRNTFRVRFRLGEFDP-ADRNPYAAIDESVIMKPEH 339

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
             LAL A +  IVLLKN  G+LPL   K    A+IGP A         Y+G    SI+P 
Sbjct: 340 AELALDAVRKSIVLLKND-GILPLSAEKLSKAAVIGPLA--GIVYRDWYSGLMPYSISPF 396

Query: 453 QALQ 456
           + ++
Sbjct: 397 EGIR 400



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 46/79 (58%), Gaps = 5/79 (6%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGG-PVDITFAKYDRNIGSILWAGY 561
           KE +DR DL L   Q++LI  V   A  P  +V++ G  P  + + +   NI +I++  +
Sbjct: 575 KETIDRPDLTLAAYQEKLIREVY--AVNPNTVVVVVGSYPFAMPWVQ--ENIPAIVYLSH 630

Query: 562 PGEAGAVALAEVIFGDHNP 580
            G+    ALA+V+FGD+NP
Sbjct: 631 AGQELGHALADVLFGDYNP 649


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  231 bits (590), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 176/601 (29%), Positives = 279/601 (46%), Gaps = 95/601 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+  R +DL+S++TL+EK  Q+     +  + +  +P   W    W + +  +     G+
Sbjct: 49  PLEDRVQDLLSQMTLEEKTCQMATLYGSGRVLKDALPQDNWKTEVWKDGIGNIDEEHNGL 108

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 109 GAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATYFPAQCGQG 168

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  ET GEDP +
Sbjct: 169 ATWNKKLIARIGEVEAKEAVAL-------GYTNIYSPILDIAQDPRWGRCVETYGEDPYL 221

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     +  +Q           K    A  KHF  Y +       + + D  V  +++
Sbjct: 222 VGELGKQMITSLQ-----------KYNLVATPKHFAVYSIPIGGRDGKTRTDPHVAPREM 270

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF    ++  A G+M +YN  +G P     + L++  R++WGF GY+ SD +AV
Sbjct: 271 RTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKGYVVSDSEAV 330

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRAL 357
             I +    A + ED +   + AG+++         F+    K AV   K+ +  +D+ +
Sbjct: 331 EFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRK-AVDDGKISQETLDKRV 389

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
             +  ++  LGLF+ NP  +  GK    +V S  HQ ++L+AA+  +VLLKN   LLPL 
Sbjct: 390 AEILRIKFWLGLFD-NP-YRGNGKQAEQIVHSKEHQAVSLEAARQSLVLLKNETHLLPLS 447

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV---- 471
           KS   S+A+IGPNA+    L+  Y   +    T  Q ++  + +   +Y  GCD +    
Sbjct: 448 KSIR-SIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIIDPHF 506

Query: 472 -----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
                      A     + + +  AK A+ VV+++G ++   +E+  R  L LPGRQ+EL
Sbjct: 507 PESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQEEL 566

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  V  A  KPVILV+L G    I +A    ++ +IL A +PGE    A+AE +FGD+NP
Sbjct: 567 LKAVC-ATGKPVILVMLDGRASSINYAA--AHVPAILHAWFPGEFCGQAVAEALFGDYNP 623

Query: 581 G 581
           G
Sbjct: 624 G 624


>gi|317503000|ref|ZP_07961085.1| beta-glucosidase, partial [Prevotella salivae DSM 15606]
 gi|315665888|gb|EFV05470.1| beta-glucosidase [Prevotella salivae DSM 15606]
          Length = 770

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 174/597 (29%), Positives = 287/597 (48%), Gaps = 95/597 (15%)

Query: 52  ISQRARDLVSRLTLDEKISQL----------------------VNSAPAI-PRLGIPAYE 88
           ++QR  DL+ R+TL+EK+ Q+                       N+A A  P + +   E
Sbjct: 33  VAQRVDDLLRRMTLEEKVGQMNQLVGIEHFKTNSITMSAEELATNTATAFYPGVTVSEIE 92

Query: 89  WW------SEALHG----------------------VAGVGKGIFFNGTIRGATSFPQVI 120
           +W      S  LH                       + G+   I  N   +  T +P  I
Sbjct: 93  YWVRRGWVSSFLHVLTLEEANYLQKLSMQSRLQIPLIIGI-DAIHGNAKCKNNTVYPTNI 151

Query: 121 LTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDP 180
             A+SFD  L Y+I +    E RA+      +   F  PN+ + RD RWGR  ET GEDP
Sbjct: 152 GLASSFDVDLAYKIARQTAEEMRAM-----NMHWNF-NPNVEVARDGRWGRCGETFGEDP 205

Query: 181 LVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AYDLDNWKGTTRYKFDARVT 238
            +  +  V+  +G Q +  N   + G      C KHF   +Y ++   G         V+
Sbjct: 206 YLVMQMGVATNKGYQRNLDNTSDVLG------CVKHFVGGSYSINGTNGAP-----CDVS 254

Query: 239 MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSD 298
            + L + + PPF++ ++QG    +M ++N +NGIP   +R L++   R++WGF G+I SD
Sbjct: 255 ERTLREVFFPPFKATLQQGGDWNVMMSHNELNGIPCHTNRWLMTDVLRKEWGFQGFIVSD 314

Query: 299 CDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEIDRAL 357
              +    D    AK  ++A    + AGMD++  G   QK     V++ ++PES ID ++
Sbjct: 315 WMDIEHCVDQHHTAKDNKEAFYQSIMAGMDMHMHGPEWQKDVVELVREGRIPESRIDESV 374

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
             + +V+ RLGLF       P+  +     V+  P H+  AL A+++ IVLLKN   LLP
Sbjct: 375 RRILTVKFRLGLFE-----HPYSDVKTRDRVINDPVHKQTALDASRESIVLLKNEKQLLP 429

Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYA--GPSCRSITPLQALQNYVENTVYY---PGCDT 470
           L + K   + + G NAN  + ++G+++   P  +  T L+ L+    +T +     G D 
Sbjct: 430 LDEQKYKKVLVTGINAND-QNIMGDWSELQPEDKVWTVLKGLKLVSPHTDFRFVDQGWDP 488

Query: 471 VACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEKEELDRVDLVLPGRQQELITR 523
              S + +D AV+ AK +D  ++  G        ++    E+ DR +L L G Q++LI R
Sbjct: 489 RNMSQSQVDAAVEAAKESDLNIVCCGEYMMRFRWNERTSGEDTDRDNLELVGLQEQLIRR 548

Query: 524 VAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + E   KP IL+++ G P+ + +A    ++ +I+ A  PG+ G  A+AE+++G  NP
Sbjct: 549 LNETG-KPTILIIISGRPLSVRYAA--DHVPAIVNAWEPGQYGGQAIAEILYGKINP 602


>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 721

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 175/561 (31%), Positives = 268/561 (47%), Gaps = 51/561 (9%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
           +PF    L   QR  DL+SR+TL EKI Q +   P +PRLGIP      E LHG A +G 
Sbjct: 24  YPFQNPALSPDQRIDDLLSRMTLQEKI-QALGDDPGVPRLGIPG-ALTEEGLHG-AAIGG 80

Query: 103 GIFFNGTIRG---ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAP 159
              + G  R     T FPQ      ++D  L  +       E R   N     G+   AP
Sbjct: 81  PAHWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAP 140

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
           N N+ RDPRWGR +E+ GEDP + G  AV++++G+QG+           + +A  KHF A
Sbjct: 141 NANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNN------PRYWETAALMKHFDA 194

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
           Y  +  +  +   F  R+      + Y  PF   ++QG +   M +YN  NGIP  A+  
Sbjct: 195 YSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMTANP- 249

Query: 280 LLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT 339
           +L     ++WGF+G I +D  A+S +     Y K+  +A    + AG++     + Q+  
Sbjct: 250 VLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMPEAAAGAVHAGINQFLDRY-QQPV 308

Query: 340 KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIG--------ADVVCSPA 391
           + A++QK L E +ID+ L  ++ V +RLGL + + +M P+  IG         D    P+
Sbjct: 309 EEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPS-SMSPYSMIGLTNDNPAKGDPWDWPS 367

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP 451
           H  L  +   + IVLLKN +  LPL   K  S+A+IGP AN     L  Y+G     +TP
Sbjct: 368 HIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWANIVA--LDWYSGTPPFGVTP 425

Query: 452 LQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT----------- 500
           ++ ++  V      P         +++  A  +AK +D  ++++G   T           
Sbjct: 426 VEGIRQRVG-----PDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWGKCALP 480

Query: 501 -QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWA 559
            + KE  DR  L LP    E I +   AA    ++VL    P    + +   +I +IL  
Sbjct: 481 SEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDWTQ--AHIPAILEM 535

Query: 560 GYPGEAGAVALAEVIFGDHNP 580
            +  E    ALA+V+FGD++P
Sbjct: 536 AHNSEEQGTALADVLFGDYDP 556


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 178/568 (31%), Positives = 280/568 (49%), Gaps = 77/568 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K +++LVN          R+ IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKEVAKLVNKIQKFLVENTRMKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   I   I  +AR L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNLELVMDIASVIRSQAR-LVGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGR +ET GEDP +     ++Y+ G+QGD           Q  A  KHF A+    
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++GIP   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYSEPLVNA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           + +  +PES IDRA+  +  ++ RLGL +      PF    +  + +     + LAL+ A
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG------LDQTQ--EK 503
            +   V     +Y  GCD  + S     +A++IA+ AD ++ +MG      L  T    +
Sbjct: 428 GVVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSE 487

Query: 504 EEL-----------DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
           EE            DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +++ A +PGE G  A+A+VIFGD+NP
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNP 572


>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
           43184]
 gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
          Length = 955

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 173/609 (28%), Positives = 292/609 (47%), Gaps = 99/609 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T+PI  R  DL+S++ ++EK  Q+V       + +  +P  +W    W        E L+
Sbjct: 65  TVPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTSDWKKQLWKDGIGAIDEHLN 124

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G E R L       G T  +AP +++ RD RWGR +E 
Sbjct: 185 PTQLGLGHTWNRNLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +   +G+Q D           Q +A  KH+ AY  +        + D 
Sbjct: 238 YGESPYLVAELGIEMAKGMQTDH----------QVAATSKHYIAYSNNKGGREGMARVDP 287

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  P++  +K+    G+M +YN  +G P  +    L+   R ++GF GY+
Sbjct: 288 QMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  ++   G A   +++V+  + AG+++ C +F    +     +  + +  +P 
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGAIPM 406

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           S ID  + ++  V+  +GLF+ +P      +   +V C+  +Q++ALQA+++ +VLLKN 
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCA-ENQLVALQASKESLVLLKNQ 464

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPG 467
             +LPL  +K   +A+ GPNA+     L +Y   +    T L+ ++N V+   + ++  G
Sbjct: 465 DAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKG 524

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD V               A   + IDKAV+ AK +D  V+++G       E   R  L 
Sbjct: 525 CDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLD 584

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A+A+
Sbjct: 585 LPGRQLDLLQAVV-ATGKPVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIAD 641

Query: 573 VIFGDHNPG 581
            +FGD+NPG
Sbjct: 642 ALFGDYNPG 650


>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 783

 Score =  231 bits (588), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 163/527 (30%), Positives = 261/527 (49%), Gaps = 62/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +G+AIG 
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P +++ RDPRW R +ET GEDP++TG+   + V G+      
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAIVEGL-----G 227

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L       A  KHF AY +          F     +++L + + PPF   +  G A 
Sbjct: 228 GGDLSRPYSTLATLKHFLAYGISESGQNGNPSFAG---IRELHENFLPPFRQAIDAG-AL 283

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G+P  A+ +LL++  R +W F G + SD  ++  I+ +   A + E+A V
Sbjct: 284 SVMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSHFVAPTMEEAAV 343

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+DV+ G     +   AV   ++ ++ +D ++  +  ++  +GLF  NP + P  
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRIGKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +     V S     LA + AQ  I LLKN H LLPL K++ V  ALIGPNA++   +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458

Query: 441 YAGPSCRS--ITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   +   T L  ++  + ++   Y  GC      +  I++AV  A+ ++ ++ ++G
Sbjct: 459 YTAPQEEANIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEIIIAVVG 518

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQEL+ +  +   KP++
Sbjct: 519 GSSARDFKTSYKETGAAIANEKTISDMECGEGFDRATLSLLGKQQELL-KALKTTGKPLV 577

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D  +A    N  ++L A YPG+ G +A+A+V+FGD NP
Sbjct: 578 VVYIEGRPLDKNWA--SENADAVLTAYYPGQEGGIAIADVLFGDFNP 622


>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 946

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 184/610 (30%), Positives = 283/610 (46%), Gaps = 101/610 (16%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----------------- 89
           T PI  R  DL+S++ L+EK  Q+V       + +  +P  EW                 
Sbjct: 57  TAPIDARIEDLLSQMNLNEKTCQMVTLYGYKRVLKDDLPTPEWKQMLWKDGMGAIDEHLN 116

Query: 90  ---------------WSEALHG----------VAGVGKGI---FFNGTIRG-----ATSF 116
                          W  + H           V     GI   F N  IRG     AT+F
Sbjct: 117 GFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNF 176

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L ++IG   G EAR L       G T  +AP +++ RD RWGR +E 
Sbjct: 177 PTQLGLGHTWNRKLIHQIGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEV 229

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  +  VRG+Q +           Q +A  KHF AY  +        + D 
Sbjct: 230 YGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDP 279

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  PF+  +++    G+M +YN  +G P  +    L+   R Q GF GY+
Sbjct: 280 QMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQMGFRGYV 339

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  +Y   G AK  ++AV   ++AG++V C +F    +     +  V++  L E
Sbjct: 340 VSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVQEGGLSE 398

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             I+  + ++  V+  +GLF+     Q   K   D V    ++ +ALQA+++ IVLLKN 
Sbjct: 399 EVINDRVRDILRVKFLVGLFDA--PYQTDLKGADDEVEKEENEAVALQASRESIVLLKNE 456

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITP----LQALQNYVENTVYYP 466
           +  LPL  +    +A+ GPNA      L +Y GP    +T     L+   N     +Y  
Sbjct: 457 NNTLPLDITSVKKIAVCGPNAAEKAYALTHY-GPLAVEVTTVVDGLREKLNGKAEVLYTK 515

Query: 467 GCDTVAC---------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDL 511
           GCD V                   + IDKAV  A+ AD  V+++G  Q    E   R  L
Sbjct: 516 GCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKSRSSL 575

Query: 512 VLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALA 571
            LPGRQ +L+ +  +A  KPVILVL+ G P+ + +A  D+ + +IL A YPG  G  A+A
Sbjct: 576 DLPGRQLDLL-KAVQATGKPVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAIA 632

Query: 572 EVIFGDHNPG 581
           +V+FGD+NPG
Sbjct: 633 DVLFGDYNPG 642


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 175/569 (30%), Positives = 277/569 (48%), Gaps = 77/569 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K +++LVN          ++ IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKDVAKLVNKIQKFLVENTKMKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   I   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNLELVMDIASVIRSQGR-LVGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           +DPRWGR +ET GEDP +     ++Y+ G+QGD           Q  A  KHF A+    
Sbjct: 147 KDPRWGRCEETYGEDPYLVASMGLAYITGLQGDN----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++GIP   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEFPTIDCYSEPLVNA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           + +  +PES IDRA+  +  ++ RLGL +      PF    +  + +     + LAL+ A
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLD-----NPFVNENSVPEKLDDHKSRELALKTA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNILPLSKNVN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------- 498
            +   V     +Y  GCD  + S     +A++IA+ AD ++ +MG               
Sbjct: 428 GVVKKVGESKVLYAKGCDIASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSE 487

Query: 499 ------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
                 Q    E  DR  L LPG Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           + +++ A +PGE G  A+A+VIFGD+NPG
Sbjct: 545 VKAVIEAWFPGEEGGNAIADVIFGDYNPG 573


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 164/507 (32%), Positives = 257/507 (50%), Gaps = 54/507 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L  ++G   G EAR L       G T  +A
Sbjct: 160 FTNEGIRGIESYRATNFPTQLGLGHTWNRELIRQVGLITGREARIL-------GYTNVYA 212

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +  VRG+Q +           Q +A  KHF 
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH----------QVAATGKHFV 262

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D +++ +++   +  PF+  +K+    G+M +YN  +G+P     
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
             L+   R + GF GY+ SD DAV  +Y     AK  ++AV   ++AG++V C +F    
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381

Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
           +     +  VK+  L E  I+  + ++  V+  +GLF+  P        GAD     A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDA-PYQTDLA--GADREVEKAEN 438

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LALQA+++ +VLLKN + +LPL  +    +A+ GPNA+     L +Y   +    T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498

Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
           + ++   E     +Y  GCD V  +                A IDKAV+ A+ AD  V++
Sbjct: 499 EGIRQKSEGKAEVLYTKGCDLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVV 558

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           +G  Q    E   R  L LPGRQ +L+  V +A  KPV+LVL+ G P+ I +A  D+ + 
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
           +IL A YPG  G  A+A+V+FGD+NPG
Sbjct: 616 AILEAWYPGSKGGTAVADVLFGDYNPG 642


>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
 gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
          Length = 783

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 260/527 (49%), Gaps = 62/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +G+AIG 
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P +++ RDPRW R +ET GEDP++TG+   + V G+      
Sbjct: 179 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL-----G 227

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L       A  KHF AY +          F     +++L + + PPF   +  G A 
Sbjct: 228 SGDLSHPYSTLATLKHFLAYGISESGQNGNPSFAG---IRELHENFLPPFRQAIDAG-AL 283

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+ +LL++  R +W F G + SD  ++  I+ +   A + E A +
Sbjct: 284 SVMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLYSIEGIHQSHFVAPTMEAAAI 343

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+DV+ G     +   AV   ++ ++ +D ++  +  ++  +GLF  NP + P  
Sbjct: 344 LALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARVLRLKFEMGLFE-NPYVDP-- 400

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           +     V S     LA + AQ  I LLKN H LLPL K++ V  ALIGPNA++   +LG+
Sbjct: 401 EKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLNKNRKV--ALIGPNADNRYNMLGD 458

Query: 441 YAGPSCRS--ITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P       T L  ++  + ++   Y  GC      +  I++AV  A+ ++ ++ ++G
Sbjct: 459 YTAPQEEENIKTVLDGIRTKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVVG 518

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQEL+ +  +A  KP+I
Sbjct: 519 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 577

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D T+A    N  ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 578 VVYIEGRPLDKTWA--SENADAVLTAYYPGQEGGNAIADVLFGDYNP 622


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 178/568 (31%), Positives = 276/568 (48%), Gaps = 77/568 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K + +LVN          RL IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLDLKPKEVVKLVNKVQKFLVENTRLKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   +   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGR +ET GEDP +     ++Y+ G+QG+T          Q  A  KHF A+    
Sbjct: 147 RDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++G+P   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           +K+  + E+ IDRA+  +  ++ RLGL +      PF    A  + +     + LAL+AA
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNMLPLSKNIN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV-ENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------- 498
            +   V E  V Y  GCD    S     +A++IAK AD ++ +MG               
Sbjct: 428 GIAKKVGEGKVLYAKGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSE 487

Query: 499 ------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
                 Q    E  DR  L L G Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +I+ A +PGE G  A+A++IFGD+NP
Sbjct: 545 VKAIIEAWFPGEEGGNAIADIIFGDYNP 572


>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 955

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 177/612 (28%), Positives = 288/612 (47%), Gaps = 105/612 (17%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T PI  R  DL+S++ ++EK  Q+V       + +  +P  +W    W        E L+
Sbjct: 65  TAPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTPDWKNQLWKDGMGAIDEHLN 124

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G E R L       G T  +AP +++ RD RWGR +E 
Sbjct: 185 PTQLGLGHTWNRDLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  V   +G+Q D           Q +A  KH+ AY  +        + D 
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  P++  +K+    G+M +YN  +G P  +    L+   R ++GF GY+
Sbjct: 288 QMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  ++   G A   +++V+  + AG+++ C +F    +     +  + +  LP 
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPM 406

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLL 407
           S ID  + ++  V+  +GLF+     QP+    K     V S  +Q +ALQA+++ +VLL
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLL 461

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVY 464
           KN   +LPL  +K   +A+ GPNA+     L +Y   +    T L+ +QN V+     ++
Sbjct: 462 KNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLF 521

Query: 465 YPGCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
             GCD V  +                + IDKAV+ AK +D  V+++G       E   R 
Sbjct: 522 TKGCDLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENKSRS 581

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
            L LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A
Sbjct: 582 SLELPGRQLDLLQAVV-ATGKPVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTA 638

Query: 570 LAEVIFGDHNPG 581
           +A+ +FGD+NPG
Sbjct: 639 IADALFGDYNPG 650


>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 166/527 (31%), Positives = 263/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++ NLL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K+ +  +A+IGPNA++   +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457

Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 621


>gi|307109345|gb|EFN57583.1| hypothetical protein CHLNCDRAFT_34794, partial [Chlorella
           variabilis]
          Length = 377

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/355 (37%), Positives = 196/355 (55%), Gaps = 14/355 (3%)

Query: 112 GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTF-----WAPNINIFRD 166
           G T +P  I  AA+FD  L  R    +G E RA YN     G        + P+++I RD
Sbjct: 13  GGTIYPINIAWAATFDDGLALRAASQVGDEMRAQYNRRARAGGPLAYSNCFGPHVHIVRD 72

Query: 167 PRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWK 226
           PRWGR  ET GEDPL+    AV++VRG+QG    G      ++  A CKHF   DL+ W+
Sbjct: 73  PRWGRMAETFGEDPLLQSNMAVAHVRGLQG----GAGTDTYIKTVATCKHFIGNDLEGWE 128

Query: 227 GTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTAR 286
           G TR+ FDA +T +DL D++ PPFE+CV++G A  +MC+YN +NG+P+C ++ LL+   R
Sbjct: 129 GVTRHTFDANITERDLRDSFLPPFEACVREGGALAVMCSYNSLNGLPACVNKPLLTGLLR 188

Query: 287 RQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKA-GMDVNCGSFLQKHTKAAVKQ 345
            + GF G + +DC A++ I       K P  A  D  +A  + V  G+ +  H    ++ 
Sbjct: 189 GELGFAGMVVTDCTALTRIVKPR--PKGPHFAGGDKRRASALAVQAGTDMACHLFDMLQP 246

Query: 346 KKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIV 405
             + + ++D A+  +   R+R G FN    + PF  +   V    AH+  A + A  G V
Sbjct: 247 GDVSQRDLDAAVRRVLHNRVRQGHFNPLSEL-PFDHLDGSVFGRAAHRDTAREIAAKGTV 305

Query: 406 LLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYV 459
           LLKN+ G LPL  +    +A+IGP A+    +LG Y G +   I TPLQA+Q+ V
Sbjct: 306 LLKNAEGTLPLRPAAMKQVAVIGPFADQPTYILGKYYGATAGPITTPLQAIQDAV 360


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 178/568 (31%), Positives = 276/568 (48%), Gaps = 77/568 (13%)

Query: 50  LPISQRARDLVSRLTLDEK-ISQLVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGI 104
           L I Q  R   SRL L  K + +LVN          RL IPA     E L G+ G     
Sbjct: 45  LGIGQITRVAGSRLGLKPKEVVKLVNKVQKFLVENTRLKIPAI-IHEECLSGLMGYS--- 100

Query: 105 FFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIF 164
                   +T+FPQ I  A++++  L   +   I  + R L    Q +     +P +++ 
Sbjct: 101 --------STAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQCL-----SPVLDVC 146

Query: 165 RDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDN 224
           RDPRWGR +ET GEDP +     ++Y+ G+QG+T          Q  A  KHF A+    
Sbjct: 147 RDPRWGRCEETYGEDPYLVASMGLAYITGLQGET----------QLVATAKHFAAH---G 193

Query: 225 WKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKT 284
           +    R      V  ++L +T+  PFE  VK G+   IM AY+ ++G+P   +  LL+  
Sbjct: 194 FPEGGRNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNI 253

Query: 285 ARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSF--LQKHTKAA 342
            R++WGF G + SD D +  +      A +  +A +  L++G+D+   +     +    A
Sbjct: 254 LRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEFPTIDCYGEPLVTA 313

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGA--DVVCSPAHQVLALQAA 400
           +K+  + E+ IDRA+  +  ++ RLGL +      PF    A  + +     + LAL+AA
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLD-----NPFVDESAVPERLDDRKSRELALKAA 368

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA-------GPSCRSITPLQ 453
           ++ IVLLKN + +LPL K+ +  +A+IGPNAN  + +LG+Y              +T LQ
Sbjct: 369 RESIVLLKNENNMLPLSKNIN-KIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQ 427

Query: 454 ALQNYV-ENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMGLD------------- 498
            +   V E  V Y  GCD    S     +A++IAK AD ++ +MG               
Sbjct: 428 GIAKKVGEGKVLYAKGCDIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSE 487

Query: 499 ------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRN 552
                 Q    E  DR  L L G Q+EL+  + +   KP+ILVL+ G P  +  +     
Sbjct: 488 EEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTG-KPIILVLINGRP--LVLSPIINY 544

Query: 553 IGSILWAGYPGEAGAVALAEVIFGDHNP 580
           + +I+ A +PGE G  A+A++IFGD+NP
Sbjct: 545 VKAIIEAWFPGEEGGNAIADIIFGDYNP 572


>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 791

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 178/526 (33%), Positives = 264/526 (50%), Gaps = 59/526 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E LHG A VG           ATSFPQ I  A+S+D  +  ++ Q IG 
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E RA     + + M   +P ++I RDPRWGR +ET GEDP + G+  V+ V G+QG+  +
Sbjct: 186 EIRA-----RGVPMVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEGRS 239

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             +L       A  KH T +      GT      A V+ ++L + + PPFE  VK+    
Sbjct: 240 --RLLRPGHVFATLKHLTGHGQPE-SGTN--VGPAPVSERELRENFFPPFEQVVKRTGIE 294

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G+PS A+R LL    R++WGF G + SD  AV  +      A + E+A +
Sbjct: 295 AVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAAM 354

Query: 321 DVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             L AG+D +    L   T    V++ K+ E+++D A+  +  ++ R GLF  NP     
Sbjct: 355 RALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPYAD-- 411

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
               A +  +   + LA  AAQ  I LLKN  G+LPL    ++  A+IGP+A  A+  LG
Sbjct: 412 ANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPEGTI--AVIGPSAAVAR--LG 466

Query: 440 NYAGPSCRSITPLQALQNYV---ENTVYYPGC----------DTVACSSAS-----IDKA 481
            Y G    S++ L+ ++  V    N V+  G           D V  S  +     I +A
Sbjct: 467 GYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVKSDPAENRKLIAQA 526

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           V+ A+  D ++L +G  +   +E        DR  L L G QQEL   + +A  KP+ +V
Sbjct: 527 VEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDAL-KALGKPITVV 585

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           L+ G P   +  K      +IL   Y GE G  A+A+++FGD NPG
Sbjct: 586 LINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPG 629


>gi|336430645|ref|ZP_08610589.1| hypothetical protein HMPREF0994_06595 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336017554|gb|EGN47314.1| hypothetical protein HMPREF0994_06595 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 962

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 132/392 (33%), Positives = 211/392 (53%), Gaps = 16/392 (4%)

Query: 44  PFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           P     LP+ +R   L+  +T +EKI+ L    P I RLGI A     EA HG+      
Sbjct: 8   PLWNNRLPVEERLDYLIGEMTTEEKIACLTTGCPDISRLGIRASYMGGEAAHGIEARHDQ 67

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINI 163
            F  G     TSF Q I  +ASFD  L    G+ +G EARAL+    + G+  WAP +++
Sbjct: 68  AFNKGEPEPTTSFTQPIGMSASFDRELIRECGRCVGEEARALFTRNGSGGLCRWAPTVDM 127

Query: 164 FRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLD 223
            RDPRWGR +E  GEDP +TG+ A SY++G++GD          ++  A  KHF A +++
Sbjct: 128 ERDPRWGRTEEAYGEDPYLTGEMASSYIQGMKGDD------PFYIRCGATLKHFYANNVE 181

Query: 224 NWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSK 283
                 R    + +  ++  + Y  PF   + +G A  +M +YN +NGIP+  +  + + 
Sbjct: 182 K----DRISISSSLDRRNKYEYYLEPFRKAIVEGGAEAVMTSYNEINGIPAIVNEEVRT- 236

Query: 284 TARRQWGFHGYITSD-CDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGSFLQKHTK 340
             +  WG  G++  D  D    +YD + Y K+ E+ V   LKAG+D   +  S + +  +
Sbjct: 237 ILKEAWGLPGHVVCDGGDMQQTVYDHK-YFKTHEETVAYGLKAGVDCFTDDKSVVMEAAR 295

Query: 341 AAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAA 400
            A+++  + E +IDR++ N F  R+RLG F+G+    P+  +G + V +  H+ + ++ A
Sbjct: 296 KALEKGMITEEDIDRSIRNSFRTRIRLGFFDGDGEC-PYTGMGEEYVNNQEHRDICVKMA 354

Query: 401 QDGIVLLKNSHGLLPLPKSKSVSLALIGPNAN 432
           ++ +VLLKN   +LP    K+ SLA+IGP A+
Sbjct: 355 EESVVLLKNEKNILPFLPEKTESLAIIGPLAD 386



 Score = 45.8 bits (107), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 54/116 (46%), Gaps = 4/116 (3%)

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQT-QEKEELDRVDLVLPGRQQELITRV 524
           P   T+   +  I +AV  A+ A   VL++G +     KEE+DR  L LP  QQ L   V
Sbjct: 542 PAVFTIEVEADGIKEAVHAAENAQRAVLVLGCNPVINSKEEIDRSTLALPPFQQNLADAV 601

Query: 525 AEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            +A  +  I+VLL   P  I   +    +  I+W+    +     +A ++ G  +P
Sbjct: 602 RKANPE-TIVVLLSNYPYSIN--RLQEEMPGIIWSASGSQELGTGVASILSGKVSP 654


>gi|160882312|ref|ZP_02063315.1| hypothetical protein BACOVA_00260 [Bacteroides ovatus ATCC 8483]
 gi|423303026|ref|ZP_17281047.1| hypothetical protein HMPREF1057_04188 [Bacteroides finegoldii
           CL09T03C10]
 gi|156112320|gb|EDO14065.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|408470355|gb|EKJ88890.1| hypothetical protein HMPREF1057_04188 [Bacteroides finegoldii
           CL09T03C10]
          Length = 842

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 150/423 (35%), Positives = 222/423 (52%), Gaps = 39/423 (9%)

Query: 49  TLPI-SQRARDLVSR----LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           TLP+ +Q  R+ V +    L++++KI  L    P +P+ G+  Y+WWSE LHGVA  GK 
Sbjct: 17  TLPLWAQPHREGVQQKLQSLSVEQKIDLLCARYPGVPQEGLAKYDWWSECLHGVARAGK- 75

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAIGMT 155
                    AT FP+ I   +++D+ L +RIG AIG EARA ++        + +  G+T
Sbjct: 76  ---------ATVFPKPIGMGSTWDAELIHRIGIAIGDEARAKHHQDVREHGYSDRHYGLT 126

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           F++P +NI RDPRWGR  E   EDPL+T    VS+++G+QGD          L+  A  K
Sbjct: 127 FFSPTLNIARDPRWGRTTECFSEDPLLTSDMGVSFIKGMQGDD------PYYLKTVATAK 180

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A + +N     R    A V    L + Y P F   V +G    IM AYN +NGIP C
Sbjct: 181 HFVANNEEN----RRLGGSAIVDEVSLREYYFPAFRDAVVRGYVKSIMGAYNALNGIPCC 236

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           A+  LL+   R++WGF G + SD  A+  +Y    Y  + E+     L+AG D++     
Sbjct: 237 ANSMLLTDVLRKEWGFEGVVISDGSAIDKLYTHHKYVPNLEEGAALALRAGCDMSLRDEY 296

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           +   K A  +  + E +++ A+  +  +R RLGL  G     P+      VV    HQ L
Sbjct: 297 RPGLKKAYHKGLITEHDLNVAVARVLDLRARLGLDKGTDQGNPYKNTPYSVVECERHQQL 356

Query: 396 ALQAAQDGIVLLKNSHGLLPLP---KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           AL+AA+  ++LLKN  G+LPL      + +++ALIG    S     G+Y+G    + T L
Sbjct: 357 ALEAAEKSMILLKNE-GVLPLKPDASKRKLNIALIGDAFKS--VYYGDYSGMPEYNRTLL 413

Query: 453 QAL 455
           +A 
Sbjct: 414 EAF 416


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 171/544 (31%), Positives = 270/544 (49%), Gaps = 64/544 (11%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++ +   A    RLGIP   +  E  HG   +G           AT FP  +   +++
Sbjct: 90  EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           ++ L+  I +A+  E RA   A        ++P +++ RDPRWGR +ET GEDP +  ++
Sbjct: 138 NTELFRSISRAVAAETRAQGGAAT------YSPVLDVVRDPRWGRTEETFGEDPHLVAEF 191

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AV+ V+G+QG+     +L       A  KHF  Y         R      + +++L +  
Sbjct: 192 AVAAVQGLQGE-----RLDSHTSLLATLKHFAGYGASE---GGRNGAPVHMGLRELHEVD 243

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF   V+ G  S IM AYN ++G+P  + R LL    R  WGF G++ +DC A+ ++ 
Sbjct: 244 LLPFRKAVESGALS-IMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVITDCGAIHMLA 302

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
                A S  +A    LKAG+D+   G+  + H + A++Q  + E +++RA   +  ++ 
Sbjct: 303 CGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRAAGRVLELKF 362

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
           RLGLF+  P + P       V+    H  LA QAA +GIVLLKN   LLPL  S S ++A
Sbjct: 363 RLGLFD-RPYVDP--AWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL-DSSSGTIA 418

Query: 426 LIGPNANSAKTLLGNYAGPS--CRSITPLQALQNYVENT--VYYPGCDTVACSSASIDKA 481
           +IGPNA++    LG+Y  P    + +T L  ++  + ++  +Y PGC     S     +A
Sbjct: 419 VIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGCRIQGDSREGFPRA 478

Query: 482 VDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLPGR 516
           +  A+ AD +V+++G    ++                          E +DR  L L G 
Sbjct: 479 LACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGIDRSTLTLMGV 538

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q EL+  + +   KPVI+V + G P  IT    D  I +I+ A YPG+ G  A+A+++FG
Sbjct: 539 QLELLQELQKLG-KPVIVVYINGRP--ITEPWIDEFIPAIIEAWYPGQEGGGAIADMLFG 595

Query: 577 DHNP 580
           D NP
Sbjct: 596 DINP 599


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 164/507 (32%), Positives = 255/507 (50%), Gaps = 54/507 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L  ++G   G EAR L       G T  +A
Sbjct: 160 FTNEGIRGVESYRATNFPTQLGLGHTWNRELIRQVGLITGREARML-------GYTNVYA 212

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +  VRG+Q             Q +A  KHF 
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHSH----------QVAATGKHFV 262

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D +++ +++   +  PF+  +K+    G+M +YN  +G+P     
Sbjct: 263 AYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSY 322

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
             L+   R + GF GY+ SD DAV  +Y     AK  ++AV   ++AG++V C +F    
Sbjct: 323 YWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRC-TFRSPD 381

Query: 339 T-----KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA-H 392
           +     +  VK+  L E  I+  + ++  V+  +GLF+  P        GAD     A +
Sbjct: 382 SYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLA--GADKEVEKAEN 438

Query: 393 QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           + LALQA+++ +VLLKN + +LPL  +    +A+ GPNA+     L +Y   +    T L
Sbjct: 439 ESLALQASRESLVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVL 498

Query: 453 QALQNYVE---NTVYYPGCDTVACS---------------SASIDKAVDIAKGADHVVLM 494
           + ++   E     +Y  GCD V  +                A IDKAV+ A+ AD  V++
Sbjct: 499 EGIRQKAEGKAEVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVV 558

Query: 495 MGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIG 554
           +G  Q    E   R  L LPGRQ +L+  V +A  KPV+LVL+ G P+ I +A  D+ + 
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLKLLQAV-QATGKPVVLVLINGRPLSINWA--DKFVP 615

Query: 555 SILWAGYPGEAGAVALAEVIFGDHNPG 581
            IL A YPG  G  A+A+V+FGD+NPG
Sbjct: 616 VILEAWYPGSKGGTAVADVLFGDYNPG 642


>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 799

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 155/524 (29%), Positives = 260/524 (49%), Gaps = 58/524 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP  ++ +E +HG+               AT+FP  I   ++++  L +++GQ IG 
Sbjct: 138 RLGIPV-DFTNEGIHGL-----------NQDHATAFPAPIGIGSTWNKELVHQMGQIIGR 185

Query: 141 EARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
           EA+AL       G T  +AP +++ RD RWGR  ET GEDP +      +   G+Q    
Sbjct: 186 EAKAL-------GYTNVYAPILDVARDQRWGRVVETYGEDPFLVAGLGTALAGGIQE--- 235

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
           NG         ++  KHF  Y +         + D  V  +++   +  PF   ++    
Sbjct: 236 NG--------VASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNVHP 287

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
            G+M +YN  +G+P  A    L++  R+Q+GF GY+ SD  AV  +Y+    AK  ++AV
Sbjct: 288 LGVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKEAV 347

Query: 320 VDVLKAGMDV----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPT 375
             V++AG++V    N  S      +  +K+  L    +++ +  + SV+ RLGLF+    
Sbjct: 348 KMVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLFDAPYV 407

Query: 376 MQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAK 435
             P  K    +V + A + +ALQ  ++ +VLLKN   +LPL   +  ++ + GP A+  +
Sbjct: 408 KDP--KAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLADEKE 465

Query: 436 TLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCDTVACS---------------SAS 477
             +  Y   + + I+ L+ ++++        Y  GC+    +                A 
Sbjct: 466 HAISRYGPSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTPQEIAE 525

Query: 478 IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLL 537
           ++KAV+ AK  D ++ +MG +  Q  E L R  L LPGRQ  L+  + +   KP++L+L+
Sbjct: 526 MNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTG-KPMVLILI 584

Query: 538 CGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
            G P+ I +   +R + +IL   +PG AG  A+AE IFG +NPG
Sbjct: 585 NGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPG 626


>gi|51507369|emb|CAH18932.1| beta-xylosidase [Pyrus communis]
          Length = 238

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 110/239 (46%), Positives = 164/239 (68%), Gaps = 2/239 (0%)

Query: 343 VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQD 402
           ++  ++ E +I+ AL N  +V+MRLG+F+G P+ Q +G +G   VC P+   LAL+AA+ 
Sbjct: 1   MRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQ 60

Query: 403 GIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT 462
           GIVLL+N    LPL   +  ++A+IGPN++  +T++GNYAG +C   TPLQ +  Y   T
Sbjct: 61  GIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMIGNYAGVACGYTTPLQGIARYTR-T 119

Query: 463 VYYPGCDTVACSSASIDKAVDIA-KGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
           ++  GC  V C+   +  A ++A + AD  VL++GLDQ+ E E  DR +L+LPG QQEL+
Sbjct: 120 IHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLDQSIEAEFRDRTNLLLPGHQQELV 179

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +RVA A++ P ILV++ GGP+D+ FAK D  IG+I+W GYPG+AG  A+A+V+FG  NP
Sbjct: 180 SRVARASRGPTILVIMSGGPIDVMFAKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNP 238


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 168/552 (30%), Positives = 268/552 (48%), Gaps = 67/552 (12%)

Query: 67  EKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASF 126
           E ++ +   A    RLGIP      E  HG   +G            T FP  +   +++
Sbjct: 89  EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136

Query: 127 DSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKY 186
           +  L+  + +A+ LE R+        G   ++P +++ RDPRWGR +E  GEDP +  +Y
Sbjct: 137 NLDLYRDMCRAVALETRS------QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEY 190

Query: 187 AVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTY 246
           AV+ V G+QG++     L      +A  KHF  Y         R      +  ++L +  
Sbjct: 191 AVASVEGLQGES-----LDSPSSVAATLKHFVGYGSSE---GGRNAGPVHMGTRELMEVD 242

Query: 247 QPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIY 306
             PF+  V+ G AS IM AYN ++G+P   +  LL    R++WGF G + +DC A+ ++ 
Sbjct: 243 MLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAIDMLA 301

Query: 307 DAEGYAKSPEDAVVDVLKAGMDVN-CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRM 365
                A+   DA V  ++AG+D+   G    KH + AV+  KL  S +D A+  + +++ 
Sbjct: 302 SGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVLTLKF 361

Query: 366 RLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLA 425
           +LGLF  NP + P  +   +V+ S  H  LA Q A +GIVLLKN    LPL K   V +A
Sbjct: 362 KLGLFE-NPYVDP--QTAENVIGSGQHIGLARQLAAEGIVLLKNEAKALPLSKEGGV-IA 417

Query: 426 LIGPNANSAKTLLGNYAGPS-CRSITPL-----QALQNYVENTVYYPGCDTVACSSASID 479
           +IGPNA+     LG+Y  P    ++T +       L    +  +Y PGC     S    +
Sbjct: 418 VIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGCRIKDDSREGFE 477

Query: 480 KAVDIAKGADHVVLMMGLDQTQE-------------------------KEELDRVDLVLP 514
            A+  A+ AD VV+++G    ++                          E +DR+ L L 
Sbjct: 478 FALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQLS 537

Query: 515 GRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVI 574
           G Q +L   + +  K+ +I+V + G P+   +   D +  +IL A YPG+ G  A+A+++
Sbjct: 538 GVQLDLAQEIHKLGKR-MIVVYINGRPIAEPW--IDEHADAILEAWYPGQEGGHAIADIL 594

Query: 575 FGDHNP-GEFTV 585
           FGD NP G+ T+
Sbjct: 595 FGDVNPSGKLTM 606


>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
           43183]
 gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 944

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 159/506 (31%), Positives = 252/506 (49%), Gaps = 52/506 (10%)

Query: 105 FFNGTIRG-----ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWA 158
           F N  IRG     AT+FP  +    +++  L  ++G   G EAR L       G T  +A
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML-------GYTNVYA 212

Query: 159 PNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT 218
           P +++ RD RWGR +E  GE P +  +  +  VRG+Q +           Q +A  KHF 
Sbjct: 213 PILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH----------QVAATAKHFA 262

Query: 219 AYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADR 278
           AY  +        + D ++  +++ + +  PF+  +++    G+M +YN  +GIP     
Sbjct: 263 AYSNNKGAREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQGSY 322

Query: 279 NLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-----GS 333
             L+   R++ GF GY+ SD DAV  +Y     AK  ++AV   ++AG++V C      S
Sbjct: 323 YWLTTRLRKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDS 382

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
           F+    +  VK+  L E  I+  + ++  V+  +GLF+        G    D V   A++
Sbjct: 383 FVLP-LRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAG--ADDEVEKEANE 439

Query: 394 VLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQ 453
            +ALQA+++ IVLLKN+   LPL   K   +A+ GPNA+     L +Y   +    T L+
Sbjct: 440 AVALQASRESIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLE 499

Query: 454 ALQNYVE---NTVYYPGCDTVAC---------------SSASIDKAVDIAKGADHVVLMM 495
            ++   +     +Y  GCD V                   A ID+AV  A+ AD  V+++
Sbjct: 500 GIREKAQGKAEVLYTKGCDLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVL 559

Query: 496 GLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGS 555
           G  Q    E   R  L LPG Q +L+  V +A  KPVIL+L+ G P+ + +A  D+ + +
Sbjct: 560 GGGQRTCGENKSRTSLELPGHQLKLLQAV-QATGKPVILILINGRPLSVNWA--DKFVPA 616

Query: 556 ILWAGYPGEAGAVALAEVIFGDHNPG 581
           IL A YPG  G   +A+++FGD+NPG
Sbjct: 617 ILEAWYPGSKGGTVVADILFGDYNPG 642


>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 859

 Score =  228 bits (582), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 180/604 (29%), Positives = 279/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N          RLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227

Query: 234 DARVTM---QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           +    +   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNK 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G 
Sbjct: 463 KGCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
 gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
          Length = 859

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 182/604 (30%), Positives = 281/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N          RLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227

Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G 
Sbjct: 463 KGCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|329963878|ref|ZP_08301220.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328527131|gb|EGF54137.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 766

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 182/609 (29%), Positives = 289/609 (47%), Gaps = 103/609 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQL----------VNSA----------------PAIPRLGI 84
           P+ +R  DL+ R+TL+EK+ Q+           NSA                P I    +
Sbjct: 31  PVKERVEDLLGRMTLEEKVGQMNQFVGLEHIKANSAVMTEEELKNNTANAFYPGITDKEV 90

Query: 85  PAYEWWSEALHG---------------------------VAGVGKGIFFNGTIRGATSFP 117
            A  W  + L G                           + G+   I  N    G T +P
Sbjct: 91  AA--WTEQGLIGSFLHVLTIEEANYLQSLAMKSRLQIPIIFGI-DAIHGNANAPGNTVYP 147

Query: 118 QVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPG 177
             I  A SFD+ + YRI +    E RA+      +  TF  PN+ + RD RWGR  ET G
Sbjct: 148 TNINLACSFDTLMAYRIARETAKEMRAM-----NMHWTF-NPNVEVARDARWGRVGETFG 201

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFT--AYDLDNWKGTTRYKFDA 235
           EDP +  +  V  V+G QG       L  K    AC KHF   +  ++   G+      A
Sbjct: 202 EDPYLVTRMGVQSVKGYQG------SLDSKEDVLACIKHFVGGSEPINGTNGSP-----A 250

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
            ++ + L + + PPFE+ VK G A  +M A+N +NG+P  ++  L++   R +W F G++
Sbjct: 251 DLSERTLREVFFPPFEAGVKAG-AMSLMTAHNELNGVPCHSNEWLMADVLRGEWNFPGFV 309

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLPESEID 354
            SD   +   +D    A++ ++A    + +GMD++  G    +     VK+ ++PES ID
Sbjct: 310 VSDWMDIEHTHDLHATAENLKEAFYQSIMSGMDMHMHGIHWNEMVVELVKEGRIPESRID 369

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADV---VCSPAHQVLALQAAQDGIVLLKNSH 411
            ++  +  ++ RLGLF      QP+  +   +   +C   H+  AL+AA++GIVLLKN  
Sbjct: 370 ESVRRILDIKFRLGLFE-----QPYADVEETMKIRLCG-EHRATALEAARNGIVLLKNE- 422

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNYVENTVY---YP 466
           G+LPL  SK   + + G NA+  + +LG+++ P       T L+ L+    +T +     
Sbjct: 423 GVLPLDPSKYKKIMVTGINADD-QNILGDWSAPEKEENVTTILEGLRMIAPDTQFDFVDQ 481

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEKEELDRVDLVLPGRQQE 519
           G D        +D+A   AK AD  +++ G        +   + E+ DR DL L G Q+E
Sbjct: 482 GWDPRNMDPKKVDEAAAHAKNADLNIVVAGEYMMRFRWNDRTDGEDTDRSDLDLVGLQEE 541

Query: 520 LITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHN 579
           LI +VA A+ KP +LVL+ G P+ + +A    N+ +I+ A  PG  G  A+AE+++G  N
Sbjct: 542 LIEKVA-ASGKPTVLVLVNGRPLSVRWAA--ENLPAIVEAWAPGMQGGQAVAEILYGKVN 598

Query: 580 PGEFTVIAV 588
           P     I +
Sbjct: 599 PSAKLAITI 607


>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 859

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 182/603 (30%), Positives = 282/603 (46%), Gaps = 108/603 (17%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
           F +   +LP+  R +DL+SR+TL+EKI+Q                               
Sbjct: 24  FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83

Query: 72  -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
                        L+N          RLGIP +   +E+LHG               G+T
Sbjct: 84  IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
            FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
             GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228

Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
            A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
            GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L   
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YYP 466
            +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y  
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAK 463

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGRQ 517
           GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G Q
Sbjct: 464 GCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQ 523

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           ++L+  +  A  KPVI+VLL G P+ +++ K   NI  I+   YPGE G +ALA+++ G 
Sbjct: 524 EDLVEAI-HATGKPVIVVLLSGKPLAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGK 580

Query: 578 HNP 580
            NP
Sbjct: 581 VNP 583


>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
          Length = 955

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 176/612 (28%), Positives = 288/612 (47%), Gaps = 105/612 (17%)

Query: 49  TLPISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALH 95
           T PI  R  DL+S++ ++EK  Q+V       + +  +P  +W    W        E L+
Sbjct: 65  TAPIDARVEDLLSQMNVEEKTCQMVTLYGYKRVLKDDLPTPDWKNQLWKDGMGAIDEHLN 124

Query: 96  GVAGVGK-------------------------------GI---FFNGTIRG-----ATSF 116
           G    G                                GI   F N  IRG     AT+F
Sbjct: 125 GFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYIATNF 184

Query: 117 PQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQET 175
           P  +    +++  L +++G   G E R L       G T  +AP +++ RD RWGR +E 
Sbjct: 185 PTQLGLGHTWNRDLVHKVGYITGREGRLL-------GYTNVYAPILDVGRDQRWGRYEEV 237

Query: 176 PGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDA 235
            GE P +  +  V   +G+Q D           Q +A  KH+ AY  +        + D 
Sbjct: 238 YGESPYLVAELGVEMAKGMQTD----------YQVAATSKHYIAYSNNKGGREGMARVDP 287

Query: 236 RVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYI 295
           +++ +++   +  P++  +K+    G+M +YN  +G P  +    L+   R ++GF GY+
Sbjct: 288 QMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYV 347

Query: 296 TSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPE 350
            SD DAV  ++   G A   +++V+  + AG+++ C +F    +     +  + +  LP 
Sbjct: 348 VSDSDAVEYLFSKHGTAADMKESVLQSVLAGLNIRC-TFRSPDSYVLPLRELIAEGALPM 406

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFG---KIGADVVCSPAHQVLALQAAQDGIVLL 407
           S ID  + ++  V+  +GLF+     QP+    K     V S  +Q +ALQA+++ +VLL
Sbjct: 407 STIDDRVRDILRVKFLVGLFD-----QPYQIDLKQADKEVNSAENQQVALQASKESLVLL 461

Query: 408 KNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVE---NTVY 464
           KN   +LPL  +K   +A+ GPNA+     L +Y   +    T L+ +QN V+     ++
Sbjct: 462 KNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLF 521

Query: 465 YPGCDTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRV 509
             GCD V  +                + I+KAV+ AK +D  V+++G       E   R 
Sbjct: 522 TKGCDLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENKSRS 581

Query: 510 DLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVA 569
            L LPGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A
Sbjct: 582 SLELPGRQLDLLQAVV-ATGKPVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTA 638

Query: 570 LAEVIFGDHNPG 581
           +A+ +FGD+NPG
Sbjct: 639 IADALFGDYNPG 650


>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
 gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
          Length = 1365

 Score =  228 bits (581), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 183/614 (29%), Positives = 280/614 (45%), Gaps = 110/614 (17%)

Query: 31   FSCDPSNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQL------------------ 72
            FS   ++P     P+ +  LPI +R +DL+ R+T +EK++Q+                  
Sbjct: 525  FSSTTASPK---LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERK 581

Query: 73   ------------VNSAPAIP------------------RLGIPAYEWWSEALHGVAGVGK 102
                        V   P                     RLGIP +   +E+LHGV     
Sbjct: 582  LEEKAQGMSWGFVEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH--- 637

Query: 103  GIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM-TFWAPNI 161
                     GAT FPQ I   ++FD+ L YR    I  E         A+GM    +P I
Sbjct: 638  --------EGATVFPQNIALGSTFDTDLAYRKTSMIADEL-------HAVGMRQVLSPCI 682

Query: 162  NIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYD 221
            ++ RD RWGR +E+ GEDP + G++ ++ V+G      NG         S   KH+  + 
Sbjct: 683  DVVRDLRWGRVEESFGEDPYLCGRFGIAEVKGYMD---NG--------ISPMLKHYGPHG 731

Query: 222  LDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLL 281
             +   G      +   +++DL + Y  PFE  +KQ     +M AYN  N IP+ A   LL
Sbjct: 732  -NPLSGLNLASVET--SIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLL 788

Query: 282  SKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKA 341
            +   R++WGF GY+ SD  A+ ++ +    A++ E+A +  L AG+DV   S        
Sbjct: 789  TDVLRKEWGFKGYVYSDWGAIEMLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPG 848

Query: 342  AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVLALQAA 400
             +++ +L    +D A+  +   + R+GLF+      P+G K     + S     L+ + A
Sbjct: 849  LIERGELNREIVDEAVRRVLYAKFRIGLFD-----DPYGEKFAKGAIHSGKAIALSKKIA 903

Query: 401  QDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQNY 458
             +  VLLKN   LLPL   K  S+A+IGPNA+  +   G+Y         +TPLQ ++ +
Sbjct: 904  DESTVLLKNDRQLLPLSIGKLKSIAVIGPNADQIQ--FGDYTWTRDNRFGVTPLQGIRKW 961

Query: 459  VENTV---YYPGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEEL 506
                V   Y  GC  V+   + I +AV+ A+ +D  VL  G            +   E  
Sbjct: 962  AGTNVKVNYVKGCSLVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGF 1021

Query: 507  DRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAG 566
            D  DL L G Q  LI +  +A  KPVILVL+ G P  I + K  +NI +IL   Y GE  
Sbjct: 1022 DLNDLTLTGAQPALI-KAVQATGKPVILVLVTGKPFAIPWEK--KNIPAILVQWYAGEQS 1078

Query: 567  AVALAEVIFGDHNP 580
              ++A+++FG  +P
Sbjct: 1079 GNSIADILFGKVSP 1092


>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 859

 Score =  228 bits (581), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 182/603 (30%), Positives = 281/603 (46%), Gaps = 108/603 (17%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
           F +   +LP+  R +DL+SR+TL+EKI+Q                               
Sbjct: 24  FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83

Query: 72  -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
                        L+N          RLGIP +   +E+LHG               G+T
Sbjct: 84  IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
            FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
             GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228

Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
            A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
            GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L   
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YYP 466
            +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y  
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVSNQLTLNYAK 463

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGRQ 517
           GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G Q
Sbjct: 464 GCDLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQ 523

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           ++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G 
Sbjct: 524 EDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGK 580

Query: 578 HNP 580
            NP
Sbjct: 581 VNP 583


>gi|336430623|ref|ZP_08610567.1| hypothetical protein HMPREF0994_06573 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336017532|gb|EGN47292.1| hypothetical protein HMPREF0994_06573 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 973

 Score =  228 bits (581), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 134/401 (33%), Positives = 223/401 (55%), Gaps = 17/401 (4%)

Query: 36  SNPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALH 95
           +N  T+T PF   +LP  +R   L+S +T+DEK+S +    P + RLGIP +    EA H
Sbjct: 3   NNVKTDT-PFWDPSLPFEERLDWLLSEMTMDEKLSCMATRVPDLERLGIPGFSVGGEAAH 61

Query: 96  GVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT 155
           GV          G     TSFPQ I  +A++D  L  + G+  G EAR +Y+     G++
Sbjct: 62  GVEARNDQNDL-GAAEPTTSFPQPIGMSATWDPDLIKKAGEVTGTEARVIYHRHPDRGLS 120

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
            WAP +++ RDPRWGR +E  GEDP++TG  + +Y++G+QGD          L+ +A  K
Sbjct: 121 RWAPTVDLERDPRWGRTEEGYGEDPVLTGVMSSAYIQGMQGDD------PKYLRVAATLK 174

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A + +  +G      D R   Q+L   Y  PF   +  GRA G+M AYN++NGIP  
Sbjct: 175 HFYANNTEAGRGWKSSCVDPR-NRQEL---YLEPFRRAIMNGRAEGVMTAYNKINGIPGM 230

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD--VNCGS 333
            +   + +  ++++G  G++  D  A+ ++ +   Y     + + + +KAG+D   +   
Sbjct: 231 LNPE-VREILKKEYGLKGHVVCDGGAMELVANLHHYYGLHAETLANAVKAGVDGMSDRPE 289

Query: 334 FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQ 393
            + +  + A +   + E EID AL N+F  ++RLG+++  P   P+ ++  + + SP ++
Sbjct: 290 AVAQAAREAWELGLITEEEIDEALRNIFRTKLRLGIYDREPC-NPYDRVTEEDLNSPLNK 348

Query: 394 VLALQAAQDGIVLLKNSHGLLPL-PKSKSVSLALIGPNANS 433
            + LQ +++ IVLLKN + +LPL P   + ++AL+GP A+S
Sbjct: 349 QICLQLSREAIVLLKNENKMLPLDPAVPAETMALVGPLADS 389



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 53/104 (50%), Gaps = 4/104 (3%)

Query: 478 IDKAVDIAKGADHVVLMMGLDQT-QEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVL 536
           ++KA  + +G + V+L +G +     KEE+DR  + LP  Q+ L+  +       VILVL
Sbjct: 554 LEKACSLVQGKEKVILALGCNSMINAKEEVDRTTIALPPAQEHLLEEIYRINPN-VILVL 612

Query: 537 LCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
               P  I  A+    + ++LW+    +    A+AE +FG + P
Sbjct: 613 FSNYPYSINMAQ--EKLPAVLWSATGSQDMGTAMAETLFGKNAP 654


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  228 bits (581), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 160/527 (30%), Positives = 263/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G            T FP  I  +A++   L   +G+AI  
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+    G  I    + P +++ RDPRW R +ET GEDP+++G+   + V G+      
Sbjct: 190 EIRS---QGAHIS---YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGL-----G 238

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L  +    A  KHF AY +        Y   A V  +DL + + PPF   ++ G A 
Sbjct: 239 SGDLSREHATIATLKHFLAYAVPEGGQNGNY---ASVGARDLHENFLPPFREAIEAG-AL 294

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+  LL++  R +W F G++ SD  ++  I+++   A + E+A V
Sbjct: 295 SVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVASTMEEAAV 354

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+D++ G     +   AV+  KL E++I+ A+  +  ++  +GLF  +P + P  
Sbjct: 355 QALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAVDRILRMKFEMGLFE-HPYVNP-- 411

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           K    +V +  H  LA + AQ  +VLL+N + +LPL K K   +A++GPNA++   +LG+
Sbjct: 412 KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLSK-KIKRVAVVGPNADNRYNMLGD 470

Query: 441 YAGPS----CRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P      R++      +       Y  GC     +   I +AV+ A  ++ ++ ++G
Sbjct: 471 YTAPQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRDTTVNEIAEAVEAAHRSEVIIAVVG 530

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQ+L+  + +   KP+I
Sbjct: 531 GSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL-KTTGKPLI 589

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D  +A       ++L A YPG+AG  A+A+V+FGD+NP
Sbjct: 590 VVYIEGRPLDKVWA--SECADALLTASYPGQAGGDAIADVLFGDYNP 634


>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
          Length = 786

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 173/600 (28%), Positives = 284/600 (47%), Gaps = 93/600 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           P+  R +DL+S++T++EK  Q+     +  + +  +P  +W    W + +  +     G+
Sbjct: 48  PLEARVKDLLSQMTMEEKTCQMATLYGSGRVLKDSLPTEQWKNEIWKDGIANIDEQANGL 107

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 108 GKFGSSLSYPYVNSVENRQAIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  ++P ++I +DPRWGR  E  GEDP +
Sbjct: 168 ATWNKELISEIAKVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     ++G+Q +              +  KHF  Y +         + D  V  +++
Sbjct: 221 VGELGKRMIKGLQAEGL-----------VSTPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF     +  A G+M +YN  +G P     + L++  R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
             +Y     A +  D    V+ AG++V     L ++     + A+ + K+ E  ID  + 
Sbjct: 330 EFLYSKHNVAANAVDGAAQVINAGLNVRTNFTLPENFIRPLRQAISEGKVSEQTIDSRVA 389

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
           ++  V+  +GLF+ NP  +   K    VV S  HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFMMGLFD-NP-YKGDAKKPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV--ENTVYYPGCDTV----- 471
           S +  +A+IGPNA     L+  Y   +    T  Q +++Y+   +  Y  G D +     
Sbjct: 448 S-TKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDADVRYAKGADIIDKYFP 506

Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
                         A ID+AV +AK +D  ++++G ++   +EE  R +L L GRQ++L+
Sbjct: 507 ESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             V  A  KPV+L+L+ G    I +A++   I  I+ A +PGE    A+A+V+FGD+NPG
Sbjct: 567 QAVY-ATGKPVVLLLVDGRAATINWAEH--YIPGIVHAWFPGEFMGDAVAKVLFGDYNPG 623


>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
 gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
          Length = 859

 Score =  228 bits (580), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 182/604 (30%), Positives = 281/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N          RLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   
Sbjct: 185 ECFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227

Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G 
Sbjct: 463 KGCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 944

 Score =  228 bits (580), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 164/526 (31%), Positives = 263/526 (50%), Gaps = 61/526 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP  ++ +E + GV               AT+FP  +    +++  L  ++G   G 
Sbjct: 153 RLGIPV-DFTNEGIRGVESYK-----------ATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 141 EARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTF 199
           EAR L       G T  +AP +++ RD RWGR +E  GE P +  +  +  VRG+Q +  
Sbjct: 201 EARML-------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH- 252

Query: 200 NGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRA 259
                    Q +A  KHF AY  +        + D +++ +++ + +  PF+  +++   
Sbjct: 253 ---------QVAATAKHFAAYSNNKGAREGMSRVDPQMSPREVENIHIYPFKRVIRETGL 303

Query: 260 SGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAV 319
            GIM +YN  +GIP       L+   R++ GF GY+ SD DAV  +Y     AK  ++AV
Sbjct: 304 LGIMSSYNDYDGIPVQGSYYWLTTRLRQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKEAV 363

Query: 320 VDVLKAGMDVNC-----GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNP 374
              ++AG++V C      SF+    +  VK+  L E  I+  + ++  V+  +GLF+ +P
Sbjct: 364 RQSVEAGLNVRCTFRSPDSFVLP-LRELVKEGGLSEEVINDRVRDILRVKFLIGLFD-SP 421

Query: 375 TMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANS 433
                   GAD  V   A++ +ALQA+++ +VLLKN+   LPL   K   +A+ GPNA+ 
Sbjct: 422 YQTDLA--GADNEVEKAANEAVALQASRESVVLLKNADNTLPLNIDKIKKIAVCGPNADE 479

Query: 434 AKTLLGNYAGPSCRSITPLQALQNYVE---NTVYYPGCDTVAC---------------SS 475
               L +Y   +    T L+ ++   +     +Y  GCD V                   
Sbjct: 480 EGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDLVDAHWPESEIIEYPLTPDEQ 539

Query: 476 ASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILV 535
           A ID+A   A+ AD  V+++G  Q    E   R  L LPG Q +L+  V +A  KPV+LV
Sbjct: 540 AEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPGHQLKLLQAV-QATGKPVVLV 598

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           L+ G P+ + +A  D+ + +IL A YPG  G  A+A+++FGD+NPG
Sbjct: 599 LINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILFGDYNPG 642


>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
 gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
          Length = 859

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 182/604 (30%), Positives = 281/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N          RLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   
Sbjct: 185 ECFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227

Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG-------LDQTQEK--EELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G        D +     E  D  DL L G 
Sbjct: 463 KGCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 812

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 174/543 (32%), Positives = 261/543 (48%), Gaps = 78/543 (14%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QG+T N
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETLN 250

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            GK        A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 251 DGK-----SVIATLKHFASY---GWTEGGHNGGTAHIGERELEEAIFPPFREAVGAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  AV  + +  G A +  +A +
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLRE-HGVAGNDYEAAI 360

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAVK+  +  + ID+A+  + S++ ++GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFD-----DPF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K    +V S  H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 416 VDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIR-TLAVIGPNADNVYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L  ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADAVV 534

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           ++MG     D + E EE                    DR  L L GRQ EL+  ++    
Sbjct: 535 MVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLG- 593

Query: 530 KPVILV----LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTV 585
           KPV+L+    LL  G +        +   +I+ A YPG  G  A+A+V+FGD+NP     
Sbjct: 594 KPVVLIKGRPLLMEGAI--------QEAEAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 645

Query: 586 IAV 588
           ++V
Sbjct: 646 LSV 648


>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
          Length = 781

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 166/527 (31%), Positives = 260/527 (49%), Gaps = 62/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G  +F  G    AT  PQ+I             +G+AIG 
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIGTTVFPTGIGMAATWSPQLI-----------NEVGKAIGK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R         G   + P +++ RDPRW R +ET GEDP++TG+   + V G+      
Sbjct: 177 EIRL------QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVAGL-----G 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L       A  KHF AY +          F     M++L + + PPF   +  G A 
Sbjct: 226 SGDLSRPYSTLATLKHFLAYGISESGQNGNPSFAG---MRELHENFLPPFGQAINAG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P  A+  LL++  R  W F G + SD  ++  I+ +   A + ++A V
Sbjct: 282 SVMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLYSIEGIHQSHFVASTMKEAAV 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             L AG+D++ G     +   AV +K++ +  +D A+  +  ++  +GLF  NP + P G
Sbjct: 342 MALSAGVDIDLGGDAYMNLMDAVNRKEISKEILDAAVSRVLRLKFEMGLFE-NPYVDP-G 399

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           K   + V S  +  LA Q AQ  I LLKN H LLPL   +S+ +ALIGPNA++   +LG+
Sbjct: 400 KAKKE-VRSKEYVALARQVAQASITLLKNEHSLLPL--DRSMKVALIGPNADNRYNMLGD 456

Query: 441 YAGPSCRS--ITPLQALQNYVENT--VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P       T L  ++  + ++   Y  GC      ++ I++AV  A+ ++ V+ ++G
Sbjct: 457 YTAPQEEENVKTVLDGIRAKLSSSQVEYVKGCSIRDTVTSDIEQAVAAARRSEVVIAVVG 516

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQEL+ +  +A  KP+I
Sbjct: 517 GSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELL-KALKATGKPLI 575

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P+D  +A    N  ++L A YPG+ G  A+A+V+FG+ NP
Sbjct: 576 VVYIEGRPLDKNWA--SENADALLTAYYPGQEGGNAIADVLFGEFNP 620



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 23/31 (74%)

Query: 43 FPFCKTTLPISQRARDLVSRLTLDEKISQLV 73
          FP+   TLP+  R  DL+SR+TL+EK+ QL+
Sbjct: 27 FPYKNPTLPVETRVADLLSRMTLEEKVGQLL 57


>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1039

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 182/608 (29%), Positives = 286/608 (47%), Gaps = 101/608 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHGV 97
           PI  R  DL+S++TL+EK  Q+V       + +  +P  EW    W        E L+G 
Sbjct: 151 PIDARIEDLLSQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGF 210

Query: 98  AGVGK-------------------------------GI---FFNGTIRG-----ATSFPQ 118
              G                                GI   F N  IRG     AT+FP 
Sbjct: 211 QQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPT 270

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  +IG   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 271 QLGLGHTWNRQLLRQIGLITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 323

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  V+G+Q +           Q +A  KHF AY  +        + D ++
Sbjct: 324 ESPYLVAELGIEMVKGMQHNH----------QVAATGKHFIAYSNNKGAREGMARVDPQM 373

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++   +  PF+  +++    G+M +YN  +G P  +    L+   R   GF GY+ S
Sbjct: 374 SPREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDMGFRGYVVS 433

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
           D DAV  +Y   G AK  ++AV   ++AG+++ C +F    +     +  VK+ +L E  
Sbjct: 434 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNIRC-TFRSPDSYVLPLRELVKEGELSEEI 492

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
           I+  + ++  V+  +GLF+ +P        GAD  V   +++ +ALQA+++ IVLLKN  
Sbjct: 493 INDRVRDILRVKFLVGLFD-HPYQTDLK--GADEEVEKASNEEIALQASRESIVLLKNDK 549

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYV---ENTVYYPGC 468
            +LPL  S    +A+ GPNA+     L +Y   +    + L+ +Q  +      +Y  GC
Sbjct: 550 NVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLYTKGC 609

Query: 469 DTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           + V  +                  I+KAV   K AD  V+++G  Q    E   R  L L
Sbjct: 610 ELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRSSLAL 669

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A+A+V
Sbjct: 670 PGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGKAVADV 726

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 727 LFGDYNPG 734


>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
 gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
          Length = 800

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 185/609 (30%), Positives = 290/609 (47%), Gaps = 111/609 (18%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P   W    W + +  +     G+
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 121

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 122 GKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  +AP ++I +DPRWGR  E+ GEDP +
Sbjct: 182 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 234

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +              A  KHF  Y +      G TR   D  V  +
Sbjct: 235 VGELGKQMILGLQSEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 341

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 400

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
            +  +  V+  +GLF+ NP        G D     VV + AHQ ++++AA + IVLLKN 
Sbjct: 401 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 453

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPG 467
             +LPL KS S  +A+IGPNA   K L   Y GP+  SI T  Q ++ Y+ N    Y  G
Sbjct: 454 KEMLPLSKSFS-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 511

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD +                   A I++AV++AK +D  +L++G ++   +EE  R +L 
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           L GRQQ+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628

Query: 573 VIFGDHNPG 581
           V+FGD+NPG
Sbjct: 629 VLFGDYNPG 637


>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
 gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
          Length = 859

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 181/603 (30%), Positives = 279/603 (46%), Gaps = 108/603 (17%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------- 71
           F +   +LP+  R +DL+SR+TL+EKI+Q                               
Sbjct: 24  FKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYGF 83

Query: 72  -------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
                        L+N          RLGIP +   +E+LHG               G+T
Sbjct: 84  IEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGST 131

Query: 115 SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQE 174
            FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +E
Sbjct: 132 IFPQAIALGSTFNPILAYEMTSAIAKELTA-----QGITQSL-TPVIDVCRDLRWGRVEE 185

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
             GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   +
Sbjct: 186 CFGEDPYLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGLN 228

Query: 235 -ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGF 291
            A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W F
Sbjct: 229 LASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDF 288

Query: 292 HGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPES 351
            GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L   
Sbjct: 289 QGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVK 348

Query: 352 EIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N +
Sbjct: 349 YIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YYP 466
            +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+  V N +   Y  
Sbjct: 406 NILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERVGNQLTLNYAK 463

Query: 467 GCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQ 517
           GCD V    +   +AVD+AK +D  ++++G                E  D  DL L G Q
Sbjct: 464 GCDLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQ 523

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           ++L+  +  A  KPVI+VLL G P  +++ K   NI  I+   YPGE G +ALA+++ G 
Sbjct: 524 EDLVEAI-HATGKPVIVVLLSGKPFAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLGK 580

Query: 578 HNP 580
            NP
Sbjct: 581 VNP 583


>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
 gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
          Length = 859

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 180/604 (29%), Positives = 279/604 (46%), Gaps = 108/604 (17%)

Query: 42  TFPFCKTTLPISQRARDLVSRLTLDEKISQ------------------------------ 71
            F +   +LP+  R +DL+SR+TL+EKI+Q                              
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 72  --------------LVNSAPAI----PRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGA 113
                         L+N          RLGIP +   +E+LHG               G+
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDGS 130

Query: 114 TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWGRGQ 173
           T FPQ I   ++F+  L Y +  AI  E  A     Q I  +   P I++ RD RWGR +
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSL-TPVIDVCRDLRWGRVE 184

Query: 174 ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
           E  GEDP +  +  VS VRG   +           Q S   KHF A+      G  +   
Sbjct: 185 ECFGEDPFLVSRMGVSQVRGYLDN-----------QVSPMIKHFGAH------GAPQGGL 227

Query: 234 D-ARVT--MQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
           + A V+   ++L   Y   FE+ VK+ +   +M +YN  N  P+ +   L+++  R +W 
Sbjct: 228 NLASVSCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWD 287

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPE 350
           F GY+ SD  A+ ++      A++  +A +  L AG+D           +  V+   L  
Sbjct: 288 FQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDV 347

Query: 351 SEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
             ID+A+  + + +  +GLF   P   P  K    VV +PAH  LA + A++ IVLL+N 
Sbjct: 348 KYIDQAVARILTAKFNMGLFE-YPL--PMEKNYDKVVHAPAHVSLARKIAEESIVLLQNE 404

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNY--AGPSCRSITPLQALQNYVENTV---YY 465
           + +LPL  +K  S+A+IGPNA+  +   G+Y  +  +   +T L+AL+    N +   Y 
Sbjct: 405 NNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKERAGNQLTLNYA 462

Query: 466 PGCDTVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGR 516
            GCD V    +   +AVD+AK +D  ++++G                E  D  DL L G 
Sbjct: 463 KGCDLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGV 522

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           Q++L+  +  A  KPVI+VLL G P+ +++ K   NI  I+   YPGE G +ALA+++ G
Sbjct: 523 QEDLVEAI-HATGKPVIVVLLSGKPLAMSWIK--ENIPGIVVQWYPGEQGGLALADMLLG 579

Query: 577 DHNP 580
             NP
Sbjct: 580 KVNP 583


>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 782

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 262/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++  LL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K+ +  +A+IGPNA++   +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457

Query: 441 YAGPSCRS--ITPLQALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 621


>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
 gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
 gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
 gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
          Length = 786

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 182/604 (30%), Positives = 290/604 (48%), Gaps = 101/604 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P  EW    W + +  +     G+
Sbjct: 48  PIEARVADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAEWSKEIWKDGIGNIDEQANGL 107

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 108 GKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMFPAQCGQG 167

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  +AP ++I +DPRWGR  E+ GEDP +
Sbjct: 168 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 220

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +             +A  KHF  Y +      G TR   D  V  +
Sbjct: 221 AGELGKQMILGLQAEGL-----------AATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 267

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD +
Sbjct: 268 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 327

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 328 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKISLHTLDQ 386

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLP 415
            +  +  V+  LGLF+ NP      +    VV + AHQ ++++AA + IVLLKN + +LP
Sbjct: 387 RVGEILRVKFMLGLFD-NPYPGD-DRHPETVVHNAAHQEVSMKAALESIVLLKNENQMLP 444

Query: 416 LPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPGCDTV- 471
           L KS +  +A+IGPNA   K L   Y GP+   I T  Q ++ Y+ N    Y  GC+ + 
Sbjct: 445 LSKSLN-KIAVIGPNAEEVKELTCRY-GPAHAPIKTVYQGIKEYLPNAEVSYAKGCNIID 502

Query: 472 --------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQ 517
                             A I++AV++AK +D  +L++G ++   +EE  R  L L GRQ
Sbjct: 503 KYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDLCGRQ 562

Query: 518 QELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGD 577
           Q+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+V+FGD
Sbjct: 563 QQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGNAIAKVLFGD 619

Query: 578 HNPG 581
           +NPG
Sbjct: 620 YNPG 623


>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
 gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
          Length = 814

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 176/539 (32%), Positives = 264/539 (48%), Gaps = 68/539 (12%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    E  HG   +G            T FP  I  A++++  L  ++G+ I +
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           EA A    G  IG   + P +++ RDPRW R +ET GEDP + G    + VRG QGDT  
Sbjct: 197 EASA---QGAHIG---YGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-- 248

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
              L+G+    A  KHF +Y    W         A +  ++L +   PPF   V  G A 
Sbjct: 249 ---LRGRKSVIATLKHFASY---GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-AL 301

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P    R LL+   + +W F G++ SD  A+  + +  G A S  +A V
Sbjct: 302 SVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAV 360

Query: 321 DVLKAGMDVNCGS-FLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             + AG+D + G+    +   AAV++  +    +D+A+  + S++  +GLF+      PF
Sbjct: 361 KAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRRILSLKFHMGLFDA-----PF 415

Query: 380 --GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTL 437
              K  A +V SP H  LA + A+  IVLLKN   LLPL K    +LA+IGPNA++   +
Sbjct: 416 VDDKRPAQLVASPEHIGLAREVARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNM 474

Query: 438 LGNYAGPSCRS--ITPLQALQNYVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVV 492
           LG+Y  P      +T L+ ++  V      +Y  GC     S      A++ A+ AD VV
Sbjct: 475 LGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCAVRDSSRTGFADAIEAARSADVVV 534

Query: 493 LMMG----LDQTQEKEE-------------------LDRVDLVLPGRQQELITRVAEAAK 529
           +++G     D + E EE                    DR  L L GRQ EL+  V +   
Sbjct: 535 MVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLG- 593

Query: 530 KPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
           KP++LVL+ G P  +      +   +IL A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 594 KPMVLVLIKGRP--LLMEGVIQEADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 650


>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
 gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
          Length = 782

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 262/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++  LL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVALTKENAAI 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K+ +  +A+IGPNA++   +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457

Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 621


>gi|383641936|ref|ZP_09954342.1| glycoside hydrolase family protein [Streptomyces chartreusis NRRL
           12338]
          Length = 981

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 165/460 (35%), Positives = 237/460 (51%), Gaps = 48/460 (10%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
            P T   PF    LP+++R  DL+SRLT+DE+I  L   +P +PRLG+  ++  +E LHG
Sbjct: 9   EPPTTDLPFRDPALPLARRIDDLISRLTVDERIELLYQYSPGVPRLGLAPFKTGTEGLHG 68

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNA----GQAI 152
           V+ +G           AT FPQ +   A++D  L + +  A+G EARA +      G   
Sbjct: 69  VSWLGP----------ATVFPQAVGLGATWDEELLHEVATAVGTEARAFHQRPAADGHRP 118

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
           G+  WAP +N+ RDPRWGR +E   EDPL+TG+ +VSY RG+ GD          L+ + 
Sbjct: 119 GLQVWAPVVNLLRDPRWGRNEEGYAEDPLLTGRLSVSYCRGLAGDH------PVYLRTAP 172

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQ-DLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
             KHF AY+ ++ +  T      RV  + DLA      F   +  G A+G+M AYN +NG
Sbjct: 173 LLKHFLAYNNEDERDITSSVVPPRVLHEYDLA-----AFRPAIASGAATGVMPAYNLING 227

Query: 272 IPSCADRNLLSKTARRQW----GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            P C    LL    RR W    G   ++ SD  AV+ + ++E Y      +    ++AG+
Sbjct: 228 RP-CHVSPLLETEVRR-WAEPTGHELFVCSDAGAVTNLAESEHYFDDHPTSHAAAVRAGV 285

Query: 328 DV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           D        GS      +AAV +  L E +IDRA     S+R RLG F  +P + P+  I
Sbjct: 286 DSITELDEDGSITLGRLRAAVDRGLLTEDDIDRAARRHLSIRFRLGEF--DPDLDPYAGI 343

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN-Y 441
             D V SPAH+ LAL+AA + +VLLKN  G LPL  +    +AL+GP A+   TL  + Y
Sbjct: 344 RDDAVDSPAHRALALRAATESVVLLKND-GALPLSPAPGRRVALVGPLAD---TLFEDWY 399

Query: 442 AGPSCRSITPLQALQNYVE----NTVYYPGCDTVACSSAS 477
           +G     IT    L+  ++      V   G D +   +AS
Sbjct: 400 SGTMPYRITIAAGLETALKERGAEAVRAEGVDRITLRAAS 439


>gi|386847378|ref|YP_006265391.1| beta-glucosidase [Actinoplanes sp. SE50/110]
 gi|359834882|gb|AEV83323.1| beta-glucosidase [Actinoplanes sp. SE50/110]
          Length = 966

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 163/455 (35%), Positives = 238/455 (52%), Gaps = 32/455 (7%)

Query: 41  ETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGV 100
           E++PF   TL +  R  DLV RLTLDEK+S L    PAIPRLGI  ++  +EALHGVA  
Sbjct: 28  ESYPFRDPTLALPARVDDLVGRLTLDEKLSLLHQYQPAIPRLGIAVFKSGTEALHGVAWS 87

Query: 101 GKGIFFNG---TIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTF 156
               + NG   T   AT FPQ +  A+S+D  L  ++G A+G EAR L+     + G+  
Sbjct: 88  TD--YTNGGAKTDATATVFPQAVGLASSWDPALVRQVGAAVGDEARGLHAQNPTVWGLNL 145

Query: 157 WAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKH 216
           WAP +N+ RDPRWGR +E   EDPL++G  A +Y RG+QGD  +       L  +   KH
Sbjct: 146 WAPVVNLLRDPRWGRNEEGYSEDPLLSGTVATAYGRGLQGDDPD------HLLTAPTLKH 199

Query: 217 FTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCA 276
           + AY+ +    T R +  + V  + L +  +  FE  ++   A+G+M +YN VNG P+  
Sbjct: 200 YAAYNNE----TGRDRTSSNVPQRVLNEYDRKTFEIPLRADAATGVMSSYNLVNGRPATV 255

Query: 277 DRNLLSKTARRQWGFHG-YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG-----MDVN 330
           D +L      R W     +  +D  A + +  ++GY  +  +A   ++KAG     +D  
Sbjct: 256 DPDLGGLV--RGWSDRRLFNVTDAGAPTNLTGSQGYFATQAEADAAIVKAGGDSFTVDDT 313

Query: 331 CGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSP 390
            G+      + A+ Q  L   ++D A+ +  S+R RLG F  +P   P+  I A  V SP
Sbjct: 314 DGAPTVTALRQAIDQGLLSVRQVDTAVGDALSIRFRLGEF--DPDGGPYAGIPASDVDSP 371

Query: 391 AHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSIT 450
           AH+ LA +AA   +VLLKN+ G+LPL    SV  A+ GP  ++  T    Y G     +T
Sbjct: 372 AHRQLAREAAASSMVLLKNT-GVLPLAAGGSV--AVSGPLGDTVYTDW--YGGQLPYRVT 426

Query: 451 PLQALQNYVENTVYYPGCDTVACSSASIDKAVDIA 485
           PL  +      T Y PG D +A    +  + V  A
Sbjct: 427 PLDGISAKSPAT-YAPGADRIALKDVATGRYVSAA 460



 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 3/79 (3%)

Query: 503 KEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYP 562
           +E  DR  L L  RQ+ LI  V  AA    ++V+    P  IT+A+   ++  I+W  + 
Sbjct: 606 REAHDRTGLGLGARQEALIEAV-RAANPRTVVVVQSSYPEAITWAQA--HVPGIVWTTHA 662

Query: 563 GEAGAVALAEVIFGDHNPG 581
           G     ALA+V++GD NPG
Sbjct: 663 GAETGHALADVLYGDVNPG 681


>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1365

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 178/600 (29%), Positives = 274/600 (45%), Gaps = 103/600 (17%)

Query: 43   FPFCKTTLPISQRARDLVSRLTLDEKISQL------------------------------ 72
             P+ +  LPI +R +DL+ R+T +EK++Q+                              
Sbjct: 534  LPYQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGF 593

Query: 73   VNSAPAIP------------------RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGAT 114
            V   P                     RLGIP +   +E+LHGV              GAT
Sbjct: 594  VEGFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGVVH-----------EGAT 641

Query: 115  SFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM-TFWAPNINIFRDPRWGRGQ 173
             FPQ I   ++FD+ L YR    I  E         A+GM    +P I++ RD RWGR +
Sbjct: 642  VFPQNIALGSTFDTDLAYRKTSMIADEL-------HAVGMRQVLSPCIDVVRDLRWGRVE 694

Query: 174  ETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKF 233
            E+ GEDP + G++ ++ V+G      NG         S   KH+  +  +   G      
Sbjct: 695  ESFGEDPYLCGRFGIAEVKGYMD---NG--------ISPMLKHYGPHG-NPLSGLNLASV 742

Query: 234  DARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHG 293
            +   +++DL + Y  PFE  +KQ     +M AYN  N IP+ A   LL+   R++WGF G
Sbjct: 743  E--TSIRDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKG 800

Query: 294  YITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEI 353
            Y+ SD  A+ ++ +    A++ E+A +  L AG+DV   S         +++ +L    +
Sbjct: 801  YVYSDWGAIEMLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860

Query: 354  DRALHNLFSVRMRLGLFNGNPTMQPFG-KIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
            D A+  +   + R+GLF+      P+G K     + S     L+ + A +  VLLKN   
Sbjct: 861  DEAVRRVLYAKFRIGLFD-----DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNERQ 915

Query: 413  LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGCD 469
            LLPL   K  S+A+IGPNA+  +     +   +   +TPLQ ++ +    V   Y  GC 
Sbjct: 916  LLPLSIGKLKSIAVIGPNADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYAKGCS 975

Query: 470  TVACSSASIDKAVDIAKGADHVVLMMG---------LDQTQEKEELDRVDLVLPGRQQEL 520
             V+   + I +AV+ A+ +D  VL  G            +   E  D  DL L G Q  L
Sbjct: 976  LVSMDESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQPAL 1035

Query: 521  ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            I +  +A  KPVILVL+ G P  I + K  +NI +IL   Y GE    ++A+++FG  +P
Sbjct: 1036 I-KAVQATGKPVILVLVTGKPFAIPWEK--KNIPAILVQWYAGEQSGNSIADILFGKVSP 1092


>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
 gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 261/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMV-----DGLG 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++  LL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 342 QSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K  +  +A+IGPNA++   +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKMIN-KVAVIGPNADNRYNMLGD 457

Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A++ A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 517

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 621


>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
           vinifera]
          Length = 785

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 121/289 (41%), Positives = 180/289 (62%), Gaps = 8/289 (2%)

Query: 294 YITSDCDAVSIIYDAEGYA-KSPEDAVVDVLKAGMDVNCGSFLQKHTKAAVKQKKLPESE 352
           YI SDC  + +I D + Y  +S  DAV   L+AG+D+ CG +       +V   K+ + E
Sbjct: 10  YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHG 412
           +DRAL N++ + MR+G F+G P  +    +G   +C+  H  LA +AA+ GIVLLKN + 
Sbjct: 70  LDRALKNIYVLLMRVGYFDGIPAYE---SLGLKDICAADHIELAREAARQGIVLLKNDYE 126

Query: 413 LLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTVYYPGCDTVA 472
           +LPL   K   L L+GP+AN+ + ++GNYAG   + ++PL+A    + N  Y  GC   +
Sbjct: 127 VLPLKPGKK--LVLVGPHANATEVMIGNYAGLPYKYVSPLEAFS-AIGNVTYATGCLDAS 183

Query: 473 CSSAS-IDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKP 531
           CS+ +   +A + AK A+  ++ +G D + E E +DRVD +LPG Q ELI +VAE +  P
Sbjct: 184 CSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGP 243

Query: 532 VILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VILV+L G  +DITFAK +  I +ILW G+PGE G  A+A+V+FG +NP
Sbjct: 244 VILVVLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNP 292


>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
           DSM 17565]
 gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 800

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 185/609 (30%), Positives = 290/609 (47%), Gaps = 111/609 (18%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P   W    W + +  +     G+
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 121

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 122 GKFGSEISYPYANSVKNRHTVQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  +AP ++I +DPRWGR  E+ GEDP +
Sbjct: 182 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 234

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +              A  KHF  Y +      G TR   D  V  +
Sbjct: 235 VGELGKQMILGLQSEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 341

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 400

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
            +  +  V+  +GLF+ NP        G D     VV + AHQ ++++AA + IVLLKN 
Sbjct: 401 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 453

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPG 467
             +LPL KS S  +A+IGPNA   K L   Y GP+  SI T  Q ++ Y+ N    Y  G
Sbjct: 454 KEMLPLSKSFS-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 511

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD +                   A I++AV++AK +D  +L++G ++   +EE  R +L 
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           L GRQQ+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628

Query: 573 VIFGDHNPG 581
           V+FGD+NPG
Sbjct: 629 VLFGDYNPG 637


>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 727

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 177/570 (31%), Positives = 284/570 (49%), Gaps = 58/570 (10%)

Query: 39  STETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA 98
           S  TFPF  T LP ++R  +L+S +TLDEK++ L  +   +PRLGI      SE LHG+A
Sbjct: 21  SQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALSTNL-GVPRLGI-RNTGHSEGLHGMA 78

Query: 99  GVGKGIFFNGTIRG------ATSFPQVILTAASFDSYLWYRIGQAIGLEARAL-YNAG-Q 150
             G G  + G+ RG       T FPQ      ++D+ L  ++      E R    NA  Q
Sbjct: 79  LGGPG-NWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQ 137

Query: 151 AIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQA 210
             GM   APN ++ RDPRWGR +E+ GED  +  +  V++V+G+QG+           ++
Sbjct: 138 KGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQGND------PKYWKS 191

Query: 211 SACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVN 270
           ++  KHF A   ++ + +T   FD R+      + Y  PF   + +G +   M +YN  N
Sbjct: 192 ASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRAFMASYNAWN 247

Query: 271 GIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVN 330
           G+P   +  +L K AR +WG +G I +D  A+S++ +A     +  +    V+KA    +
Sbjct: 248 GVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAAVVKA----S 302

Query: 331 CGSFL---QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVV 387
            G FL   + +   A+K+  L E  ID  +   F V ++LGL + + +  P+  IG    
Sbjct: 303 VGQFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIGVTDT 362

Query: 388 CSPAH----QVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG 443
            SP +    +    +     +VLLKN+ GLLPL KSK  S+A+IGP AN  + LL  Y+G
Sbjct: 363 VSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRAN--EVLLDWYSG 420

Query: 444 PSCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLD----- 498
               +++ LQ ++N V   +     +     S  +DKA   A+ AD  ++ +G       
Sbjct: 421 TPPYAVSILQGIKNAVGKDI-----EVFYAPSDEMDKATLAARKADVAIVCVGNHPYGTD 475

Query: 499 --------QTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYD 550
                    +  +E +DR  + L   Q++L+  V +A  K  ++VL+   P  I +++  
Sbjct: 476 ARWKISPVPSDGREAVDRKSITL--EQEDLVKLVMQANPK-TVMVLVSNFPFAINWSQ-- 530

Query: 551 RNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
            N+ +IL      +     LA+VIFGD +P
Sbjct: 531 ENVPAILHVTNNSQELGNGLADVIFGDVSP 560


>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
 gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
          Length = 812

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 190/637 (29%), Positives = 293/637 (45%), Gaps = 112/637 (17%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA--PAIPRLG----------- 83
           N + +   +   + P+ +R   L+S++TL+EK+ Q++ S   P   R+G           
Sbjct: 39  NKNGQMDTYENPSAPVEERVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEEIRLTARLEK 98

Query: 84  -IPAYE------------WWSEALHGVAGVGKGIFFNGTIR------------------- 111
            I  Y             W    LH   G+   +    + R                   
Sbjct: 99  EISEYHIGALWGFMRADPWTQRTLH--TGLNPSLAARASNRLQAFVMEHSRLGIPLFLAE 156

Query: 112 ---------GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
                    G T FP  I  A++++  L  ++G+ I +EA A    G  IG   + P ++
Sbjct: 157 ECPHGHMAIGTTVFPTSIGQASTWNPELIRQMGRVIAIEASA---QGAHIG---YGPVLD 210

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           + RDPRW R +ET GEDP + G    + VRG QGDT     L+G+    A  KHF +Y  
Sbjct: 211 LARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-----LRGRKSVIATLKHFASY-- 263

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
             W         A +  ++L +   PPF   V  G A  +M +YN ++G P    R LL+
Sbjct: 264 -GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLT 321

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKA 341
                +W F G++ SD  A+  + +  G A S  +A V  + AG+D + G+    +   A
Sbjct: 322 DILEDRWLFKGFVVSDLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVA 380

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF--GKIGADVVCSPAHQVLALQA 399
           AV++  +    +D+A+  + S++  +GLF+      PF   K  A +V SP H  LA + 
Sbjct: 381 AVRKGDVAMETVDKAVRRILSLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLAREV 435

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQN 457
           A+  IVLLKN   LLPL K    +LA+IGPNA++   +LG+Y  P      +T L+ ++ 
Sbjct: 436 ARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQ 494

Query: 458 YV--ENTVYY-PGCDTVACSSASIDKAVDIAKGADHVVLMMG----LDQTQEKEE----- 505
            V  +  V+Y  GC     S      A++ A+ AD VV+++G     D + E EE     
Sbjct: 495 KVSKDTRVFYAKGCAVRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAK 554

Query: 506 --------------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
                          DR  L L GRQ EL+  V +   KP++LVL+ G P  +      +
Sbjct: 555 VSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLG-KPMVLVLIKGRP--LLMEGVIQ 611

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
              +IL A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 612 EADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 648


>gi|237718534|ref|ZP_04549015.1| beta-xylosidase [Bacteroides sp. 2_2_4]
 gi|229452241|gb|EEO58032.1| beta-xylosidase [Bacteroides sp. 2_2_4]
          Length = 835

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 149/423 (35%), Positives = 223/423 (52%), Gaps = 39/423 (9%)

Query: 49  TLPI-SQRARDLVSR----LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKG 103
           TLP+ +Q  R+ V +    L++++KI  L    P +P+ G+  Y+WWSE LHGVA  GK 
Sbjct: 10  TLPLWAQPHREGVQQKLQSLSVEQKIDLLCARYPGVPQEGLAKYDWWSECLHGVARAGK- 68

Query: 104 IFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN--------AGQAIGMT 155
                    AT FP+ I   +++D+ L + IG AIG EARA ++        + +  G+T
Sbjct: 69  ---------ATVFPKPIGMGSTWDAELIHCIGIAIGDEARAKHHQDVREHGYSDRHYGLT 119

Query: 156 FWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCK 215
           F++P +NI RDPRWGR  E   EDPL+T    VS+++G+QGD          L+  A  K
Sbjct: 120 FFSPTLNIARDPRWGRTTECFSEDPLLTSDMGVSFIKGMQGDD------PYYLKTVATAK 173

Query: 216 HFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSC 275
           HF A + +N     R    A V    L + Y P F+  V +G    IM AYN +NGIP C
Sbjct: 174 HFVANNEEN----RRLGGSAIVDEVSLREYYFPAFKDAVVRGYVKSIMGAYNALNGIPCC 229

Query: 276 ADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFL 335
           A+  LL+   R++WGF G + SD  A+  +Y    Y  + E+     L+AG D++     
Sbjct: 230 ANSMLLTDVLRKEWGFEGVVISDGSAIDKLYTHHKYVPNLEEGAALALRAGCDMSLRDEY 289

Query: 336 QKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVL 395
           +   K A  +  + E +++ A+  +  +R RLGL  G     P+      VV    HQ L
Sbjct: 290 RPGLKKAYHKGLITEHDLNVAVARVLDLRARLGLDKGTDQGNPYKNTPYSVVECERHQQL 349

Query: 396 ALQAAQDGIVLLKNSHGLLPLP---KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPL 452
           AL+AA+  ++LLKN  G+LPL     ++ +++ALIG    S     G+Y+G    + T L
Sbjct: 350 ALEAAEKSMILLKNE-GVLPLKPDASNRKLNIALIGDAFKS--VYYGDYSGMPEYNRTLL 406

Query: 453 QAL 455
           +A 
Sbjct: 407 EAF 409


>gi|393787408|ref|ZP_10375540.1| hypothetical protein HMPREF1068_01820 [Bacteroides nordii
           CL02T12C05]
 gi|392658643|gb|EIY52273.1| hypothetical protein HMPREF1068_01820 [Bacteroides nordii
           CL02T12C05]
          Length = 764

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 170/488 (34%), Positives = 254/488 (52%), Gaps = 45/488 (9%)

Query: 111 RGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNINIFRDPRWG 170
           RG T +P  I  A+SF+  L   IG+   +E R+       +  TF  PNI + RD RWG
Sbjct: 138 RGCTVYPTSIGMASSFNPVLLKEIGRQTAVEMRS-----SGVHWTF-NPNIELARDARWG 191

Query: 171 RGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTR 230
           R  ET GED  +  +   + + G+QG+  NG    G L   AC KHF     +   G   
Sbjct: 192 RVGETFGEDTYLVTQMGTALILGLQGE--NGFDGSGVL---ACAKHFVGGG-EPAGGINA 245

Query: 231 YKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWG 290
              D  ++ Q L D Y  PF   + +   + +M A+N +NG+P  A+  LL +  R + G
Sbjct: 246 APMD--MSEQKLRDLYLSPFAEAINKAYVATVMPAHNELNGVPCHANHYLLQEILRNELG 303

Query: 291 FHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNC-GSFLQKHTKAAVKQKKLP 349
           F G++ SD   +  +++   YA S E+A    +KAG+D++  G    +    AV+ K +P
Sbjct: 304 FQGFVISDWMDIERLHEMHHYAPSQEEAFRMAVKAGVDMHMQGDGFLEAIVEAVRNKYIP 363

Query: 350 ESEIDRALHNLFSVRMRLGLFNGNPTMQ-PFGKIGADVVCSPAHQVLALQAAQDGIVLLK 408
           E+ ID A++ +   + RLGLF  NP +  P  +    ++ +  HQ  AL+AA+  IVLLK
Sbjct: 364 ETRIDLAVYKILEAKFRLGLFE-NPLVDIPASR---SLIYTEDHQATALEAARQSIVLLK 419

Query: 409 NSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAG--PSCRSITPLQALQNYVENTVYYP 466
           N + LLPL + +   + + GPNANS  T++G++    P    IT L  +Q  V + V   
Sbjct: 420 NDNYLLPLKQGRYKKILVTGPNANSP-TIMGDWTTRQPEENVITVLAGIQQQVPDAV--- 475

Query: 467 GCDTVACSSASIDK--------AVDIAKGADHVVLMMGLDQTQEK------EELDRVDLV 512
             DTV C S  I K        A   A  AD  ++++G +  +        E  DR +L 
Sbjct: 476 -IDTV-CFSNKIRKMDRSLIKTAAQKAVEADINIVVVGENSERYNSDRTCGENCDRDNLE 533

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           LP  QQEL+  V  A+ KPVILVLL G P+ +T+A+  ++I +I+ A  PG  G  A+AE
Sbjct: 534 LPTHQQELLEAVY-ASGKPVILVLLNGRPLSVTWAQ--QHIPAIVEAWEPGGMGGRAIAE 590

Query: 573 VIFGDHNP 580
           ++FG  NP
Sbjct: 591 ILFGKVNP 598


>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
 gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
          Length = 946

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 184/608 (30%), Positives = 285/608 (46%), Gaps = 101/608 (16%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WS-------EALHGV 97
           P+  R  DL+ ++TL+EK  Q+V       + +  +P  EW    W        E L+G 
Sbjct: 59  PVDARIEDLLKQMTLEEKTCQMVTLYGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGF 118

Query: 98  AGVGK-------------------------------GI---FFNGTIRG-----ATSFPQ 118
              G                                GI   F N  IRG     AT+FP 
Sbjct: 119 QQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPT 178

Query: 119 VILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPG 177
            +    +++  L  ++G   G EAR L       G T  +AP +++ RD RWGR +E  G
Sbjct: 179 QLGLGHTWNRELIRQVGVITGREARML-------GYTNVYAPILDVGRDQRWGRYEEVYG 231

Query: 178 EDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARV 237
           E P +  +  +  VRG+Q D           Q +A  KHF AY  +        + D ++
Sbjct: 232 ESPYLVAELGIEMVRGMQQD----------YQVAATGKHFIAYSNNKGGREGMSRVDPQM 281

Query: 238 TMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITS 297
           + +++   +  PF+  +++    G+M +YN  +G P  +    L+   R + GF GY+ S
Sbjct: 282 SPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVS 341

Query: 298 DCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESE 352
           D DAV  +Y     AK  ++AV   ++AG++V C +F    +     +  VK+  L E  
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRC-TFRSPDSYVLPLRELVKEGGLSEEV 400

Query: 353 IDRALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-VVCSPAHQVLALQAAQDGIVLLKNSH 411
           I+  + ++  V+  +GLF+ +P        GAD  V    ++ +ALQA+++ IVLLKN  
Sbjct: 401 INDRVRDILRVKFLVGLFD-HPYQTDLK--GADEEVEKAENEEVALQASRESIVLLKNDQ 457

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENTV---YYPGC 468
            +LPL  S    +A+ GPNA+     LG+Y   +    + L+ +Q   +  V   Y  GC
Sbjct: 458 DVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEVLYSKGC 517

Query: 469 DTVACS---------------SASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           + V  +                  ID+AV  AK AD  V+++G  Q    E   R  L L
Sbjct: 518 ELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKSRSSLDL 577

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
           PGRQ +L+  V  A  KPV+LVL+ G P+ I +A  D+ + +IL A YPG  G  A+A+V
Sbjct: 578 PGRQLDLLKAVV-ATGKPVVLVLINGRPLSINWA--DKFVPAILEAWYPGAKGGKAVADV 634

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 635 LFGDYNPG 642


>gi|255618512|ref|XP_002539944.1| beta-glucosidase, putative [Ricinus communis]
 gi|223500859|gb|EEF22439.1| beta-glucosidase, putative [Ricinus communis]
          Length = 256

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 122/274 (44%), Positives = 158/274 (57%), Gaps = 27/274 (9%)

Query: 63  LTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILT 122
           +TLDEKI QL+N APAIPRL IPAY WW+E+LHG  G              T+FP+ I  
Sbjct: 1   MTLDEKIDQLLNVAPAIPRLNIPAYNWWTESLHGALGP----------LPTTNFPEPIGL 50

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQ--------AIGMTFWAPNINIFRDPRWGRGQE 174
           AASFD+ L  ++  AI  E RAL+  G+          G+  W+PNINIFRDPRWGRGQE
Sbjct: 51  AASFDAPLVRQVAAAISTEVRALHTLGRETGRLGRIGTGLDTWSPNINIFRDPRWGRGQE 110

Query: 175 TPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFD 234
           T GEDP +T    V+++ G+QG      +L   +   A  KHF  +   +   +TR+  D
Sbjct: 111 TYGEDPHLTAALGVAFIHGIQG---GNPELPDVI---ATPKHFAVH---SGPESTRHVAD 161

Query: 235 ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGY 294
             V+  DL DTY P F + +   +A  IMCAYNR+NG P+C    L+ +  R  WGF GY
Sbjct: 162 VFVSAHDLEDTYLPAFRAAIVDAQAGSIMCAYNRINGQPACGSELLMKQHLRGAWGFKGY 221

Query: 295 ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMD 328
           + SDCDAV+ I +   YA  P  AV   L+ G D
Sbjct: 222 VVSDCDAVTDISEQHKYATDPAAAVAVALRTGTD 255


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 188/637 (29%), Positives = 291/637 (45%), Gaps = 112/637 (17%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSA--PAIPRLG----------- 83
           N + +   +   ++P+ +R   L+S++TL+EK+ Q++ S   P   R+G           
Sbjct: 29  NKNGQMDTYENPSVPVEERVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEEIRLTARLEK 88

Query: 84  -IPAYE------------WWSEALHGVAGVGKGIFFNGTIR------------------- 111
            I  Y             W    LH   G+   +    + R                   
Sbjct: 89  EISEYHIGALWGFMRADPWTQRTLH--TGLNPSLAARASNRLQAFVMEHSRLGIPLFLAE 146

Query: 112 ---------GATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGMTFWAPNIN 162
                    G T FP  I  A++++  L  ++G+ I  EA A    G  IG   + P ++
Sbjct: 147 ECPHGHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA---QGAHIG---YGPVLD 200

Query: 163 IFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL 222
           + RDPRW R +ET GEDP + G    + VRG QGDT     L+G+    A  KHF +Y  
Sbjct: 201 LARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDT-----LRGRKSVIATLKHFASY-- 253

Query: 223 DNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLS 282
             W         A +  ++L +   PPF   V  G A  +M +YN ++G P    R LL+
Sbjct: 254 -GWTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLT 311

Query: 283 KTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGS-FLQKHTKA 341
              + +W F G++ SD  A+  + +  G A S  +A V  + AG+D + G+    +   A
Sbjct: 312 DILKDRWQFKGFVVSDLYAIGGLRE-HGVAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVA 370

Query: 342 AVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF--GKIGADVVCSPAHQVLALQA 399
           AV++  +    +D+A+  +  ++  +GLF+      PF   K  A +V SP H  LA + 
Sbjct: 371 AVRKGDVAMETVDKAVRRILFLKFHMGLFDA-----PFVDDKRPAQLVASPEHIGLAREV 425

Query: 400 AQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRS--ITPLQALQN 457
           A+  IVLLKN   LLPL K    +LA+IGPNA++   +LG+Y  P      +T L+ ++ 
Sbjct: 426 ARQSIVLLKNEDKLLPLKKDIR-TLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQ 484

Query: 458 YVEN---TVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG----LDQTQEKEE----- 505
            V      +Y  GC     S      A++ A+ AD VV+++G     D + E EE     
Sbjct: 485 KVSKDTRVLYAKGCAVRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAK 544

Query: 506 --------------LDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDR 551
                          DR  L L GRQ EL+  V +   KP++LVL+ G P  +      +
Sbjct: 545 VSANRVSDMESGEGYDRATLHLMGRQLELLEEVRKLG-KPMVLVLIKGRP--LLMEGVIQ 601

Query: 552 NIGSILWAGYPGEAGAVALAEVIFGDHNPGEFTVIAV 588
              +IL A YPG  G  A+A+V+FGD+NP     ++V
Sbjct: 602 EADAILDAWYPGMQGGNAVADVLFGDYNPAGRLTLSV 638


>gi|189468349|ref|ZP_03017134.1| hypothetical protein BACINT_04746 [Bacteroides intestinalis DSM
           17393]
 gi|189436613|gb|EDV05598.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 786

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 175/600 (29%), Positives = 281/600 (46%), Gaps = 93/600 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R +DL+S++TL+EK  Q+     +  + +  +P  +W    W + +  +     G+
Sbjct: 48  PIEARVQDLLSQMTLEEKTCQMATLYGSGRVLKDSLPTEKWKDEIWKDGIANIDEQANGL 107

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           G+                             GI   F N  IRG     AT FP      
Sbjct: 108 GRFGSSLSYPYVNSVENRQTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 167

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I Q    EA+AL       G T  ++P ++I +DPRWGR  E  GEDP +
Sbjct: 168 ATWNKELISEIAQVTAEEAKAL-------GYTNIYSPILDIAQDPRWGRVVECYGEDPFL 220

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     ++G+Q +              A  KHF  Y +         + D  V  +++
Sbjct: 221 VGELGKRMIKGLQQEGL-----------VATPKHFAVYSIPVGGRDAGTRTDPHVAPREM 269

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF     +  A G+M +YN  +G P     + L++  R +WGF GY+ SD +AV
Sbjct: 270 RTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKGYVVSDSEAV 329

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH----TKAAVKQKKLPESEIDRALH 358
             +Y     A    D    V+ AG++V     L ++     + A+ + K+    ID  + 
Sbjct: 330 EFLYSKHQVAVDAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTIDSRVA 389

Query: 359 NLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPK 418
           ++  V+  +GLF+ NP  +   K    VV S  HQ ++++AA + IVLLKN + +LPL K
Sbjct: 390 DVLRVKFGMGLFD-NP-YKGDAKHPEKVVHSKEHQAVSMRAALESIVLLKNENNILPLSK 447

Query: 419 SKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVEN--TVYYPGCDTV----- 471
                +A+IGPNAN  + L+  Y   +    T  Q ++ Y+ +    Y  G D +     
Sbjct: 448 DLK-KIAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDIIDKYFP 506

Query: 472 ----------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQELI 521
                         A +D+AV +A+ +D  ++++G ++   +EE  R +L L GRQ++L+
Sbjct: 507 ESELYEVPLDQEEQAMMDEAVTLAEESDVAIMVLGGNEKTVREEYSRTNLDLCGRQEKLL 566

Query: 522 TRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
             V  A  KPVIL+L+ G    I +A  +R I  I+ A +PGE    A+A+V+FGD+NPG
Sbjct: 567 QAVY-ATGKPVILLLVDGRVATINWA--ERYIPGIVHAWFPGEFMGDAVAQVLFGDYNPG 623


>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
          Length = 798

 Score =  225 bits (574), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 184/609 (30%), Positives = 290/609 (47%), Gaps = 111/609 (18%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P   W    W + +  +     G+
Sbjct: 60  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTDGWSTEIWKDGIGNIDEQANGL 119

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 120 GKFGSEISYPYANSAKNRHTVQRWFVEKTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 179

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  +AP ++I +DPRWGR  E+ GEDP +
Sbjct: 180 ATWNKKLIREIAKVTADEAKAL-------GYTNIYAPILDIAQDPRWGRVVESYGEDPYL 232

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +              A  KHF  Y +      G TR   D  V  +
Sbjct: 233 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 279

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD +
Sbjct: 280 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 339

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 340 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAISEGKVSLHTLDQ 398

Query: 356 ALHNLFSVRMRLGLFNGNPTMQPFGKIGAD-----VVCSPAHQVLALQAAQDGIVLLKNS 410
            +  +  V+  +GLF+ NP        G D     VV + AHQ ++++AA + IVLLKN 
Sbjct: 399 RVGEILRVKFMMGLFD-NPYP------GDDRRPEVVVHNAAHQDVSMRAALESIVLLKNE 451

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVEN--TVYYPG 467
             +LPL KS +  +A+IGPNA   K L   Y GP+  SI T  Q ++ Y+ N    Y  G
Sbjct: 452 KEMLPLSKSFN-KIAVIGPNAEEVKELTCRY-GPANASIKTVYQGIKEYLPNAEVRYAKG 509

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD +                   A I++AV++AK +D  +L++G ++   +EE  R +L 
Sbjct: 510 CDIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 569

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           L GRQQ+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+
Sbjct: 570 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 626

Query: 573 VIFGDHNPG 581
           V+FGD+NPG
Sbjct: 627 VLFGDYNPG 635


>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 736

 Score =  225 bits (574), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 260/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G            T FP  I  AA++   L   +GQ I  
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 131 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 179

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 180 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 235

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++  LL+K  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 236 SVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 295

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 296 QSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 352

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K  +  +A+IGPNA++   +LG+
Sbjct: 353 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKMIN-KVAVIGPNADNRYNMLGD 411

Query: 441 YAGPSCRS--ITPLQALQNYV--ENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A++ A+ ++ V++++G
Sbjct: 412 YTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDTTVNEIEQAIEAARRSEVVIVVVG 471

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 472 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 530

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 531 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 575


>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 791

 Score =  225 bits (573), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 176/526 (33%), Positives = 261/526 (49%), Gaps = 59/526 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP   +  E LHG A VG           ATSFPQ I  A+S+D  +  ++ Q I  
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIAR 185

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E RA     + + M   +P ++I RDPRWGR +ET GEDP + G+  V+ V G+QG    
Sbjct: 186 EIRA-----RGVPMVL-SPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG--VG 237

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
             +        A  KH T +      GT      A V+ ++L + + PPFE  VK+    
Sbjct: 238 RSRTLQPNHVFATLKHLTGHGQPE-SGTN--IGPAPVSERELRENFFPPFEQVVKRTGIE 294

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G+PS A+R LL    R++WGF G + SD  AV  +      A + E+A +
Sbjct: 295 AVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIHHIAANLEEAAM 354

Query: 321 DVLKAGMDVNCGSFLQKHTKAA-VKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPF 379
             L AG+D +    L   T    V++ K+ E+++D A+  +  ++ R GLF  NP     
Sbjct: 355 RALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPYAD-- 411

Query: 380 GKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLG 439
               A +  +   + LA  AAQ  I LLKN  G+LPL    ++  A+IGP+A  A+  LG
Sbjct: 412 ANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPEGTI--AVIGPSAAVAR--LG 466

Query: 440 NYAGPSCRSITPLQALQNYV---ENTVYYPG----------CDTVACSSAS-----IDKA 481
            Y G    S++ L+ ++  V    N V+  G           D+V  S  +     I +A
Sbjct: 467 GYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAENRKLIAQA 526

Query: 482 VDIAKGADHVVLMMGLDQTQEKEEL------DRVDLVLPGRQQELITRVAEAAKKPVILV 535
           V+ A+  D ++L +G  +   +E        DR  L L G QQEL   + +A  KP+ +V
Sbjct: 527 VEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDAL-KALGKPITVV 585

Query: 536 LLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           L+ G P   +  K      +IL   Y GE G  A+A+++FGD NPG
Sbjct: 586 LINGRPA--STVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPG 629


>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
 gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
          Length = 815

 Score =  225 bits (573), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 160/528 (30%), Positives = 261/528 (49%), Gaps = 61/528 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G            T FP  I  AA++   L   +G  I  
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPVLIEEVGNVIAK 207

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+    G  I    + P +++ RDPRW R +ET GEDP+++G+   + V G+      
Sbjct: 208 EIRS---QGAHIS---YGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVIGL-----G 256

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
            G L  +    A  KHF AY +        Y   A V  +DL + + PPF+  +  G A 
Sbjct: 257 SGDLSREYATIATLKHFLAYAVPEGGQNGNY---ASVGTRDLHENFLPPFQEAIDAG-AL 312

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  A+  LL++  R +W F G++ SD  ++  ++++   A + E+A +
Sbjct: 313 SVMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLYSIEGVHESHFVAPTIEEAAM 372

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
            V+ AG+D++ G     +   AV+  K+ E+ ID A+  +  ++  +GLF  +P + P  
Sbjct: 373 QVVSAGVDIDLGGNAFMNLTHAVQSGKISEAVIDTAVCRVLRMKFEMGLFE-HPYVNP-- 429

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           K    VV S  H  LA + AQ  IVLLKN + +LPL K K   +A++GPNA++   +LG+
Sbjct: 430 KSATKVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLNK-KIKKVAVVGPNADNRYNMLGD 488

Query: 441 YAGP----SCRSITPLQALQNYVENTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P    + +++      +       Y  GC     +   I + V+ A  ++ ++ ++G
Sbjct: 489 YTAPQEDENIKTVLDGVISKLSPSKVEYVRGCAIRDTTVNEIAEVVEAASRSEVIIAVVG 548

Query: 497 -----------------------LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
                                  +   +  E  DR  L L G+QQ+L+  + +A  KP+I
Sbjct: 549 GSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNAL-KATGKPLI 607

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNPG 581
           +V + G P+D  +A       ++L A YPG+ G  A+A+V+FGD+NP 
Sbjct: 608 VVYIEGRPLDKVWA--SEYADALLTASYPGQEGGYAIADVLFGDYNPA 653


>gi|149826|gb|AAB36835.1| glucan-glucohydrolase [Thermobispora bispora]
          Length = 986

 Score =  225 bits (573), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 170/484 (35%), Positives = 247/484 (51%), Gaps = 43/484 (8%)

Query: 43  FPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVAGVGK 102
            PF   TLP++QR  DL+ RLTLDEKIS L    P I RLGI +++  +EALHGVA    
Sbjct: 39  LPFRDPTLPLAQRIDDLLGRLTLDEKISLLHQYQPPIERLGIKSFKTGTEALHGVAW-ST 97

Query: 103 GIFFNGTIRGA--TSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFWAP 159
            +  NG +  A  T FPQ +  A+++D  L  R+G  +G EAR  +     + G+  WAP
Sbjct: 98  DVTDNGAVVTANGTVFPQAVGLASTWDPELNQRVG-TVGEEARGFHAQNPVVWGLNLWAP 156

Query: 160 NINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTA 219
            +N+ RDPRWGR +E   EDPL+TG  A++Y  G+QGD  +       L+A+   KH+ A
Sbjct: 157 VVNLLRDPRWGRNEEGYSEDPLLTGAIAIAYGSGIQGDDPD------HLRAAPTLKHYLA 210

Query: 220 YDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRN 279
            + +  + TT      RV  +     Y+ PF + +  G A+G+M AYN VNG P+  + +
Sbjct: 211 NNNEIRRDTTSSNLPPRVKHE----YYEAPFRAAITAGAATGVMTAYNLVNGRPATVNPD 266

Query: 280 LLSKTARRQWGFHGYIT-SDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKH 338
           L      R W     +  +D  A + +  ++ Y  +  +A    LKAG+D    SF    
Sbjct: 267 LNDTV--RTWTDRDLLNVTDAGAPNNLVGSQAYFATLAEADAAALKAGID----SFTTDE 320

Query: 339 T---------KAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           T         K A+ Q  L E +ID A+ ++  +R RLG F  +P   P+ KI  DV+ S
Sbjct: 321 TNSAPTITAIKTALSQGLLTEQDIDTAVRHILGIRFRLGEF--DPDGGPYAKITPDVIDS 378

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI 449
           PAH+ LA + A   +VLLKN  G LPL   K V  A++GP A+   T    Y+G     +
Sbjct: 379 PAHRRLARETAAKAMVLLKNERGTLPLDPGKKV--AVVGPLADVLYTDW--YSGRPTYQV 434

Query: 450 TPLQALQNYVENTVYYPGCDTVA---CSSASIDKAVDIAKGADHVVLMMGLDQTQEKEEL 506
           TPL  ++    +     G D VA    ++  +    +  +GAD   L +      E E+ 
Sbjct: 435 TPLDGIRERAASVTSSEGVDRVAFKDLATGQVHHRGERPEGAD---LRLSATTIGETEQF 491

Query: 507 DRVD 510
           D  D
Sbjct: 492 DVFD 495



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 57/106 (53%), Gaps = 4/106 (3%)

Query: 476 ASIDKAVDIAKGADHVVLMMG-LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVIL 534
           + ID AV  AK AD  V+++G +     +E+ DR D+ L   Q+ L+  V  A  + V+ 
Sbjct: 592 SGIDDAVAKAKEADVAVVVVGSMPFINGREDHDRTDMNLAEGQEALVKAVFNANPRTVV- 650

Query: 535 VLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           VL    P  I +   D ++ +ILW  + G     ALA+V++GD NP
Sbjct: 651 VLENSYPTTINW--IDEHVPAILWTTHAGAETGNALADVLYGDVNP 694


>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
 gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 800

 Score =  225 bits (573), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 183/608 (30%), Positives = 290/608 (47%), Gaps = 109/608 (17%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P   W    W + +  +     G+
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTTGWSTEIWKDGIGNIDEQANGL 121

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 122 GKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  ++P ++I +DPRWGR  E+ GEDP +
Sbjct: 182 ATWNKKLIGEIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPYL 234

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +              A  KHF  Y +      G TR   D  V  +
Sbjct: 235 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GYI SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDSE 341

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLDQ 400

Query: 356 ALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            +  +  V+  +GLF+ NP      +P       VV + AH+ ++++AA + IVLLKN +
Sbjct: 401 RVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNEN 454

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGC 468
            +LPL K+ S  +A+IGPN    K L   Y GP+  SI T  Q ++ Y+ N+   Y  GC
Sbjct: 455 QMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 469 DTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           D +                   A I +AV++AK +D  +L++G ++   +EE  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
            GRQQ+L+  V  A  KPVILV++ G    I +A  ++ I +I+ A +PGE    A+A+V
Sbjct: 573 CGRQQQLLEAVY-ATGKPVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIAKV 629

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 630 LFGDYNPG 637


>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
 gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
          Length = 800

 Score =  225 bits (573), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 183/608 (30%), Positives = 290/608 (47%), Gaps = 109/608 (17%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGV----AGV 100
           PI  R  DL+S++TL+EK  Q+     +  + +   P   W    W + +  +     G+
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTTGWSTEIWKDGIGNIDEQANGL 121

Query: 101 GK-----------------------------GI---FFNGTIRG-----ATSFPQVILTA 123
           GK                             GI   F N  IRG     AT FP      
Sbjct: 122 GKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQG 181

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           A+++  L   I +    EA+AL       G T  ++P ++I +DPRWGR  E+ GEDP +
Sbjct: 182 ATWNKKLIGEIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPYL 234

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTMQ 240
            G+     + G+Q +              A  KHF  Y +      G TR   D  V  +
Sbjct: 235 VGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAPR 281

Query: 241 DLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCD 300
           ++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GYI SD +
Sbjct: 282 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDSE 341

Query: 301 AVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEIDR 355
           AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D+
Sbjct: 342 AVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLDQ 400

Query: 356 ALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSH 411
            +  +  V+  +GLF+ NP      +P       VV + AH+ ++++AA + IVLLKN +
Sbjct: 401 RVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNEN 454

Query: 412 GLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGC 468
            +LPL K+ S  +A+IGPN    K L   Y GP+  SI T  Q ++ Y+ N+   Y  GC
Sbjct: 455 QMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 469 DTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVL 513
           D +                   A I +AV++AK +D  +L++G ++   +EE  R +L L
Sbjct: 513 DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 514 PGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEV 573
            GRQQ+L+  V  A  KPVILV++ G    I +A  ++ I +I+ A +PGE    A+A+V
Sbjct: 573 CGRQQQLLEAVY-ATGKPVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIAKV 629

Query: 574 IFGDHNPG 581
           +FGD+NPG
Sbjct: 630 LFGDYNPG 637


>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
 gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
          Length = 782

 Score =  224 bits (572), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 165/527 (31%), Positives = 261/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSLELVKEVGQVIAK 176

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 177 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 225

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 226 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG-AL 281

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++GIP  ++  LL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 282 SVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESHFVALTKENAAI 341

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 342 QSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 398

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K+ +  +A+IGPNA++   +LG+
Sbjct: 399 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 457

Query: 441 YAGPSCRS--ITPLQALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A+  A+ ++ V++++G
Sbjct: 458 YTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIKAARRSEVVIVVVG 517

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 518 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 576

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 577 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 621


>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
 gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
          Length = 800

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 183/605 (30%), Positives = 290/605 (47%), Gaps = 103/605 (17%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGV----AG 99
           PI  R  DL+S++TL+EK  Q+            A P  G  A E W + +  +     G
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAGWSA-EIWKDGIGNIDEQANG 120

Query: 100 VGK-----------------------------GI---FFNGTIRG-----ATSFPQVILT 122
           +GK                             GI   F N  IRG     AT FP     
Sbjct: 121 LGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQ 180

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPL 181
            A+++  L   I +    EA+AL       G T  ++P ++I +DPRWGR  E+ GEDP 
Sbjct: 181 GATWNKKLIREIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPY 233

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTM 239
           + G+     + G+Q +              A  KHF  Y +      G TR   D  V  
Sbjct: 234 LVGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAP 280

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           +++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEID 354
           +AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLD 399

Query: 355 RALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLL 414
           + +  +  V+  +GLF+ NP      +  A VV + AH+ ++++AA + IVLLKN + +L
Sbjct: 400 QRVGEILRVKFMMGLFD-NPYPGDDRRPEA-VVHNDAHKAVSMKAALESIVLLKNENQML 457

Query: 415 PLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPGCDTV 471
           PL K+ S  +A+IGPN    K L   Y GP+  SI T  Q ++ Y+ N+   Y  GCD +
Sbjct: 458 PLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYVKGCDII 515

Query: 472 ---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGR 516
                              A I +AV++AK +D  +L++G ++   +EE  R +L L GR
Sbjct: 516 DKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDLCGR 575

Query: 517 QQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFG 576
           QQ+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+V+FG
Sbjct: 576 QQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAKVLFG 632

Query: 577 DHNPG 581
           D+NPG
Sbjct: 633 DYNPG 637


>gi|300786232|ref|YP_003766523.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|384149550|ref|YP_005532366.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|399538115|ref|YP_006550777.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|299795746|gb|ADJ46121.1| beta-glucosidase [Amycolatopsis mediterranei U32]
 gi|340527704|gb|AEK42909.1| beta-glucosidase [Amycolatopsis mediterranei S699]
 gi|398318885|gb|AFO77832.1| beta-glucosidase [Amycolatopsis mediterranei S699]
          Length = 975

 Score =  224 bits (571), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 154/450 (34%), Positives = 237/450 (52%), Gaps = 42/450 (9%)

Query: 45  FCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVA------ 98
           F   +LP++ R  DL+SRLT DEKIS L    PAIPRLGI  ++  +EALHGVA      
Sbjct: 33  FRDPSLPLATRIDDLLSRLTADEKISLLHQYEPAIPRLGIGVFKTGTEALHGVAWSTDYD 92

Query: 99  GVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAI-GMTFW 157
             G  +  +GT+     FPQ I  A++++  L  ++G A+G EAR        + G+  W
Sbjct: 93  NKGAVVKADGTV-----FPQAIGLASTWNPALVKQVGAAVGQEARGFNVRNPTLWGLNLW 147

Query: 158 APNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHF 217
           AP +N+ RDPRWGR +E   EDP +TG++A +Y RG+QGD          LQA+   KHF
Sbjct: 148 APVVNLLRDPRWGRNEEGYSEDPYLTGEFATAYGRGMQGDD------PRYLQAAPTLKHF 201

Query: 218 TAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCAD 277
            AY+ +      R   ++ V  + L D  +  F+  ++ G A+ +M +YN VNG P+   
Sbjct: 202 LAYNNE----VNRDTSNSSVPPKILHDYDEQAFKIPLQNGAANAVMPSYNLVNGRPNHVS 257

Query: 278 RNLLSKTARRQWGFHGY-ITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGMDV-----NC 331
            +L  K   R+W      + SD  A S + ++E Y  +  +     +KAG+D        
Sbjct: 258 PDLDGKL--RKWAPQDIAVVSDAGAPSNLVNSEKYYATKAEGDAAAIKAGLDSFTDNDTD 315

Query: 332 GSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPA 391
           GS      K A+ +  L  ++++ A  +L S+R RLG F+  P   P+ KI   V+ SP 
Sbjct: 316 GSITVAAVKEALSKGLLTMADVENADRHLLSLRFRLGEFD-PPGGNPYAKITPAVIGSPE 374

Query: 392 HQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN-YAGPSCRSIT 450
           HQ LA + A + IV+L+N+   LPL  +++  +A++GP +N   TL  + Y+G     +T
Sbjct: 375 HQALARKTADEQIVMLRNNGNALPLNAARNKKIAVVGPLSN---TLYEDWYSGAMQYKVT 431

Query: 451 PLQALQNYVENTVYYPGCDTVACSSASIDK 480
           P+Q ++  +       G      SS  +D+
Sbjct: 432 PVQGIKERL-------GSAGTVASSEGVDR 454



 Score = 38.5 bits (88), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 7/108 (6%)

Query: 475 SASIDKAVDIAKGADHVVLMMG-LDQTQEKEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
           ++ +  AV  AK AD  V+++G +     +E  DR    L   Q+ LI  V +A    V+
Sbjct: 589 TSGVGSAVAAAKDADTAVVVVGSMPFINGREANDRTRTELAPAQRALIEAVQKANPHTVV 648

Query: 534 LVLLCGGPVDITFAKYDR-NIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V             +D  ++  ILW  + G+    A+A+V+FGD +P
Sbjct: 649 VVENS-----YPTTGWDTLSVPGILWTSHAGQETGHAVADVLFGDQDP 691


>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
 gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 736

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 164/527 (31%), Positives = 260/527 (49%), Gaps = 61/527 (11%)

Query: 81  RLGIPAYEWWSEALHGVAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGL 140
           RLGIP +    EA HG   +G           AT FP  I  AA++   L   +GQ I  
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 141 EARALYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFN 200
           E R+        G   + P +++ RDPRW R +ET GEDP+++G    S V     D   
Sbjct: 131 EIRS------QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMV-----DGLG 179

Query: 201 GGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDLADTYQPPFESCVKQGRAS 260
           GG L  K    A  KHF AY +        Y   A V ++DL   + PPF   +  G A 
Sbjct: 180 GGNLSQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG-AL 235

Query: 261 GIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVV 320
            +M +YN ++G P  ++  LL++  R +W F G++ SD  ++  I+++   A + E+A +
Sbjct: 236 SVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESHFVAPTKENAAI 295

Query: 321 DVLKAGMDVNCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFG 380
             + AG+DV+ G     +   AV+  ++ ++ ID A+  +  ++  +GLF  +P + P  
Sbjct: 296 QSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRVLRMKFEMGLFE-HPYVDP-- 352

Query: 381 KIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN 440
           KI A  V    H  LA + AQ  I LLKN + +LPL K+ +  +A+IGPNA++   +LG+
Sbjct: 353 KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLSKTIN-KVAVIGPNADNRYNMLGD 411

Query: 441 YAGPSCRS--ITPLQALQNYVE--NTVYYPGCDTVACSSASIDKAVDIAKGADHVVLMMG 496
           Y  P   S   T L  +   +      Y  GC     +   I++A+  A+ ++ V++++G
Sbjct: 412 YTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDTTVNEIEQAIKAARRSEVVIVVVG 471

Query: 497 LDQTQE-----------------------KEELDRVDLVLPGRQQELITRVAEAAKKPVI 533
               ++                        E  DR  L L GRQQEL+  + +   KP+I
Sbjct: 472 GSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESLQKTG-KPLI 530

Query: 534 LVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +V + G P++  +A       ++L A YPG+ G  A+A+V+FGD+NP
Sbjct: 531 VVYIEGRPLEKNWA--SEYADALLTAYYPGQEGGNAIADVLFGDYNP 575


>gi|443623426|ref|ZP_21107926.1| putative Glycoside hydrolase family 3 domain protein [Streptomyces
           viridochromogenes Tue57]
 gi|443343044|gb|ELS57186.1| putative Glycoside hydrolase family 3 domain protein [Streptomyces
           viridochromogenes Tue57]
          Length = 980

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 164/460 (35%), Positives = 234/460 (50%), Gaps = 48/460 (10%)

Query: 37  NPSTETFPFCKTTLPISQRARDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHG 96
            P T   PF    LP+ +R  DL+SRLTLDE+I  L   +P +PRLG+  ++  +E LHG
Sbjct: 9   EPPTTDLPFRDPALPLGRRVDDLISRLTLDERIELLYQYSPGVPRLGLAPFKTGTEGLHG 68

Query: 97  VAGVGKGIFFNGTIRGATSFPQVILTAASFDSYLWYRIGQAIGLEARALYN----AGQAI 152
           V+ +G           AT FPQ +   A++D  L + +  A+G E RA +      G   
Sbjct: 69  VSWLGP----------ATVFPQAVGLGATWDEELLHEVATAVGTETRAFHQRPAAGGHRH 118

Query: 153 GMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASA 212
            +  WAP +N+ RDPRWGR +E   EDPL+TG+ +VS+ RG+ GD          L+ + 
Sbjct: 119 SLQVWAPVVNLLRDPRWGRNEEGYAEDPLLTGRLSVSFCRGLAGDH------PAYLRTAP 172

Query: 213 CCKHFTAYDLDNWKGTTRYKFDARVTMQ-DLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
             KHF AY+ ++ +  T      RV  + DLA      F   +  G A+G+M AYN VNG
Sbjct: 173 LLKHFLAYNNEDERDITSSVVPPRVLHEYDLA-----AFRPAIASGAATGVMPAYNLVNG 227

Query: 272 IPSCADRNLLSKTARRQW----GFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAGM 327
            P C    LL    RR W    G   ++ SD  AV+ + ++E Y      +    ++AG+
Sbjct: 228 RP-CHVSPLLETEVRR-WAEPTGHELFVCSDAGAVTNLAESEHYFDDHPTSHAAAVRAGV 285

Query: 328 DV-----NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKI 382
           D        GS      + AV +  L E +IDRA     S+R RLG F  +P + P+  I
Sbjct: 286 DSITELDEDGSITLGRLRTAVDRGLLTEDDIDRAARRHLSIRFRLGEF--DPDLDPYAGI 343

Query: 383 GADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGN-Y 441
             DVV SPAH+ LAL+AA + +VLLKN  G LPL  +    +AL+GP A+   TL  + Y
Sbjct: 344 RDDVVDSPAHRALALRAATESVVLLKND-GALPLSPAPGGRVALVGPLAD---TLFEDWY 399

Query: 442 AGPSCRSITPLQALQNYVE----NTVYYPGCDTVACSSAS 477
           +G     IT    L+  ++      V   G D +   +AS
Sbjct: 400 SGTMPYRITIAAGLEAALKERGAEAVRAEGVDRITLRAAS 439


>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 800

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 183/609 (30%), Positives = 290/609 (47%), Gaps = 111/609 (18%)

Query: 51  PISQRARDLVSRLTLDEKISQLVNS-------APAIPRLGIPAYEWWSEALHGV----AG 99
           PI  R  DL+S++TL+EK  Q+            A P  G  A E W + +  +     G
Sbjct: 62  PIEARIADLLSQMTLEEKTCQMATLYGSGRVLKDAWPTAGWSA-EIWKDGIGNIDEQANG 120

Query: 100 VGK-----------------------------GI---FFNGTIRG-----ATSFPQVILT 122
           +GK                             GI   F N  IRG     AT FP     
Sbjct: 121 LGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPAQCGQ 180

Query: 123 AASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPL 181
            A+++  L   I +    EA+AL       G T  ++P ++I +DPRWGR  E+ GEDP 
Sbjct: 181 GATWNKKLIREIAKVTADEAKAL-------GYTNIYSPILDIAQDPRWGRVVESYGEDPY 233

Query: 182 VTGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDL--DNWKGTTRYKFDARVTM 239
           + G+     + G+Q +              A  KHF  Y +      G TR   D  V  
Sbjct: 234 LVGELGKQMILGLQNEGI-----------VATPKHFAVYSIPVGGRDGGTRT--DPHVAP 280

Query: 240 QDLADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDC 299
           +++   Y  PF   +++  A G+M +YN  +G P     + L++  R+QWGF GY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 300 DAVSIIYDAEGYAKSPEDAVVDVLKAGMDVNCG-----SFLQKHTKAAVKQKKLPESEID 354
           +AV  ++       + E+    V+ AG+++         F+    + A+ + K+    +D
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILP-LRRAINEGKVSLHTLD 399

Query: 355 RALHNLFSVRMRLGLFNGNP----TMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNS 410
           + +  +  V+  +GLF+ NP      +P       VV + AH+ ++++AA + IVLLKN 
Sbjct: 400 QRVGEILRVKFMMGLFD-NPYPGDDRRP-----ETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 411 HGLLPLPKSKSVSLALIGPNANSAKTLLGNYAGPSCRSI-TPLQALQNYVENTV--YYPG 467
           + +LPL K+ S  +A+IGPN    K L   Y GP+  SI T  Q ++ Y+ N+   Y  G
Sbjct: 454 NQMLPLSKNFS-KIAVIGPNGEEVKELTCRY-GPANASIKTVYQGIKEYLPNSEVRYVKG 511

Query: 468 CDTV---------------ACSSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLV 512
           CD +                   A I +AV++AK +D  +L++G ++   +EE  R +L 
Sbjct: 512 CDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLD 571

Query: 513 LPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAE 572
           L GRQQ+L+  V  A  KPV+LV++ G    I +A  ++ + +I+ A +PGE    A+A+
Sbjct: 572 LCGRQQQLLEAVY-ATGKPVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIAK 628

Query: 573 VIFGDHNPG 581
           V+FGD+NPG
Sbjct: 629 VLFGDYNPG 637


>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 791

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 171/601 (28%), Positives = 277/601 (46%), Gaps = 95/601 (15%)

Query: 51  PISQRARDLVSRLTLDEKISQLVN--SAPAIPRLGIPAYEW----WSEALHGVAGVGKGI 104
           P+ +R  DL+S++TL+EKI Q+     +  +    +P   W    W + +  +     G+
Sbjct: 53  PMEERVNDLLSQMTLEEKICQMATLYGSGRVLEDALPEEHWKQALWKDGIGNIDEEHNGL 112

Query: 105 ------------------------------------FFNGTIRG-----ATSFPQVILTA 123
                                               F N  IRG     AT FP      
Sbjct: 113 GTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRATFFPSQSGQG 172

Query: 124 ASFDSYLWYRIGQAIGLEARALYNAGQAIGMT-FWAPNINIFRDPRWGRGQETPGEDPLV 182
           ++++  L  RIG+    EA AL       G T  ++P ++I +DPRWGR  E  GEDP +
Sbjct: 173 STWNKELIARIGEVEAKEAIAL-------GYTNIYSPILDICQDPRWGRSVECYGEDPYL 225

Query: 183 TGKYAVSYVRGVQGDTFNGGKLKGKLQASACCKHFTAYDLDNWKGTTRYKFDARVTMQDL 242
            G+     ++ +Q           K +  +  KHF  Y +       + + D  V+ +++
Sbjct: 226 VGQLGKQMIQSLQ-----------KHRLVSTVKHFAVYSIPVGGRDGKTRTDPHVSPREM 274

Query: 243 ADTYQPPFESCVKQGRASGIMCAYNRVNGIPSCADRNLLSKTARRQWGFHGYITSDCDAV 302
              Y  PF     +  A G+M +YN  +G P  +  + L++  R+++GF GY+ SD +AV
Sbjct: 275 RTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKGYVVSDSEAV 334

Query: 303 SIIYDAEGYAKSPEDAVVDVLKAGMDVNCGSFLQKHT-----KAAVKQKKLPESEIDRAL 357
             I        +  + V   + AG+++    F +        + A+K+ K+    I+  +
Sbjct: 335 EFITTKHHVVSNEVEGVAQAVNAGLNIRT-HFTKPEDFVLPLRQAIKEGKVSPETINSRV 393

Query: 358 HNLFSVRMRLGLFNGNPTMQPFGKIGADVVCSPAHQVLALQAAQDGIVLLKNSHGLLPLP 417
            ++  ++  LGLF+ NP  +   K    +V    HQ +AL+AA+  +VLLKN + LLPL 
Sbjct: 394 ADILRIKFWLGLFD-NP-YRGDEKQEEKIVHCKEHQQVALEAARQSLVLLKNENQLLPLK 451

Query: 418 KSKSVSLALIGPNANSAKTLLGNYAGPSCRSITPLQALQNYVENT--VYYPGCDTVAC-- 473
           K+   S+A+IGPNAN    L+  Y   +    T  Q ++  +  T  VY  GC+ +    
Sbjct: 452 KTVK-SVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPETEVVYRKGCEIIDSHF 510

Query: 474 -------------SSASIDKAVDIAKGADHVVLMMGLDQTQEKEELDRVDLVLPGRQQEL 520
                            +D+AV  A+ A+ VVL++G  +   +E+  R  L LPG QQEL
Sbjct: 511 PESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVREDRSRTSLDLPGHQQEL 570

Query: 521 ITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPGEAGAVALAEVIFGDHNP 580
           +  +  A  KP +LVLL G    I +A  ++ I +IL A +PGE    A+AE +FGD+NP
Sbjct: 571 MQAI-HATGKPTVLVLLDGRAATINYA--NQYIPAILHAWFPGEFAGTAVAEALFGDYNP 627

Query: 581 G 581
           G
Sbjct: 628 G 628


>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
 gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
          Length = 770

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 188/623 (30%), Positives = 296/623 (47%), Gaps = 82/623 (13%)

Query: 7   SLVFPLLCLCFTSLLTRVDS---TQPPFSCDPSNPSTETFPFCKTTLPISQRARDLVSRL 63
           +L+  L+  C T  + + D+   TQ  FS D                 I QR   ++  +
Sbjct: 10  TLISLLMIGCATPAILKSDNEKFTQKSFSGDKH---------------IEQRVDSVLRLM 54

Query: 64  TLDEKISQLVN-------SAPAIPRLGIPAYEWWSEALHG-------VAGVGK------- 102
           TL+EKI Q+         + P +     P  E   + L G       VAG+ K       
Sbjct: 55  TLEEKIGQMTQFSADWSVTGPVMADKYQPYLE---KGLVGSIFNATSVAGIRKLQKIAVE 111

Query: 103 ----GI---FFNGTIRG-ATSFPQVILTAASFDSYLWYRIGQAIGLEARALYNAGQAIGM 154
               GI   F    I G  T FP  +  + S+D  L  +  +    EA A       I  
Sbjct: 112 QTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA-----DGINW 166

Query: 155 TFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDTFNGGKLKGKLQASACC 214
           TF AP ++I RD RWGR  E  GEDP +    A + V+G QG   N   L       AC 
Sbjct: 167 TF-APMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGD-NWQMLSSPHTLLACG 224

Query: 215 KHFTAYDLDNWKGTTRYKFD---ARVTMQDLADTYQPPFESCVKQGRASGIMCAYNRVNG 271
           KHF  Y      G      D   A ++M  L + Y PP+E+ +  G  S IM + N +NG
Sbjct: 225 KHFAGY------GAAESGKDYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEING 277

Query: 272 IPSCADRNLLSKTARRQWGFHGYITSDCDAVSIIYDAEGYAKSPEDAVVDVLKAG--MDV 329
           +P+ AD+ LL++  R++WGF+G + SD   ++ +    G AK  + A      AG  MD+
Sbjct: 278 VPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQAANLSANAGIEMDM 336

Query: 330 NCGSFLQKHTKAAVKQKKLPESEIDRALHNLFSVRMRLGLFNGNPTMQPFGKIGADVVCS 389
           N  +F+ K+  A VK+ K+ E++ID+A+ ++  ++  LGLF+ +P          +   +
Sbjct: 337 NGATFI-KYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFT 394

Query: 390 PAHQVLALQAAQDGIVLLKNSHGLLPLPKSKSVSLALIGPNANSAKTLLGNYA--GPSCR 447
             +  +A QA    +VLLKN   +LP+ K    ++A+IGP  N+   + G++   G   +
Sbjct: 395 EEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQ 454

Query: 448 SITPLQALQNYVENT----VYYPGCDTVACSSASIDKAVDIAKGADHVVLMMGLDQTQEK 503
           S++ L  L    + T    +Y  GC     S+  + +AV IA+ AD V++ +G   +   
Sbjct: 455 SVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQLKEAVAIARKADRVLVAVGEQSSWAG 514

Query: 504 EELDRVDLVLPGRQQELITRVAEAAKKPVILVLLCGGPVDITFAKYDRNIGSILWAGYPG 563
           E   R D+ LP  Q++L+  + +A  KP+ +V   G P+D+++   + N+ +IL A +PG
Sbjct: 515 ESAVRTDIRLPQAQRQLLEAL-KAINKPIAIVTFSGRPLDLSWE--NENVQAILQAWFPG 571

Query: 564 EAGAVALAEVIFGDHNP-GEFTV 585
             G   +A+VI GD NP G  T+
Sbjct: 572 TQGGNGIADVIAGDVNPSGHLTM 594


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.418 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,460,816,974
Number of Sequences: 23463169
Number of extensions: 403839698
Number of successful extensions: 1008527
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5479
Number of HSP's successfully gapped in prelim test: 2102
Number of HSP's that attempted gapping in prelim test: 967214
Number of HSP's gapped (non-prelim): 13114
length of query: 591
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 443
effective length of database: 8,886,646,355
effective search space: 3936784335265
effective search space used: 3936784335265
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 80 (35.4 bits)