BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003606
         (808 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score = 1197 bits (3097), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 569/797 (71%), Positives = 670/797 (84%), Gaps = 17/797 (2%)

Query: 13  RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
           R LF  L  + V+ + + S S      P FPC PP    YPFCNTSL ISTRA+SL+SLL
Sbjct: 6   RFLFICL-FLQVLPLFSISES----THPQFPCMPPTNSDYPFCNTSLPISTRAQSLVSLL 60

Query: 73  TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           TL EKIQQLSD A+AIPRL IPAYEWWSESLHGIA+NGPGV+FNGTVS+ TSFPQVL++A
Sbjct: 61  TLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTA 120

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
           ASFNRSLW +IGSA+AVEARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV S
Sbjct: 121 ASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVAS 180

Query: 193 AYAVEFVKSFQGENWKSDD---GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
           AYAVEFV+ FQG NWK  D   G +G    +KRVL+G   +SD GD LMLSACCKHL AY
Sbjct: 181 AYAVEFVRGFQGGNWKGGDEIRGAVG----KKRVLRG---DSD-GDGLMLSACCKHLTAY 232

Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
           DLEKWGNFSRYSF+A+++ QD EDT+QPPFRSC++QGKASC+MCSYN+VNGVPAC R DL
Sbjct: 233 DLEKWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDL 292

Query: 310 FQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
           FQKA+ EWGFKGYITSDCDAVAT++EYQ+Y  + ED+ A VLKAG DINCG+ MLRHTQS
Sbjct: 293 FQKAKTEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQS 352

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
           AID+GKV+E+DIDRAL NLFSVQ+RLGLF+GDP  G YG LGP DVCT EH+ LAL+AAR
Sbjct: 353 AIDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAAR 412

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
           QGIVLLKNDKKFLPL+K+ +SSLAIIGP  +    +GGGYTGIPC P+SL+ GL+ YV K
Sbjct: 413 QGIVLLKNDKKFLPLDKSRISSLAIIGPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEK 471

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
           T +A+GC DVPC SD GF EAV IA+KAD V+VVAGLDL+QETED DRVSLLLPG+QM+L
Sbjct: 472 TSFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMAL 531

Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           ++SVA   ++P++LVLTGGGPLDVSFAE D +I+SILWIGYPGEAGAKALAEIIFGDFNP
Sbjct: 532 ISSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNP 591

Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
           GGRLPMTWYPESFT+VPMNDMNMRAD  R YPGR+YRFY G +VYGFG GLSYT ++Y+F
Sbjct: 592 GGRLPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQF 651

Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
           +SAP++L +  S    S KN+ +Q    ++Y HI+E+ +C SLRFHV+ISVTN GD+DGS
Sbjct: 652 VSAPNKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGS 711

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
           HVVMLF+RVPK+ +GTPEKQLIGF RVHTV++ S E S  VDPCE  SIAN+ G+RI+PL
Sbjct: 712 HVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPL 771

Query: 790 GNHVLMVGELRHSLTIE 806
           G+H +M+G++ HS+++E
Sbjct: 772 GDHTIMLGDVVHSVSVE 788


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score = 1176 bits (3041), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 556/794 (70%), Positives = 655/794 (82%), Gaps = 32/794 (4%)

Query: 13  RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
           R LF  L  + V+ + + S S      P FPC PP    YPFCNTSL ISTRA+SL+SLL
Sbjct: 6   RFLFICL-FLQVLPLFSISES----THPQFPCMPPTNSDYPFCNTSLPISTRAQSLVSLL 60

Query: 73  TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           TL EKIQQLSD A+AIPRL IPAYEWWSESLHGIA+NGPGV+FNGTVS+ TSFPQVL++A
Sbjct: 61  TLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQVLLTA 120

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
           ASFNRSLW +IGSA+AVEARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV S
Sbjct: 121 ASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVAS 180

Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
           AYAVEFV+ FQG++                           GD LMLSACCKHL AYDLE
Sbjct: 181 AYAVEFVRGFQGDS--------------------------DGDGLMLSACCKHLTAYDLE 214

Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
           KWGNFSRYSF+A+++ QD EDT+QPPFRSC++QGKASC+MCSYN+VNGVPAC R DLFQK
Sbjct: 215 KWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLFQK 274

Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
           A+ EWGFKGYITSDCDAVAT++EYQ+Y  + ED+ A VLKAG DINCG+ MLRHTQSAID
Sbjct: 275 AKTEWGFKGYITSDCDAVATVYEYQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAID 334

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
           +GKV+E+DIDRAL NLFSVQ+RLGLF+GDP  G YG LGP DVCT EH+ LAL+AARQGI
Sbjct: 335 QGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGI 394

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
           VLLKNDKKFLPL+K+ +SSLAIIGP  +    +GGGYTGIPC P+SL+ GL+ YV KT +
Sbjct: 395 VLLKNDKKFLPLDKSRISSLAIIGPQADQ-PFLGGGYTGIPCKPESLVEGLKTYVEKTSF 453

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
           A+GC DVPC SD GF EAV IA+KAD V+VVAGLDL+QETED DRVSLLLPG+QM+L++S
Sbjct: 454 AAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISS 513

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
           VA   ++P++LVLTGGGPLDVSFAE D +I+SILWIGYPGEAGAKALAEIIFGDFNPGGR
Sbjct: 514 VASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGR 573

Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           LPMTWYPESFT+VPMNDMNMRAD  R YPGR+YRFY G +VYGFG GLSYT ++Y+F+SA
Sbjct: 574 LPMTWYPESFTRVPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSA 633

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
           P++L +  S    S KN+ +Q    ++Y HI+E+ +C SLRFHV+ISVTN GD+DGSHVV
Sbjct: 634 PNKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVV 693

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           MLF+RVPK+ +GTPEKQLIGF RVHTV++ S E S  VDPCE  SIAN+ G+RI+PLG+H
Sbjct: 694 MLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDH 753

Query: 793 VLMVGELRHSLTIE 806
            +M+G++ HS+++E
Sbjct: 754 TIMLGDVVHSVSVE 767


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score = 1155 bits (2989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 564/780 (72%), Positives = 654/780 (83%), Gaps = 21/780 (2%)

Query: 31  SNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
           SNSK V N P FPCKPP  ++Y FCN SL I+ RA+SLIS LTLQEKIQQLSDNAS IPR
Sbjct: 20  SNSKSVAN-PQFPCKPPTHNTYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPR 78

Query: 91  LGIPAYEWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVA 148
           LGIP YEWWSESLHGI+ NGPGV+F   G V+S T FPQV+VSAASFNR+LW  IGSA+A
Sbjct: 79  LGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIA 138

Query: 149 VEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           +EARAMYN+GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMV SAYA+EFVK FQG +WK
Sbjct: 139 IEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWK 198

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
           ++DG I                    D+LMLSACCKH  AYDLEKWGNFSRYSFNA++TE
Sbjct: 199 NEDGEI------------------NDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTE 240

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCD 328
           QD EDT+QPPFRSCI++GKASC+MCSYN+VNGVPAC R DL QK R EWGFKGYITSDCD
Sbjct: 241 QDMEDTYQPPFRSCIQKGKASCLMCSYNEVNGVPACAREDLLQKPRTEWGFKGYITSDCD 300

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           AVATIFEYQNY+K+ ED+ A  LKAGMDINCGT +LR+ QSA++KGK+QE+DIDRAL NL
Sbjct: 301 AVATIFEYQNYSKSPEDAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNL 360

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
           FSVQLRLGLF+GDPRKG++GKLGP +VCT EHK LAL+AARQGIVLLKNDKK LPLNK A
Sbjct: 361 FSVQLRLGLFDGDPRKGQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKA 420

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
           VSSLAIIGPL N  + +GG YTG PC P+SL  GL+AYV KT YA GC DV C SD  FH
Sbjct: 421 VSSLAIIGPLANMANSLGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFH 480

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
           +A+ +AK+ADFVI+VAGLDL+QETE+ DRVSLLLPG+QMSLV+SVA  SK+PVILVLTGG
Sbjct: 481 KAIIVAKRADFVIIVAGLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGG 540

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
           GPLDVSFA+ D +I+SILWIGYPGEAGAKALAEIIFG++NPGGRLPMTWYPESFT+V M 
Sbjct: 541 GPLDVSFAKGDPRIASILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESFTEVSMT 600

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           DMNMR + SR YPGR+YRFYTG +VYGFG GLSYTN++YK LSAPS+L++S SL + S K
Sbjct: 601 DMNMRPNPSRGYPGRTYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRK 660

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
            ILQQ G RL Y++I+E+TSC SLRF++QI V N G++DG HVVMLF+RVP V +G PEK
Sbjct: 661 RILQQGGERLSYININEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEK 720

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
           QL+GFDRVHT++  S E+S  VDPCE LS+AN+ G++I+ LG H LM+G+L H +TI+ Y
Sbjct: 721 QLVGFDRVHTISHRSTEMSILVDPCEHLSVANEQGKKIMLLGGHGLMLGDLEHFVTIQIY 780


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score = 1107 bits (2863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 528/807 (65%), Positives = 634/807 (78%), Gaps = 27/807 (3%)

Query: 5   FMSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTR 64
            M +Q R  I+  +  L+++       +       PD+PCKPPH   YPFCN SL ISTR
Sbjct: 1   MMFLQQRSTIIIFLFSLLLI-------HLPKFFTTPDYPCKPPH-SHYPFCNISLPISTR 52

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
             SLISLLTL +KI QLS+ AS+I  LGIP+Y+WWSE+LHGIA+NGPGVNFNG+V S T+
Sbjct: 53  TTSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATN 112

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETP 184
           FPQV+VSAA+FNRSLW  IG AV VE RAM+N+GQAGL+FWAPN+N+FRDPRWGRGQETP
Sbjct: 113 FPQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETP 172

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDPMV SAYAVEFV+  QG +      GI      K+VL    +     D LM+SACCK
Sbjct: 173 GEDPMVGSAYAVEFVRGIQGVD------GI------KKVLN---DHDSDDDGLMVSACCK 217

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  AYDLEKWG FSRY+FNA++T+QD EDT+QPPFR C++QGKASC+MCSYN+VNGVPAC
Sbjct: 218 HFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQGKASCLMCSYNEVNGVPAC 277

Query: 305 LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
              DL    RN+WGF+GYI SDCDAVAT+FEYQ Y K+ ED+ A VLKAGMDINCGT ML
Sbjct: 278 ASKDLLGLVRNKWGFEGYIASDCDAVATVFEYQKYAKSAEDAVADVLKAGMDINCGTFML 337

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
           RHT+SAI++G V+E+D+DRAL NLFSVQ+RLGLFNGDP KGK+GKLGP DVCT EHKKLA
Sbjct: 338 RHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLA 397

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
           L+AARQGIVLLKND KFLPL+K    SLAIIGP+    S++GGGY+GIPCSP+SL  GL+
Sbjct: 398 LEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMATT-SELGGGYSGIPCSPRSLYDGLK 456

Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
            YV    YA GC DV C+SD GF  A+ IAK+ADFV++VAGLD T ETED DRVSLLLPG
Sbjct: 457 EYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLDTTLETEDLDRVSLLLPG 516

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +QM LV+ VA  SKRPVILVLTGGGPLDVSFAE++  I+SILWIGYPGEAG KALAEIIF
Sbjct: 517 KQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILWIGYPGEAGGKALAEIIF 576

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
           G+FNP GRLPMTWYPESFT VPMNDM MRAD SR YPGR+YRFYTG+++YGFGHGLSY++
Sbjct: 577 GEFNPAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSD 636

Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGS---RLDYVHIDEVTSCTSLRFHVQISVT 721
           +SY+ LSAPS+L++S +   G  +++L +       +D+VH+DE+ +C SL F V ISV 
Sbjct: 637 FSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVM 696

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N GD+DGSHVVMLF++ PK  QG+PE QL+G  R+HTV+  S E S   DPCE  S A++
Sbjct: 697 NVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSIETSILADPCEHFSFADE 756

Query: 782 HGRRILPLGNHVLMVGELRHSLTIETY 808
            G+RILPLGNH+L VG++ H ++IE Y
Sbjct: 757 QGKRILPLGNHILNVGDVEHIVSIEIY 783


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score = 1106 bits (2860), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 531/803 (66%), Positives = 636/803 (79%), Gaps = 28/803 (3%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
           ++ L+   + IA    K + + P FPCKPPHF SYPFCN SLSI  RA SL+SLLTL EK
Sbjct: 7   LISLVFFTSAIA-ETFKNLDSHPQFPCKPPHFSSYPFCNVSLSIKQRAISLVSLLTLPEK 65

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I QLS  A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSFPQV+VSAASFNR
Sbjct: 66  IGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAASFNR 125

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
           +LW  IGSAVAVEARAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVS Y VE
Sbjct: 126 TLWYEIGSAVAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPKVVSEYGVE 185

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKG-FGEES---------DRGDELMLSACCKHLI 247
           FV+ FQ +             ++++VLK  FG ++         D   +LMLSACCKH  
Sbjct: 186 FVRGFQEK-------------KKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFT 232

Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
           AYDLEKWGNF+RY FNA++TEQD EDT+QPPF +CI+ GKASC+MCSYN VNGVPAC +G
Sbjct: 233 AYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQG 292

Query: 308 DLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT 367
           DL QKAR EWGF GYITSDCDAVATIFEYQ YTK+ E++ A  +KAG+DINCGT MLR+T
Sbjct: 293 DLLQKARVEWGFDGYITSDCDAVATIFEYQGYTKSPEEAVADAIKAGVDINCGTYMLRNT 352

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
           QSAI++GKV E+ +DRALLNLF+VQLRLGLF+GDPR G YGKLG +D+C+S+H+KLAL+A
Sbjct: 353 QSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEA 412

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
           ARQGIVLLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC  K+L   L  YV
Sbjct: 413 ARQGIVLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYV 472

Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
            KT YASGC DV C SD GF EAV IAK ADFVIVVAGLDL+QETED+DR SL LPG+Q 
Sbjct: 473 KKTSYASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQK 532

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
            LV+SVA  SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFGDF
Sbjct: 533 DLVSSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDF 592

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NPGGRLP+TWYPESF  VPM+DM+MRADSSR YPGR+YRFYTG QVY FG GLSYT + Y
Sbjct: 593 NPGGRLPITWYPESFADVPMSDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDY 652

Query: 668 KFLSAPSELTISASL--KAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNA 723
           K +SAP  L++S  L  ++   K +LQ    +L Y+ +D+  V SC SLRF+V+++V N 
Sbjct: 653 KIISAPIRLSLSELLPQQSSHKKQLLQHGEEQLQYIQLDDVMVNSCESLRFNVRVNVRNT 712

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G++DGSHV+MLF+++ +V  G PEKQLIGFDRVH  +    E  F +DPC+ LS+AN  G
Sbjct: 713 GEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVANDVG 772

Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
           +R++PLG H L +G+L+HSL++E
Sbjct: 773 KRVIPLGIHALFLGDLQHSLSVE 795


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score = 1104 bits (2856), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 532/803 (66%), Positives = 631/803 (78%), Gaps = 14/803 (1%)

Query: 6   MSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRA 65
           M++QL       ++ L+   + IA    K + + P FPCKPPHF SYPFCN SLSI  RA
Sbjct: 1   MNLQLT------LISLLFFTSAIA-ETFKNLDSHPQFPCKPPHFSSYPFCNVSLSIKQRA 53

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
            SL+SLL L EKI QLS+ A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSF
Sbjct: 54  ISLVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSF 113

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPG 185
           PQV+VSAASFNR+LW  IGSAVAVE RAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPG
Sbjct: 114 PQVIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPG 173

Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKH 245
           EDP VVS Y VEFV+ FQ E  K       F             + D   +LMLSACCKH
Sbjct: 174 EDPKVVSEYGVEFVRGFQ-EKKKRKVLKRRFSDDVDDDRH----DDDADGKLMLSACCKH 228

Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
             AYDLEKWGNF+RY FNA++TEQD EDT+QPPF +CI  GKASC+MCSYN VNGVPAC 
Sbjct: 229 FTAYDLEKWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACA 288

Query: 306 RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR 365
           +GDL QKAR EWGF+GYITSDCDAVATIF YQ YTK+ E++ A  +KAG+DINCGT MLR
Sbjct: 289 QGDLLQKARVEWGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLR 348

Query: 366 HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL 425
           HTQSAI++GKV E+ +DRALLNLF+VQLRLGLF+GDPR+G+YGKLG +D+C+S+H+KLAL
Sbjct: 349 HTQSAIEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLAL 408

Query: 426 DAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEA 485
           +A RQGIVLLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC  K+L   L  
Sbjct: 409 EATRQGIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLE 468

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
           YV KT YASGC DV C+SD GF EAV IAK ADFVIVVAGLDL+QETED+DRVSL LPG+
Sbjct: 469 YVKKTSYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGK 528

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q  LV+ VA  SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFG
Sbjct: 529 QKDLVSHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFG 588

Query: 606 DFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
           DFNPGGRLP TWYPESFT V M+DM+MRA+SSR YPGR+YRFYTG QVY FG GLSYT +
Sbjct: 589 DFNPGGRLPTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKF 648

Query: 666 SYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNA 723
            YK LSAP  L++S  L   S      Q G  L Y+ +D+  V SC SLRF+V++ V+N 
Sbjct: 649 EYKILSAPIRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNT 708

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G++DGSHVVMLF+++P V  G PEKQLIG+DRVH  +    E  F +DPC+QLS+AN  G
Sbjct: 709 GEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVG 768

Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
           +R++PLG+HVL +G+L+HSL++E
Sbjct: 769 KRVIPLGSHVLFLGDLQHSLSVE 791


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score = 1073 bits (2774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 495/785 (63%), Positives = 623/785 (79%), Gaps = 14/785 (1%)

Query: 28  IAFSNSKPVLNKP---DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
           +A+ N K + N      FPC      SYPFCN SLS + RA+SL+SLLTL EKIQQLS+N
Sbjct: 3   VAWYNLKQLCNSSSDFQFPCNSSLHPSYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNN 62

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           AS+IPRLGIP+Y+WWSE LHGIA+NGPGV+FNG+++S TSFPQVLV+AASFNR+LW  IG
Sbjct: 63  ASSIPRLGIPSYQWWSEGLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIG 122

Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
           SA+AVEARAM+N+GQ GLT WAPNINIFRDPRWGRGQETPGEDPMV SAY+++FV+  Q 
Sbjct: 123 SAIAVEARAMFNVGQCGLTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQS 182

Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA 264
            NW           +E  +     EE +    LM+SACCKH  AYDLEKW NF+RY+F++
Sbjct: 183 GNW----------MKEHEIRNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDS 232

Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYIT 324
           ++TEQD  DT+QPPFRSCI+QGKASC+MCSYN VNGVPAC   DL +KARN+WG KGYIT
Sbjct: 233 VVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYIT 292

Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
           SDCDAVAT++EYQ YT T ED+ A VLKAGMDINCGT MLR T+SAID+GKV+E+++D A
Sbjct: 293 SDCDAVATVYEYQKYTDTPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSA 352

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           L+NLFSVQ RLG F+G+PR+GK+G+LG  DVCT++HK LAL+AARQGIVLLKN+ KFLPL
Sbjct: 353 LINLFSVQARLGFFDGNPREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPL 412

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
           +KNA+SSL +IG L N+ S++ GGY G+PCSP SL+ G + Y     +ASGC DVPC SD
Sbjct: 413 DKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASD 472

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
             F +A+ IAKKADFVI VAGLD +QETED DRVSLLLPG+QM LV+SVA  SK+P+ILV
Sbjct: 473 NRFEDAILIAKKADFVIAVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILV 532

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L GGGPLD+SFA+ DS+++SILWIG PGEAG KALAE+IFGD+NPGGRLP+TWYP+SFT 
Sbjct: 533 LIGGGPLDISFAKKDSRVASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTN 592

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
           VPMNDM+MR + SR YPGR+YRFYTG ++YGFG GLSYT++ Y+ LSAP ++ +    + 
Sbjct: 593 VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAET 652

Query: 685 GSDKNILQ-QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
              + I Q + G  + Y+ ++EV SC  LRF V++SV+N G+ DGSHVVM+F+  PKV  
Sbjct: 653 SRRRIIPQVRDGVNMSYMEVEEVESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLT 712

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
           GTP++QLIGFDR++     S E S  VDPC  +S+A+++G+R++PLG+H + +G+L H +
Sbjct: 713 GTPQRQLIGFDRLYVKRNQSAESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVI 772

Query: 804 TIETY 808
           +I+ +
Sbjct: 773 SIQVF 777


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score = 1072 bits (2772), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 491/768 (63%), Positives = 616/768 (80%), Gaps = 11/768 (1%)

Query: 42  FPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSE 101
           FPC      SYPFCN SLS + RA+SL+SLLTL EKIQQLS+NAS+IPRLGIP+Y+WWSE
Sbjct: 10  FPCNSSLHPSYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSE 69

Query: 102 SLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
            LHGIA+NGPGV+FNG+++S TSFPQVLV+AASFNR+LW  IGSA+AVEARAM+N+GQ G
Sbjct: 70  GLHGIATNGPGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCG 129

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LT WAPNINIFRDPRWGRGQETPGEDPMV SAY+++FV+  Q  NW           +E 
Sbjct: 130 LTIWAPNINIFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNW----------MKEH 179

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
            +     EE +    LM+SACCKH  AYDLEKW NF+RY+F++++TEQD  DT+QPPFRS
Sbjct: 180 EIRNEVLEEDNGMGSLMVSACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRS 239

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           CI+QGKASC+MCSYN VNGVPAC   DL +KARN+WG KGYITSDCDAVAT++EYQ YT 
Sbjct: 240 CIQQGKASCLMCSYNAVNGVPACANPDLLKKARNDWGLKGYITSDCDAVATVYEYQKYTD 299

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
           T ED+ A VLKAGMDINCGT MLR T+SAID+GKV+E+++D AL+NLFSVQ RLG F+G+
Sbjct: 300 TPEDAIADVLKAGMDINCGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGN 359

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           PR+GK+G+LG  DVCT++HK LAL+AARQGIVLLKN+ KFLPL+KNA+SSL +IG L N+
Sbjct: 360 PREGKFGELGAQDVCTAQHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLAND 419

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
            S++ GGY G+PCSP SL+ G + Y     +ASGC DVPC SD  F +A+ IAKKADFVI
Sbjct: 420 SSKLLGGYAGVPCSPMSLVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVI 479

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
            VAGLD +QETED DRVSLLLPG+QM LV+SVA  SK+P+ILVL GGGPLD+SFA+ DS+
Sbjct: 480 AVAGLDASQETEDLDRVSLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSR 539

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
           ++SILWIG PGEAG KALAE+IFGD+NPGGRLP+TWYP+SFT VPMNDM+MR + SR YP
Sbjct: 540 VASILWIGNPGEAGGKALAEVIFGDYNPGGRLPVTWYPQSFTNVPMNDMHMRPNPSRGYP 599

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDY 700
           GR+YRFYTG ++YGFG GLSYT++ Y+ LSAP ++ +    +    + I Q + G  + Y
Sbjct: 600 GRTYRFYTGDRIYGFGEGLSYTSFKYRLLSAPKKVNLLGKAETSRRRIIPQVRDGVNMSY 659

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
           + ++EV SC  LRF V++SV+N G+ DGSHVVM+F+  PKV  GTP++QLIGFDR++   
Sbjct: 660 MEVEEVESCDLLRFEVKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKR 719

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
             S E S  VDPC  +S+A+++G+R++PLG+H + +G+L H ++I+ +
Sbjct: 720 NQSAESSIMVDPCNHVSLADEYGKRVIPLGDHTISLGDLEHVISIQVF 767


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score = 1056 bits (2731), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/773 (65%), Positives = 618/773 (79%), Gaps = 27/773 (3%)

Query: 41  DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           D+ CK     S+PFC+TSL   TRA+SL+SLLTL EKI  LS+NAS+IPRLGIPAY+WWS
Sbjct: 30  DYACK--FSQSHPFCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWS 87

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           ESLHG+A NGPGV+F G V S TSFPQV++SAASFNRSLW    +A+A EARAM+N+GQA
Sbjct: 88  ESLHGLALNGPGVSFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQA 147

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLTFWAPNIN+FRDPRWGRGQETPGEDPM+ SAYAVE+V+  QG                
Sbjct: 148 GLTFWAPNINLFRDPRWGRGQETPGEDPMLASAYAVEYVRGLQG---------------- 191

Query: 221 KRVLKGF--GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
              L G       D  D LM+SACCKH  AYDL+ WG FSRY+FNA++++QD EDT+QPP
Sbjct: 192 ---LSGIQDAVVVDDDDTLMVSACCKHFTAYDLDMWGQFSRYNFNAVVSQQDLEDTYQPP 248

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQN 338
           FRSCI+QGKASC+MCSYN+VNGVPAC   +L   AR++WGFKGYITSDCDAVAT++EYQ 
Sbjct: 249 FRSCIQQGKASCLMCSYNEVNGVPACASEELLGLARDKWGFKGYITSDCDAVATVYEYQK 308

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           Y K+ ED+ A VLKAGMDINCGT MLRHT+SAI++GKV+E+D+DRALLNLFSVQLRLGLF
Sbjct: 309 YAKSQEDAVADVLKAGMDINCGTFMLRHTESAIEQGKVKEEDLDRALLNLFSVQLRLGLF 368

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +GDP +G++GKLGP DVCT EHK LALDAARQGIVLLKNDKKFLPL+++  +SLA+IGPL
Sbjct: 369 DGDPIRGRFGKLGPKDVCTQEHKTLALDAARQGIVLLKNDKKFLPLDRDIGASLAVIGPL 428

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
               +++GGGY+GIPCS  SL  GL  +  +  YA GC+DVPC+SD GF EA+  AK+AD
Sbjct: 429 ATT-TKLGGGYSGIPCSSSSLYEGLGEFAERISYAFGCYDVPCDSDDGFAEAIDTAKQAD 487

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
           FV++VAGLD TQETED DRVSLLLPG+QM+LV+SVA  SK PVILVL GGGPLDVSFAE 
Sbjct: 488 FVVIVAGLDATQETEDHDRVSLLLPGKQMNLVSSVADASKNPVILVLIGGGPLDVSFAEK 547

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
           + QI+SI+W+GYPGEAG KALAEIIFG+FNP GRLPMTWYPE+FT VPMN+M+MRAD SR
Sbjct: 548 NPQIASIIWLGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPEAFTNVPMNEMSMRADPSR 607

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR- 697
            YPGR+YRFYTG +VYGFGHGLS++++SY FLSAPS++++S ++K GS K +L Q  +  
Sbjct: 608 GYPGRTYRFYTGGRVYGFGHGLSFSDFSYNFLSAPSKISLSRTIKDGSRKRLLYQVENEV 667

Query: 698 --LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
             +DYV ++++ +C  L F V ISV N G +DGSHVVMLF++ PKV  G+PE QL+GF R
Sbjct: 668 YGVDYVPVNQLQNCNKLSFSVHISVMNLGGLDGSHVVMLFSKGPKVVDGSPETQLVGFSR 727

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
           +HT++    E S  V PCE LS A+K G+RILPLG H L VG+L H ++IE Y
Sbjct: 728 LHTISSKPTETSILVHPCEHLSFADKQGKRILPLGPHTLSVGDLEHVVSIEIY 780


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score = 1046 bits (2705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/735 (67%), Positives = 589/735 (80%), Gaps = 7/735 (0%)

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           L EKI QLS+ A+++PRLGIP YEWWSESLHG+A NGPGV+FNG++S+ TSFPQV+VSAA
Sbjct: 2   LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           SFNR+LW  IGSAVAVE RAMYN GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVS 
Sbjct: 62  SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           Y VEFV+ FQ E  K       F             + D   +LMLSACCKH  AYDLEK
Sbjct: 122 YGVEFVRGFQ-EKKKRKVLKRRFSDDVDDDRH----DDDADGKLMLSACCKHFTAYDLEK 176

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
           WGNF+RY FNA++TEQD EDT+QPPF +CI  GKASC+MCSYN VNGVPAC +GDL QKA
Sbjct: 177 WGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLLQKA 236

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           R EWGF+GYITSDCDAVATIF YQ YTK+ E++ A  +KAG+DINCGT MLRHTQSAI++
Sbjct: 237 RVEWGFEGYITSDCDAVATIFAYQGYTKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQ 296

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
           GKV E+ +DRALLNLF+VQLRLGLF+GDPR+G+YGKLG +D+C+S+H+KLAL+A RQGIV
Sbjct: 297 GKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIV 356

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
           LLKND K LPLNKN VSSLAI+GP+ NNIS MGG YTG PC  K+L   L  YV KT YA
Sbjct: 357 LLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYA 416

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
           SGC DV C+SD GF EAV IAK ADFVIVVAGLDL+QETED+DRVSL LPG+Q  LV+ V
Sbjct: 417 SGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHV 476

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  SK+PVILVLTGGGP+DV+FA+ D +I SI+WIGYPGE G +ALAEIIFGDFNPGGRL
Sbjct: 477 AAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRL 536

Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           P TWYPESFT V M+DM+MRA+SSR YPGR+YRFYTG QVY FG GLSYT + YK LSAP
Sbjct: 537 PTTWYPESFTDVAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILSAP 596

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHV 731
             L++S  L   S      Q G  L Y+ +D+  V SC SLRF+V++ V+N G++DGSHV
Sbjct: 597 IRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNSCESLRFNVRVHVSNTGEIDGSHV 656

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
           VMLF+++P V  G PEKQLIG+DRVH  +    E  F +DPC+QLS+AN  G+R++PLG+
Sbjct: 657 VMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPLGS 716

Query: 792 HVLMVGELRHSLTIE 806
           HVL +G+L+HSL++E
Sbjct: 717 HVLFLGDLQHSLSVE 731


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score = 1043 bits (2697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 494/756 (65%), Positives = 600/756 (79%), Gaps = 7/756 (0%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           Y FCN +L ISTR +SLISLLT+ EKI  LSDN ++IPRLG+PAYEWWSESLHGIA+NGP
Sbjct: 41  YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
            VNFNG +  VTSFPQV+++AA+FNR+LW +I +A+AVEARAMYNLGQAGLTFWAPNINI
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINI 160

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI-GFGFREKRVLKGFGEE 230
            RDPRWGRGQETPGEDPMVVSAYA+E+V  FQG N K+  G   G+G +++RVLK   E+
Sbjct: 161 LRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYG-KKRRVLK---ED 216

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
            + G+ LMLSACCKH  AYDLEKWG+ +RY FNA++T+QD EDTFQ PFRSCI+QGKASC
Sbjct: 217 DNDGERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASC 276

Query: 291 IMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           +MCSYN VNGVPAC   +L  K R +WGF GYITSDCDAVATI+E Q YTKT ED+ A  
Sbjct: 277 LMCSYNSVNGVPACADKELLDKVRTDWGFDGYITSDCDAVATIYENQKYTKTPEDAVAVA 336

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           LKAG +INCGT MLRH +SA  +G V E+D+DRAL  LFSVQ RLGLF+G+P  G++   
Sbjct: 337 LKAGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANF 396

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           G  DVCTS H  LALDAARQGIVLLKND+KFLPL+K +VS+LAI+GP+  N+S  GG Y+
Sbjct: 397 GAQDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMA-NVSSPGGTYS 455

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           G+PC  KS+  G   ++++T YA+GC DV CNS AGF +A+ I K+AD+VIVVAG DL++
Sbjct: 456 GVPCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSE 515

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
           ETED DR SLLLPGQQ +LVT++A  SK+P+ILVLTGGGP+DVSFAE D +I+SILW+ Y
Sbjct: 516 ETEDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAY 575

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           PGE G KAL+EIIFG  NPGG+LPMTWY ESFTKVPM DMNMRAD S  YPGR+YRFYTG
Sbjct: 576 PGETGGKALSEIIFGYQNPGGKLPMTWYLESFTKVPMTDMNMRADPSNGYPGRTYRFYTG 635

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             +YGFGHGLSYT++S + LSAPS L++S + K+   ++IL +  SRL Y+H+DEV SC 
Sbjct: 636 DVLYGFGHGLSYTSFSSQLLSAPSRLSLSLA-KSNRKRSILAKGRSRLGYIHVDEVESCH 694

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
           S +F V ISVTN GD+DGSHV+MLF+RV +  QG P+KQL+GFDRVH  A+   E S  V
Sbjct: 695 SSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLLV 754

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           DPCE  S AN  G RIL LG H  ++ ++ H + +E
Sbjct: 755 DPCELFSFANDQGNRILALGEHTFILDDIEHVVFVE 790


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  983 bits (2540), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/824 (59%), Positives = 598/824 (72%), Gaps = 69/824 (8%)

Query: 6   MSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRA 65
           M +Q R  I+  +  L+++       +       PD+PCKPPH   YPFCN SL ISTR 
Sbjct: 1   MFLQQRSTIIIFLFSLLLI-------HLPKFFTTPDYPCKPPH-SHYPFCNISLPISTRT 52

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
            SLISLLTL +KI QLS+ AS+I  LGIP+Y+WWSE+LHGIA+NGPGVNFNG+V S T+F
Sbjct: 53  TSLISLLTLSDKINQLSNTASSISHLGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNF 112

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPG 185
           PQV+VSAA+FNRSLW  IG AV VE RAM+N+GQAGL+FWAPN+N+FRDPRWGRGQETPG
Sbjct: 113 PQVIVSAAAFNRSLWFLIGYAVGVEGRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPG 172

Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKH 245
           EDPMV SAYAVEFV+  QG +      GI      K+VL    +     D LM+SACCKH
Sbjct: 173 EDPMVGSAYAVEFVRGIQGVD------GI------KKVLN---DHDSDDDGLMVSACCKH 217

Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
             AYDLEKWG FSRY+FNA++      +T+QPPFR C++QGKASC+MCSYN+VNGVPAC 
Sbjct: 218 FTAYDLEKWGEFSRYNFNAVV------NTYQPPFRGCVQQGKASCLMCSYNEVNGVPACA 271

Query: 306 RGDLFQKARNEWGFKGY------------------ITSDCDAVATIFEYQNYTKTHEDSA 347
             DL    RN+WGF+G                   + +    +  +F  Q +    E+  
Sbjct: 272 SKDLLGLVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNLPKMLLLMFLKQVFFYVFEN-- 329

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
              L   MDINCGT MLRHT+SAI++G V+E+D+DRAL NLFSVQ+RLGLFNGDP KGK+
Sbjct: 330 ---LWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPEKGKF 386

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
           GKLGP DVCT EHKKLAL+AARQGIVLLKND KFLPL+K    SLAIIGP+    S++GG
Sbjct: 387 GKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMATT-SELGG 445

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
           GY+GIPCSP+SL  GL+ YV    YA GC DV C+SD GF  A+ IAK+ADFV++VAGLD
Sbjct: 446 GYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIVAGLD 505

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T ETED DRVSLLLPG+QM LV+ VA  SKRPVILVLTGGGPLDVSFAE++  I+SILW
Sbjct: 506 TTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLITSILW 565

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
           IGYP              DF+  GRLPMTWYPESFT VPMNDM MRAD SR YPGR+YRF
Sbjct: 566 IGYP-------------VDFDAAGRLPMTWYPESFTNVPMNDMGMRADPSRGYPGRTYRF 612

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS---RLDYVHID 704
           YTG+++YGFGHGLSY+++SY+ LSAPS+L++S +   G  +++L +       +D+VH+D
Sbjct: 613 YTGSRIYGFGHGLSYSDFSYRVLSAPSKLSLSKTTNGGLRRSLLNKVEKDVFEVDHVHVD 672

Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
           E+ +C SL F V ISV N GD+DGSHVVMLF++ PK  QG+PE QL+G  R+HTV+  S 
Sbjct: 673 ELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLHTVSNKSI 732

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
           E S   DPCE  S A++ G+RILPLGNH+L VG++ H ++IE Y
Sbjct: 733 ETSILADPCEHFSFADEQGKRILPLGNHILNVGDVEHIVSIEIY 776


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  968 bits (2503), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 453/759 (59%), Positives = 580/759 (76%), Gaps = 29/759 (3%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPFC+ SL    RA++L+SLLTL EKI QLS+ A+ +PRLGIP YEWWSESLHG+A NGP
Sbjct: 37  YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96

Query: 112 GVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           GVNF+ G V + T FPQV++SAASFNRSLW  +  AVAVEARAM+N GQAGLT+WAPNIN
Sbjct: 97  GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNIN 156

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           +FRDPRWGRGQETPGEDP V++AY+VE+VK FQGE     +G                  
Sbjct: 157 VFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYGDGKEG------------------ 198

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 +MLSACCKH +AYDLEKWGNF+RY+FNA + EQD EDT++PPF+SCI++G+ASC
Sbjct: 199 -----RMMLSACCKHYVAYDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASC 253

Query: 291 IMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           +MCSYNQVNGVPAC R DL QK R+EWGF+GY+ SDCDAV  I+ YQNYT + EDS A V
Sbjct: 254 LMCSYNQVNGVPACARKDLLQKVRDEWGFQGYVVSDCDAVGIIYGYQNYTNSDEDSIAIV 313

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           LKAGMDINCG+ ++RHT+SAI KGK+ E+DI+ AL NLFSVQLRLGLF+       + +L
Sbjct: 314 LKAGMDINCGSFLIRHTKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQL 373

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           GP ++CT EH++LA +AARQG VLLKND  FLPL ++ VS +AIIGP+ N+   MGG YT
Sbjct: 374 GPSNICTKEHRELAAEAARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYT 433

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           G+PC+P + L+G++A V +T  A+GC D+ CNS  GF EA+ +AK+AD V+++AGL+LTQ
Sbjct: 434 GVPCNPTTFLKGMQAVVPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQ 493

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
           ETED DRVSLLLPG+QM L+ S+A  +K+P++LV+TGGGP+DVSFA+ D +I+S+LWIGY
Sbjct: 494 ETEDLDRVSLLLPGKQMDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGY 553

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           PGE G + L EI+FG++NPGG+LP+TWYPESFT VPMNDMNMRAD SR YPGR+YRFYTG
Sbjct: 554 PGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAVPMNDMNMRADPSRSYPGRTYRFYTG 613

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---LDYVHIDEVT 707
             VYGFG+GLSY+ YSY  + AP+++++S S  +  D    ++  +R   LDYV ++++ 
Sbjct: 614 DVVYGFGYGLSYSKYSYNIIQAPTKISLSRS--SAVDFISTKRAHTRRDGLDYVQVEDIA 671

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           SC S++F V ISV N G +DGSH V+LF R      G P KQL+GF+R++  A  +  + 
Sbjct: 672 SCESIKFSVHISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVE 731

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             VDPC+ +S AN  GRR+L LG+H+LMVG+  H   +E
Sbjct: 732 ITVDPCKLMSSANTEGRRVLLLGSHLLMVGDEEHEFFME 770


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  966 bits (2497), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 459/766 (59%), Positives = 577/766 (75%), Gaps = 27/766 (3%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
           PC  P   +YPFCN +L    RA++L+SLLTL EKI QLS+ A+  PRLG+P +EWWSES
Sbjct: 29  PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88

Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
           LHG+  NGPGVNF+ G V S T FPQV++SAA+FNRSLW     A+AVEARAM+N GQAG
Sbjct: 89  LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ                  
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
              + +GEE      +MLSACCKH IAYDLEKW  F+RY+FNA +  QD EDT+QPPF+S
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKS 243

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           CI++G+ASC+MCSYNQVNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA I E Q YT 
Sbjct: 244 CIQEGRASCLMCSYNQVNGVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTYTA 303

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
           + EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQLRLG F+  
Sbjct: 304 SDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKT 363

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
                + +LGP++VCT+EH++LA +A RQG VLLKND  FLPL ++ V  +A+IGP  N+
Sbjct: 364 NENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAAND 423

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
              +GG YTG+PC   + ++G++AYV KT +A+GC DVPCNS  GF EA+  AK+AD V+
Sbjct: 424 PYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVV 483

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
           ++AGL+LT+ETED DRVSLLLPG+QM L+ +VA  +K+PV+LVL GGGP+DVSFA+ D +
Sbjct: 484 LIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPR 543

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
           I+SILWIGYPGE G   L EI+FG +NPGG+LP+TWYPESFT VPM+DMNMRAD+SR YP
Sbjct: 544 IASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYP 603

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDY 700
           GR+YRFYTG  VYGFG+GLSY+ YSY  L AP ++++S +S+     +         +DY
Sbjct: 604 GRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAYTRRDGVDY 663

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
           V ++++ SC +L+F V ISV+N G +DGSH V+LFA       G+P KQL+GF+RVHT A
Sbjct: 664 VQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAA 723

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             S ++   VDPC+ +S AN  G R+L LG HVLMVG+  H L IE
Sbjct: 724 GRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 769


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  956 bits (2471), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 460/767 (59%), Positives = 576/767 (75%), Gaps = 27/767 (3%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C     +SY FC+ SL    RA++L+SLLTL EKI QLS+ A+ +PRLG+P YEWWSESL
Sbjct: 30  CASAEANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESL 89

Query: 104 HGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
           HG+A NGPGVNF+ G V++ T FPQV++SAA+FNRSLW  +  AVAVEARAM+N GQAGL
Sbjct: 90  HGLADNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGL 149

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+WAPNIN+FRDPRWGRGQETPGEDP +++AY+VE+VK FQGE     DG      RE R
Sbjct: 150 TYWAPNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGE---YGDG------REGR 200

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         +MLSACCKH IAYDLEKWG F+RY+FNA +  QD EDT++PPF+SC
Sbjct: 201 --------------MMLSACCKHYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSC 246

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           I++G+ASC+MCSYNQVNGVPAC R DL QK R+EWGFKGYI SDCDAVA I E Q YT +
Sbjct: 247 IQEGRASCLMCSYNQVNGVPACARKDLLQKIRDEWGFKGYIVSDCDAVAIIHENQTYTSS 306

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            EDS A VLKAGMD+NCG+ ++RHT+SAI+KGK+QE+DI+ AL NLFSVQLRLGLF    
Sbjct: 307 DEDSVAIVLKAGMDVNCGSFLIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKAN 366

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
               + +LGP +VCT EH++LA +A RQG VLLKND  FLPL ++ VS +A+IG   N+ 
Sbjct: 367 ENQWFTRLGPSNVCTKEHRELAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDA 426

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
             MGG YTG+PC P + L+G++A+V +T  A+GC DV C+S  GF EA+  AK+AD V+V
Sbjct: 427 YIMGGDYTGVPCDPITFLKGMQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVV 486

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           +AGL+LTQE+ED DRV+LLLPG+Q  LV  +A  +K+P++LV+TGGGP+DV+FA+ D +I
Sbjct: 487 IAGLNLTQESEDLDRVTLLLPGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRI 546

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
           +S+LWIGYPGE G + L EI+FG++NPGG+LPMTWYPESFT VPMNDMNMRAD SR YPG
Sbjct: 547 ASVLWIGYPGEVGGQVLPEILFGEYNPGGKLPMTWYPESFTAVPMNDMNMRADPSRGYPG 606

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG--SDKNILQQTGSRLDY 700
           R+YRFYTG  VYGFG+GLSY+ YSY  + AP  +++S S   G  S K    +    LDY
Sbjct: 607 RTYRFYTGEVVYGFGYGLSYSKYSYNIVQAPQRISLSHSPVPGLISRKPAYTRRDG-LDY 665

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
           V ++++ SC SL F V ISV N G +DGSH V+LFAR      G P KQL+GF+RV+T A
Sbjct: 666 VQVEDIASCESLVFSVHISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAA 725

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
             SK ++  VDPC+ +S AN  GRR+L LG+H LMVG+  H   IE 
Sbjct: 726 GSSKNVAITVDPCKYMSAANTEGRRVLLLGSHHLMVGDEVHEFVIEA 772


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  946 bits (2444), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 454/770 (58%), Positives = 574/770 (74%), Gaps = 33/770 (4%)

Query: 43  PCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           PC  P+  S  YPFC+ SLSI  RA++L+SLLTL EKI QLS+ A  +PRLGIP Y+WWS
Sbjct: 23  PCASPYSGSSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWS 82

Query: 101 ESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           ESLHG+A NGPGVNF+ G V + T+FPQV++S A+FNRSLW  +  AVA EA  M+N GQ
Sbjct: 83  ESLHGLADNGPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQ 142

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQGE              
Sbjct: 143 AGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQ------------- 189

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                   GEE      + LSACCKH  AYD+EKW  FSRY+FNA +  QD EDT+QPPF
Sbjct: 190 --------GEEG----RIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPF 237

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
           ++CI++ +ASC+MC+YNQVNGVP C   DL QK R+EWGF+GYITSDCDAVA I E Q Y
Sbjct: 238 KTCIQEARASCLMCAYNQVNGVPMCANKDLLQKTRDEWGFQGYITSDCDAVAIIHENQTY 297

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           TK+ EDS A VLKAGMDINCG+ ++RHT+SA++KGKVQE+DIDRAL NLFSVQLRLG+F+
Sbjct: 298 TKSDEDSIAIVLKAGMDINCGSFLVRHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFD 357

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
                    +LGP++VCT EH++LA +A RQG VLLKND  FLPL ++ V  +AIIGP  
Sbjct: 358 KPNNNQWSTQLGPNNVCTKEHRELAAEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSA 417

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
           N++  MGG YTG+ C+P + L+G++AY ++T +A+GC DV CNS   F EA+  AK+AD 
Sbjct: 418 NDVYAMGGDYTGVACNPTTFLKGIQAYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADI 477

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+VVAGL+LT+E ED DRVSLLLPG+QMSL+ +VA  +K+P++LVL GGGP+DVSFA+ D
Sbjct: 478 VVVVAGLNLTEEREDFDRVSLLLPGKQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQD 537

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
            +I+SILW+GYPGE G + L EI+FG++NPGG+L MTWYPESFT +PM DMNMRAD SR 
Sbjct: 538 PRIASILWLGYPGEVGGQVLPEILFGEYNPGGKLAMTWYPESFTAIPMTDMNMRADPSRG 597

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS--LKAGSDK-NILQQTGS 696
           YPGR+YRFYTG  VYGFG+GLSY+ YSY  LSAP ++T+S S  L   S K + +++ G 
Sbjct: 598 YPGRTYRFYTGDVVYGFGYGLSYSKYSYSILSAPKKITMSRSSVLDIISRKPSYIRRDG- 656

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD+V  +++ SC +L F V ++V+N G +DGSH V+LFAR      G P KQL+GF+RV
Sbjct: 657 -LDFVKTEDIASCEALAFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERV 715

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           HT A  +  +   VDPC+ +S AN  G+R+L LG+HVL VG+    L IE
Sbjct: 716 HTAAGSASNVEISVDPCKHMSAANPEGKRVLLLGDHVLTVGDEEFELFIE 765


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  913 bits (2360), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/771 (57%), Positives = 562/771 (72%), Gaps = 36/771 (4%)

Query: 43  PCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSE 101
           PC  P+  S YPFC+ SLSI  RA++L+SLLTL EKI QLS+ A  +PRLGIP Y+WWSE
Sbjct: 27  PCASPYSTSAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSE 86

Query: 102 SLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           SLHG+A NGPGVNF+ G V + T FPQV++S A+FNRSLW  +  AVA EA  M+N GQA
Sbjct: 87  SLHGLADNGPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQA 146

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQ                 
Sbjct: 147 GLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ----------------- 189

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                    E      + LSACCKH  AYD+EKW  FSRY+FNA +  QD EDT+QPPF+
Sbjct: 190 --------GEEGEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFK 241

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYT 340
           +CI++ +ASC+MC+YNQVNGVP C   DL QK R+EWGF+GYITSDCDAVA I E Q YT
Sbjct: 242 TCIQEARASCLMCAYNQVNGVPMCAHKDLLQKTRDEWGFQGYITSDCDAVAIIHENQTYT 301

Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
           K+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGK+QE+DIDRAL NLFSVQLRLG+F+ 
Sbjct: 302 KSGEDSIAIVLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDK 361

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
                 + +LGP+ VCT EH++LA +A RQG VLLKND  FLPL ++ V  +AIIGP  N
Sbjct: 362 PSNNQWFSQLGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSAN 421

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFV 520
           +   MGG YTG+PC+P + L+G++AY ++T +A GC D  CNS   F EAV  AK+AD V
Sbjct: 422 DAYAMGGDYTGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIV 481

Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
           +V+AGL+LT+E ED DRVSLLLPG+QM L+ ++A  +K+P++LVL GGGP+DVSFA+ D 
Sbjct: 482 VVIAGLNLTEEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDP 541

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQY 640
           +I+SILW+GYPGE G + L EI+FG++NPGG+LP+TWYPESFT +PM DMNMRAD SR Y
Sbjct: 542 RIASILWLGYPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMRADPSRGY 601

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR--- 697
           PGR+YRFYTG  VYGFG+GLSY+ YSY   SAP ++T+S S    SD  I+ +  +    
Sbjct: 602 PGRTYRFYTGDVVYGFGYGLSYSKYSYSISSAPKKITVSRS----SDLGIISRKPAYTRR 657

Query: 698 --LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
             L  V  +++ SC +L F V ++V+N G +DGSH V+LFAR      G P KQL+GF+ 
Sbjct: 658 DGLGSVKTEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFES 717

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           VHT A  +  +   VDPC+Q+S AN  G+R+L LG HVL VG+    L+IE
Sbjct: 718 VHTAAGSASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIE 768


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  907 bits (2343), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 443/775 (57%), Positives = 561/775 (72%), Gaps = 44/775 (5%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR---------LGI 93
           PC  P   +YPFCN +L    RA++L+SLLTL EKI QL  +    P          +G+
Sbjct: 29  PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGV 88

Query: 94  PAYEWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
           P+           A+ GPG  +  G V S T FPQV++SAA+FNRSLW     A+AVEAR
Sbjct: 89  PS--------TASATTGPGSTSPRGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEAR 140

Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
           AM+N GQAGLTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ         
Sbjct: 141 AMHNAGQAGLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ--------- 191

Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                       + +GEE      +MLSACCKH IAYDLEKW  F+RY+FNA +  QD E
Sbjct: 192 ------------RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDME 235

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVAT 332
           DT+QPPF+SCI++G+ASC+MCSYNQVNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA 
Sbjct: 236 DTYQPPFKSCIQEGRASCLMCSYNQVNGVPACARKDILQRARDEWGFQGYITSDCDAVAI 295

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           I E Q YT + EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQ
Sbjct: 296 IHENQTYTASDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQ 355

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           LRLG F+       + +LGP++VCT+EH++LA +A RQG VLLKND  FLPL ++ V  +
Sbjct: 356 LRLGFFDKTNENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHI 415

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  N+   +GG YTG+PC   + ++G++AYV KT +A+GC DVPCNS  GF EA+ 
Sbjct: 416 ALIGPAANDPYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIE 475

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            AK+AD V+++AGL+LT+ETED DRVSLLLPG+QM L+ +VA  +K+PV+LVL GGGP+D
Sbjct: 476 AAKRADVVVLIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVD 535

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           VSFA+ D +I+SILWIGYPGE G   L EI+FG +NPGG+LP+TWYPESFT VPM+DMNM
Sbjct: 536 VSFAKHDPRIASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNM 595

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNIL 691
           RAD+SR YPGR+YRFYTG  VYGFG+GLSY+ YSY  L AP ++++S +S+     +   
Sbjct: 596 RADASRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPA 655

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
                 +DYV ++++ SC +L+F V ISV+N G +DGSH V+LFA       G+P KQL+
Sbjct: 656 YTRRDGVDYVQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLV 715

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           GF+RVHT A  S ++   VDPC+ +S AN  G R+L LG HVLMVG+  H L IE
Sbjct: 716 GFERVHTAAGRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 770


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  894 bits (2311), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/766 (57%), Positives = 549/766 (71%), Gaps = 59/766 (7%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
           PC  P   +YPFCN +L    RA++L+SLLTL EKI QLS+ A+  PRLG+P +EWWSES
Sbjct: 29  PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88

Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
           LHG+  NGPGVNF+ G V S T FPQV++SAA+FNRSLW     A+AVEARAM+N GQAG
Sbjct: 89  LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ                  
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
              + +GEE      +MLSACCKH IAYDLEKW  F+RY+FNA                 
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNA----------------- 226

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
                          +VNGVPAC R D+ Q+AR+EWGF+GYITSDCDAVA I E Q YT 
Sbjct: 227 ---------------KVNGVPACARKDILQRARDEWGFQGYITSDCDAVAIIHENQTYTA 271

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
           + EDS A VLKAGMDINCG+ ++RHT+SAI+KGKVQE+DI+ AL NLFSVQLRLG F+  
Sbjct: 272 SDEDSIAVVLKAGMDINCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKT 331

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
                + +LGP++VCT+EH++LA +A RQG VLLKND  FLPL ++ V  +A+IGP  N+
Sbjct: 332 NENQWFTQLGPNNVCTTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAAND 391

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
              +GG YTG+PC   + ++G++AYV KT +A+GC DVPCNS  GF EA+  AK+AD V+
Sbjct: 392 PYILGGDYTGVPCHSTTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVV 451

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
           ++AGL+LT+ETED DRVSLLLPG+QM L+ +VA  +K+PV+LVL GGGP+DVSFA+ D +
Sbjct: 452 LIAGLNLTEETEDHDRVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPR 511

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
           I+SILWIGYPGE G   L EI+FG +NPGG+LP+TWYPESFT VPM+DMNMRAD+SR YP
Sbjct: 512 IASILWIGYPGEVGGNVLPEILFGKYNPGGKLPITWYPESFTAVPMDDMNMRADASRGYP 571

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDY 700
           GR+YRFYTG  VYGFG+GLSY+ YSY  L AP ++++S +S+     +         +DY
Sbjct: 572 GRTYRFYTGDVVYGFGYGLSYSKYSYSILQAPKKISLSRSSVPDLISRKPAYTRRDGVDY 631

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
           V ++++ SC +L+F V ISV+N G +DGSH V+LFA       G+P KQL+GF+RVHT A
Sbjct: 632 VQVEDIASCEALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAA 691

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             S ++   VDPC+ +S AN  G R+L LG HVLMVG+  H L IE
Sbjct: 692 GRSTDVEITVDPCKLMSFANTEGTRVLFLGTHVLMVGDEEHELLIE 737


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  810 bits (2093), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/774 (51%), Positives = 520/774 (67%), Gaps = 33/774 (4%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C    P   S  FC T L IS R   L+S LTL EKI QL ++A AIPRLGIPAY
Sbjct: 26  QPPFSCDSSNPQTKSLKFCQTGLPISVRVLDLVSRLTLDEKISQLVNSAPAIPRLGIPAY 85

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSESLHG+ S G G+ FNG+++  TSFPQV+++AA+F+ +LW  IG  + VEAR +YN
Sbjct: 86  EWWSESLHGVGSAGKGIFFNGSIAGATSFPQVILTAATFDENLWYRIGQVIGVEARGVYN 145

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
            GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++   YA+ +V+  QG+++   +GG  
Sbjct: 146 AGQAIGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGKYAIRYVRGVQGDSF---NGG-- 200

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                 ++ KG          L  SACCKH  AYDL++W N  R+SFNA++T QD  DTF
Sbjct: 201 ------QLKKG---------HLQASACCKHFTAYDLDQWKNLDRFSFNAIVTPQDMADTF 245

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
           QPPF+ CI++ +AS IMCSYN VNG+P+C   +L  K AR +WGF GYITSDCDAV  + 
Sbjct: 246 QPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYITSDCDAVQVMH 305

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +   Y  T EDS A  LKAGMDI+CG  + ++T+SA+ K KV +  IDRAL NLFS+++R
Sbjct: 306 DNHRYGNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDRALHNLFSIRMR 365

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGLFNGDPRK  YG + P  VC  +H++LAL+AAR GIVLLKN  K LPL+K   +SLA+
Sbjct: 366 LGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLPLSKAKTNSLAV 425

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IG   NN   + G Y G PC    +L+ L  Y     Y  GC+   C S A   +AV IA
Sbjct: 426 IGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTS-ANIDQAVNIA 484

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           + AD+V+++ GLD TQE E  DR  L+LPGQQ +L+ SVA+ +K+PVILV+  GGP+D+S
Sbjct: 485 RNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVILVILSGGPVDIS 544

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I SILW GYPGEAG  ALAEIIFG+ NPGG+LP+TWYP++F K+PM DM MR 
Sbjct: 545 FAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFVKIPMTDMRMRP 604

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQ 693
           D    YPGR+YRFY G +VY FG+GLSYT YSY F SA P+ + ++  L   + +N    
Sbjct: 605 DPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQLNQLLSVKTVEN---- 660

Query: 694 TGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
               + Y  +DE+ S  C   +F   +SV N+G++DG H V+LF +  K   G+P KQL+
Sbjct: 661 -SDSIRYTFVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVKQDKARNGSPIKQLV 719

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           GF  V   A  + ++ F + PCE LS AN+ G  ++  G+  L+VG+  H + I
Sbjct: 720 GFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVGDAEHPINI 773


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  805 bits (2078), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 407/793 (51%), Positives = 523/793 (65%), Gaps = 62/793 (7%)

Query: 18   VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
            VL  +  +   A S  K + + P F C    P   SY FCNT+L IS RA  LIS LTL 
Sbjct: 674  VLLFVRRLKASAGSPIKQLSSSPPFACDSSDPLTKSYAFCNTTLRISQRASDLISRLTLD 733

Query: 76   EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
            EKI QL  +A++IPRLGIPAYEWWSE+LHGI     G+ FNGT+ S TSFPQV+++AASF
Sbjct: 734  EKISQLISSAASIPRLGIPAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASF 792

Query: 136  NRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
            +  LW  IG A+ +E RAMYN GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   Y
Sbjct: 793  DAHLWYRIGQAIGIETRAMYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKY 852

Query: 195  AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
            AV +V+  QG+ +                      E  + D L  SACCKH  AYDL+ W
Sbjct: 853  AVSYVRGLQGDTF----------------------EGGKVDVLQASACCKHFTAYDLDNW 890

Query: 255  GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-A 313
             +  RY+F+A +T QD  DT+QPPFRSCIE+G+AS +MC+YN VNGVP C   +L  K A
Sbjct: 891  TSIDRYTFDARVTMQDLADTYQPPFRSCIEEGRASGLMCAYNLVNGVPNCADFNLLSKTA 950

Query: 314  RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
            R +WGF GYI SDCDAV+ + + Q Y K+ ED+ A VL AGMD+ CG  + +H +SA+ +
Sbjct: 951  RGQWGFDGYIVSDCDAVSLVHDVQGYAKSPEDAVAIVLTAGMDVACGGYLQKHAKSAVSQ 1010

Query: 374  GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
             K+ E +IDRALLNLF+V++RLGLFNG+PRK  +G +GPD VC++EH+ LAL+AAR GIV
Sbjct: 1011 KKLTESEIDRALLNLFTVRMRLGLFNGNPRKLPFGNIGPDQVCSTEHQTLALEAARSGIV 1070

Query: 434  LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
            LLKN  + LPL+K    SLA+IGP  N    + G Y G PC   S L+GL++YV+ T Y 
Sbjct: 1071 LLKNSDRLLPLSKGETLSLAVIGPNANATDTLLGNYAGPPCKFISPLQGLQSYVNNTMYH 1130

Query: 494  SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            +GC+DV C+S A    AV +AK+AD+V++V GLD TQE E  DR+ L+LPG+Q  L+T V
Sbjct: 1131 AGCNDVACSS-ASIENAVDVAKQADYVVLVMGLDQTQEREKYDRLDLVLPGKQEQLITGV 1189

Query: 554  ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
            A+ +K+PV+LVL  GGP+D+SFA+  S I SILW GYPGEAG  A+AE IFGD NPGGRL
Sbjct: 1190 AKAAKKPVVLVLLCGGPVDISFAKGSSNIGSILWAGYPGEAGGAAIAETIFGDHNPGGRL 1249

Query: 614  PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-A 672
            P+TWYP+ F K+PM DM MR +    YPGR++RFYTG  V+ FG+GLSY+ YSY+FLS  
Sbjct: 1250 PVTWYPKDFIKIPMTDMRMRPEPQSGYPGRTHRFYTGKTVFEFGNGLSYSPYSYEFLSVT 1309

Query: 673  PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
            P++L                       Y++    T       HV   V N+G + G H V
Sbjct: 1310 PNKL-----------------------YLNQPSTT-------HV---VENSGKMAGKHPV 1336

Query: 733  MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
            +LF +  K   G+P KQL+GF  V   A  S  + F + PCE LS ANK G  ++  G H
Sbjct: 1337 LLFVKQAKAGNGSPMKQLVGFQNVFLDAGESSNVEFILSPCEHLSRANKDGLMVMEQGIH 1396

Query: 793  VLMVGELRHSLTI 805
            +L+VG+  + + I
Sbjct: 1397 LLVVGDKEYPIAI 1409



 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/747 (50%), Positives = 492/747 (65%), Gaps = 59/747 (7%)

Query: 8   IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRA 65
           ++L   +L  ++ + V++  +  + S P      F C    P   SY FC T+L I  R 
Sbjct: 1   MKLHSLLLINLIYVTVILVGVESTQSPP------FSCDSSNPSTKSYHFCKTTLPIPDRV 54

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
           + L+S LTL EKI QL ++A AIPRLGIPAYEWWSE+LHG+A  GPG+ FNGT+ S TSF
Sbjct: 55  RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSF 114

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
           PQV+++AASF+  LW  IG A+ VEARA+YN GQ  G+TFWAPNINIFRDPRWGRGQETP
Sbjct: 115 PQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETP 174

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP+V  +YAV +V+  QG+                  L+G      R  EL  SACCK
Sbjct: 175 GEDPLVTGSYAVSYVRGVQGD-----------------CLRGL----KRCGELQASACCK 213

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  AYDL+ W    R+ F+A +T QD  DT+QPPF  CIE+G+AS IMC+YN+VNGVP+C
Sbjct: 214 HFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSC 273

Query: 305 LRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
              +L    AR  W F+GYITSDCDAV+ I +   + KT ED+   VLKAGMD+NCGT +
Sbjct: 274 ADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYL 333

Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
           L HT+SA+ + K+ E ++DRAL NLF+V++RLGLFNG+P+   YG +GP+ VC+ EH+ L
Sbjct: 334 LNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTL 393

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
           ALDAAR GIVLLKN ++ LPL K    SLA+IGP  N+   + G Y G PC   + L+ L
Sbjct: 394 ALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQAL 453

Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
           ++YV  T Y  GC  V C+S +   +AV IA+KAD+V++V GLD TQE E  DR+ L+LP
Sbjct: 454 QSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLP 512

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q  L+  VA  +K+PV+LVL  GGP+D+SFA+  + I SILW GYPG AG  A+AE I
Sbjct: 513 GKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETI 572

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           FGD NPGGRLP+TWYP+ FTK+PM DM MR +S+  YPGR+YRFYTG +V+ FG+GLSY+
Sbjct: 573 FGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYS 632

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            YS +            ++    +K    Q+ +     H+ E T   S+R+         
Sbjct: 633 TYSCE------------TIPVTRNKLYFNQSST----AHVYENTD--SIRYT-------- 666

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQL 750
             + G H V+LF R  K S G+P KQL
Sbjct: 667 -SMAGKHSVLLFVRRLKASAGSPIKQL 692


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  803 bits (2075), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/771 (50%), Positives = 513/771 (66%), Gaps = 56/771 (7%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C    P   ++PFC T+L IS RA  L+S LTL+EKI QL ++A  IPRLGIP Y
Sbjct: 25  QPPFSCDSSNPSTKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIPRLGIPGY 84

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           +WWSE+LHG+A  GPG+ FNGT+   TSFPQV++SAASF+ + W  I  A+  EARA+YN
Sbjct: 85  QWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGKEARALYN 144

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
            GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++   YAV +V+  QG+++K  +    
Sbjct: 145 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQGDSFKGGE---- 200

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                   +KG          L  SACCKH  AYDLE W   SRY F+A +T QD  DT+
Sbjct: 201 --------IKG---------PLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTY 243

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
           QPPF+SC+E+G+AS IMC+YN+VNG+P C   +   + AR +WGF GYI SDCDAV+ I 
Sbjct: 244 QPPFKSCVEEGRASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIH 303

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + Q Y KT ED+   VLKAGMD+NCG+ + +HT++A+D+ K+   +IDRAL NLFSV++R
Sbjct: 304 DAQGYAKTPEDAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMR 363

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGLFNG+P   ++G +GPD VC+ E++ LALDAAR GIVLLKN    LPL+K+   SLA+
Sbjct: 364 LGLFNGNPTGQQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAV 423

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  N++  + G Y G PC   + L+ L++Y+  T    GC  V C+S A    AV +A
Sbjct: 424 IGPNANSVQTLLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQCSS-ASIVGAVNVA 482

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K AD V+++ GLD TQE E  DR  L+LPG+Q  L+ SVA+ +K PV+LVL  GGP+D+S
Sbjct: 483 KGADHVVLIMGLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDIS 542

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ D  I SILW GYPGEAGA ALAEIIFGD NPGG+LPMTWYP+ F KVPM DM MR 
Sbjct: 543 FAKNDKNIGSILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRP 602

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           ++S  YPGR+YRFY G  V+ FG+GLSY+ Y+Y+                          
Sbjct: 603 ETSSGYPGRTYRFYKGPTVFEFGYGLSYSKYTYE-------------------------- 636

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
              L  ++I E   C +++F V +SV N G + G H V+LFAR  K  +G P K+L+GF 
Sbjct: 637 ---LRAIYIGE-EQCENIKFKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQ 692

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            V   A    EI + + PCE LS AN+ G  ++  G+ +L+VG+  H +TI
Sbjct: 693 TVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDKEHPVTI 743


>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score =  802 bits (2072), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/774 (51%), Positives = 524/774 (67%), Gaps = 34/774 (4%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C P  P   S+ FC TSL IS R + L+S LTL EKI QL  +A +IPRLGIPAY
Sbjct: 26  EPPFSCDPSNPSTSSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIPRLGIPAY 85

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+A+ G G++F G + + TSFPQV+++AASF+   W  IG  +  EARA+YN
Sbjct: 86  EWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGREARAVYN 145

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
            GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   YAV +V+  QG++++   GG  
Sbjct: 146 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQ---GG-- 200

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                   LKG          L  SACCKH  AYDL+ W   +R+ F+A +T QD  DT+
Sbjct: 201 -------KLKG---------HLQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTY 244

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
           QPPF+SC++QGKAS IMC+YN+VNG+P+C   +L  + AR +W F GYI SDCDAV+ I+
Sbjct: 245 QPPFQSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIY 304

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + Q Y K+ ED+   VLKAGMD+NCG+ + +HT++A+++ K+ E  IDRAL NLFSV++R
Sbjct: 305 DNQGYAKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMR 364

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGLFNG+P +  +  +GPD VC+ EH+ LAL+AAR GIVLLKN  + LPL K+   SLA+
Sbjct: 365 LGLFNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAV 424

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  N++  + G Y G PC   + L+ L+ YV  T Y SGC  V C+S A   +AV IA
Sbjct: 425 IGPNANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKCSS-ASIDKAVDIA 483

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K  D V+++ GLD TQE E+ DR+ L+LPG+Q  L+T+VA+++K P++LVL  GGP+D+S
Sbjct: 484 KGVDRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDIS 543

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ D  I SILW GYPGEAG  ALAEIIFGD NPGG+LPMTWYP+ F KVPM DM MR 
Sbjct: 544 FAKYDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFVKVPMTDMRMRP 603

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY--KFLSAPSELTISASLKAGSDKNILQ 692
           D S  YPGR+YRFY G  V+ FG+GLSY+ YSY  K++S  ++L ++ S    S   I+ 
Sbjct: 604 DPSSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQ-TKLYLNQS----STMRIID 658

Query: 693 QTGS-RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
            +   R   V       C   +F V++ V N G++ G H V+LFAR  +   G P +QLI
Sbjct: 659 NSDPVRATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLI 718

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           GF  V   A    EI F + PCE  S AN+ G R++  G H LMVG  ++ +++
Sbjct: 719 GFKSVILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMVGGDKYPISV 772


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  799 bits (2063), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/764 (52%), Positives = 518/764 (67%), Gaps = 32/764 (4%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C    P   ++PFC T+L IS RA+ L+S LTL EKI QL ++A  IPRLGIP Y
Sbjct: 25  QPPFSCDSSNPSTKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIPRLGIPGY 84

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+++ GPG++FN  +   TSFPQV+++AASF+   W  IG A+  EARA+YN
Sbjct: 85  EWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGKEARALYN 144

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
            GQA G+TFWAPNINIFRDPRWGRGQETPGEDP+V   YA  +VK  QG+++   +GG  
Sbjct: 145 AGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSF---EGG-- 199

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                   +KG          L  SACCKH  AYDL+ W   +R+ F+A +T QD  DT+
Sbjct: 200 -------KIKG---------HLQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTY 243

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
           QPPF+SC+EQG+AS IMC+YN+VNGVP+C   +L  K AR +WGF+GYITSDCDAV+ I 
Sbjct: 244 QPPFKSCVEQGRASGIMCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIH 303

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + Q Y K+ ED+   VLKAGMD+NCG+ +L+H + A+++ K+ E DID+AL NLFSV++R
Sbjct: 304 DDQGYAKSPEDAVVDVLKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMR 363

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGLFNG P    +G +GPD VC+ EH+ LAL+AAR GIVLLKN  + LPL+K+   SLA+
Sbjct: 364 LGLFNGRPEGQLFGNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAV 423

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  N+   + G Y G PC   + L+ L++Y+ +T Y   C  V C+S A    AV +A
Sbjct: 424 IGPNANSGQMLLGNYAGPPCRFVTPLQALQSYIKQTVYHPACDTVQCSS-ASVDRAVDVA 482

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K AD V+++ GLD TQE E+ DR  LLLPG+Q  L+ +VA+ +K PV+LVL  GGP+D+S
Sbjct: 483 KGADNVVLMMGLDQTQEREELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDIS 542

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ D  I SILW GYPGE GA ALAEI+FGD NPGGRLPMTWYP+ F KVPM DM MR 
Sbjct: 543 FAKNDKNIGSILWAGYPGEGGAIALAEIVFGDHNPGGRLPMTWYPQEFVKVPMTDMGMRP 602

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           ++S  YPGR+YRFY G  V+ FG+G+SY+ YSY+ L+A S+ T+   L   S  +I+   
Sbjct: 603 EASSGYPGRTYRFYRGRSVFEFGYGISYSKYSYE-LTAVSQNTL--YLNQSSTMHIINDF 659

Query: 695 GSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
            S +    I E+ +  C   +   +I V N G++ G H V+LFAR  K   G P KQLIG
Sbjct: 660 DS-VRSTLISELGTEFCEQNKCRARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIG 718

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           F  V   A    EI F V PCE LS AN+ G  ++  G H L+V
Sbjct: 719 FQSVVLGAGERAEIEFEVSPCEHLSRANEDGLMVMEEGRHFLVV 762


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  795 bits (2053), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/768 (50%), Positives = 518/768 (67%), Gaps = 35/768 (4%)

Query: 38  NKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           ++P + C      + PFC+T+L I  R   L+S LT+ EKI QL D + AIPRLG+PAY+
Sbjct: 36  SEPPYTCGAGAPPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIPRLGVPAYK 95

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG+A+ G G++ +G + + TSFPQV+++AASFN  LW  IG  + VEARA+YN 
Sbjct: 96  WWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVYNN 155

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP +   YA  FV+  QG           +
Sbjct: 156 GQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQG-----------Y 204

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
           G      + G    +D    L  SACCKH  AYDLE W   +RY ++A +T QD EDT+ 
Sbjct: 205 G------VAGPVNSTD----LEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYN 254

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+E G AS IMCSYN+VNGVP C   +L  K AR  WGF GYITSDCDAV+ I +
Sbjct: 255 PPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHD 314

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q Y KT ED+ A VLKAGMD+NCG  + ++  SA+ +GK+ E+DI+RAL NLF+V++RL
Sbjct: 315 AQGYAKTSEDAVADVLKAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRL 374

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           GLFNGDPR+ +YG +GPD VCT EH+ LAL+AA+ GIVLLKND   LPL+K+ V+SLA+I
Sbjct: 375 GLFNGDPRRNRYGNIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVI 434

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           G   NN + + G Y G PC   + L+ L+ YV  T + +GC+   CN      EAV+ A 
Sbjct: 435 GFNANNATSLLGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAAS 493

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD V++  GLD  QE E+ DR+ L LPGQQ +L+ SVA  +K+PVILVL  GGP+DVSF
Sbjct: 494 SADSVVLFMGLDQNQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSF 553

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A+ + +I +ILW GYPGEAG  A+A+++FG+ NPGGRLP+TWYP+ FTKVPM DM MRAD
Sbjct: 554 AKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTKVPMTDMRMRAD 613

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQT 694
            +  YPGR+YRFY G  V+ FG+GLSY+ YS++F++  P  ++  A LKA      L  T
Sbjct: 614 PATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLKA------LATT 667

Query: 695 GSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQL 750
              +    ++ + S  C  L+F   + V N G +DG H V++F R P  + G+  P +QL
Sbjct: 668 AGGVATYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQL 727

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           IGF  +H  A  +  + F V PC+  S A + GR+++  G+H +MVG+
Sbjct: 728 IGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGD 775


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  795 bits (2052), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/775 (50%), Positives = 520/775 (67%), Gaps = 34/775 (4%)

Query: 38  NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++P + C    P   + PFC T L I  RA+ L+S LTL EK+ QL +    IPRLGIPA
Sbjct: 34  SQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPA 93

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           YEWWSE+LHG+A+ G G+  NGT+++ TSFPQV+++AASF+ +LW  IG A+  EARA+Y
Sbjct: 94  YEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVY 153

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA G+TFW PNINIFRDPRWGRGQETPGEDP++   Y+V +V+  QG+   + +GG 
Sbjct: 154 NAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD---AIEGG- 209

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                              G++L  SACCKH  AYDL++W   +RY F+A +T QD  DT
Sbjct: 210 -----------------KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADT 252

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
           +QPPF SC+E+GKAS IMC+YN+VNGVP+C    L    AR +W F GYITSDCDAV+ I
Sbjct: 253 YQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSII 312

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            + Q Y K  ED+ A VL+AGMD+NCGT +  HT+SA++  KV    IDRAL NLFSV++
Sbjct: 313 HDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRM 372

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLGLF+G+P K  +G++G D VC+ +H+ LAL AAR+GIVLLKN  K LPL+K+   SLA
Sbjct: 373 RLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLA 432

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IG   N+   + G Y GIPC   +  +GL  YV  T Y  GC+   C ++A  ++AV+I
Sbjct: 433 VIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKI 491

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK  D+V++V GLD TQE ED DR  L LPG+Q  L+  VA+ +KRPVILV+  GGP+D+
Sbjct: 492 AKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDI 551

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           S A+ + +I SILW GYPG+AG  A+AEIIFGD NPGGRLP+TWYP  F K PM DM MR
Sbjct: 552 SSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMR 611

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQ 692
           ADSS  YPGR+YRFY G +VY FG+GLSY+N+ Y+F S + S+L +S    +   KN   
Sbjct: 612 ADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKN--- 668

Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                + Y  + E+    C S   +V + V N G++ G H V+LF +  K   G+P KQL
Sbjct: 669 --SDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQL 726

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +GF +V   A   +EI F V PC+ +S A++ G  I+  G++ L+VG++ H L I
Sbjct: 727 VGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  792 bits (2045), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/775 (50%), Positives = 519/775 (66%), Gaps = 34/775 (4%)

Query: 38  NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++P + C    P   + PFC T L I  RA+ L+S LTL EK+ QL +    IPRLGIPA
Sbjct: 34  SQPPYACDSSNPLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVPPIPRLGIPA 93

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           YEWWSE+LHG+A+ G G+  NGT+++ TSFPQV+++AASF+ +LW  IG A+  EARA+Y
Sbjct: 94  YEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQAIGTEARAVY 153

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA G+TFW PNINIFRDPRWGRGQETPGEDP++   Y+V +V+  QG+   + +GG 
Sbjct: 154 NAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD---AIEGG- 209

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                              G++L  SACCKH  AYDL++W   +RY F+A +T QD  DT
Sbjct: 210 -----------------KLGNQLKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADT 252

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
           +QPPF SC+E+GKAS IMC+YN+VNGVP+C    L    AR +W F GYITSDCDAV+ I
Sbjct: 253 YQPPFESCVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSII 312

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            + Q Y K  ED+ A VL+AGMD+NCGT +  HT+SA++  KV    IDRAL NLFSV++
Sbjct: 313 HDAQGYAKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRM 372

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLGLF+G+P K  +G++G D VC+ +H+ LAL AAR+GIVLLKN  K LPL+K+   SLA
Sbjct: 373 RLGLFDGNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLA 432

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IG   N+   + G Y GIPC   +  +GL  YV  T Y  GC+   C ++A  ++AV+I
Sbjct: 433 VIGHNGNDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKI 491

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK  D+V++V GLD TQE ED DR  L LPG+Q  L+  VA+ +K PVILV+  GGP+D+
Sbjct: 492 AKSVDYVVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDI 551

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           S A+ + +I SILW GYPG+AG  A+AEIIFGD NPGGRLP+TWYP  F K PM DM MR
Sbjct: 552 SSAKYNEKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIKFPMTDMRMR 611

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQ 692
           ADSS  YPGR+YRFY G +VY FG+GLSY+N+ Y+F S + S+L +S    +   KN   
Sbjct: 612 ADSSTGYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKN--- 668

Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                + Y  + E+    C S   +V + V N G++ G H V+LF +  K   G+P KQL
Sbjct: 669 --SDLVSYRLVSELDKKFCESKTVNVTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQL 726

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +GF +V   A   +EI F V PC+ +S A++ G  I+  G++ L+VG++ H L I
Sbjct: 727 VGFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGDVEHPLDI 781


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  785 bits (2028), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/771 (50%), Positives = 507/771 (65%), Gaps = 36/771 (4%)

Query: 36  VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           V ++P + C      + PFC+  L I  R   L+S +T+ EKI QL D + AIPRLG+PA
Sbjct: 29  VASEPPYTCGAGAPPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIPRLGVPA 88

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WWSE+LHGI++ G G++ +G + + TSFPQV+++AASFN  LW  IG  + VEARA+Y
Sbjct: 89  YKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGVEARAVY 148

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP +   YA  FV+  QG          
Sbjct: 149 NNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQGYGLAGPVNST 208

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
           G                     L  SACCKH  AYDLE W   +RY F+A +T QD  DT
Sbjct: 209 G---------------------LEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADT 247

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
           + PPF+SC+E G AS IMCSYN+VNGVP C   +L    AR +WGF GYITSDCDAVA I
Sbjct: 248 YNPPFKSCVEDGHASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAII 307

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            + Q Y KT ED+ A VLKAGMD+NCG+ +  H  SA+ +GK+ E+DI+RAL NLF+V++
Sbjct: 308 HDAQGYAKTAEDAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRM 367

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSS 451
           RLGLFNGDPR+  YG +GPD VCT EH+ LAL+AA+ GIVLLKND     LPL+K  V+S
Sbjct: 368 RLGLFNGDPRRNLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVAS 427

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
           LA+IG   N+  ++ G Y G PC   + L+ L+ YV  T + +GC+   CN      EAV
Sbjct: 428 LAVIGFNANDAIRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAV 486

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           + A  AD V++  GLD  QE E+ DR+ L LPGQQ +L+ SVA  +K+PVILVL  GGP+
Sbjct: 487 QAASSADSVVLFMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPV 546

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
           DVSFA+ + +I +ILW GYPGEAG  A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM 
Sbjct: 547 DVSFAKTNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMR 606

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MRAD +  YPGR+YRFY G  V+ FG+GLSY+ YS++F + P   +  A LKA      +
Sbjct: 607 MRADPATGYPGRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------V 660

Query: 692 QQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PE 747
           + T   +    ++ + S  C  L+F   + V N G +DG H V++F R P  + G+  P 
Sbjct: 661 EATAGGMASYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPA 720

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            QLIGF  +H  A  +  + F V PC+  S A + GR+++  G+H +MVGE
Sbjct: 721 SQLIGFQSLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 771


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  783 bits (2022), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/804 (49%), Positives = 527/804 (65%), Gaps = 38/804 (4%)

Query: 8   IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRA 65
           ++L   +L  ++ + V++  +  + S P      F C    P   SY FC T+L I  R 
Sbjct: 1   MKLHSLLLINLIYVTVILVGVESTQSPP------FSCDSSNPSTKSYHFCKTTLPIPDRV 54

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
           + L+S LTL EKI QL ++A AIPRLGIPAYEWWSE+LHG+A  GPG+ FNGT+ S TSF
Sbjct: 55  RDLVSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSF 114

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
           PQV+++AASF+  LW  IG A+ VEARA+YN GQ  G+TFWAPNINIFRDPRWGRGQETP
Sbjct: 115 PQVILTAASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETP 174

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP+V  +YAV +V+  QG+                  L+G      R  EL  SACCK
Sbjct: 175 GEDPLVTGSYAVSYVRGVQGD-----------------CLRGL----KRCGELQASACCK 213

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  AYDL+ W    R+ F+A +T QD  DT+QPPF  CIE+G+AS IMC+YN+VNGVP+C
Sbjct: 214 HFTAYDLDDWKGIDRFKFDARVTMQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSC 273

Query: 305 LRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
              +L    AR  W F+GYITSDCDAV+ I +   + KT ED+   VLKAGMD+NCGT +
Sbjct: 274 ADFNLLTNTARKRWNFQGYITSDCDAVSLIHDSYGFAKTPEDAVVDVLKAGMDVNCGTYL 333

Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
           L HT+SA+ + K+ E ++DRAL NLF+V++RLGLFNG+P+   YG +GP+ VC+ EH+ L
Sbjct: 334 LNHTKSAVMQKKLPESELDRALENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTL 393

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
           ALDAAR GIVLLKN ++ LPL K    SLA+IGP  N+   + G Y G PC   + L+ L
Sbjct: 394 ALDAARDGIVLLKNSQRLLPLPKGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQAL 453

Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
           ++YV  T Y  GC  V C+S +   +AV IA+KAD+V++V GLD TQE E  DR+ L+LP
Sbjct: 454 QSYVKSTMYHPGCDAVACSSPS-IEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLP 512

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q  L+  VA  +K+PV+LVL  GGP+D+SFA+  + I SILW GYPG AG  A+AE I
Sbjct: 513 GKQQQLIICVANAAKKPVVLVLLSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETI 572

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           FGD NPGGRLP+TWYP+ FTK+PM DM MR +S+  YPGR+YRFYTG +V+ FG+GLSY+
Sbjct: 573 FGDHNPGGRLPVTWYPQDFTKIPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYS 632

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
            YS + +               S  ++ + T S + Y  + E+    C S    + I V 
Sbjct: 633 TYSCETIPVTRN---KLYFNQSSTAHVYENTDS-IRYTSVAELGKELCDSNNISISIRVR 688

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G++ G H V+LF R  K S G+P KQL+ F  VH     S ++ F ++PCE  S  NK
Sbjct: 689 NDGEMAGKHSVLLFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFSGPNK 748

Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
            G  ++  G H L+VG+  H +T+
Sbjct: 749 DGLMVIEEGTHFLVVGDQEHPVTV 772


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  783 bits (2022), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/658 (56%), Positives = 477/658 (72%), Gaps = 34/658 (5%)

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           M+N GQAGLT+WAPNINIFRDPRWGRGQET GEDP V +AY++E+VK FQ          
Sbjct: 1   MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ---------- 50

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                           E      + LSACCKH  AYD+EKW  FSRY+FNA +  QD ED
Sbjct: 51  ---------------GEEGEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLED 95

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATI 333
           T+QPPF++CI++ +ASC+MC+YNQVNGVP C   DL QK R+EWGF+GYITSDCDAVA I
Sbjct: 96  TYQPPFKTCIQEARASCLMCAYNQVNGVPMCAHKDLLQKTRDEWGFQGYITSDCDAVAII 155

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            E Q YTK+ EDS A VLKAGMDINCG+ ++RHT+SAI+KGK+QE+DIDRAL NLFSVQL
Sbjct: 156 HENQTYTKSGEDSIAIVLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQL 215

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLG+F+       + +LGP+ VCT EH++LA +A RQG VLLKND  FLPL ++ V  +A
Sbjct: 216 RLGIFDKPSNNQWFSQLGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVA 275

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           IIGP  N+   MGG YTG+PC+P + L+G++AY ++T +A GC D  CNS   F EAV  
Sbjct: 276 IIGPSANDAYAMGGDYTGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEA 335

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK+AD V+V+AGL+LT+E ED DRVSLLLPG+QM L+ ++A  +K+P++LVL GGGP+DV
Sbjct: 336 AKRADIVVVIAGLNLTEEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDV 395

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           SFA+ D +I+SILW+GYPGE G + L EI+FG++NPGG+LP+TWYPESFT +PM DMNMR
Sbjct: 396 SFAKQDPRIASILWLGYPGEVGGQVLPEILFGEYNPGGKLPITWYPESFTAIPMTDMNMR 455

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           AD SR YPGR+YRFYTG  VYGFG+GLSY+ YSY   SAP ++T+S S    SD  I+ +
Sbjct: 456 ADPSRGYPGRTYRFYTGDVVYGFGYGLSYSKYSYSISSAPKKITVSRS----SDLGIISR 511

Query: 694 TGSR-----LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
             +      L  V  +++ SC +L F V ++V+N G +DGSH V+LFAR      G P K
Sbjct: 512 KPAYTRRDGLGSVKTEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIK 571

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           QL+GF+ VHT A  +  +   VDPC+Q+S AN  G+R+L LG HVL VG+    L+IE
Sbjct: 572 QLVGFESVHTAAGSASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIE 629


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  782 bits (2019), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/794 (48%), Positives = 518/794 (65%), Gaps = 34/794 (4%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
            +  I V+ V+ F  ++    KP F C    P+  SYPFCN +L I  R   L+S LT+ 
Sbjct: 10  TVMFIFVILVLLFRRTEST--KPPFSCDSSNPNTSSYPFCNAALPIPQRVNDLVSRLTVD 67

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EKI QL + A  IPRLGI AYEWWSE LHGI+ +G G  FNGT+ + T FPQ++++A+SF
Sbjct: 68  EKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIILTASSF 127

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQ-AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           + +LW  I  A+  EARA+YN GQ  G+T WAPNINI RDPRWGRGQETPGEDPM+V  Y
Sbjct: 128 DENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPMMVGKY 187

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
            V +V+  QG+++   +GG                   +   L  SACCKH IA D++ W
Sbjct: 188 GVAYVRGLQGDSF---EGG-----------------KLKDGHLQTSACCKHFIAQDMDNW 227

Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-A 313
            NFSRY+F+A + +QD  D+++PPF+ C+EQGKAS +MC+YN VNG+P C   DL    A
Sbjct: 228 HNFSRYTFDAQVLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTA 287

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           R +WG +GYI SDCDAV  ++  Q+Y K  ED+ A  LKAGMD+NCG+ +  +T+SA++K
Sbjct: 288 RGKWGLQGYIVSDCDAVDKMYSEQHYAKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEK 347

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
            KV+E DIDRAL NLFSV++RLGLFNGDP K +YG +   +VC+ EH+ LA++AAR G V
Sbjct: 348 QKVKESDIDRALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSV 407

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
           LLKN  + LPL+K   +SLA+IGP  N+   + G Y G  C   +L +GL+ YV+ T Y 
Sbjct: 408 LLKNSNRLLPLSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYH 467

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            GC  + C S A   EAV IAKKAD+V++V GLD T E E  DR  L LPG Q  L+TS+
Sbjct: 468 PGCDFINCTSPA-IDEAVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSI 526

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  + +PVILVL  GGP+DV+FA+ + +I  ILW+GYPGE GA ALA+I+FG+ NPGGR 
Sbjct: 527 AEAASKPVILVLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRS 586

Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           P+TWYP+ F KV MNDM MR +SS  YPGR+YRFY G +V+ FG+GLSYTNYSY F S  
Sbjct: 587 PVTWYPKEFNKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFASVS 646

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHV 731
               +  + K     N   + GS L+ + + +V    C S    V+++V N G++ G H 
Sbjct: 647 KNQLLFKNPKI----NQSTEKGSVLN-IAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHP 701

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
           V+LF +        P+K LIGF  V+  A  + +++F V PCE  + AN+ G  ++  G 
Sbjct: 702 VLLFLKHSSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGK 761

Query: 792 HVLMVGELRHSLTI 805
           H L++G+  + + +
Sbjct: 762 HFLLLGDQEYPIPV 775


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  780 bits (2014), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/775 (50%), Positives = 505/775 (65%), Gaps = 32/775 (4%)

Query: 36  VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           V   P F C  P   S  FC+  L I  RA  L+S LTL+EKI QL D + A+ RLG+PA
Sbjct: 28  VAADPPFSCGAP--SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPA 85

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WWSE+LHG+A+ G GV+ +G + + TSFPQV+++AASFN  LW  IG  +  EAR +Y
Sbjct: 86  YKWWSEALHGVANAGRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVY 145

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA GLTFWAPNIN+FRDPRWGRGQETPGEDP +   YA  FV+  QG          
Sbjct: 146 NNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQG---------- 195

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
            +G      + G    SD    L  SACCKH  AYDLE W   +R++F+A +TEQD  DT
Sbjct: 196 -YG------MSGAINSSD----LEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADT 244

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
           + PPF+SC+E G AS IMCSYN+VNGVP C   +L  K AR +W F GYITSDCDAVA I
Sbjct: 245 YNPPFKSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAII 304

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            + Q Y K  ED+ A VLKAGMD+NCG  +  H  SA  +GK+  +DIDRAL NLF++++
Sbjct: 305 HDVQGYAKAPEDAVADVLKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRM 364

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLGLF+G+P+  +YG +G D VC+ EH+ LAL AAR GIVLLKND   LPL+K+ VSSLA
Sbjct: 365 RLGLFDGNPKYNRYGNIGADQVCSKEHQDLALQAARDGIVLLKNDGAALPLSKSKVSSLA 424

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  NN S + G Y G PC   + L+ L+ YV    +  GC+   CN  +   EAV  
Sbjct: 425 VIGPNGNNASLLLGNYFGPPCISVTPLQALQGYVKDARFVQGCNAAVCNV-SNIGEAVHA 483

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A  AD+V++  GLD  QE E+ DR+ L LPG Q SLV SVA  +K+PVILVL  GGP+DV
Sbjct: 484 AGSADYVVLFMGLDQNQEREEVDRLELGLPGMQESLVNSVADAAKKPVILVLLCGGPVDV 543

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           +FA+ + +I +I+W GYPG+AG  A+A+++FGD NPGGRLP+TWYP+ FT VPM DM MR
Sbjct: 544 TFAKNNPKIGAIVWAGYPGQAGGIAIAQVLFGDHNPGGRLPVTWYPKEFTAVPMTDMRMR 603

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS---APSELTISASLKAGSDKNI 690
           AD S  YPGR+YRFY G  VY FG+GLSY+ YS++F S    P  ++    LKA +  + 
Sbjct: 604 ADPSTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHRFASKGTKPPSMSGIEGLKATARASA 663

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                  ++ +  +   +C  LRF   + V N G +DG H+V+LF R P  + G P  QL
Sbjct: 664 AGTVSYDVEEMGAE---ACDRLRFPAVVRVQNHGPMDGGHLVLLFLRWPNATDGRPASQL 720

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           IGF  VH  A  +  + F V PC+ LS A + GR+++  G+H + VG+    L+ 
Sbjct: 721 IGFQSVHLRADEAAHVEFEVSPCKHLSRAAEDGRKVIDQGSHFVRVGDDEFELSF 775


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  777 bits (2007), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/776 (48%), Positives = 504/776 (64%), Gaps = 32/776 (4%)

Query: 36  VLNKPDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
           V   P F C P   ++     FC+ +L+ + RA  L+S LT  EKI QL D A  +PRLG
Sbjct: 25  VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84

Query: 93  IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
           +P Y+WW+E+LHG+A++G G++F+  V + TSFPQVL++AA+F+  LW  IG A+  EAR
Sbjct: 85  VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144

Query: 153 AMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
           A++N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+  QG +  S  
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-- 202

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                                    L  SACCKH  AYDLE W   +RYSF A +TEQD 
Sbjct: 203 ------------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDL 238

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAV 330
           EDTF PPFRSC+ + KASC+MC+Y  +NGVPAC   DL     R +WG  GY+ SDCDAV
Sbjct: 239 EDTFNPPFRSCVVEAKASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAV 298

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
           A + + Q Y  T ED+ A  LKAG+DI+CG+ + +H  +AI +GK+ E+DID+AL NL++
Sbjct: 299 AIMRDAQRYAPTPEDAVAVSLKAGLDIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYA 358

Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
           V++RLG F+GDPRK  YG LG  D+CT EH+ LAL+AA+ GIVLLKND   LPL+++ V+
Sbjct: 359 VRMRLGHFDGDPRKNMYGVLGAADICTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVT 418

Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
           S A+IGP  N+   +   Y G PC   + L+GL++YV+   + +GC+   C+  A   +A
Sbjct: 419 SAAVIGPNANDGMALIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDV-AATDQA 477

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V +A   D+V +  GL   QE+E +DR SLLLPG Q SL+T+VA  SKRPVILVL  GGP
Sbjct: 478 VALAGSEDYVFLFMGLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGP 537

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
           +D++FA+++ +I +ILW GYPG+AG  A+A+++FGD NP GRLP+TWYPE FTKVPM DM
Sbjct: 538 VDITFAQSNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDM 597

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
            MRAD +  YPGRSYRFY G  VY FG+GLSY+ +S + +   S   +S++L  G  + +
Sbjct: 598 RMRADPTSGYPGRSYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETM 657

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
             Q G R  +V       C  L+F   + V N G +DG H V++F R P   QG P  QL
Sbjct: 658 TPQDGDRSYHVDAIGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQL 717

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           IGF   H  A  + ++ F + PC+  S     GR+++ +G+H LMV      +  E
Sbjct: 718 IGFRSQHLKAGETAKLRFDISPCKHFSRVRADGRKVIDIGSHFLMVDNHEMEIRFE 773


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  775 bits (2002), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/776 (49%), Positives = 519/776 (66%), Gaps = 35/776 (4%)

Query: 38  NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           N P + C    P   S+PFCN +L+I+ RAK ++S LTL EKI QL + A AIPRLGIP+
Sbjct: 30  NVPPYSCDTSNPLTKSFPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPAIPRLGIPS 89

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WW+E+LHG++  G G+  NG++++ TSFPQ+++ AASF+  LW  I   +  EAR +Y
Sbjct: 90  YQWWNEALHGVSYVGKGIRLNGSITAATSFPQIILIAASFDPKLWYRISKVIGTEARGVY 149

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V S Y V +V+  QG+++   +GG 
Sbjct: 150 NAGQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGGK 206

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
             G R                 L  SACCKH  AYDLE W   +RY F+A +T QD  DT
Sbjct: 207 LIGGR-----------------LKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADT 249

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
           +QP F SC+ QG++S IMC+YN+VNGVP C   +L    AR +W F GYI SDCDAV  I
Sbjct: 250 YQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFI 309

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
           +E Q Y KT ED  A VL+AGMD+ CG  M +H +SA+ + K+    IDRAL NLF++++
Sbjct: 310 YEKQGYAKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRI 369

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLGLF+G+P K +YG++GP+ VC+ E+  LAL+AAR GIVLLKN    LPL +  V++L 
Sbjct: 370 RLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPR--VNTLG 427

Query: 454 IIGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD-VPCNSDAGFHEAV 511
           +IGP  N  S  + G Y G PC   S+L+G   Y S+THY SGC D V C S A    AV
Sbjct: 428 VIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTDGVKCAS-AEIDRAV 486

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
            +AK +D+VI+V GLD +QETE  DR  L LPG+Q  L+ SVA+ SK+PVILV+  GGP+
Sbjct: 487 EVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASKKPVILVILCGGPV 546

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
           D++FA+ + +I  I+W GYPGE G +ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM 
Sbjct: 547 DITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMR 606

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MRAD S  YPGR+YRFYTG +VY FG+GLSY+NYSY F+S  +    +  +   +  +IL
Sbjct: 607 MRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN---NIHINQSTTHSIL 663

Query: 692 QQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
           + + + + Y  + E+   +C ++   V + +TN G + G H V+LF +  K   G P KQ
Sbjct: 664 ENSET-IRYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQ 722

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           L+GF+ V     G  E+ F V  CE LS AN+ G +++  G ++ +VGEL +S+ I
Sbjct: 723 LVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 778


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/765 (49%), Positives = 508/765 (66%), Gaps = 33/765 (4%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           P F C  P   S  FCN  L I  RA  L+S LTL+EKI QL D + A+ RLG+PAY+WW
Sbjct: 28  PPFSCGAP--SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWW 85

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ G G++ +G + + TSFPQV+++AASFN  LW  IG  +  EARA+YN GQ
Sbjct: 86  SEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQ 145

Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           A GLTFWAPNIN+FRDPRWGRGQETPGEDP V   YA  FV+  QG              
Sbjct: 146 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQG-------------- 191

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                L G    +D    L  SACCKH  AYDLE W   +RY+F+A +T QD  DT+ PP
Sbjct: 192 ---YALAGAINSTD----LEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPP 244

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
           FRSC+E G AS IMCSYN+VNGVP C   +L  K AR +W F GYITSDCDAV+ I + Q
Sbjct: 245 FRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQ 304

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            Y KT ED+ A VLKAGMD+NCG+ +  H  SAI +GK+ E+DI+RAL NLF+V++RLGL
Sbjct: 305 GYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGL 364

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           FNG+P+  +YG +GPD VCT EH+ LAL+AA+ G+VLLKND   LPL+K+ VSS+A+IG 
Sbjct: 365 FNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGH 424

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N+ +++ G Y G PC   + L+ L+ YV  T + +GC+   CN  +   EA ++A   
Sbjct: 425 NANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAGCNSAACNVSS-IGEAAQLASSV 483

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D+V++  GLD  QE E+ DR+ L LPG Q +L+ +VA  +K+PVILVL  GGP+DV+FA+
Sbjct: 484 DYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAK 543

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            + +I +ILW GYPGEAG  A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S
Sbjct: 544 YNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPS 603

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             YPGR+YRFY G  VY FG+GLSY+ YS+ F++  ++L   +S+     K +       
Sbjct: 604 TGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTKLPSLSSIDG--LKAMATAAAGT 661

Query: 698 LDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGF 753
           + Y  ++E+   +C  L+F   + V N G +DG H V+LF R P      G P  QLIGF
Sbjct: 662 VSY-DVEEIGTETCDKLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGF 720

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
             +H  +  +  + F V PC+  S A + G++++  G+H +MVG+
Sbjct: 721 QSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKVIDHGSHFMMVGD 765


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/765 (49%), Positives = 508/765 (66%), Gaps = 33/765 (4%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           P F C  P   S  FCN  L I  RA  L+S LTL+EKI QL D + A+ RLG+PAY+WW
Sbjct: 34  PPFSCGAP--SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWW 91

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ G G++ +G + + TSFPQV+++AASFN  LW  IG  +  EARA+YN GQ
Sbjct: 92  SEALHGVSNAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQ 151

Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           A GLTFWAPNIN+FRDPRWGRGQETPGEDP V   YA  FV+  QG              
Sbjct: 152 AEGLTFWAPNINVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQG-------------- 197

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                L G    +D    L  SACCKH  AYDLE W   +RY+F+A +T QD  DT+ PP
Sbjct: 198 ---YALAGAINSTD----LEASACCKHFTAYDLENWKGVTRYAFDAKVTAQDLADTYNPP 250

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
           FRSC+E G AS IMCSYN+VNGVP C   +L  K AR +W F GYITSDCDAV+ I + Q
Sbjct: 251 FRSCVEDGGASGIMCSYNRVNGVPTCADYNLLSKTARGDWRFYGYITSDCDAVSIIHDVQ 310

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            Y KT ED+ A VLKAGMD+NCG+ +  H  SAI +GK+ E+DI+RAL NLF+V++RLGL
Sbjct: 311 GYAKTAEDAVADVLKAGMDVNCGSYVQEHGLSAIQQGKITEQDINRALHNLFAVRMRLGL 370

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           FNG+P+  +YG +GPD VCT EH+ LAL+AA+ G+VLLKND   LPL+K+ VSS+A+IG 
Sbjct: 371 FNGNPKYNRYGNIGPDQVCTQEHQNLALEAAQHGVVLLKNDANALPLSKSQVSSIAVIGH 430

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N+ +++ G Y G PC   + L+ L+ YV  T + +GC+   CN  +   EA ++A   
Sbjct: 431 NANDATRLLGNYFGPPCISVTPLQVLQGYVKDTRFLAGCNSAACNVSS-IGEAAQLASSV 489

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D+V++  GLD  QE E+ DR+ L LPG Q +L+ +VA  +K+PVILVL  GGP+DV+FA+
Sbjct: 490 DYVVLFMGLDQDQEREEVDRLELSLPGMQENLINTVANAAKKPVILVLLCGGPVDVTFAK 549

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            + +I +ILW GYPGEAG  A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S
Sbjct: 550 YNPKIGAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPKEFTSVPMTDMRMRADPS 609

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             YPGR+YRFY G  VY FG+GLSY+ YS+ F++  ++L   +S+     K +       
Sbjct: 610 TGYPGRTYRFYRGNTVYKFGYGLSYSKYSHHFVANGTKLPSLSSIDG--LKAMATAAAGT 667

Query: 698 LDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGF 753
           + Y  ++E+   +C  L+F   + V N G +DG H V+LF R P      G P  QLIGF
Sbjct: 668 VSY-DVEEIGPETCDKLKFPALVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGF 726

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
             +H  +  +  + F V PC+  S A + G++++  G+H +MVG+
Sbjct: 727 QSLHLKSMQTVHVEFEVSPCKHFSRATEDGKKVIDHGSHFMMVGD 771


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/789 (49%), Positives = 513/789 (65%), Gaps = 42/789 (5%)

Query: 27  VIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
            +   +SKP   +  F C    P   SYPFCN  L I  R K L+S LTL EK+ QL + 
Sbjct: 14  TVTVQSSKP---EAPFACDFSNPSSRSYPFCNPKLPIPQRTKDLLSRLTLDEKLSQLVNT 70

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSN 142
           A  IPRLGIPAY+WWSE+LHG++  GPG+ F  N T+SS TSFPQV+++AASF+  LW  
Sbjct: 71  APPIPRLGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRLWYR 130

Query: 143 IGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
           IG A+ +EARA++N GQA GLTFWAPNINIFRDPRWGRGQET GEDP++ S YAV FV+ 
Sbjct: 131 IGHAIGIEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSFVRG 190

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
            QG+++K                         G  L+ SACCKH  AYDL+ W    R+ 
Sbjct: 191 LQGDSFK-------------------------GAHLLASACCKHFTAYDLDNWKGVDRFV 225

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFK 320
           F+A ++ QD  DT+QPPF+SC++QG+AS IMC+YN+VNGVP C   G L Q ARN+W F 
Sbjct: 226 FDARVSLQDLADTYQPPFQSCVQQGRASGIMCAYNRVNGVPNCADYGLLTQTARNQWDFN 285

Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
           GYITSDC AV  I + Q Y K+ ED  A VL+AGMD+ CG+ +  H +SA+ + K+   +
Sbjct: 286 GYITSDCGAVGFIHDRQRYAKSPEDVVADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSE 345

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           IDRAL NLFS+++RLGLF+G+P +  +G +G + VC+ EH+ LAL+AAR GIVLLKN   
Sbjct: 346 IDRALQNLFSIRMRLGLFDGNPTRLSFGLIGSNHVCSKEHQYLALEAARNGIVLLKNSPT 405

Query: 441 FLPLNKNAVS-SLAIIGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD 498
            LPL K + S SLA+IGP  N+    + G Y G PC   ++L+G   YV    Y  GC  
Sbjct: 406 LLPLPKTSPSISLAVIGPNANSSPLTLLGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDG 465

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
            P  S A   +AV +AKK D+V++V GLD ++E E+RDRV L LPG+Q+ L+  VA  SK
Sbjct: 466 GPKCSSAQIDQAVEVAKKVDYVVLVMGLDQSEEREERDRVHLDLPGKQLELINGVAEASK 525

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           +PVILVL  GGPLD++ A+ + +I  ILW GYPGE G  ALA+IIFGD NPGGRLP TWY
Sbjct: 526 KPVILVLLSGGPLDITSAKYNHKIGGILWAGYPGELGGIALAQIIFGDHNPGGRLPTTWY 585

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
           P+ + KVPM DM MRAD S  YPGR+YRFY G +VY FG+GLSY+ YSY+F+S    +T 
Sbjct: 586 PKDYIKVPMTDMRMRADPSTGYPGRTYRFYKGPKVYEFGYGLSYSKYSYEFVS----VTH 641

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                  S  +++ +    + Y  + E+   +C S+   V + V N G + G H V+LF 
Sbjct: 642 DKLHFNQSSTHLMVENSETISYKLVSELDEQTCQSMSLSVTVRVQNHGSMVGKHPVLLFI 701

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           R  +   G+P KQL+GF+ V   A     + F V PCE LS AN+ G  I+  G+H+L+V
Sbjct: 702 RPKRQKSGSPVKQLVGFESVMLDAGEMAHVEFEVSPCEHLSRANEAGAMIIEEGSHMLLV 761

Query: 797 GELRHSLTI 805
            +L H + I
Sbjct: 762 DDLEHPIDI 770


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/769 (48%), Positives = 508/769 (66%), Gaps = 32/769 (4%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C      S+PFC+ SL +  R   L+  + L EKI Q+  NAS IPRLGIP Y+WW E+L
Sbjct: 25  CTDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEAL 84

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
           HG+A + PGV F   V S TSFPQV+++ ASFN SLW+ I  A+++EA AMYN G++GLT
Sbjct: 85  HGVAES-PGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLT 143

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+PNINIFRDPRWGRGQETPGEDP++ S YA  FV+  Q  ++   D G      ++R 
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDY---DEGTAISTMQRRP 200

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
            +           L +S+CCKH  AYD+EK      + FNA +T QD +DTF PPFRSCI
Sbjct: 201 TR-----------LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCI 249

Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
             G+AS +MCSYN+VNGVP+C     L +  RN WGF+GYI SDCDAVA ++EY NYT T
Sbjct: 250 VDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTT 309

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            ED+ A VL AGMD+NCGT +LRHT +AI++GKV E  +DRAL N+ +V++RLGLF+G+ 
Sbjct: 310 AEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNS 369

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
            +  Y  +GPD VCT EH++L+L+AA QGIVLLKN    LP  +N + ++A+IGP  N  
Sbjct: 370 GE-TYNSIGPDAVCTREHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNAT 428

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
             M G Y G+PC   +  +GL+ Y     +  GC D+ CN    F  AVR A+ +D V++
Sbjct: 429 ETMLGNYAGVPCQYITPFQGLQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVI 488

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           V GLD  QE E  DR SLLLPG Q  LV  V++ +K PVILV+  GGP+DV+FA+ + +I
Sbjct: 489 VVGLDKDQEREGLDRTSLLLPGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKI 548

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYP 641
           SS+LW+GYPGEAG KA+A +IFGD NP GRLPMTWYP++F + V + +M++R ++S  +P
Sbjct: 549 SSVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFP 608

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---L 698
           GR+YRFYTG  VY FGHGLSYTN++Y   SAPS +T   ++     +  L++ G+R   +
Sbjct: 609 GRTYRFYTGENVYEFGHGLSYTNFTYTNFSAPSNITARNTVAI---RTPLREDGARHFPI 665

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRV 756
           DY      T C +L F V   ++N G  D  H+ +L+A  P  S     P KQLI F R 
Sbjct: 666 DY------TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQ 719

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           H +A    ++ F VD C+ L + N+ G ++L  G++ L +G++ H +++
Sbjct: 720 HLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  770 bits (1988), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/776 (48%), Positives = 507/776 (65%), Gaps = 41/776 (5%)

Query: 40  PDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           P F C P   ++     FC+ +LS + RA  L+S LT  EKI QL D A+ +PRLG+P Y
Sbjct: 26  PPFSCGPTSAEASEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLGVPGY 85

Query: 97  EWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
           +WW+E+LHG+A++G G++F+  G V + TSFPQVL++AA+F+  LW  IG A+  EARA+
Sbjct: 86  KWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGREARAL 145

Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           +N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+  QG +  S    
Sbjct: 146 FNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS---- 201

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                                  L  SACCKH  AYDLE W   +RYSF A +T QD ED
Sbjct: 202 ----------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLED 239

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVAT 332
           TF PPFRSC+ +GKASCIMC+Y  +NGVPAC   DL     R +WG  GY+ SDCDAVA 
Sbjct: 240 TFNPPFRSCVVEGKASCIMCAYTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAI 299

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           + + Q Y  T ED+ A  LKAG+DI+CG+ + +H  +AI +GK+ E DID+AL+NLF+V+
Sbjct: 300 MRDAQRYAPTPEDAVAVSLKAGLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVR 359

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG F+GDPRK  YG L   D+CT EH+ LAL+AA+ GIVLLKND   LPL+++ V+S 
Sbjct: 360 MRLGHFDGDPRKNMYGALSAADICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSA 419

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  N+   +   Y G PC   + L+GL++YV+   + +GC    C+  A   +AV 
Sbjct: 420 AVIGPNSNDGMALIANYFGPPCESTTPLQGLQSYVNNVRFLAGCSSAACDV-AVTDQAVV 478

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           ++   D+V +  GL   QE+E +DR SLLLPG Q SL+T+VA  SKRPVILVL  GGP+D
Sbjct: 479 LSGSEDYVFLFMGLSQQQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVD 538

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           ++FA+++ +I +ILW GYPG+AG  A+A+++FGD NP GRLPMTWYPE FTKVPM DM M
Sbjct: 539 ITFAQSNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPMTWYPEDFTKVPMTDMRM 598

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           RAD +  YPGRSYRFY G  VY FG+GLSY+ +S + L   S   +S+++ AG  + + +
Sbjct: 599 RADPTSGYPGRSYRFYQGNAVYKFGYGLSYSTFSSRLLYGTSMPALSSTVLAGLRETVTE 658

Query: 693 QTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
           + G R    HID++ +  C  L+F   + V N G +DG H  ++F R P  + G P  QL
Sbjct: 659 E-GDR--SYHIDDIGTDGCEQLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQL 715

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           IGF   H  A  +  + F + PCE  S     G +++ +G+H L V    H++ I 
Sbjct: 716 IGFMSQHLKAGETANLRFDISPCEHFSRVRADGMKVIDIGSHFLTVDN--HAIEIR 769


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  770 bits (1987), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/776 (50%), Positives = 516/776 (66%), Gaps = 42/776 (5%)

Query: 40  PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P   C P  P    Y FC T L I  RA+ L+S LT+ EKI QL + A  IPRLG+PAYE
Sbjct: 22  PPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYE 81

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG+A  GPG+ FNGTV + TSFPQV+++AASF+   W  I   +  EAR +YN 
Sbjct: 82  WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 141

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA G+TFWAPNINIFRDPRWGRGQETPGEDPM+   YAV +V+  QG++          
Sbjct: 142 GQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDS---------- 191

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
            F  ++ L          + L  SACCKH  AYDL++W   +RY FNA ++  D  +T+Q
Sbjct: 192 -FDGRKTLS---------NHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQ 241

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+ CIE+G+AS IMC+YN+VNG+P+C   +L  + AR +W F+GYITSDCDAV+ I++
Sbjct: 242 PPFKKCIEEGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYD 301

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q Y K+ ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRALLNLFSV++RL
Sbjct: 302 AQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRL 361

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           GLFNGDP K  YG + P++VC+  H+ LALDAAR GIVLLKN+ K LP +K +VSSLA+I
Sbjct: 362 GLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVI 421

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  + +  + G Y G PC   + L  L +YV    Y  GC  V C S+A   +AV IAK
Sbjct: 422 GPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAK 480

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD V+++ GLD TQE ED DRV L LPG+Q  L+TSVA  +K+PV+LVL  GGP+D+SF
Sbjct: 481 NADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISF 540

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A  +++I SI+W GYPGEAG  A++EIIFGD NPGGRLP+TWYP+SF  + M DM MR  
Sbjct: 541 AANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNIQMTDMRMR-- 598

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQT 694
           S+  YPGR+Y+FY G +VY FGHGLSY+ YSY+F + A + L ++ S KA       Q  
Sbjct: 599 SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQS-KA-------QTN 650

Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TPEKQL 750
              + Y  + E+    C   +  V + V N G++ G H V++FAR  +  +     EKQL
Sbjct: 651 SDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQL 710

Query: 751 IGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +GF  +  ++ G K E+ F +  CE LS AN+ G  +L  G + L VG+    L +
Sbjct: 711 VGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDSELPLIV 765


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  769 bits (1985), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/761 (49%), Positives = 508/761 (66%), Gaps = 34/761 (4%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPFCNT L IS RA+ L+S LTL EK+ QL + A AIPRLGIP+Y+WWSE+LHG+A  G 
Sbjct: 41  YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 100

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
           G+ FNGT+ S TSFPQV+++AASF+ +LW  I   +  EARA+YN GQA G+TFWAPNIN
Sbjct: 101 GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGKEARAVYNAGQATGMTFWAPNIN 160

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           +FRDPRWGRGQET GEDP++ + Y V +V+  QG+++                     E 
Sbjct: 161 VFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSF---------------------EG 199

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
              G+ L  SACCKH  AYDL+ W    R+ ++A +T QD  DT+QPPF+SCIEQG+AS 
Sbjct: 200 GKLGERLQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIEQGRASG 259

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           IMC+YN+VNGVP C   +L  K AR +W F GYITSDC AV+ I + Q Y KT ED+ A 
Sbjct: 260 IMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGYAKTAEDAIAD 319

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
           V +AGMD+ CG  + +H +SA+ + K+    IDRAL NLFS+++RLGL +G+P K  +G 
Sbjct: 320 VFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNPTKLPFGT 379

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GG 468
           +GPD VC+ +  +LAL+AAR GIVLLKN    LPL K    ++A+IGP  N  S++  G 
Sbjct: 380 IGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGN 438

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G PC+  +LL+G E Y   T Y  GC D P  + A    AV +AKK D+V++V GLD 
Sbjct: 439 YYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVVLVMGLDQ 498

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           +QE E  DR  L LPG+Q  L+ SVAR SKRPV+LVL  GGP+D++ A+ D ++  ILW 
Sbjct: 499 SQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDKVGGILWA 558

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPGE G  ALA+++FGD NPGG+LP+TWYP+ F KVPM DM MRAD +  YPGR+YRFY
Sbjct: 559 GYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFY 618

Query: 649 TGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           TG +VY FG+GLSYT YSYK LS + + L I+      S  ++  Q    + Y  + E+ 
Sbjct: 619 TGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQ-----SSTHLTTQNSETIRYKLVSELA 673

Query: 708 --SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSK 764
             +C ++   + + VTN G++ G H V+LF R  KV + G P KQL+GF  V   A  + 
Sbjct: 674 EETCQTMLLSIALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVGFQSVKLNAGETV 733

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           ++ F + PCE LS+AN+ G  ++  G+++L+VG+  + + I
Sbjct: 734 QVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGDQEYPIEI 774


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  768 bits (1982), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/789 (48%), Positives = 520/789 (65%), Gaps = 49/789 (6%)

Query: 40  PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P + C    P   S PFCN +L+I+ RAK ++S LTL EKI QL + A +IPRLGIP+Y+
Sbjct: 34  PPYSCDTTNPLTKSLPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQ 93

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WW E+LHG+A+ G G+  NG+V+  TSFPQV+++AASF+  LW  I   +  EAR +YN 
Sbjct: 94  WWDEALHGVANAGKGIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNA 153

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V S Y V +V+  QG+++   +GG   
Sbjct: 154 GQAQGMTFWAPNINIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGG--- 207

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA------------ 264
               K +          GD L  SACCKH  AYDL+ W    R+ F+A            
Sbjct: 208 ----KLI----------GDRLKASACCKHFTAYDLDNWKGLDRFDFDAKVSFLFSMAYSP 253

Query: 265 ----MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF 319
                +T QD  DT+QPPF SCI QG++S IMC+YN+VNGVP C   +L  K AR +W F
Sbjct: 254 WMINYVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGVPNCADYNLLTKTARQKWNF 313

Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
            GYITSDC+AV  I++ Q Y KT ED+ A VL+AGMD+ CG  + +H ++A+ + KV   
Sbjct: 314 NGYITSDCEAVRIIYDNQGYAKTPEDAVADVLQAGMDVECGDYLTKHAKAAVLQKKVPIS 373

Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
            IDRAL NLF++++RLGLF+G+P K +YG++GP+ VC+ E+  LAL+AAR GIVLLKN  
Sbjct: 374 QIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTA 433

Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHD 498
             LPL +  V++L +IGP  N  S++  G Y G PC    +L+G   Y S+THY SGC D
Sbjct: 434 SILPLPR--VNTLGVIGPNANKSSKVVLGNYFGRPCRLVPILKGFYTYASQTHYRSGCLD 491

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
               + A    AV +AK +D+VI+V GLD +QE E RDR  L LPG+Q  L+ SVA+ SK
Sbjct: 492 GTKCASAEIDRAVEVAKISDYVILVMGLDQSQERESRDRDDLELPGKQQELINSVAKASK 551

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           +PVILVL  GGP+D++FA+ + +I  I+W GYPGE G +ALA+++FGD+NPGGRLPMTWY
Sbjct: 552 KPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWY 611

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
           P+ F K+PM DM MRAD S  YPGR+YRFYTG +VY FG+GLSY+NYSY F+S  +    
Sbjct: 612 PKDFIKIPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN--- 668

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           +  +   +  +IL+ + + + Y  + E+   +C ++   V + +TN G + G H V+LF 
Sbjct: 669 NLHINQSTTHSILENSET-IYYKLVSELGEETCKTMSISVTLGITNTGSMAGKHPVLLFV 727

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +  K   G P KQL+GF+ V     G  E+ F V  CE LS AN+ G +++  G H+L+V
Sbjct: 728 KPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVV 787

Query: 797 GELRHSLTI 805
           GE  +S+ I
Sbjct: 788 GEEEYSINI 796


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  767 bits (1981), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/777 (49%), Positives = 500/777 (64%), Gaps = 32/777 (4%)

Query: 36  VLNKPDFPCKPPHFD-SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           V + P F C P     +YPFC+ SL  + RA  L+S LT+ EK+ QL D A+ +PRLG+P
Sbjct: 24  VASDPMFSCGPSSASRAYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVPRLGVP 83

Query: 95  AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
            Y+WWSE LHG+A  G G+ FNGTVS+VTSFPQVL++ ASF+ SLW  IG A+  EARA+
Sbjct: 84  PYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGREARAL 143

Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           YNLGQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+  QG N       
Sbjct: 144 YNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQGSN------- 196

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                                  L  SACCKH  AYDLE W   +RY+F+A +T QD  D
Sbjct: 197 ---------------PAGAAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLAD 241

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           TF PPF+SC+  GKASC+MC+Y  +NGVPAC   DL  K  R  WG  GY++SDCDAVA 
Sbjct: 242 TFNPPFQSCVVDGKASCVMCAYTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAI 301

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           + + Q Y  T ED+ A  LKAG+D+NCGT   +H  +AI +GK+ EKD+D+AL NLF+V+
Sbjct: 302 MRDAQRYEPTPEDTVAVALKAGLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVR 361

Query: 393 LRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           +RLG F+GDPR    YG+LG  DVCT++HK LAL+AA+ GIVLLKND   LPL+++AV S
Sbjct: 362 MRLGHFDGDPRGNALYGRLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGS 421

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
            A+IG   N+   + G Y G  C   + L GL++YV    + +GC    C   A   +A 
Sbjct: 422 AAVIGHNANDPLVLSGNYFGPACETTTPLEGLQSYVRNVRFLAGCSSAACGYAA-TGQAA 480

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
            +A  A++V +  GL   QE E  DR SLLLPG+Q SLVT+VA  +KRPV+LVL  GGP+
Sbjct: 481 ALASSAEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPV 540

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
           D++FA+++ +I +ILW GYPG+AG  A+A ++FGD NP GRLP+TWY E FTKVPM DM 
Sbjct: 541 DITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYTEDFTKVPMTDMR 600

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MRAD +  YPGR+YRFY G  +Y FG+GLSY+ +S + ++    L  + SL A       
Sbjct: 601 MRADPATGYPGRTYRFYRGKTIYKFGYGLSYSKFSRQLVTGDKNLAPNTSLLAHLSAKTQ 660

Query: 692 QQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
               S   Y H+D++ +  C  L+F  ++ V N G +DG H V++F R P  + G P +Q
Sbjct: 661 HAATS---YYHVDDIGTVGCEQLKFPAEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQ 717

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           LIGF   H  A     + F V PCE  S     G++++  G+H LMVG+    ++ E
Sbjct: 718 LIGFRSQHIKAGEKANVRFHVSPCEHFSRTRADGKKVIDRGSHFLMVGKEELEISFE 774


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  765 bits (1975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/763 (49%), Positives = 516/763 (67%), Gaps = 37/763 (4%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           Y FCNT L I+ RA+ L+S LTL EK+ QL + A AIPRLGIP+Y+WWSE+LHG+A  G 
Sbjct: 42  YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIPRLGIPSYQWWSEALHGVADAGF 101

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
           G+ FNGT+ S TSFPQV+++AASF+ +LW  I   +  EARA+YN GQA G+TFWAPNIN
Sbjct: 102 GIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGREARAVYNAGQATGMTFWAPNIN 161

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           +FRDPRWGRGQET GEDP++ + Y V +V+  QG++++                   G+ 
Sbjct: 162 VFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFEG------------------GKL 203

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           ++R   L  SACCKH  AYDL++W    R+ F+A +T QD  DT+QPPF+SCIEQG+AS 
Sbjct: 204 AER---LQASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIEQGRASG 260

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           IMC+YN+VNGVP C   +L  K AR +W F GYITSDC AV+ I E Q Y KT ED+ A 
Sbjct: 261 IMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGYAKTAEDAIAD 320

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
           V +AGMD+ CG  + +H +SA+ + K+    IDRAL NLFS+++RLGLF+G+P K  +G 
Sbjct: 321 VFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNPTKLPFGT 380

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GG 468
           +GP++VC+ +  +LAL+AAR GIVLLKN    LPL K    ++A+IGP  N  S++  G 
Sbjct: 381 IGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANASSKVFLGN 439

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G PC+  +LL+G E Y +KT Y  GC D P  + A   EAV +AKK D+V++V GLD 
Sbjct: 440 YYGRPCNLVTLLQGFEGY-AKTVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVLVMGLDQ 498

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           +QE E  DR  L LPG+Q  L+ SVAR +KRPV++VL  GGP+D++ A+ D ++  ILW 
Sbjct: 499 SQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKVGGILWA 558

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPGE G  ALA+++FGD NPGG+LP+TWYP+ F KVPM DM MRAD +  YPGR+YRFY
Sbjct: 559 GYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIKVPMTDMRMRADPASGYPGRTYRFY 618

Query: 649 TGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           TG +VY FG+GLSYT YSYK LS + S L I+      S  +++ Q    + Y  + E+ 
Sbjct: 619 TGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQ-----SSTHLMTQNSETIRYKLVSELA 673

Query: 708 --SCTSLRFHVQISVTNAGDVDGSHVVMLFAR---VPKVSQGTPEKQLIGFDRVHTVAKG 762
             +C ++   + + VTN G++ G H V+LF R   V  ++ G P KQL+GF  V   A  
Sbjct: 674 EETCQTMLLSIALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLVGFQSVKVNAGE 733

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           + ++ F + PCE LS+AN+ G  ++  G+++ +VG+  + + +
Sbjct: 734 TVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQEYPIEV 776


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  764 bits (1974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/773 (49%), Positives = 510/773 (65%), Gaps = 33/773 (4%)

Query: 40  PDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C    P   SYPFCN  L I+ R K L+S LTL EK+ QL ++A  IPRLGIPAYE
Sbjct: 28  PPFACDYSNPSTRSYPFCNPKLPITQRTKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYE 87

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHGI + G G+ FNG+++S TSFPQV+++AASF+  LW  IG A+ VEARA+YN 
Sbjct: 88  WWSEALHGIGNVGRGIFFNGSITSATSFPQVILTAASFDSHLWYRIGQAIGVEARAIYNG 147

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA G+TFWAPNINIFRDPRWGRGQET GEDPM+ S YAV +V+  QG++++   GG   
Sbjct: 148 GQAMGMTFWAPNINIFRDPRWGRGQETAGEDPMMTSNYAVSYVRGLQGDSFQ---GG--- 201

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                  L+G          L  SACCKH  AYDL+ W   +R+ F+A ++ QD  DT+Q
Sbjct: 202 ------KLRG---------HLQASACCKHFTAYDLDNWKGVNRFHFDARVSLQDLADTYQ 246

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE 335
           PPFRSCIEQG+AS IMC+YN+VNG+P+C   +L     R +W F GYI SDC AV  I +
Sbjct: 247 PPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTVRKQWEFHGYIVSDCGAVGIIHD 306

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q Y K+ ED+ A VL AGMD+ CG+ +  H +SA+ + K+    IDRAL NLFS+++RL
Sbjct: 307 EQGYAKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQQKKLPIVRIDRALHNLFSIRIRL 366

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G+P K  +G +GP+ VC+  H  LAL+AAR GIVLLKN    LPL K ++S LA+I
Sbjct: 367 GQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGIVLLKNTASLLPLPKTSIS-LAVI 425

Query: 456 GPLVN-NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           GP  N +   + G Y G PC   ++L+G + YV    +  GC   P  + A   +AV++A
Sbjct: 426 GPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAVFHPGCDGGPKCASAPIDKAVKVA 485

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K AD+V++V GLD + E E+RDRV L LPG+Q+ L+ SVA+ SKRPVILVL  GGP+D+S
Sbjct: 486 KNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELINSVAKASKRPVILVLLCGGPIDIS 545

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
            A+ + +I  I+W GYPGE G  ALA+IIFGD NPGGRLP+TWYP+ + KVPM DM MRA
Sbjct: 546 SAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGGRLPITWYPKDYIKVPMTDMRMRA 605

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGR+YRFY G  VY FGHGLSYT YSY+F+S    +T        S  +++ + 
Sbjct: 606 DPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSYEFVS----VTHDKLHFNQSSTHLMTEN 661

Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
              + Y  + E+   +C S+   V + V N G++ G H ++LF R  K    +P KQL+G
Sbjct: 662 SETIRYKLVSELDEETCKSMSVSVTVGVKNHGNIVGRHPILLFMRPQKHRTRSPMKQLVG 721

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           F  +   A     + F + PCE LS AN+ G +I+  G+H+L VGE  + + I
Sbjct: 722 FHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKIIEEGSHLLHVGEEEYLIDI 774


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  764 bits (1972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/806 (48%), Positives = 517/806 (64%), Gaps = 81/806 (10%)

Query: 12  ERILFRVLQLIVV--VNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKS 67
           + +  R+L LI +   ++  +  S     +P + C    P    YPFC T L IS R + 
Sbjct: 4   QNLCLRILILIAIHTTSLHLYVES----TQPPYSCDSSDPSTKLYPFCQTKLPISQRVED 59

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA---SNGPGVNFNGTVSSVTS 124
           L+S LTL EK+ QL D A AIPRLGIPAYEWWSE+LHG+A   +   G+ FNGT+   TS
Sbjct: 60  LVSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATS 119

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQET 183
           FPQV+++AASF+  LW  IG  +  EAR +YN GQA G+TFWAPNINIFRDPRWGRGQET
Sbjct: 120 FPQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQET 179

Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
           PGEDP+V   YAV +V+  QG+++    GG                    G++L  SACC
Sbjct: 180 PGEDPLVAGKYAVSYVRGVQGDSF----GG-----------------GTLGEQLQASACC 218

Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
           KH  AYDL+KW   +R+ F+A    QD  DT+QPPF+SCI++GKAS IMC+YN+VNGVP 
Sbjct: 219 KHFTAYDLDKWKGMNRFVFDA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPN 274

Query: 304 CLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
           C   +L  +KAR +WGF GYITSDCDAVA I + Q Y K+ ED+ A VLKAGMD+NCG  
Sbjct: 275 CADYNLLSKKARGQWGFYGYITSDCDAVAIIHDDQGYAKSPEDAVADVLKAGMDVNCGDY 334

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
           +  +T+SA+ K K+ E +IDRAL NLFS+++RLGLFNG+P K  YG + PD VC+ EH+ 
Sbjct: 335 LKNYTKSAVKKKKLPESEIDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQA 394

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
           LAL AA+ GIVLLKN  K LPL+K    SLA+IGP  NN +++ G Y G PC   + L+G
Sbjct: 395 LALKAAQDGIVLLKNPDKLLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQG 454

Query: 483 LEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
           L+ Y+  T Y  GC  V C+S A  ++AV+IAK AD VI+V GLD TQE E++DRV L+L
Sbjct: 455 LQNYIKNTRYHPGCSRVACSS-ASINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVL 513

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           PG+Q  L+T+VA+ +K+PV+LVL  GGP+DVSFA+ D  I SI+W GYPGEAG  ALA+I
Sbjct: 514 PGKQRELITAVAKAAKKPVVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQI 573

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           IFGD NPGGRLPMTWYP+ FTKVPM DM MR   S  YPGR+YRFY G +V+ FG+GLSY
Sbjct: 574 IFGDHNPGGRLPMTWYPQDFTKVPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSY 633

Query: 663 TNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQIS 719
           +NYSY+  S   ++L + AS       N + +  + + +  I  +    C   +F V + 
Sbjct: 634 SNYSYELASDTQNKLYLRAS------SNQITKNSNTIRHKLISNIGKELCEKTKFTVTVR 687

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           V N G++ G +                                  EI + + PCE LS  
Sbjct: 688 VKNHGEMAGENA---------------------------------EIQYELSPCEHLSSP 714

Query: 780 NKHGRRILPLGNHVLMVGELRHSLTI 805
           +  G  ++  G+  L++G+  + +TI
Sbjct: 715 DDRGMMVMEEGSQFLLIGDKEYPITI 740


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  763 bits (1970), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/769 (48%), Positives = 505/769 (65%), Gaps = 32/769 (4%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C      S+PFC+ SL +  R   L+  + L EKI Q+  NAS IPRLGIP Y+WW E+L
Sbjct: 25  CTDSRSSSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEAL 84

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
           HG+A + PGV F   V S TSFPQV+++ ASFN SLW+ I  A+++EA AMYN G++GLT
Sbjct: 85  HGVAES-PGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLT 143

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+PNINIFRDPRWGRGQETPGEDP++ S YA  FV+  Q  ++   D G      +   
Sbjct: 144 FWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDY---DEGTAISTMQGSP 200

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
            +           L +S+CCKH  AYD+EK      + FNA +T QD +DTF PPFRSCI
Sbjct: 201 TR-----------LKVSSCCKHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCI 249

Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
             G+AS +MCSYN+VNGVP+C     L +  RN WGF+GYI SDCDAVA ++EY NYT T
Sbjct: 250 VDGQASGLMCSYNRVNGVPSCADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINYTTT 309

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            ED+ A VL AGMD+NCGT +LRHT +AI++GKV E  +DRAL N+ +V++RLGLF+G+ 
Sbjct: 310 AEDAVADVLSAGMDLNCGTFLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNS 369

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
            +  Y  +GPD VCT EH++L+L+AA QGIVLLKN    LP  +N + ++A+IGP  N  
Sbjct: 370 GE-TYNSIGPDAVCTPEHRQLSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNAT 428

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
             M G Y G+PC   +  +GL+ Y     +  GC D+ CN    F  AVR A+ +D V++
Sbjct: 429 ETMLGNYAGVPCQYITPFQGLQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVI 488

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           V GLD  QE E  DR SLLLPG Q  LV  V++ +K PVILV+  GGP+DV+FA+ + +I
Sbjct: 489 VVGLDKDQEREGLDRTSLLLPGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKI 548

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYP 641
           S++LW+GYPGEAG KA+A +IFGD NP GRLPMTWYP++F + V + +M++R ++S  +P
Sbjct: 549 SNVLWVGYPGEAGGKAIARVIFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFP 608

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR---L 698
           GR+YRFYTG  VY FGHGLSYTN++Y    APS +T   ++     +  L++ G+R   +
Sbjct: 609 GRTYRFYTGENVYEFGHGLSYTNFTYTNFCAPSNITARNTVAI---RTPLREDGARQFPI 665

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRV 756
           DY      T C +L F V   ++N G  D  H+ +L+A  P  S     P KQLI F R 
Sbjct: 666 DY------TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQ 719

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           H +A    ++ F VD C+ L + N+ G ++L  G++ L +G++ H +++
Sbjct: 720 HLIAGRCAKVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGDIEHVISL 768


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  762 bits (1968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/780 (50%), Positives = 513/780 (65%), Gaps = 50/780 (6%)

Query: 40  PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P   C P  P    Y FC T L IS RA+ L+S L + EKI QL + A  IPRLG+PAYE
Sbjct: 21  PPHSCDPSNPTTKLYQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGIPRLGVPAYE 80

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG+A  GPG+ FNGTV + TSFPQV+++AASF+   W  I   +  EAR +YN 
Sbjct: 81  WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 140

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA G+TFWAPNINIFRDPRWGRGQETPGEDP++   YAV +V+  QG++          
Sbjct: 141 GQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDS---------- 190

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
            F  ++ L            L  SACCKH  AYDL++W   +RY FNA ++  D  +T+Q
Sbjct: 191 -FDGRKTLS---------IHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQ 240

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+ CIE+G+AS IMC+YN+VNG+P+C   +L  + AR  W F+GYITSDCDAV+ I +
Sbjct: 241 PPFKKCIEEGRASGIMCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHD 300

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q Y KT ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRALLNLFSV++RL
Sbjct: 301 AQGYAKTPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRL 360

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           GLFNGDP K  YG + P+DVC+  H+ LAL+AAR GIVLLKN+ K LP +K +VSSLA+I
Sbjct: 361 GLFNGDPTKLPYGNISPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVI 420

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  +    + G Y G PC   + L  L +YV    Y +GC  V C S+A   +AV IA+
Sbjct: 421 GPNAHVAKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIAR 479

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD V+++ GLD TQE ED DRV L LPG+Q  L+TSVA  +K+PV+LVL  GGP+D+SF
Sbjct: 480 NADHVVLIMGLDQTQEKEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISF 539

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A  + +I SI+W GYPGEAG  ALAEIIFGD NPGGRLP+TWYP+SF  V M DM MR  
Sbjct: 540 ATNNDKIGSIMWAGYPGEAGGIALAEIIFGDHNPGGRLPVTWYPQSFVNVQMTDMRMR-- 597

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
           S+  YPGR+Y+FY G +V+ FGHGLSY+ YSY+F +             G+    L Q+ 
Sbjct: 598 SATGYPGRTYKFYKGPKVFEFGHGLSYSTYSYRFKT------------LGATNLYLNQSK 645

Query: 696 SRLD-----YVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TP 746
           ++L+     Y  + E+    C   +  V ++V N G++ G H V++FAR  +  +     
Sbjct: 646 AQLNSDSVRYTLVSEMGEEGCNIAKTKVIVTVENQGEMAGKHPVLMFARHERGGENGKRA 705

Query: 747 EKQLIGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           EKQL+GF  +  ++ G K E+ F +  CE LS AN+ G  ++  G + L VG+    LTI
Sbjct: 706 EKQLVGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTI 764


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/769 (48%), Positives = 507/769 (65%), Gaps = 40/769 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K P    Y FC+ SLS+  R   L+  LTLQEKI  L ++A+++ RLGIP YEW
Sbjct: 41  FACDVAKNPALAGYGFCDKSLSLEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEW 100

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG +F+  V   TSFP  +++AASFN SL+  IG  V+ EARAMYN+G
Sbjct: 101 WSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVG 160

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA  +VK  Q    ++DDG      
Sbjct: 161 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQ----QTDDG------ 210

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                           ++L ++ACCKH  AYDL+ W    RY+FNA++T+QD +DTFQPP
Sbjct: 211 --------------DSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPP 256

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+  G  + +MCSYNQVNG P C   DL +   R EW   GYI SDCD+V  +F+ Q
Sbjct: 257 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNGYIVSDCDSVEVLFKDQ 316

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +YTKT E++AA  + AG+D+NCG  + ++T+ A+ +G + E  I+ A+ N F+  +RLG 
Sbjct: 317 HYTKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGF 376

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDP K  YG LGP+DVCTSE+++LA +AARQGIVLLKN    LPLN  A+ SLA+IGP
Sbjct: 377 FDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLGSLPLNAKAIKSLAVIGP 436

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N    M G Y GIPC+  S L+ L A V  T YA+GC +V C ++A   +A +IA  A
Sbjct: 437 NANATRVMIGNYEGIPCNYISPLQALTALV-PTSYAAGCPNVQC-ANAELDDATQIAASA 494

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D  ++V G  L  E E  DR+++LLPGQQ  LV+ VA  SK PVILV+  GG +DVSFA+
Sbjct: 495 DATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAK 554

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
           ++ +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+  KVPM +MNMRAD 
Sbjct: 555 SNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVNKVPMTNMNMRADP 614

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +  YPGR+YRFY G  V+ FG G+S++N  +K + AP  +++  +     D         
Sbjct: 615 ATGYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPLA----EDHECRSSECM 670

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD    DE   C +L F + + V N G +  SHVV+LF   P V    P+K L+GF++V
Sbjct: 671 SLDVA--DE--HCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPPDV-HNAPQKHLLGFEKV 725

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           H   K   ++ F VD C+ LS+ ++ G R +PLG H+L VG L+H L++
Sbjct: 726 HLPGKSEAQVRFKVDICKDLSVVDELGNRKVPLGQHLLHVGNLKHQLSV 774


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  761 bits (1964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/776 (48%), Positives = 497/776 (64%), Gaps = 38/776 (4%)

Query: 40  PDFPCKP-PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           P F C P      Y FC+ +L +  RA  L+S LTL EK+ QL D A A+PRLG+PAY+W
Sbjct: 34  PPFSCGPGSATQGYAFCDKALPVERRAADLVSRLTLAEKVSQLGDEADAVPRLGVPAYKW 93

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE LHG++  G G++F+G V ++TSFPQVL++AASF++ +W  IG A+  EARA+YNLG
Sbjct: 94  WSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTAASFDQDIWYRIGQAIGTEARALYNLG 153

Query: 159 QA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           QA GLT W+PN+NI+RDPRWGRGQETPGEDP   S YAV FVK  QG +  +        
Sbjct: 154 QAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGTSATT-------- 205

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L  SACCKH  AYDLE W    RY+FNA +T QD  DTF P
Sbjct: 206 -------------------LQTSACCKHATAYDLEDWNGVVRYNFNAKVTLQDLADTFNP 246

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+E+GKA+C+MC+Y  +NGVPAC   DL  K  + +WG  GY++SDCDAVA + + 
Sbjct: 247 PFKSCVEEGKATCVMCAYTNINGVPACASSDLITKTFKGDWGLNGYVSSDCDAVALLRDA 306

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q Y  T ED+ A  LKAG+D+NCG     H  SA+ +GK+ E+D+D AL NLF+V++RLG
Sbjct: 307 QRYRATPEDTVAVALKAGLDLNCGNYTQVHGMSALQQGKMTEQDVDNALKNLFAVRMRLG 366

Query: 397 LFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
            F+GDPR    YG LG  DVC+  HK LAL+AA+ GIVLLKND   LPL+ +AV+S A I
Sbjct: 367 HFDGDPRTSALYGSLGAADVCSPAHKNLALEAAQSGIVLLKNDAGILPLDPSAVASAAAI 426

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           G   N+ + + G Y G PC   + L+GL+ YV    + +GC    C   A   +AV +A 
Sbjct: 427 GHNANDPAALNGNYFGPPCETTTPLQGLQGYVKNVKFLAGCDSAACGF-AATGQAVTLAS 485

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            +D+VI+  GL   +E E  DR SLLLPG+Q +L+T+VA  SKRPVILVL  GG +D++F
Sbjct: 486 SSDYVILFMGLSQKEEQEGIDRTSLLLPGKQQNLITAVASASKRPVILVLLTGGSVDITF 545

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A+++ +I +ILW GYPG+AG  A+A ++FGD NP GRLP+TWYPE FTKVPM DM MRAD
Sbjct: 546 AKSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRAD 605

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQT 694
            +  YPGRSYRFY G  VY FG GLSY+ +S + +S+ +   + + +L  G         
Sbjct: 606 PATGYPGRSYRFYQGKTVYKFGDGLSYSKFSRQLVSSTNTHQVPNTNLLTGLTARTATDG 665

Query: 695 GSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLI 751
           G  + Y H++E  V  C  L+F   + V N G +DG H VM+F R P     G P  QL+
Sbjct: 666 G--MSYYHVEEIGVEGCDKLKFPAVVEVQNHGPMDGKHSVMMFLRWPNSTGTGRPVSQLV 723

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           GF   H  A     ++F V PCE  + A + G++++  G+H L+VG+    ++  +
Sbjct: 724 GFRSQHLKAGEKASLTFDVSPCEHFARAREDGKKVIDRGSHFLVVGKDEREISFHS 779


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  759 bits (1961), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/772 (49%), Positives = 505/772 (65%), Gaps = 40/772 (5%)

Query: 40  PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           P F C     P    + FCNTSL  + R   L+  LTL+EKI  L ++A+++ RLGIP Y
Sbjct: 37  PVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKY 96

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG++  GPG +FN  V   TSFPQV+++AASFN SL+  IG AV+ EARAMYN
Sbjct: 97  EWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYN 156

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           +G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q    +SDDG    
Sbjct: 157 VGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDG---- 208

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                             D L ++ACCKH  AYDL+ W    R+ FNA++T+QD +DTFQ
Sbjct: 209 ----------------SPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQ 252

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+  G  + +MCSYNQVNG PAC   DL     R EW   GYI SDCD+V   + 
Sbjct: 253 PPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYN 312

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E++AA  + AG+D+NCG+ + +HT++A+  G V E  +D+A+ N F+  +RL
Sbjct: 313 SQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRL 372

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G+P K  YGKLGP DVCTSEH++LA +AARQGIVLLKN K  LPL+  A+ +LA+I
Sbjct: 373 GFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVI 432

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C + A   EA +IA 
Sbjct: 433 GPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  +++ G+D + E E RDRV++ LPGQQ  L+T VA+ SK  VILV+  GG  D+SF
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ D +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+  KVPM +MNMR 
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGR+YRFYTG  +Y FG GLSYT +++  + AP     S S+      +     
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPK----SVSIPIEEGHSCHSSK 666

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
              +D V      SC +L F + + V NAG++ GSH V LF+  P V   +P+K L+GF+
Sbjct: 667 CKSVDAVQ----ESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGFE 721

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +V   AK    + F VD C+ LSI ++ G R + LG HVL VG L+HSL + 
Sbjct: 722 KVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNVR 773


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  758 bits (1956), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/755 (49%), Positives = 491/755 (65%), Gaps = 47/755 (6%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC+ +L    RA  L++ LT  EK+ QL D A+ +PRLG+PAY+WWSE+LHG+A++G G+
Sbjct: 52  FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGRGL 111

Query: 114 NFNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNI 169
           +F+   S+    TSFPQVL++AA+F+  LW  IG A+  EARA+YN+GQA GLT W+PN+
Sbjct: 112 HFDAPGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLTMWSPNV 171

Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
           NIFRDPRWGRGQETPGEDP + S YAV FVK  QG +                       
Sbjct: 172 NIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQGNS----------------------- 208

Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
                  L  SACCKH+ AYDLE W    RY+FNA +T QD EDT+ PPFRSC+   KA+
Sbjct: 209 ----SAILQTSACCKHVTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKAT 264

Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CIMC+Y  +NGVPAC   DL  K  R +WG  GYI SDCDAVA + + Q YT+T ED+ A
Sbjct: 265 CIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVA 324

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             LKAG+D+NCGT M +H  +AI +GK+ E+DID+AL NLF++++RLG F+GDPR    Y
Sbjct: 325 VALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVY 384

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
           G LG  D+CT EH+ LAL+AA  GIVLLKND   LPL++ AV+S A+IGP  N+   + G
Sbjct: 385 GGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIG 444

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G PC   + L G+  Y+    + +GC+   C+  A   +A  +A  +D+V +  GL 
Sbjct: 445 NYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVAA-TDQAAAVASSSDYVFLFMGLS 503

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
             QE+E RDR SLLLPG+Q SL+T+VA  +KRPVILVL  GGP+DV+FA+ + +I +ILW
Sbjct: 504 QKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILW 563

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
            GYPG+AG  A+A ++FGD NPGGRLP+TWYPE FTKVPM DM MRAD +  YPGRSYRF
Sbjct: 564 AGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRF 623

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLS----APSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           Y G  VY FG+GLSY++YS + +S    A S   + ASL+  +       T    +  HI
Sbjct: 624 YQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLLASLRTTT-------TSEGDESYHI 676

Query: 704 DEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
           +E+ +  C  L+F   + V N G +DG H V+++ R P    G P  QLIGF   H    
Sbjct: 677 EEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVG 736

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
               I F + PCE  S   K G++++  G+H LMV
Sbjct: 737 EKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 771


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  755 bits (1950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/760 (48%), Positives = 502/760 (66%), Gaps = 31/760 (4%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S+PFC+T L I  R + L+S LTL EK+ ++ + A  IPRLG+P+Y+WW E+LHG+AS+ 
Sbjct: 48  SFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVPSYQWWQEALHGVASS- 106

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F G   + TSFP  + +AASFN +L+ +IG AV+ EARA++NLG+AGLTFW+PN+N
Sbjct: 107 PGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARALHNLGRAGLTFWSPNVN 166

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRGQETPGEDP++ S +A  +V+  QG            G  E     GF   
Sbjct: 167 IFRDPRWGRGQETPGEDPLLASKFASLYVRGLQG------------GAYEGSASDGF--- 211

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 L +SACCKHL AYD++ W    RY FNA ++EQD  DT+ PPF+SCIE G+ S 
Sbjct: 212 ------LKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSS 265

Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYN+VNGVP C  R  L +  RN WGF GYI SDCDA+  +FE   Y  + ED+ A 
Sbjct: 266 VMCSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTYAPSAEDAVAD 325

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            + AG+D+NCGT + +H +SA+  GK+ E D+D A+ NL   ++RLGLF+GDP    Y  
Sbjct: 326 SILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSS 385

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  D+C+++H++LALDAA QG+VLLKND   LPL+  A+ ++A+IGP  N    M G Y
Sbjct: 386 LGATDICSNDHQQLALDAALQGVVLLKNDGS-LPLS-TALKTVALIGPNANATYTMLGNY 443

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GIPC   S L+G++ Y S   Y+ GC +V CN       AV +A KAD V++V GLD +
Sbjct: 444 EGIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQS 503

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
           QE E  DR SLLLPG Q  LV+++A     P++LV+   GP+D+S  + +S+ISS++W+G
Sbjct: 504 QERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLG 563

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG++G  ALA ++FG +NPGGRLP TWY E FT V M DM MR +    YPGRSYRFYT
Sbjct: 564 YPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNVSMLDMQMRPNPLSGYPGRSYRFYT 623

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHI--DE 705
           GT +Y FG GLSY+ Y YKFL AP++L+   S   G+ +    + ++ ++    H+  D+
Sbjct: 624 GTPLYNFGDGLSYSTYFYKFLLAPTKLSFFKS-NTGNSRGCPAVNRSKAKSGCFHLPADD 682

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           + +C S+ F V + V+N G   GSH V++F+  P V +G P KQLI F +VH  +  ++ 
Sbjct: 683 LETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV-EGAPLKQLIAFQKVHLESDTTQR 741

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           + FG+DPC+ LS   ++G+R L  G H L++G   H L I
Sbjct: 742 LIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHILDI 781


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/785 (48%), Positives = 513/785 (65%), Gaps = 39/785 (4%)

Query: 27  VIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSD 83
           V++  N     + P F C     P   S+ FCN SL IS R   L++ LTLQEKI  L +
Sbjct: 26  VLSSRNQVLAQSSPVFACDVKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVN 85

Query: 84  NASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNI 143
           +A ++ RLGIP YEWWSE+LHG++  GPG +F+  V   TSFPQV+++AASFN SL+  I
Sbjct: 86  SAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAI 145

Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
           G  V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y   +V+  Q
Sbjct: 146 GKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQ 205

Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
               ++D+G                 +S+R   L ++ACCKH  AYDL+ W    RY FN
Sbjct: 206 ----QTDNG-----------------DSER---LKVAACCKHYTAYDLDNWKGTDRYHFN 241

Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGY 322
           A++T+QD +DTFQPPF+SC+  G  + +MCSYNQVNG P C   DL     R EW   GY
Sbjct: 242 AVVTKQDLDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGY 301

Query: 323 ITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDID 382
           I SDCD+V  I+  Q+YTKT E++AA  + AG+D+NCG+ + +HT++A++ G +    +D
Sbjct: 302 IVSDCDSVDVIYNSQHYTKTPEEAAAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVD 361

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           +A+ N F+  +RLG F+GDP K  YGKLGP DVCT+ +++LA +AARQGIVLLKN    L
Sbjct: 362 KAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSL 421

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           PL+  A+ +LA+IGP  N    M G Y G PC   + L+GL A V+ T Y +GC +V C 
Sbjct: 422 PLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTASVATT-YLAGCSNVACA 480

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
           + A   +A ++A  AD  ++V G D + E E RDRV +LLPGQQ  L+T VA  SK PVI
Sbjct: 481 A-AQVDDAKKLAASADATVLVMGADQSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVI 539

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           LV+  GG +DVSFA+ + +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+++
Sbjct: 540 LVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAY 599

Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             KVPM +MNMR D S  YPGR+YRFYTG  VY FG GLSY+ Y ++ + AP  ++I   
Sbjct: 600 VDKVPMTNMNMRPDPSSGYPGRTYRFYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPL- 658

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                + + + ++ S+   V   E  +C  L F++ + V N G V G+H V LF   P V
Sbjct: 659 -----EDDHVCRSSSKCISVDAGE-QNCQGLAFNIDLKVRNIGKVRGTHTVFLFFTPPSV 712

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
              +P+K L+ F++V   AK    +SF VD C+ LS+ ++ G R + LG HVL VG L H
Sbjct: 713 -HNSPQKHLVDFEKVSLDAKTYGMVSFKVDVCKHLSVVDEFGSRKVALGGHVLHVGNLEH 771

Query: 802 SLTIE 806
           SLT+ 
Sbjct: 772 SLTVR 776


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  754 bits (1946), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/789 (46%), Positives = 507/789 (64%), Gaps = 53/789 (6%)

Query: 42  FPCKPPH-FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           F C P     S PFC  SL +  RA+ L+S LT  EK++ L +NA+ +PRLG+  YEWWS
Sbjct: 27  FACAPGGPAASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVAGYEWWS 86

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHG++  GPGV F G     T+FPQV+ +AAS N +LW  +G AV+ EARAMYN G+A
Sbjct: 87  EALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAMYNGGRA 146

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  Q + + + +GG       
Sbjct: 147 GLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQ-QPYAAPNGG------- 198

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                         + L L+ACCKH  AYDL+KWG   R+ FNA++  QD EDTF  PFR
Sbjct: 199 ------------HRNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFR 246

Query: 281 SCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
           +C+E G+A+ +MCSYNQVNGVP C     LRG +    R  WG  GYI SDCD+V   F 
Sbjct: 247 ACVEDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RGRWGLDGYIVSDCDSVDVFFR 302

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YT+T ED+AA  L+AG+D++CG  +  +  SA+  GKV + D+D ALLN  +VQ+RL
Sbjct: 303 DQHYTRTPEDAAAATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRL 362

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN------DKKFLPLNKNAV 449
           G+F+GDP  G +G+LGP DVCT EH+ LALDAARQG+VLLKN      ++  LPL   A 
Sbjct: 363 GMFDGDPAAGPFGRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAH 422

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
             +A++GP  +    M G Y G PC   + L+G+ AY ++  + +GC DV C  +     
Sbjct: 423 RVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAA 482

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           AV  A++AD  +VVAGLD   E E  DR +LLLPG+Q  L+++VA+ SK PVILVL  GG
Sbjct: 483 AVEAARQADATVVVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGG 542

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMN 628
           P+D++FA+ D +I  ILW+GYPG+AG +A+A++IFG  NPG +LP+TWY + +  KVPM 
Sbjct: 543 PIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMT 602

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-------SAS 681
           +M MRA+ +R YPGR+YRFYTG  +Y FGHGLSYT +++    AP++LT+       SA+
Sbjct: 603 NMAMRANPARGYPGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAA 662

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
             A      L +    +   H      C  L   V + V+N GD DG+H V+++   P  
Sbjct: 663 SAASLLNATLARPVRAVRVAH----ARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSP 718

Query: 742 SQG-----TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           S        P +QL+ F++VH  A G   +   +  C++LS+A+++G R +P+G H LM+
Sbjct: 719 SHAAPGADAPARQLVAFEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMI 778

Query: 797 GELRHSLTI 805
           GEL HS+++
Sbjct: 779 GELTHSVSL 787


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  754 bits (1946), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/769 (48%), Positives = 505/769 (65%), Gaps = 40/769 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K P    Y FC+ SLS+  R   L+  LTLQEKI  L ++A+++ RLGIP YEW
Sbjct: 41  FACDVAKNPALAGYGFCDKSLSVEDRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEW 100

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG +F+  V   TSFP  +++AASFN SL+  IG  V+ EARAMYN+G
Sbjct: 101 WSEALHGVSNVGPGTHFSSLVPGATSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVG 160

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA  +VK  Q    ++DDG      
Sbjct: 161 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYATGYVKGLQ----QTDDG------ 210

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                           ++L ++ACCKH  AYDL+ W    RY+FNA++T+QD +DTFQPP
Sbjct: 211 --------------DSNKLKVAACCKHYTAYDLDNWKGIQRYTFNAVVTQQDMDDTFQPP 256

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+  G  + +MCSYNQVNG P C   DL +   R EW   GYI SDCD+V  +F+ Q
Sbjct: 257 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKLNGYIVSDCDSVEVLFKDQ 316

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +YTKT E++AA  + AG+D+NCG  + ++T+ A+ +G + E  I+ A+ N F+  +RLG 
Sbjct: 317 HYTKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDEASINNAVSNNFATLMRLGF 376

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDP K  YG LGP DVCTSE+++LA +AARQGIVLLKN    LPLN   + SLA+IGP
Sbjct: 377 FDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNSPGSLPLNAKTIKSLAVIGP 436

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N    M G Y GIPC+  S L+ L A V  T YA+GC +V C ++A   +A +IA  A
Sbjct: 437 NANATRVMIGNYEGIPCNYISPLQTLTALV-PTSYAAGCPNVQC-ANAELDDATQIAASA 494

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D  +++ G  L  E E  DR+++LLPGQQ  LV+ VA  SK PVILV+  GG +DVSFA+
Sbjct: 495 DATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAK 554

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
           ++ +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+++  KVPM +MNMRAD 
Sbjct: 555 SNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQAYVNKVPMTNMNMRADP 614

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +  YPGR+YRFY G  V+ FG G+S+++  +K + AP  +++  +     D         
Sbjct: 615 ATGYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSVPLA----EDHECRSSECM 670

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD    DE   C +L F + + V N G +  SHVV+LF   P V    P+K L+GF++V
Sbjct: 671 SLDIA--DE--HCQNLAFDIHLGVKNTGKMSTSHVVLLFFTPPDV-HNAPQKHLLGFEKV 725

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           H   K   ++ F VD C+ LS+ ++ G R +PLG H+L VG L+H L++
Sbjct: 726 HLPGKSEAQVRFKVDVCKDLSVVDELGNRKVPLGQHLLHVGNLKHPLSL 774


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/775 (48%), Positives = 502/775 (64%), Gaps = 38/775 (4%)

Query: 40  PDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           P F C P    +   Y FC+ +L ++ RA  L++ LT  EK+ QL D A+ +PRLG+PAY
Sbjct: 36  PPFSCGPSSTAATQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVPRLGVPAY 95

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           +WW+E+LHG+A++G G++FNG V S TSFPQV ++AA+F+  LW  IG A+  EARA+YN
Sbjct: 96  KWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGREARALYN 155

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           +GQA GLT W+PN+NI+RDPRWGRGQETPGEDP   S Y V FVK  QG +  S      
Sbjct: 156 VGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQGNSTSSS----- 210

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  SACCKH  AYDLE WG  +RY+F+A +T QD EDT+
Sbjct: 211 --------------------LLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTY 250

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLR-GDLFQKARNEWGFKGYITSDCDAVATIF 334
            PPFRSC+  GKASC+MC+Y  +NGVPAC   G L    R +WG  GY+ SDCDAVA + 
Sbjct: 251 NPPFRSCVVDGKASCVMCAYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMR 310

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + Q Y  T ED+ A  LKAG+DI+CGT M +H  +A+ +GK+ E D+D+AL NLF++++R
Sbjct: 311 DAQRYAPTPEDAVALALKAGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMR 370

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG F+GDPR   YG L    +CT EH+ LAL+AA+ GIVLLKND   LPL++ A++S A+
Sbjct: 371 LGHFDGDPRANIYGGLNAAHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAV 430

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  NN   + G Y G PC   + L+G++ YV    + +GC    C+  A   +A  +A
Sbjct: 431 IGPNANNPGLLIGNYFGPPCESVTPLKGVQGYVKDVRFMAGCGSAACDV-ADTDQAATLA 489

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
             +D+V++  GL   QE+E RDR SLLLPGQQ SL+T+VA  +KRPVILVL  GGP+DV+
Sbjct: 490 GSSDYVLLFMGLSQQQESEGRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVT 549

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I +ILW GYPG+AG  A+A ++FGD NPGGRLP+TWYPE FTKVPM DM MRA
Sbjct: 550 FAKNNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTKVPMTDMRMRA 609

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG-SDKNILQQ 693
           D +  YPGRSYRFY G  VY FG+GLSY++YS + LS+    T +  L AG S      +
Sbjct: 610 DPATGYPGRSYRFYQGETVYKFGYGLSYSSYSRRLLSSG---TPNTDLLAGLSTMPTPAE 666

Query: 694 TGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
            G    Y H++ + +  C  L+F   + V N G +DG H V+++ R    + G P KQLI
Sbjct: 667 EGGVASY-HVEHIGARGCEQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLI 725

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           GF R H  A     ++F + PCE  S   K G +++  G+H LMV      +T E
Sbjct: 726 GFRRQHLKAGEKASLTFDISPCEHFSRVRKDGNKVVDRGSHFLMVDMHEMEITFE 780


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  753 bits (1944), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/775 (49%), Positives = 505/775 (65%), Gaps = 34/775 (4%)

Query: 40  PDFPCK--PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P + C    P   SY FCN +L+   RAK ++S LTL EK+ QL + A AIPRLGI +Y+
Sbjct: 34  PPYSCDITNPLTKSYTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQ 93

Query: 98  WWSESLHGIASNGPGVNFNG--TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           WWSE+LHG+A  G G+  NG  T+ + T FPQV+++AASF+  LW  I   +  EARA+Y
Sbjct: 94  WWSEALHGVADYGKGIRLNGNVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVY 153

Query: 156 NLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N GQA G+TFWAPNINIFRDPRWGRGQET GEDP+V + YAV FV+  QG++++      
Sbjct: 154 NAGQAEGMTFWAPNINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFE------ 207

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
           G    E              D L  SACCKH  AYDL+ W    R+ F+A +T QD  DT
Sbjct: 208 GGKLNE--------------DRLKASACCKHFTAYDLDNWKGVDRFDFDANVTLQDLADT 253

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
           +QPPF SCI QG++S IMC+YN+VNG+P C   +L    AR +W F GYITSDC AV  I
Sbjct: 254 YQPPFHSCIVQGRSSGIMCAYNRVNGIPNCADYNLLTNTARKKWNFNGYITSDCSAVDII 313

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            + Q Y K  ED+ A VL+AGMD+ CG     H++SA+ + KV    IDRAL NLFS+++
Sbjct: 314 HDRQGYAKAPEDAVADVLQAGMDVECGDYFTSHSKSAVLQKKVPISQIDRALHNLFSIRI 373

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLGLF+G P K KYGK+GP+ VC+ ++  +AL+AAR GIVLLKN    LPL K +  S+ 
Sbjct: 374 RLGLFDGHPTKLKYGKIGPNRVCSKQNLNIALEAARSGIVLLKNAASILPLPK-STDSIV 432

Query: 454 IIGPLVNNISQMG-GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           +IGP  N+ SQ+  G Y G PC+  ++L+G E Y     Y  GC D      A    AV 
Sbjct: 433 VIGPNANSSSQVVLGNYFGRPCNLVTILQGFENYSDNLLYHPGCSDGTKCVSAEIDRAVE 492

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +AK  D+V++V GLD +QE+E  DR  L LPG+Q  L+ SVA+ SKRPVILVL  GGP+D
Sbjct: 493 VAKVVDYVVLVMGLDQSQESEGHDRDDLELPGKQQELINSVAKASKRPVILVLFCGGPVD 552

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +SFA+ D +I  ILW GYPGE G  ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM M
Sbjct: 553 ISFAKVDDKIGGILWAGYPGELGGMALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRM 612

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           RAD S  YPGR+YRFYTG +VY FG+GLSY+NYSY F+S  +    +  +   +  +IL+
Sbjct: 613 RADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFISVKNN---NLHINQSTTYSILE 669

Query: 693 QTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
           ++ + + Y  + E+   +C ++   V + +TN G + G H V+LF +  K   G P KQL
Sbjct: 670 KSQT-IHYKLVSELGKKACKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQL 728

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +GF+ V     G  E+ F V  CE LS AN+ G +++  G ++ +VGEL +S+ I
Sbjct: 729 VGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEEGGYLFLVGELEYSINI 783


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  752 bits (1942), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/783 (48%), Positives = 507/783 (64%), Gaps = 42/783 (5%)

Query: 30  FSNSKPVLNK--PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
            S+ K VL +  P F C     P    + FCNTSL  + R   L+  LTL+EKI  L ++
Sbjct: 25  LSSPKWVLGQSSPVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNS 84

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           A+++ RLGIP YEWWSE+LHG++  GPG +FN  V   TSFPQV+++AASFN SL+  IG
Sbjct: 85  AASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPGATSFPQVILTAASFNASLFEAIG 144

Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
             V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q 
Sbjct: 145 KVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASAYVRGLQ- 203

Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA 264
              + DDG                      D L ++ACCKH  AYDL+ W    R  FNA
Sbjct: 204 ---QGDDG--------------------SPDRLKVAACCKHYTAYDLDNWKGVDRLHFNA 240

Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYI 323
           ++T+QD +DTFQPPF+SC+  G  + +MCS+NQVNG P C   DL     R EW   GYI
Sbjct: 241 VVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWKLNGYI 300

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDR 383
            SDCD+V   +  Q+YTKT E++AA  + AG+D+NCG+ + +HT++A+  G V E  +D+
Sbjct: 301 VSDCDSVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDK 360

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
           A+ N F+  +RLG F+G+P K  YGKLGP DVCTSEH+++A +AARQGIVLLKN K  LP
Sbjct: 361 AVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKNSKGSLP 420

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
           L+  A+ +LAIIGP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C +
Sbjct: 421 LSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT 479

Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
            A   EA +IA  AD  +++ G+D + E E RDRVS+ LPGQQ  L+T VA+ SK  VIL
Sbjct: 480 -AQIDEAKKIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASKGNVIL 538

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           V+  GG  D+SFA+ D +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+ 
Sbjct: 539 VVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYV 598

Query: 624 -KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
            KVPM +MNMR D +  YPGR+YRFYTG  +Y FG GLSYT +++  + AP     S S+
Sbjct: 599 DKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPK----SVSI 654

Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
                 +        +D V      SC +L F + + V NAG++ GSH V LF+  P V 
Sbjct: 655 PIEEGHSCHSSKCKSVDAVQ----ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV- 709

Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
             +P+K L+GF++V   AK    + F VD C+ LSI ++ G + + LG HVL VG L+HS
Sbjct: 710 HNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHVGSLKHS 769

Query: 803 LTI 805
           L +
Sbjct: 770 LNV 772


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  752 bits (1941), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/777 (48%), Positives = 504/777 (64%), Gaps = 49/777 (6%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           P F C      SY FC+ +L ++ RA  L+S LT  EK+ QL D A+ +PRLG+P Y+WW
Sbjct: 25  PPFSCGQAS-SSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVPRLGVPGYKWW 83

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           +E+LHG+A++G G++F+G V S TSFPQV ++AA+F+  LW  IG A+  EARA+YNLGQ
Sbjct: 84  NEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGREARALYNLGQ 143

Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           A GLT W+PN+NI+RDPRWGRGQETPGEDP   S YAV FV+  QG +            
Sbjct: 144 AEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQGNSTSL--------- 194

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L  SACCKH  AYDLE W   +RY+F+A +T QD EDTF PP
Sbjct: 195 ------------------LQASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPP 236

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           FRSC+  GKASC+MC+Y  +NGVPAC   DL  K  R +WG  GY  SDCDAVA + + Q
Sbjct: 237 FRSCVVDGKASCVMCAYTGINGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQ 296

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            Y ++ ED+ A  LKAG+DI+CGT M +H  +AI +GK+ E+DID+AL NLF++++RLG 
Sbjct: 297 RYAQSPEDAVALALKAGLDIDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGH 356

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDPR   YG LG  D+CT+EH+ LALDAA+ GIVLLKND   LPL++ AV+S A+IGP
Sbjct: 357 FDGDPRTNMYGGLGAADICTAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGP 416

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             NN   +   Y G PC   + L+G++ YV    + +GC    C+  A   +A  +A  +
Sbjct: 417 NANNPGALIANYFGPPCESTTPLKGIQGYVKDARFLAGCSSTACDV-ATTDQAAALASTS 475

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D+V +  GL   QE+E RDR SLLLPG+Q SL+T+VA  ++RPVILVL  GGP+DV+FA+
Sbjct: 476 DYVFLFMGLGQRQESEGRDRTSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQ 535

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            + +I +ILW GYPG+AG  A+A ++FGD NP GRLP+TWYPE FT VPM DM MRAD +
Sbjct: 536 TNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTNVPMTDMRMRADPA 595

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF----LSAP---SELTISASLKAGSDKNI 690
             YPGRSYRFY G  VY FG+GLSY++YS +      S P   ++L  S +    S +NI
Sbjct: 596 NGYPGRSYRFYQGKTVYKFGYGLSYSSYSRRLLSSGTSTPAPNADLLASLTTTMPSAENI 655

Query: 691 LQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
           L   GS     H++++ +  C  L+F   + V N G +DG   V+++ R P  + G PE+
Sbjct: 656 L---GS----YHVEQIGAQGCEMLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPER 708

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           QLIGF + H  A     I F + PCE LS   + G +++  G+H L V   +H L I
Sbjct: 709 QLIGFKKEHLKAGEKAHIKFEIRPCEHLSRVREDGNKVIDRGSHFLRVD--KHELEI 763


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  750 bits (1936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/783 (46%), Positives = 506/783 (64%), Gaps = 32/783 (4%)

Query: 30  FSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS 86
            SN++    +P + C         S+PFC+T L +  R + L+S LTL EK+ ++ + A 
Sbjct: 25  LSNARVSTAQPRYACDVSSNASLGSFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQ 84

Query: 87  AIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSA 146
            IPRLG+P+Y+WW E+LHG+AS+ PGV F G   + TSFP  +  AASFN +L+ +IG A
Sbjct: 85  GIPRLGVPSYQWWQEALHGVASS-PGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEA 143

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           V+ EARA++NLG+AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S +A  +V+  QG  
Sbjct: 144 VSSEARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGA 203

Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
           +    G    GF                  L +SACCKHL AYD++ W    RY FNA +
Sbjct: 204 YG---GSASDGF------------------LKVSACCKHLTAYDMDNWKGMDRYHFNAEV 242

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITS 325
           +EQD  DT+ PPF+SCIE G+ S +MCSYN+VNGVP C  R  L +  RN WGF GYI S
Sbjct: 243 SEQDLVDTYNPPFQSCIEDGRVSSVMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVS 302

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
           DCDA+  +FE   Y  + ED+ A  + AG+D+NCGT + +H +SA+  GKV E D+D A+
Sbjct: 303 DCDALQVLFEDTTYAPSAEDAVADSILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAI 362

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            NL   ++RLGLF+GD     Y  LG  D+C+++H++LALDAA QG+VLLKND   LPL+
Sbjct: 363 SNLMRTRMRLGLFDGDLNTRPYSSLGATDICSNDHQQLALDAALQGVVLLKNDGS-LPLS 421

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
             A+ ++A+IGP  N    M G Y GIPC   S L+G++ Y +   Y+ GC DV C+   
Sbjct: 422 -TALKTVALIGPNANATYTMLGNYEGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGD 480

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
               AV +A KAD V++V GLD +QE E  DR SLLLPG Q  LV+++A     P++LV+
Sbjct: 481 LVASAVEVATKADAVVLVVGLDQSQERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVI 540

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
              GP+D+S  + +S+ISS++WIGYPG++G  ALA ++FG +NPGGRLP TWY E FT V
Sbjct: 541 MSAGPVDISTFKDNSRISSVIWIGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTNV 600

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT-ISASLKA 684
            M DM MR +    YPGRSYRFYTGT +Y FG GLSY+ Y YKFL AP+ L+   ++ + 
Sbjct: 601 SMLDMRMRPNPPSGYPGRSYRFYTGTPLYNFGDGLSYSTYLYKFLLAPTRLSFFKSNTRN 660

Query: 685 GSDKNILQQTGSRLDYVHI--DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
             D   + ++ +     H+  D++ +C S+ F V + V+N G   GSH V++F+  P V 
Sbjct: 661 SRDCPTVNRSEAEFGCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPPPV- 719

Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
           +G P KQLI F +VH  +  ++ + FG+DPC+ LS   ++G+R L  G H L++G   H 
Sbjct: 720 EGAPLKQLIAFQKVHLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGNAVHI 779

Query: 803 LTI 805
           L +
Sbjct: 780 LDV 782


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  750 bits (1936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/767 (47%), Positives = 485/767 (63%), Gaps = 39/767 (5%)

Query: 40  PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           P F C P      + FCN +L    RA  L++ LT  EK+ QL D A  +PRLGIP Y+W
Sbjct: 35  PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 94

Query: 99  WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WSE+LHG+A +G G++F NG   + TSFPQV+ +AA+F+  LW  IG A+  E RA YNL
Sbjct: 95  WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 154

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA GL  W+PN+NIFRDPRWGRGQETPGEDP   S Y   FVK  QG +  +       
Sbjct: 155 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 207

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               L  SACCKH+ AYD+E+W   SRY+FNA +T QD  DT+ 
Sbjct: 208 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 247

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           PPFRSC+  GKASCIMC+Y  +NGVPAC   DL  K  R EW   GY  SDCDAVA + +
Sbjct: 248 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 307

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            +++T+T E++ A  LKAG+DINCG  M ++  SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 308 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 367

Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G F+GDPR  K YG+LG  DVCT  HK LAL+AAR+G+VLLKND + LPL    VSS A+
Sbjct: 368 GHFDGDPRGNKLYGRLGAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAV 427

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IG   N+I  + G Y G+PC   +   G++ YV    +  GC    C+  A   +A  +A
Sbjct: 428 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 486

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K +D+V +V GL   QE E  DR SLLLPG+Q +L+T+VA  SKRPVIL+L  GGP+D++
Sbjct: 487 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 546

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK  M DM MR 
Sbjct: 547 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 606

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGRSYRFY G  VY FG+GLSY+ ++ + +S       S+S    +   +   T
Sbjct: 607 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 663

Query: 695 GSRLDYVH-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
               D V+ +DE+    C  LRF V + V N G +DG H V++F R      G P +QLI
Sbjct: 664 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 723

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           GF   H      K++   + PCE LS A   G +++  G+H LMV E
Sbjct: 724 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 770


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  750 bits (1936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/768 (48%), Positives = 494/768 (64%), Gaps = 39/768 (5%)

Query: 39  KPDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           +P + C P    S Y FC+  L  + RA  L+S LT  EK+ QL D A  +PRLG+P Y+
Sbjct: 24  EPPYSCGPRSPSSGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLGVPPYK 83

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE LHG++  G G++FNG V+++TSFPQVL++AA+F+  LW  IG A+  EARA+YNL
Sbjct: 84  WWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYNL 143

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA GLT W+PN+NI+RDPRWGRGQETPGEDP   S YAV FVK  QG    +       
Sbjct: 144 GQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGT------- 196

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               L  SACCKH  AYDLE+W   +RY+FNA +T QD  DTF 
Sbjct: 197 --------------------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFN 236

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+   KASC+MC+Y  +NGVPAC   DL  K  R +WG  GY++SDCDAVA + +
Sbjct: 237 PPFKSCVVDAKASCVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRD 296

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q Y  T ED+ A  +KAG+D+NCG     H  +A+ +GK++E D+DRAL NLF+V++RL
Sbjct: 297 AQRYAPTPEDTVAVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRL 356

Query: 396 GLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G F+GDPR    YG LG  DVCT  H+ LAL+AA+ GIVLLKND   LPL++  V S A+
Sbjct: 357 GHFDGDPRSNAAYGHLGAADVCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAV 416

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  N+ + + G Y G PC   + L+G++ Y+S   + +GC D P    A   +A  +A
Sbjct: 417 IGPNANDPAALNGNYFGPPCETTTPLQGVQRYISSVRFLAGC-DSPACGFAATGQAAALA 475

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
             +D VI+  GL   QE E  DR SLLLPG+Q SL+T+VA  ++RPVILVL  GGP+DV+
Sbjct: 476 SSSDQVIMFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVT 535

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I +ILW GYPG+AG  A+A+++FGD NP GRLP+TWYPE FT++PM DM MRA
Sbjct: 536 FAKNNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMRA 595

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGRSYRFY G  VY FG+GLSY+ +S + ++A      + +L AG    I +  
Sbjct: 596 DPATGYPGRSYRFYQGNPVYKFGYGLSYSKFSRRLVAAAKPRRPNRNLLAGV---IPKPA 652

Query: 695 GSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQL 750
           G   +  H++E+    C  L+F   + V N G +DG H V++F R P  + G   P +QL
Sbjct: 653 GDGGESYHVEEIGEEGCERLKFPATVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQL 712

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           +GF   H  A     ++  ++PCE LS A + G +++  G+H L VGE
Sbjct: 713 VGFSSQHVRAGEKARLTMEINPCEHLSRAREDGTKVIDRGSHFLKVGE 760


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  750 bits (1936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/773 (49%), Positives = 500/773 (64%), Gaps = 65/773 (8%)

Query: 40  PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           P F C     P    + FCNTSL  + R   L+  LTL+EKI  L ++A+++ RLGIP Y
Sbjct: 37  PVFACDVENNPTLGQFGFCNTSLETAARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKY 96

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG++  GPG +FN  V   TSFPQV+++AASFN SL+  IG AV+ EARAMYN
Sbjct: 97  EWWSEALHGVSYVGPGTHFNSVVPGATSFPQVILTAASFNASLFEAIGKAVSTEARAMYN 156

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           +G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q    +SDDG    
Sbjct: 157 VGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDG---- 208

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                             D L ++ACCKH  AYDL+ W    R+ FNA++T+QD +DTFQ
Sbjct: 209 ----------------SPDRLKVAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQ 252

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+  G  + +MCSYNQVNG PAC   DL     R EW   GYI SDCD+V   + 
Sbjct: 253 PPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYN 312

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E++AA  + AG+D+NCG+ + +HT++A+  G V E  +D+A+ N F+  +RL
Sbjct: 313 SQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRL 372

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G+P K  YGKLGP DVCTSEH++LA +AARQGIVLLKN K  LPL+  A+ +LA+I
Sbjct: 373 GFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKNSKGSLPLSPTAIKTLAVI 432

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C + A   EA +IA 
Sbjct: 433 GPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  +++ G+D + E E RDRV++ LPGQQ  L+T VA+ SK  VILV+  GG  D+SF
Sbjct: 491 AADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISF 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ D +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+  KVPM +MNMR 
Sbjct: 551 AKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQSYVDKVPMTNMNMRP 610

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGR+YRFYTG  +Y FG GLSYT +++                           
Sbjct: 611 DPASGYPGRTYRFYTGETIYTFGDGLSYTQFNH--------------------------- 643

Query: 695 GSRLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                ++ +D V  SC +L F + + V NAG++ GSH V LF+  P V   +P+K L+GF
Sbjct: 644 -----HLSVDAVQESCQNLVFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGF 697

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           ++V   AK    + F VD C+ LSI ++ G R + LG HVL VG L+HSL + 
Sbjct: 698 EKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNVR 750


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score =  750 bits (1936), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/776 (46%), Positives = 503/776 (64%), Gaps = 42/776 (5%)

Query: 36  VLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
           V ++  F C P      S  FC  +L I  R + LI  LTLQEKI+ L +NA+A+PRLGI
Sbjct: 22  VESREPFACDPKLGLTRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGI 81

Query: 94  PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
             YEWWSE+LHG+++ GPG  F G     T+FPQV+ +AASFN SLW  IG  V+ EARA
Sbjct: 82  QGYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARA 141

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+V   YA  +V+  QG N       
Sbjct: 142 MYNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNN------- 194

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                               G  L ++ACCKH  AYDL+ W    RY FNA +++QD ED
Sbjct: 195 --------------------GLRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLED 234

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           T+  PF+SC+  GK + +MCSYNQVNG P C    L +   R EWG  GYI SDCD+V  
Sbjct: 235 TYNVPFKSCVVAGKVASVMCSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGV 294

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           +F+ Q+YT T E++AA  ++AG+D++CG  +  HT++A+  G ++E+D++ AL N  +VQ
Sbjct: 295 LFDTQHYTATPEEAAASTIRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQ 354

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG+F+G+P    +G LGP DVCT  H++LAL AARQGIVLL+N  + LPL++  + ++
Sbjct: 355 MRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPLSRT-LQTV 413

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  +    M G Y G+ C   + L+G+  Y    H+  GC+DV CN +  F+ A  
Sbjct: 414 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIRRYAKTVHH-PGCNDVFCNGNQQFNAAEV 472

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A+ AD  I+V GLD + E E RDR  LLLPG Q  LV+ VAR S+ P ILVL  GGP+D
Sbjct: 473 AARHADATILVMGLDQSIEAEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPID 532

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
           VSFA+ D +I +ILW+GYPG+AG  A+A+++FG  NPGG+LPMTWYP ++  KVPM +M 
Sbjct: 533 VSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHNYLAKVPMTNMG 592

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
           MRAD SR YPGR+YRFY G  V+ FGHG+SYT +++  + AP E+++  ASL      ++
Sbjct: 593 MRADPSRGYPGRTYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASL------HV 646

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
            + T    + + +    +C +L   V I V N GD+DG+H +++F+  P     T +KQL
Sbjct: 647 SRNTTGASNAIRVSH-ANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWST-QKQL 704

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           IGF++VH V    K +   +  C+ LS+ ++ G R +P+G H L +G+L+HS++++
Sbjct: 705 IGFEKVHLVTGSQKRVKIDIHVCKHLSVVDRFGIRRIPIGEHDLYIGDLKHSISLQ 760


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  749 bits (1935), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/774 (49%), Positives = 505/774 (65%), Gaps = 42/774 (5%)

Query: 40  PDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           P F C     P   S+ FCNTSL +S R   L+  LTLQEKI  L ++A ++ RLGIP Y
Sbjct: 2   PVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRLGIPKY 61

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG++  GPG +F+  V   TSFPQV+++AASFN SL+  IG  V+ EARAMYN
Sbjct: 62  EWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEARAMYN 121

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           +G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y   +VK  Q    + DDG    
Sbjct: 122 VGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQ----QRDDG---- 173

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                             D L ++ACCKH  AYDL+ W    RY FNA++T+QD +DTFQ
Sbjct: 174 ----------------NPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQ 217

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+  G  + +MCSYN+VNG+P C   DL     R EW   GYI +DCD++   + 
Sbjct: 218 PPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYN 277

Query: 336 YQNYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            Q+YTKT E++AA  + AG  +D+NCG+ + +HT++A+  G V E  IDRA+ N F+  +
Sbjct: 278 SQHYTKTPEEAAAKAILAGIRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLM 337

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLG F+GDP K  YGKLGP DVCT+E+++LA +AARQGIVLLKN    LPL+  A+ +LA
Sbjct: 338 RLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLA 397

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C S A   +A +I
Sbjct: 398 VIGPNANVTKTMIGNYEGTPCKYTTPLQGLAALVATT-YLPGCSNVAC-STAQVDDAKKI 455

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A  AD  ++V G DL+ E E RDRV +LLPGQQ  L+T+VA  S  PVILV+  GG +DV
Sbjct: 456 AAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDV 515

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           SFA+ + +I+SILW+GYPGEAG  A+A+IIFG +NP GRLPMTWYP+S+  KVPM +MNM
Sbjct: 516 SFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNM 575

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           R D S  YPGR+YRFYTG  VY FG GLSY+ +S++   AP  +++        ++N + 
Sbjct: 576 RPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPL------EENHVC 629

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
            + S    V   E T C +L F V + + N G   GSH V LF+  P V   +P+K L+G
Sbjct: 630 YS-SECKSVAAAEQT-CQNLTFDVHLRIKNTGTTSGSHTVFLFSTPPSV-HNSPQKHLVG 686

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           F++V   A+    + F VD C+ LS+ ++ G + + LG HVL +G L+HS+T+ 
Sbjct: 687 FEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTVR 740


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  748 bits (1930), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/752 (49%), Positives = 488/752 (64%), Gaps = 38/752 (5%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC+  L+   RA  L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++  G G+
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
            FNGTV +VTSFPQV+++AA+F+  LW  +G AV  EARA+YNLGQA GLT W+PN+NIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+  S YAV FV   QG                       G E+ 
Sbjct: 151 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQG----------------------IGGEA- 187

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                  SACCKH  AYDL+ W N  RY++++ +T QD EDT+ PPF+SC+ +GKA+CIM
Sbjct: 188 -------SACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIM 240

Query: 293 CSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           C YN +NGVPAC   DL  +K R EWG  GY+ SDCDAVATI +  +YT + ED+ A  +
Sbjct: 241 CGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSI 300

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKL 410
           K GMD+NCG     H  +A+ KG + EKDIDRAL+NLF+V++RLG F+GDPR    YG L
Sbjct: 301 KVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHL 360

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           G  DVC+  HK LAL+AA+ GIVLLKND   LPL  +AV+SLA+IGP  +N+  + G Y 
Sbjct: 361 GAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYF 420

Query: 471 GIPCSPKSLLRGLEAYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
           G PC   + L+G++ Y+  +  + +GC D P  + A  +EA  +A  +D V++  GL   
Sbjct: 421 GPPCETTTPLQGIKGYLGDRARFLAGC-DSPACAVAATNEAAALASSSDHVVLFMGLSQK 479

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
           QE +  DR SLLLPG+Q  L+T+VA  ++RPVILVL  GGP+DV+FA+ + +I +ILW G
Sbjct: 480 QEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAG 539

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD +  YPGRSYRFY 
Sbjct: 540 YPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQ 599

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQTGSRLDYVHIDE--V 706
           G  VY FG+GLSY+ +S +  S+ S     + SL AG         G  +    + E  V
Sbjct: 600 GNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGV 659

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
             C+ L F   + V N G +DG H V+++ R P  S G P +QLIGF   H        +
Sbjct: 660 ERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMV 719

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           SF V PCE  S   + G R++  G H LMVG+
Sbjct: 720 SFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 751


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  747 bits (1929), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/767 (47%), Positives = 484/767 (63%), Gaps = 39/767 (5%)

Query: 40  PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           P F C P      + FCN +L    RA  L++ LT  EK+ QL D A  +PRLGIP Y+W
Sbjct: 109 PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 168

Query: 99  WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WSE+LHG+A +G G++F NG   + TSFPQV+ +AA+F+  LW  IG A+  E RA YNL
Sbjct: 169 WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 228

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA GL  W+PN+NIFRDPRWGRGQETPGEDP   S Y   FVK  QG +  +       
Sbjct: 229 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 281

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               L  SACCKH+ AYD+E+W   SRY+FNA +T QD  DT+ 
Sbjct: 282 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 321

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           PPFRSC+  GKASCIMC+Y  +NGVPAC   DL  K  R EW   GY  SDCDAVA + +
Sbjct: 322 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 381

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            +++T+T E++ A  LKAG+DINCG  M ++  SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 382 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 441

Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G F+GDPR  K YG+L   DVCT  HK LAL+AAR+G+VLLKND + LPL    V+S A+
Sbjct: 442 GHFDGDPRGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAV 501

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IG   N+I  + G Y G+PC   +   G++ YV    +  GC    C+  A   +A  +A
Sbjct: 502 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 560

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K +D+V +V GL   QE E  DR SLLLPG+Q +L+T+VA  SKRPVIL+L  GGP+D++
Sbjct: 561 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 620

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK  M DM MR 
Sbjct: 621 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 680

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGRSYRFY G  VY FG+GLSY+ ++ + +S       S+S    +   +   T
Sbjct: 681 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 737

Query: 695 GSRLDYVH-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
               D V+ +DE+    C  LRF V + V N G +DG H V++F R      G P +QLI
Sbjct: 738 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 797

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           GF   H      K++   + PCE LS A   G +++  G+H LMV E
Sbjct: 798 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 844


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  747 bits (1929), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/793 (45%), Positives = 503/793 (63%), Gaps = 48/793 (6%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQ 75
           +L  +VV+ V++     P      F C P    +  +PFC  S+ I  R K LI  LTL+
Sbjct: 11  LLIFLVVLAVVSGEARDP------FACDPKDGANAGFPFCRKSIGIGERVKDLIGRLTLE 64

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EK++ L +NA+ +PRLGI  YEWWSE+LHG+++ GPG  F+G     TSFPQV+ +AASF
Sbjct: 65  EKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAASF 124

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N SLW  IG  V+ EARAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++   YA
Sbjct: 125 NSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGKYA 184

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             +V+  QG                           + GD L ++ACCKH  AYDL+ W 
Sbjct: 185 ARYVRGLQG---------------------------NAGDRLKVAACCKHFTAYDLDNWN 217

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R+ F+A +++Q+ EDTF  PFRSC+ +GK + +MCSYNQVNGVP C   +L +   R
Sbjct: 218 GVDRFHFDARVSKQEMEDTFDVPFRSCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVR 277

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
            +W   GY+ SDCD+V   ++ Q+YT T E++AA  +KAG+D++CG  +  HTQ AI KG
Sbjct: 278 KQWHLNGYVVSDCDSVGVFYDNQHYTNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKG 337

Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
            V E D+D AL+N  +VQ+RLG+F+G+P    +G LGP DVC+  H++LA++AARQGIVL
Sbjct: 338 LVSEADVDSALVNTVTVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVL 397

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           LKN    LPL+  +  S+A+IGP  +    M G Y GIPC   + L+G+  Y S+T +  
Sbjct: 398 LKNHGHSLPLSTRSHRSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQGIGRY-SRTIHQK 456

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           GC DV C+ D  F  A+  A +AD  ++V GLD + E E +DR  LLLPG+Q  LV+ VA
Sbjct: 457 GCADVACSEDQLFAGAIDAASQADATVLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVA 516

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
             S+ P +LVL  GGP+DVSFA+ D +I++I+W GYPG+AG  A+A+I+FG  NPGG+LP
Sbjct: 517 MASRGPTVLVLMSGGPVDVSFAKKDPRIAAIVWAGYPGQAGGAAIADILFGVANPGGKLP 576

Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           MTWYP+ + +KVPM  M MRA  S+ YPGR+YRFY G  VY FGHGLSYTN+ +    AP
Sbjct: 577 MTWYPQEYLSKVPMTTMAMRAIPSKAYPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAP 636

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
           + + I   L    +  +   +G  +   H      C  L   + + V N G+ DGSH ++
Sbjct: 637 TAVAI--PLHGHHNTTV---SGKAIRVTH----AKCNRLSIALHLDVKNVGNKDGSHTLL 687

Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
           +F++ P      P KQL+ F++VH  A+  + +   +  C+ LS+ ++ G R +P+G H 
Sbjct: 688 VFSK-PPAGHWAPHKQLVAFEKVHVAARTQQRVQINIHVCKYLSVVDRSGIRRIPMGQHG 746

Query: 794 LMVGELRHSLTIE 806
           L +G+ +H ++++
Sbjct: 747 LHIGDTKHLVSLQ 759


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  747 bits (1929), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/774 (48%), Positives = 496/774 (64%), Gaps = 40/774 (5%)

Query: 38  NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           N P F C     P   +  FC+ SL++  R   L++ LTL EKI  L   A  + RLGIP
Sbjct: 31  NSPVFACDVTSNPALGNLTFCDASLAVENRVNDLVNRLTLGEKIGFLVSGAGGVSRLGIP 90

Query: 95  AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
            YEWWSE+LHG+A  GPGV+F   V   TSFPQV+++AASFN +L+  IG  V+ EARAM
Sbjct: 91  KYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVILTAASFNVTLFQTIGKVVSTEARAM 150

Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           YN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP + S Y V +V+  Q    ++DDG  
Sbjct: 151 YNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLTSKYGVAYVEGLQ----QTDDGST 206

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                               ++L ++ACCKH  AYD++ W    RYSFNA++ +QD +DT
Sbjct: 207 --------------------NKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQQDLDDT 246

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
           FQPPFRSC+ +G  + +MCSYNQVNG P C   +L     R EW   GYI +DCD++  I
Sbjct: 247 FQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDCDSLQVI 306

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
           F+ QNYTKT E++AA  L +G+D+NCG+ +  +TQ A+++  V E  IDRA+ N F+  +
Sbjct: 307 FKSQNYTKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAISNNFATLM 366

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLG F+G+P+   YG LGP DVCT E+++LA +AARQGIVLLKN    LPL   A+ SLA
Sbjct: 367 RLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTPTAIKSLA 426

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N    M G Y GIPC   + L+GL A V+ T Y  GC DV CN+ A   +A +I
Sbjct: 427 VIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVA-TIYKPGCADVSCNT-AQIDDAKQI 484

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A  AD V++V G D + E E  DR S+ LPGQQ  LV  VA+ +K PVILV+  GG +DV
Sbjct: 485 ATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMSGGGMDV 544

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNM 632
            FA  + +I+SILW+G+PGEAG  ALA++IFG +NP GRLPMTWYP+S+   VPM DMNM
Sbjct: 545 QFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVPMTDMNM 604

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           R + +  YPGR+YRFYTG  V+ FGHGLSY+ + +    AP  +    SL  G       
Sbjct: 605 RPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFV----SLPLGEKHTCRL 660

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                +D V      SC+++ F + + V N G + GSH++ LF   P V    P+K L+G
Sbjct: 661 SKCKTVDAVG----QSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSV-HNAPKKHLLG 715

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           F++VH   +G   + F V+ C+ LS+ ++ G R + LG HVL +G+L+HSLT+ 
Sbjct: 716 FEKVHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGDLKHSLTVR 769


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  746 bits (1926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/767 (47%), Positives = 484/767 (63%), Gaps = 39/767 (5%)

Query: 40  PDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           P F C P      + FCN +L    RA  L++ LT  EK+ QL D A  +PRLGIP Y+W
Sbjct: 35  PGFTCGPASAQKGFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRLGIPVYKW 94

Query: 99  WSESLHGIASNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WSE+LHG+A +G G++F NG   + TSFPQV+ +AA+F+  LW  IG A+  E RA YNL
Sbjct: 95  WSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKEGRAFYNL 154

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA GL  W+PN+NIFRDPRWGRGQETPGEDP   S Y   FVK  QG +  +       
Sbjct: 155 GQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN------- 207

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               L  SACCKH+ AYD+E+W   SRY+FNA +T QD  DT+ 
Sbjct: 208 --------------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYN 247

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           PPFRSC+  GKASCIMC+Y  +NGVPAC   DL  K  R EW   GY  SDCDAVA + +
Sbjct: 248 PPFRSCVVDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHK 307

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            +++T+T E++ A  LKAG+DINCG  M ++  SA+ +GK+ EKD+D+AL NLF++++RL
Sbjct: 308 SEHFTRTAEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRL 367

Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G F+GDPR  K YG+L   DVCT  HK LAL+AAR+G+VLLKND + LPL    V+S A+
Sbjct: 368 GHFDGDPRGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAV 427

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IG   N+I  + G Y G+PC   +   G++ YV    +  GC    C+  A   +A  +A
Sbjct: 428 IGHNANDILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALA 486

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K +D+V +V GL   QE E  DR SLLLPG+Q +L+T+VA  SKRPVIL+L  GGP+D++
Sbjct: 487 KSSDYVFLVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDIT 546

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ + +I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK  M DM MR 
Sbjct: 547 FAQTNPKIGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRP 606

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGRSYRFY G  VY FG+GLSY+ ++ + +S       S+S    +   +   T
Sbjct: 607 DPATGYPGRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAAT 663

Query: 695 GSRLDYVH-IDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
               D V+ +DE+    C  LRF V + V N G +DG H V++F R      G P +QLI
Sbjct: 664 TPEGDAVYRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLI 723

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           GF   H      K++   + PCE LS A   G +++  G+H LMV E
Sbjct: 724 GFRNQHLKVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 770


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  744 bits (1921), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/771 (47%), Positives = 497/771 (64%), Gaps = 44/771 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K P    Y FC+ SL +  R K L+  LTLQEKI  L ++A  + RLGIP YEW
Sbjct: 30  FACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIPKYEW 89

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG  F+  +   TSFP  +++AASFN SL+  IG  V+ EARAMYN+G
Sbjct: 90  WSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAMYNVG 149

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRG ETPGEDP++ S YA  +VK  Q               
Sbjct: 150 LAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--------------- 194

Query: 219 REKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                      ++D GD  +L ++ACCKH  AYD++ W    RY+FNA++T+QD EDTFQ
Sbjct: 195 -----------QTDGGDPNKLKVAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQ 243

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           PPF+SC+  G  + +MCSYN+VNG P C   DL +   R EW   GYI SDCD+V  +++
Sbjct: 244 PPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYK 303

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E++AA  + AG+D+NCG  + ++T+ A+ +G + E  I+ A+ N F+  +RL
Sbjct: 304 DQHYTKTPEEAAAISILAGLDLNCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRL 363

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+GDPRK  YG LGP DVCT E+++LA +AARQGIVLLKN    LPLN  A+ SLA+I
Sbjct: 364 GFFDGDPRKQPYGNLGPKDVCTQENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVI 423

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  N    M G Y GIPC   S L+GL A+ + T YA+GC DV C +     +A +IA 
Sbjct: 424 GPNANATRVMIGNYEGIPCKYISPLQGLTAF-APTSYAAGCLDVRCPNPV-LDDAKKIAA 481

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G  L  E E  DRV++LLPGQQ  LV+ VA  SK PVILV+  GG +DVSF
Sbjct: 482 SADATVIVVGASLAIEAESLDRVNILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSF 541

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ +++I+SILW+GYPGEAG  A+A++IFG  NP GRLPMTWYP+S+  KVPM +MNMR 
Sbjct: 542 AKNNNKITSILWVGYPGEAGGAAIADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRP 601

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D +  YPGR+YRFY G  V+ FG GLSY++  +K + AP  +    S++   D       
Sbjct: 602 DPATGYPGRTYRFYKGETVFAFGDGLSYSSIVHKLVKAPQLV----SVQLAEDHVCRSSE 657

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
              +D V       C +L F + + + N G +  +H V LF+  P V    P+K L+GF+
Sbjct: 658 CKSIDVVG----EHCQNLVFDIHLRIKNKGKMSSAHTVFLFSTPPAV-HNAPQKHLLGFE 712

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +VH + K    +SF VD C+ LSI ++ G R + LG H+L VG+L+H L++
Sbjct: 713 KVHLIGKSEALVSFKVDVCKDLSIVDELGNRKVALGQHLLHVGDLKHPLSV 763


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  744 bits (1921), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/775 (47%), Positives = 483/775 (62%), Gaps = 35/775 (4%)

Query: 36  VLNKPDFPCKPPH-FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           V   P F C P       PFCN  L  S RA  L+S +T  EK  QL D A+ +PRLG+P
Sbjct: 43  VAGAPPFSCGPSSPSKGLPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVPRLGVP 102

Query: 95  AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
           +Y+WW+E+LHG+A +G G++ N  V S TSFPQVL +AASFN +LW  IG A   EARA 
Sbjct: 103 SYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGKEARAF 162

Query: 155 YNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S Y   FV+  QG +       
Sbjct: 163 YNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQGSS------- 215

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                                  L  SACCKH  AYDLE W   SRYSF A +T QD  D
Sbjct: 216 --------------SNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLAD 261

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           TF PPFRSC+  GKASC+MC+Y  VNGVP+C  GDL  K  R  WG  GY+ +DCDAVA 
Sbjct: 262 TFNPPFRSCVVDGKASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAI 321

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           +   Q Y  T ED+ A  LKAG+DI+CG  + ++  +AI KGK+ ++D+D+A+ NL + +
Sbjct: 322 MRNSQFYRPTAEDTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTR 381

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG F+GDP+   YG LG   +CT+EHK LAL+AA  GIVLLKN    LPL +  V+S 
Sbjct: 382 MRLGHFDGDPKTNVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSA 441

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IG   N++  + G Y G PC+P + L+G++ YV    + +GC+   CN  A   +A  
Sbjct: 442 AVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNV-AATPQATA 500

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +A  +D VI+  GL   QE+E +DR +LLLPG Q SL+ +VA  +KRPVILVL  GGP+D
Sbjct: 501 LASSSDAVILFMGLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVD 560

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           ++FA+A+ +I +ILW GYPG+AG  A+A+++FG+ NP G+LP TWYPE FT++PM DM M
Sbjct: 561 ITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEFTRIPMTDMRM 620

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           RA  S  YPGR+YRFY G  +Y FG+GLSY+ +S++ ++       + SL A     +  
Sbjct: 621 RAAGS--YPGRTYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAG---LAA 675

Query: 693 QTGSRLDY--VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
            T   L Y   HI +V  C  L+F   + V N G +DG H  ++F R P  + G P +QL
Sbjct: 676 MTEDNLSYHVEHIGDVV-CDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQL 734

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           IGF   H  A     + F V PCE  S   + GR+++  G+H L VG  +H L I
Sbjct: 735 IGFQSQHIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVG--KHELEI 787


>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score =  743 bits (1919), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/756 (46%), Positives = 487/756 (64%), Gaps = 39/756 (5%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PFC   L I  R K LI  LTL EK+  L +NA A+ RLGI  YEWWSE+LHG+++ GPG
Sbjct: 41  PFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIKGYEWWSEALHGVSNVGPG 100

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
             F G+    TSFPQV+ +AASFN +LW  IG  V+ EARAMYN G AGLT+W+PN+NI 
Sbjct: 101 TKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIL 160

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP++V  YA  +VK  QG +                          
Sbjct: 161 RDPRWGRGQETPGEDPLLVGKYAASYVKGLQGND-------------------------- 194

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
            G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD +DTF  PFR C+++GK + +M
Sbjct: 195 -GERLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVM 253

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           CSYNQVNG+P C   +L +K  R +WG  GYI SDCD+V   ++ Q+YT T E++AA  +
Sbjct: 254 CSYNQVNGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTSTPEEAAADAI 313

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           KAG+D++CG  +  HTQ A+ +G + E D++ AL N  +VQ+RLG+F+G+P    YG LG
Sbjct: 314 KAGLDLDCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNLG 373

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P DVCT  H++LAL+A RQGIVLLKN    LPL+     ++AIIGP  N    M G Y G
Sbjct: 374 PKDVCTPAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYAG 433

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
           + C   + L+G+ +Y +KT +  GC DV C +D  F  A+  A++AD  ++V GLD + E
Sbjct: 434 VACQYTTPLQGIGSY-AKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIE 492

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E RDR  LLLPG+Q  LV+ VA  SK P ILVL  GGP+DVSFA+ D +I++ILW GYP
Sbjct: 493 AEFRDRTGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYP 552

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           G+AG  A+A+++FG  NPGG+LPMTWYP+ + T +PM +M MR+  S+ YPGR+YRFY G
Sbjct: 553 GQAGGAAIADVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQG 612

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             VY FGHG+SYT++ +   SAP+ +++      G+       +G  +   H    T C 
Sbjct: 613 KVVYPFGHGMSYTHFVHNIASAPTMVSVPLDGHRGNT----SISGKAIRVTH----TKCN 664

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
            L   +Q+ V N G  DG+H +++++  P   + +P KQL+ F+RVH  A   + +   +
Sbjct: 665 KLSLGIQVDVKNVGSKDGTHTLLVYS-APPAGRWSPHKQLVAFERVHVSAGTQERVGISI 723

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             C+ LS+ ++ G R +P+G H + +G ++HS++++
Sbjct: 724 HVCKLLSVVDRSGIRRIPIGEHSIHIGNVKHSVSLQ 759


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  743 bits (1918), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/769 (47%), Positives = 492/769 (63%), Gaps = 41/769 (5%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P + C P  P    Y FC+  L  + RA  L+S LT  EK+ QL D A  + RLG+P Y
Sbjct: 24  EPPYSCGPRSPSL-GYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLGVPPY 82

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           +WWSE LHG++  G G++FNG V+++TSFPQVL++AA+F+  LW  IG A+  EARA+YN
Sbjct: 83  KWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEARALYN 142

Query: 157 LGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           LGQA GLT W+PN+NI+RDPRWGRGQETPGEDP   S YAV FVK  QG    +      
Sbjct: 143 LGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQGSTPGT------ 196

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  SACCKH  AYDLE+W   +RY+FNA +T QD  DTF
Sbjct: 197 ---------------------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTF 235

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            PPF+SC+   KASC+MC+Y  +NGVPAC   DL  K  R +WG  GY++SDCDAVA + 
Sbjct: 236 NPPFKSCVVDAKASCVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLR 295

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + Q Y  T ED+ A  +KAG+D+NCG     H  +A+ +GK++E D+DRAL NLF+V++R
Sbjct: 296 DAQRYAPTPEDTVAVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMR 355

Query: 395 LGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F+GDPR    YG LG  DVCT  H+ LAL+AA+ GIVLLKND   LPL++  V S A
Sbjct: 356 LGHFDGDPRSNAAYGHLGAADVCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAA 415

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N+ + + G Y G PC   + L+G++ Y+S   + +GC D P    A   +A  +
Sbjct: 416 VIGPNANDPAALNGNYFGPPCETTTPLQGVQRYISSVRFLAGC-DSPACGFAATGQAAAL 474

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A  +D VI+  GL   QE E  DR SLLLPG+Q SL+T+VA  ++RPVILVL  GGP+DV
Sbjct: 475 ASSSDQVIMFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDV 534

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           +FA+ + +I +ILW GYPG+AG  A+A+++FGD NP GRLP+TWYPE FT++PM DM MR
Sbjct: 535 TFAKNNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTRIPMTDMRMR 594

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           AD +  YPGRSYRFY G  VY FG+GLSY+ ++ + ++A      + +L AG    I + 
Sbjct: 595 ADPATGYPGRSYRFYQGNPVYKFGYGLSYSKFTRRLVAAAKPRRPNRNLLAGV---IPKP 651

Query: 694 TGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQ 749
            G   +  H++E+    C  L+F   + V N G +DG H V++F + P  + G   P +Q
Sbjct: 652 AGDGGESYHVEEIGEEGCERLKFPATVEVHNHGPMDGKHSVLVFVQWPNATAGASRPARQ 711

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           L+GF   H  A     ++  ++PCE LS A   G +++  G+H L VGE
Sbjct: 712 LVGFSSQHVRAGEKARLTMEINPCEHLSRARDDGTKVIDRGSHFLKVGE 760


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  742 bits (1916), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/803 (46%), Positives = 501/803 (62%), Gaps = 46/803 (5%)

Query: 14  ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           +L  ++ L+V   ++    +      P F C        PFCNT L  + RA  L+S +T
Sbjct: 58  LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
             EK  QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
           ASFN +LW  IG A   EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V 
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S YA  FV+  QG +                              L+ SACCKH  AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
           E W   +RYSF A +T QD  DTF PPFRSC+  GKASC+MC+Y  VNGVP+C   DL  
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336

Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
           K  R  WG  GY+ +DCDAV+ +   Q Y  T ED+ A  LKAG+DI+CG  + +H  +A
Sbjct: 337 KTFRGSWGLDGYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGLDIDCGPYVQQHAMAA 396

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           I KGK+ ++D+D+A+ NLF+ ++RLG F+GDP+   YG LG   +CT EHK LAL+AA  
Sbjct: 397 IQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQEHKNLALEAALD 456

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
           GIVLLKN    LPL + +V+S A+IG   N++  + G Y G PC+P + L+G++ YV   
Sbjct: 457 GIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKNV 516

Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
            + +GCH   CN  A   +A  +A  +D VI+  GL   QE+E +DR +LLLPG Q SL+
Sbjct: 517 RFLAGCHKAACNVAA-TPQAAALASTSDSVILFMGLSQEQESEGKDRTTLLLPGNQQSLI 575

Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
           T+VA  +KRPVILVL  GGP+D++FA+A+ +I +ILW GYPG+AG  A+A+++FG+ NP 
Sbjct: 576 TAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNPS 635

Query: 611 GRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           GRLP+TWYPE FTKVPM DM MR+  S  YPGRSYRFY G  +Y FG+GLSY+ +S++ +
Sbjct: 636 GRLPVTWYPEEFTKVPMTDMRMRSAGS--YPGRSYRFYKGKTIYKFGYGLSYSKFSHRVV 693

Query: 671 SA------PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTN 722
           +A       + L ++A   A ++ N        L Y H+D +    C  L+F   + V N
Sbjct: 694 TARNNPAHNTTLLLAAGHAATTEDN--------LSY-HVDHIGDELCRQLKFLAVVKVQN 744

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G +DG H  ++F R P  + G P +QL+GF   H  A     + F V PCE  S     
Sbjct: 745 HGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCEDFSRVRDD 804

Query: 783 GRRILPLGNHVLMVGELRHSLTI 805
           GR+++  G+H L VG  +H L I
Sbjct: 805 GRKVIDKGSHFLKVG--KHELEI 825


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  742 bits (1915), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/779 (47%), Positives = 499/779 (64%), Gaps = 45/779 (5%)

Query: 38  NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           + P F C     P   +Y FCNT L I  R   L++ LTLQEKI  L   A+ + RLGIP
Sbjct: 40  SSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEKIGFLVSKANGVTRLGIP 99

Query: 95  AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
            YEWWSE+LHG++  GPG +F+  V   TSFPQV+++AASFN SL+  IG  V+ EARAM
Sbjct: 100 TYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNVSLFQAIGKVVSTEARAM 159

Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           YN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA  +VK  Q           
Sbjct: 160 YNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASGYVKGLQ----------- 208

Query: 215 GFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                          E+D GD   L ++ACCKH  AYD++ W    RYSFNA++T+QD +
Sbjct: 209 ---------------ETDGGDSNRLKVAACCKHYTAYDVDNWKGVERYSFNAVVTQQDMD 253

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA 331
           DT+QPPF+SC+  G  + +MCSYNQVNG P C   DL     R EW   GYI SDCD+V 
Sbjct: 254 DTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVD 313

Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
            +++ Q+YTKT  ++AA  + AG+D+NCG+ + +HT+ A+  G V E  ID+A+ N F  
Sbjct: 314 VLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLT 373

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
            +RLG F+G+P+   YG LGP DVCTS +++LA DAARQGIVLLKN   FLPL+  ++ +
Sbjct: 374 LMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGFLPLSPKSIKT 432

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
           LA+IGP  N    M G Y G PC   + L+GL   VS T Y  GC +V C + A    A 
Sbjct: 433 LAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTT-YLPGCSNVAC-AVADVAGAT 490

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           ++A  AD  +++ G D + E E RDRV L LPGQQ  LV  VA+ +K PV+LV+  GG  
Sbjct: 491 KLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAAKGPVLLVIMSGGGF 550

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDM 630
           D++FA+ D +I+ ILW+GYPGEAG  A+A+IIFG +NP GRLPMTWYP+S+  KVPM  M
Sbjct: 551 DITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTWYPQSYVEKVPMTIM 610

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           NMR D S+ YPGR+YRFYTG  VY FG GLSYT +S+  + APS +++S       + ++
Sbjct: 611 NMRPDKSKGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLVSLSL-----EENHV 665

Query: 691 LQQTGSR-LDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
            + +  + LD +  H +   S     F VQI V N GD +G H V LF   P +  G+P 
Sbjct: 666 CRSSECQSLDAIGPHCENAVSGGGSAFEVQIKVRNGGDREGIHTVFLFTTPPAI-HGSPR 724

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           K L+GF+++         + F V+ C+ LS+ ++ G+R + LG H+L VG+L+HSL+I 
Sbjct: 725 KHLLGFEKIRLGKMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIR 783


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  741 bits (1913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/769 (47%), Positives = 495/769 (64%), Gaps = 40/769 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K P   +Y FCN  LS+  R K L+  LTLQEK+  L ++A  + RLGIP YEW
Sbjct: 39  FACDVAKNPALANYGFCNKKLSVDARVKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEW 98

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG +F+  +   TSFP  ++ AASFN SL+  IG  V+ EARAM+N+G
Sbjct: 99  WSEALHGVSNIGPGTHFSNVIPGATSFPMPILIAASFNASLFQTIGKVVSTEARAMHNVG 158

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA  +VK  Q    ++DDG      
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLASKYAAGYVKGLQ----QTDDG------ 208

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                           ++L ++ACCKH  AYD++ W    RY+FNA++T+QD +DT+QPP
Sbjct: 209 --------------DSNKLKVAACCKHYTAYDVDDWKGVQRYTFNAVVTQQDLDDTYQPP 254

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+  G  + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V  +F+ Q
Sbjct: 255 FKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVDVLFKNQ 314

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +YTKT E++AA  + AG+D+NCG+ + R+T+ A+ +G + E  I+ A+ N F+  +RLG 
Sbjct: 315 HYTKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEASINNAVYNNFATLMRLGF 374

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDP K  YG LGP DVCTS +++LA +AARQGIVLLKN    LPLN  A+ SLA+IGP
Sbjct: 375 FDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAGSLPLNAKAIKSLAVIGP 434

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N    M G Y GIPC   S L+GL A V  T +A+GC DV C ++A   +A +IA  A
Sbjct: 435 NANATRAMIGNYEGIPCKYTSPLQGLTALVP-TSFAAGCPDVQC-TNAALDDAKKIAASA 492

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D  ++V G +L  E E  DR+++LLPGQQ  LVT VA  +K PVIL +  GG +DVSFA+
Sbjct: 493 DATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPVILAIMSGGGMDVSFAK 552

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
            + +I+SILW+GYPGEAG  A+A++IFG  NP GRLPMTWYP+S+  KVPM +MNMR D 
Sbjct: 553 TNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDP 612

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +  YPGR+YRFY G  V+ FG G+SY+ + +K + AP  +++  +     D         
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPLA----EDHVCRSSKCK 668

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD V       C +L F + + + N G +  S  V LF+  P V    P+K L+ F++V
Sbjct: 669 SLDVVG----EHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPPAV-HNAPQKHLLAFEKV 723

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
               K    +SF VD C+ L + ++ G R + LG H+L VG+L+H L++
Sbjct: 724 LLTGKSEALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGDLKHPLSV 772


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  740 bits (1910), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/792 (47%), Positives = 499/792 (63%), Gaps = 48/792 (6%)

Query: 22  IVVVNVIAFSNSKPVLNKPDFPCKPPHFD-SYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
           ++++  + FS++  V + P F C P     +YPFC+ SL  + RA  L+S LT+ EK+ Q
Sbjct: 1   MLLLAAVFFSSA--VASDPLFSCGPSSPSRAYPFCDRSLPAARRAADLVSRLTVAEKVSQ 58

Query: 81  LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLW 140
           L D A+ +PRLG+P Y+WWSE LHG+A  G G+ FNGTV+ VTSFPQVL++ ASF+  LW
Sbjct: 59  LGDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDGLW 118

Query: 141 SNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
             IG A+  EARA+YNLGQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV
Sbjct: 119 FRIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFV 178

Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
           +  QG                          +     L  SACCKH  AYDLE W   +R
Sbjct: 179 RGIQGS-----------------------SAAGAAAPLQASACCKHATAYDLEDWNGVAR 215

Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
           Y+F+A +T QD  DTF PPF+SC+  GKA+C+MC+Y  +NGVPAC   DL  K  R  WG
Sbjct: 216 YNFDARVTAQDLADTFNPPFQSCVVDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWG 275

Query: 319 FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQE 378
             GY++SDCDAVA + + Q Y  T ED+ A  LK             H  +AI +GK+ E
Sbjct: 276 HDGYVSSDCDAVAIMHDAQRYVPTPEDTVAVALK------------EHGMAAIQQGKMTE 323

Query: 379 KDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           KD+D+AL NLF+V++RLG F+GDPR    YG LG  DVCT++HK LAL+AA+ GIVLLKN
Sbjct: 324 KDVDKALTNLFAVRMRLGHFDGDPRGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKN 383

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
           D   LPL+++A+ S A+IG   N+   + G Y G  C   + L+G+++YVS   + +GC 
Sbjct: 384 DAGILPLDRSAMGSAAVIGHNANDALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCS 443

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
              C   A   +A  +A  +++V +  GL   QE E  DR SLLLPG+Q SL+T+VA  +
Sbjct: 444 SAACGYAA-TGQAAALASSSEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAA 502

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           KRPVILVL  GGP+D++FA+++ +I +ILW GYPG+AG  A+A ++FGD NP GRLP+TW
Sbjct: 503 KRPVILVLLTGGPVDITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTW 562

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           YPE FTKVPM DM MRAD +  YPGRSYRFY G  +Y FG+GLSY+ +S + ++      
Sbjct: 563 YPEEFTKVPMTDMRMRADPANGYPGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQL 622

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLF 735
             ASL AG          +   Y H+D++ +  C  LRF  ++ V N G +DG H V++F
Sbjct: 623 --ASLLAGLSATTKDDDAT--SYYHVDDIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMF 678

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
            R P  + G P  QLIGF   H  A     + F V PCE  S A   G++++  G+H LM
Sbjct: 679 LRWPNATDGRPVSQLIGFTSQHIKAGEKANVRFDVRPCEHFSRARADGKKVIDRGSHFLM 738

Query: 796 VGELRHSLTIET 807
           VG+    ++ E 
Sbjct: 739 VGKEEVEVSFEA 750


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  739 bits (1908), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/770 (48%), Positives = 503/770 (65%), Gaps = 40/770 (5%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    SY FCN S S S RA  L+S LTL EK+  L D  +A+PRLGIP YE
Sbjct: 32  PVFACDASNATLASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALPRLGIPLYE 91

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V + TSFPQ +++AASFN +L+  IG  V+ EARAM+N+
Sbjct: 92  WWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 151

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q       D G G G
Sbjct: 152 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ-------DAGSGSG 204

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L ++ACCKH  AYD++ W    RY+FNA++++QD +DTFQP
Sbjct: 205 ------------------SLKVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQP 246

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYNQVNG P C   DL     R +W   GYI+SDCD+V  ++  
Sbjct: 247 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 306

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT ED+AA  +KAG+D+NCG  + +HT +A+  GK+ E D+DRA+ N F   +RLG
Sbjct: 307 QHYTKTPEDAAAISIKAGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLG 366

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPRK  +G LGP DVCTS +++LA +AARQGIVLLKN    LPL+ +++ SLA+IG
Sbjct: 367 FFDGDPRKLPFGNLGPSDVCTSSNQELAREAARQGIVLLKNSGA-LPLSASSIKSLAVIG 425

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A 
Sbjct: 426 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAA 484

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S+ P ILV+  GGP D+SF
Sbjct: 485 SADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISF 544

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A++  +I++ILW+GYPGEAG  A+A+++FG  NP GRLP+TWYPESFTKVPM DM MR D
Sbjct: 545 AKSSDKIAAILWVGYPGEAGGAAIADVLFGHHNPSGRLPVTWYPESFTKVPMIDMRMRPD 604

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
           +S  YPGR+YRFYTG  VY FG GLSYT++++  +SAP ++    +L+       L +  
Sbjct: 605 ASTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQV----ALQLAEGHTCLTEQC 660

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
             ++     E   C  L F V + V NAGD+ G+H V LF+  P V    P K L+GF++
Sbjct: 661 PSVEA----EGAHCEGLAFDVHLRVRNAGDMSGAHTVFLFSSPPAV-HNAPAKHLLGFEK 715

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           V      +  ++F VD C+ LS+ ++ G R + LGNH L VG+L+H+L++
Sbjct: 716 VSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGNHTLHVGDLKHTLSL 765


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score =  738 bits (1906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/772 (46%), Positives = 490/772 (63%), Gaps = 43/772 (5%)

Query: 42  FPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +    + PFC  SL+   R K LI  LTLQEK+  L +NA+A+PRLGI  YEWW
Sbjct: 28  FACDPKNTATKNLPFCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWW 87

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G   + TSFPQV+ +AASFN SLW  IG   + EARAMYN G 
Sbjct: 88  SEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGT 147

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+  QG +             
Sbjct: 148 AGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTD------------- 194

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G+ L ++A CKH  AYDL+ W    R+ FNA +++QD EDTF  PF
Sbjct: 195 --------------GNRLKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPF 240

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           R C+++GK + +MCSYNQVNGVP C    L ++  R +WG  GYI SDCD+V   +  Q+
Sbjct: 241 RMCVKEGKVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQH 300

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT T E++AA  +KAG+D++CG  + +HTQ+A+ KG + E D++ ALLN  +VQ+RLG++
Sbjct: 301 YTSTPEEAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMY 360

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    Y  LGP DVCT  H++LAL+AARQGIVLLKN    LPL+     ++A+IGP 
Sbjct: 361 DGEPSSHPYNNLGPRDVCTQSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPN 420

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            N    M G Y GI C   S L+G+  Y +KT Y  GC +V C  D  F  A+  A++AD
Sbjct: 421 SNVTFTMIGNYAGIACGYTSPLQGIGTY-TKTIYEHGCANVACTDDKQFGRAINAAQQAD 479

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E  DR SLLLPG Q  LV+ VA  SK P ILV+  GGP+D++FA+ 
Sbjct: 480 ATVLVMGLDQSIEAETVDRASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKN 539

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSS 637
           D +I  ILW GYPG+AG  A+A+I+FG  NPGG+LPMTWYP+ + K +PM +M MRA  S
Sbjct: 540 DPRIQGILWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRS 599

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGS 696
           + YPGR+YRFY G  VY FG+GLSYT++ +   SAP  ++I     + G+  NI  +   
Sbjct: 600 KGYPGRTYRFYNGPVVYPFGYGLSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNIANKA-- 657

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG--TPEKQLIGFD 754
            +   H      C  L  ++ + V N G  DG H +++F+  P    G   P KQL+ F+
Sbjct: 658 -IKVTH----ARCGKLSINLHVDVKNVGSKDGIHTLLVFS-APPAGNGHWAPHKQLVAFE 711

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +VH  AK  + +   +  C+ LS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 712 KVHIPAKAQQRVRVKIHVCKLLSVVDRSGTRRIPMGLHSLHIGDVKHSVSLQ 763


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/770 (45%), Positives = 500/770 (64%), Gaps = 37/770 (4%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +     + FCNT + I  R + LI+ LTL EKI+ + +NA A+PRLGI  YEWW
Sbjct: 37  FACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWW 96

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G     T FPQV+ +AASFN+SLW  IG  V+ EARAMYN GQ
Sbjct: 97  SEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQ 156

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA  +VK  QG++             
Sbjct: 157 AGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQGDS------------- 203

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+  PF
Sbjct: 204 -------------AGNHLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPF 250

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++C+ +G+ + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V   F+ Q+
Sbjct: 251 KACVLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQH 310

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YTKT E++AA  +KAG+D++CG  +  HT SAI KG + E D++ AL NL SVQ+RLG+F
Sbjct: 311 YTKTPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMF 370

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    YG LGP DVCTS H++LAL+AAR+ IVLL+N    LPL+ + + ++ ++GP 
Sbjct: 371 DGEPSTQPYGNLGPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPN 430

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            +    M G Y G+ C   + L+G+  YV KT +  GC  V C  +  F  A  IA++AD
Sbjct: 431 ADATVTMIGNYAGVACGYTTPLQGIARYV-KTAHQVGCRGVACRGNELFGAAETIARQAD 489

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
            +++V GLD T E E RDRV LLLPG Q  LVT VAR +K PVIL++  GGP+D+SFA+ 
Sbjct: 490 AIVLVMGLDQTVEAETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKN 549

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
           D +IS+ILW+GYPG+AG  A+A++IFG  NPGGRLPMTWYP+ +  KVPM +M+MR + +
Sbjct: 550 DPKISAILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPT 609

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             YPGR+YRFY G  V+ FGHGLSY+ +S+    AP ++++          + L     +
Sbjct: 610 TGYPGRTYRFYKGPVVFPFGHGLSYSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVK 669

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
           + + + D+     SL     + V N G +DG+H +++F++ P   + +  KQL+GF + H
Sbjct: 670 VSHANCDD-----SLEMEFHVDVKNEGSMDGTHTLLIFSQPPH-GKWSQIKQLVGFHKTH 723

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +A   + +  GV  C+ LS+ ++ G R +P G H L +G+++HS++++T
Sbjct: 724 VLAGSKQRVKVGVHVCKHLSVVDQFGVRRIPTGEHELHIGDVKHSISVQT 773


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  738 bits (1904), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/804 (46%), Positives = 501/804 (62%), Gaps = 47/804 (5%)

Query: 14  ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           +L  ++ L+V   ++    +      P F C        PFCNT L  + RA  L+S +T
Sbjct: 58  LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
             EK  QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
           ASFN +LW  IG A   EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V 
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S YA  FV+  QG +                              L+ SACCKH  AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
           E W   +RYSF A +T QD  DTF PPFRSC+  GKASC+MC+Y  VNGVP+C   DL  
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336

Query: 312 KA-RNEWGFKG-YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
           K  R  WG  G Y+ +DCDAV+ +   Q Y  T ED+ A  LKAG+DI+CG  + +H  +
Sbjct: 337 KTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGLDIDCGPYVQQHAMA 396

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
           AI KGK+ ++D+D+A+ NLF+ ++RLG F+GDP+   YG LG   +CT EHK LAL+AA 
Sbjct: 397 AIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKAHVYGNLGAAHICTQEHKNLALEAAL 456

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
            GIVLLKN    LPL + +V+S A+IG   N++  + G Y G PC+P + L+G++ YV  
Sbjct: 457 DGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLALLGNYWGPPCAPTTPLQGIQGYVKN 516

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
             + +GCH   CN  A   +A  +A  +D VI+  GL   QE+E +DR +LLLPG Q SL
Sbjct: 517 VRFLAGCHKAACNVAA-TPQAAALASTSDSVILFMGLSQEQESEGKDRTTLLLPGNQQSL 575

Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           +T+VA  +KRPVILVL  GGP+D++FA+A+ +I +ILW GYPG+AG  A+A+++FG+ NP
Sbjct: 576 ITAVANAAKRPVILVLLTGGPVDITFAQANPKIGAILWAGYPGQAGGLAIAKVLFGEKNP 635

Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
            GRLP+TWYPE FTKVPM DM MR+  S  YPGRSYRFY G  +Y FG+GLSY+ +S++ 
Sbjct: 636 SGRLPVTWYPEEFTKVPMTDMRMRSAGS--YPGRSYRFYKGKTIYKFGYGLSYSKFSHRV 693

Query: 670 LSA------PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
           ++A       + L ++A   A ++ N        L Y H+D +    C  L+F   + V 
Sbjct: 694 VTARNNPAHNTTLLLAAGHAATTEDN--------LSY-HVDHIGDELCRQLKFLAVVKVQ 744

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G +DG H  ++F R P  + G P +QL+GF   H  A     + F V PCE  S    
Sbjct: 745 NHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGEKAHLRFEVSPCEDFSRVRD 804

Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
            GR+++  G+H L VG  +H L I
Sbjct: 805 DGRKVIDKGSHFLKVG--KHELEI 826


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  736 bits (1900), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/797 (47%), Positives = 503/797 (63%), Gaps = 47/797 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
           LI +   + F N     + P F C     P   +Y FCNT L I  R   L++ LTLQEK
Sbjct: 23  LIFLCFFLYFLNFSNAQSSPVFACDVAANPSLAAYGFCNTVLKIEYRVADLVARLTLQEK 82

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I  L   A+ + RLGIP YEWWSE+LHG++  GPG +F+  V   TSFPQV+++AASFN 
Sbjct: 83  IGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQVILTAASFNV 142

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
           SL+  IG  V+ EARAMYN+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA  
Sbjct: 143 SLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLASKYASG 202

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWG 255
           +VK  Q                          E+D GD   L ++ACCKH  AYD++ W 
Sbjct: 203 YVKGLQ--------------------------ETDGGDSNRLKVAACCKHYTAYDVDNWK 236

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              RYSFNA++T+QD +DT+QPPF+SC+  G  + +MCSYNQVNG P C   DL     R
Sbjct: 237 GVERYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIR 296

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
            EW   GYI SDCD+V  +++ Q+YTKT  ++AA  + AG+D+NCG+ + +HT+ A+  G
Sbjct: 297 GEWKLNGYIVSDCDSVDVLYKNQHYTKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSG 356

Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
            V E  ID+A+ N F   +RLG F+G+P+   YG LGP DVCTS +++LA DAARQGIVL
Sbjct: 357 LVNEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVL 416

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           LKN    LPL+  ++ +LA+IGP  N    M G Y G PC   + L+GL   VS T Y  
Sbjct: 417 LKN-TGCLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTT-YLP 474

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           GC +V C + A    A ++A  AD  ++V G D + E E RDRV L LPGQQ  LV  VA
Sbjct: 475 GCSNVAC-AVADVAGATKLAATADVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVA 533

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
           + +K PV+LV+  GG  D++FA+ D +I+ ILW+GYPGEAG  A+A+IIFG +NP G+LP
Sbjct: 534 KAAKGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLP 593

Query: 615 MTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           MTWYP+S+  KVPM  MNMR D +  YPGR+YRFYTG  VY FG GLSYT +S+  + AP
Sbjct: 594 MTWYPQSYVEKVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAP 653

Query: 674 SELTISASLKAGSDKNILQQTGS--RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGS 729
           S +++      G ++N + ++     LD +  H +   S     F V I V N GD +G 
Sbjct: 654 SLVSL------GLEENHVCRSSECQSLDAIGPHCENAVSGGGSAFEVHIKVRNGGDREGI 707

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
           H V LF   P +  G+P K L+GF+++    +    + F V+ C+ LS+ ++ G+R + L
Sbjct: 708 HTVFLFTTPPAI-HGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGL 766

Query: 790 GNHVLMVGELRHSLTIE 806
           G H+L VG+L+HSL+I 
Sbjct: 767 GKHLLHVGDLKHSLSIR 783


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score =  734 bits (1896), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/796 (44%), Positives = 500/796 (62%), Gaps = 39/796 (4%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLT 73
           F  L  ++ V ++ F        +  F C P +   ++ PFC  SL+I  R K L+  LT
Sbjct: 5   FSPLLNLIAVFLLLFLVRHTCEARDPFACDPKNGATENMPFCKASLAIPERVKDLVGRLT 64

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           LQEK++ L +NA+A+PRLG+  YEWWSE+LHG+++ GPGV FN      TSFPQV+ +AA
Sbjct: 65  LQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFPGATSFPQVITTAA 124

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           SFN SLW  IG  V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   
Sbjct: 125 SFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGT 184

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           YA  +V+  QG +                           G+ L ++ACCKH  AYDL+ 
Sbjct: 185 YAASYVRGLQGTD---------------------------GNRLKVAACCKHFTAYDLDN 217

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK- 312
           W    R+ FNA +++QD E+TF  PFR C+ +GK + +MCSYNQVNGVP C   +L +K 
Sbjct: 218 WNGMDRFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKT 277

Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
            R  W   GYI SDCD+V   ++ Q+YT T E++AA  +KAG+D++CG  +  HTQ+A++
Sbjct: 278 VRGLWQLDGYIVSDCDSVGVFYDNQHYTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVE 337

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
           KG + E D++ AL+N  +VQ+RLG+F+G+P    YGKLGP DVC   H++LAL+AARQGI
Sbjct: 338 KGLLSEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGI 397

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
           VLLKN    LPL+     ++A+IGP       M G Y G+ C   + L+G+  Y +KT +
Sbjct: 398 VLLKNTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRY-AKTIH 456

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
             GC +V C +D  F  A+  A++AD  ++V GLD + E E  DR  LLLPG+Q  LV+ 
Sbjct: 457 QLGCENVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSK 516

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
           VA  SK P ILV+  GG +D++FA+ + +I  ILW GYPG+AG  A+A+I+FG  NPGG+
Sbjct: 517 VAAASKGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGK 576

Query: 613 LPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
           LP+TWYP+ + TK+PM +M MR   S  YPGR+YRFY G  VY FGHGL+YT++ +   S
Sbjct: 577 LPVTWYPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLAS 636

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           AP+   +S  L      N+   +   +   H      C  L   +++ + N G  DG+H 
Sbjct: 637 APT--VVSVPLNGHRRANVTNISNRAIRVTH----ARCDKLSISLEVDIKNVGSRDGTHT 690

Query: 732 VMLFARVPK-VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
           +++F+  P        EKQL+ F+++H  AKG + +   +  C+ LS+ +K G R +PLG
Sbjct: 691 LLVFSAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLSVVDKSGIRRIPLG 750

Query: 791 NHVLMVGELRHSLTIE 806
            H   +G+++HS++++
Sbjct: 751 EHSFNIGDVKHSVSLQ 766


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  734 bits (1896), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/806 (45%), Positives = 507/806 (62%), Gaps = 54/806 (6%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           F  L L+++  V +   S+      +F C    P    Y FCN SLS   RAK L+S L+
Sbjct: 6   FVRLSLLIIALVSSLCESQK-----NFACDISAPATAKYGFCNVSLSYEARAKDLVSRLS 60

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           L+EK+QQL + A+ +PRLG+P YEWWSE+LHG++  GPGV+FNGTV   TSFP  +++AA
Sbjct: 61  LKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAA 120

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           SFN SLW  +G  V+ EARAM+N+G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS 
Sbjct: 121 SFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSK 180

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           YAV +VK  Q  +                       ++ +   L +S+CCKH  AYDL+ 
Sbjct: 181 YAVNYVKGLQDVH-----------------------DAGKSRRLKVSSCCKHYTAYDLDN 217

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
           W    R+ F+A +T+QD EDT+Q PF+SC+E+G  S +MCSYN+VNG+P C   +L +  
Sbjct: 218 WKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGV 277

Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
            R +W   GYI SDCD++   F   +YTKT ED+ A  LKAG+++NCG  + ++T++A+ 
Sbjct: 278 IRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGLNMNCGDFLGKYTENAVK 337

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
             K+   D+D AL+  + V +RLG F+GDP+   +G LGP DVC+ +H+ LAL+AA+QGI
Sbjct: 338 LKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGI 397

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTH 491
           VLL+N +  LPL K  V  LA+IGP  N    M   Y G+PC   S ++GL+ YV  K  
Sbjct: 398 VLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIV 456

Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           Y  GC DV C        AV+   +AD  ++V GLD T E E  DRV+L LPG Q  LV 
Sbjct: 457 YEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVR 516

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
            VA  +K+ V+LV+   GP+D+SFA+  S I ++LW+GYPGEAG  A+A++IFGD+NP G
Sbjct: 517 DVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSG 576

Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           RLP TWYP+ F  KV M DMNMR +S+  +PGRSYRFYTG  +Y FG+GLSY+++S   L
Sbjct: 577 RLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVL 636

Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
           SAPS + I    K     N+ + T   +  V+      C  L+  + I V N G   GSH
Sbjct: 637 SAPSIIHI----KTNPIMNLNKTTSVDISTVN------CHDLKIRIVIGVKNHGLRSGSH 686

Query: 731 VVMLFARVPKVSQ-----GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
           VV++F + PK S+     G P  QL+GF+RV      +++ +   D C+ LS+ + HG+R
Sbjct: 687 VVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKR 746

Query: 786 ILPLGNHVLMVG-----ELRHSLTIE 806
            L  G+H L++G     ++ H L + 
Sbjct: 747 KLVTGHHKLVIGSNSDQQIYHHLNVR 772


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  734 bits (1896), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/806 (45%), Positives = 507/806 (62%), Gaps = 54/806 (6%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           F  L L+++  V +   S+      +F C    P    Y FCN SLS   RAK L+S L+
Sbjct: 6   FVRLSLLIIALVSSLCESQK-----NFACDISAPATAKYGFCNVSLSYEARAKDLVSRLS 60

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           L+EK+QQL + A+ +PRLG+P YEWWSE+LHG++  GPGV+FNGTV   TSFP  +++AA
Sbjct: 61  LKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDVGPGVHFNGTVPGATSFPATILTAA 120

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           SFN SLW  +G  V+ EARAM+N+G AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS 
Sbjct: 121 SFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSK 180

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           YAV +VK  Q  +                       ++ +   L +S+CCKH  AYDL+ 
Sbjct: 181 YAVNYVKGLQDVH-----------------------DAGKSRRLKVSSCCKHYTAYDLDN 217

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
           W    R+ F+A +T+QD EDT+Q PF+SC+E+G  S +MCSYN+VNG+P C   +L +  
Sbjct: 218 WKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGV 277

Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
            R +W   GYI SDCD++   F   +YTKT ED+ A  LKAG+++NCG  + ++T++A+ 
Sbjct: 278 IRGQWRLDGYIVSDCDSIQVYFNDIHYTKTREDAVALALKAGLNMNCGDFLGKYTENAVK 337

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
             K+   D+D AL+  + V +RLG F+GDP+   +G LGP DVC+ +H+ LAL+AA+QGI
Sbjct: 338 LKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLALEAAKQGI 397

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTH 491
           VLL+N +  LPL K  V  LA+IGP  N    M   Y G+PC   S ++GL+ YV  K  
Sbjct: 398 VLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQKYVPEKIV 456

Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           Y  GC DV C        AV+   +AD  ++V GLD T E E  DRV+L LPG Q  LV 
Sbjct: 457 YEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVR 516

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
            VA  +K+ V+LV+   GP+D+SFA+  S I ++LW+GYPGEAG  A+A++IFGD+NP G
Sbjct: 517 DVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIFGDYNPSG 576

Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           RLP TWYP+ F  KV M DMNMR +S+  +PGRSYRFYTG  +Y FG+GLSY+++S   L
Sbjct: 577 RLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSSFSTFVL 636

Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
           SAPS + I    K     N+ + T   +  V+      C  L+  + I V N G   GSH
Sbjct: 637 SAPSIIHI----KTNPIMNLNKTTSVDISTVN------CHDLKIRIVIGVKNHGLRSGSH 686

Query: 731 VVMLFARVPKVSQ-----GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
           VV++F + PK S+     G P  QL+GF+RV      +++ +   D C+ LS+ + HG+R
Sbjct: 687 VVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKALSLVDTHGKR 746

Query: 786 ILPLGNHVLMVG-----ELRHSLTIE 806
            L  G+H L++G     ++ H L + 
Sbjct: 747 KLVTGHHKLVIGSNSDQQIYHHLNVR 772


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score =  734 bits (1896), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/772 (45%), Positives = 495/772 (64%), Gaps = 38/772 (4%)

Query: 39  KPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C  K   F ++PFC T+L I  R + LI  LTLQEK++ L +NA+A+PRLGI  Y
Sbjct: 28  RPPFACDQKNRAFRNFPFCQTNLPIGDRVRDLIGRLTLQEKVKLLGNNAAAVPRLGIKGY 87

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+++ GPG  F G     TSFPQV+ +AASFN SLW  IG  V+ EARAMYN
Sbjct: 88  EWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYN 147

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GLT+W+PN+NIFRDPRWGRGQETPGEDP+V + YA  +V+  QG            
Sbjct: 148 GEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAALYAERYVRGLQG------------ 195

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                         ++ GD L ++ACCKH  AYDL+ WG   R+ FNA +T+QD EDTF 
Sbjct: 196 --------------NEDGDSLKVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDIEDTFD 241

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
            PFRSC++QGK + IMCSYNQVNG+P C    L +K  R  WG  GYI SDCD+V   ++
Sbjct: 242 VPFRSCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSVGVFYD 301

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YT T E++AA  +KAG+D++CG  + +HT++A+  G ++E  ID  L N  +VQ+RL
Sbjct: 302 TQHYTSTPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTVAVQMRL 361

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G+F+G+P   +YG LGP DVC+  H++LA++AARQGIVLLKN    LPL+     ++A+I
Sbjct: 362 GMFDGEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRHRTVAVI 421

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  +    M G Y G+ C   S L+G+  Y +KT +  GC DV C+ D  F  AV  A+
Sbjct: 422 GPNSDVTVTMIGNYAGVACGYTSPLQGISKY-AKTIHEKGCGDVACSDDKLFAGAVNAAR 480

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD  ++V GLD + E E RDR  LLLPG Q  L++ V++ S+ PV+LVL  GGP+DV+F
Sbjct: 481 QADATVLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGPVDVTF 540

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRA 634
           A  D +I +I+W GYPG+ G  A+A+++FG  NPGG+LPMTWYP+ +   +PM  M+MR+
Sbjct: 541 ANNDPRIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTTMDMRS 600

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           + ++ YPGR+YRFY G  VY FGHGLSYT +      AP  L I    +   + + +   
Sbjct: 601 NLAKGYPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDGRHTYNSSTISNK 660

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
             R+ +        C+ +   + + V N G  DGSH +++F++ P V    P KQL+ F 
Sbjct: 661 SIRVTH------AKCSKISVQIHVDVKNVGPKDGSHTLLVFSK-PPVDIWVPHKQLVAFQ 713

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +V+  A+  + ++  +  C+ LS+ ++ G R +P+G H + +G+ +HSL+++
Sbjct: 714 KVYVPARSKQRVAINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQ 765


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  734 bits (1894), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/764 (48%), Positives = 494/764 (64%), Gaps = 41/764 (5%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P    + FC++SL    R + L+  LTLQEKI  L +NA  + RLGIP YEWWSE+LHG+
Sbjct: 55  PSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGV 114

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           +  GPG  F+  V   TSFPQV+++AASFN SL+  IG  V+ EARAMYN+G AGLT+W+
Sbjct: 115 SYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTYWS 174

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q    + DDG              
Sbjct: 175 PNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQ----QRDDG-------------- 216

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                   D L ++ACCKH  AYDL+ W    RY FNA+++ QD EDTFQPPF+SC+  G
Sbjct: 217 ------DPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDG 270

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
             + +MCSYNQVNG P C   DL     R +W   GYI SDCD+V  ++  Q+YTK+ E+
Sbjct: 271 NVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEE 330

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           +AA  + AG+D++CG  + +HT++A+  G V E  I +A+ N     +RLG F+G+P K 
Sbjct: 331 AAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQ 390

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            YGKLGP DVCT EH++LA +AARQGIVLLKN  K LPL+ +A+ SLA+IGP  N    M
Sbjct: 391 LYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTM 450

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
            G Y G PC   + L+GL A VS T +  GC +V C S A   EA +IA  AD  ++V G
Sbjct: 451 IGNYEGTPCKYTTPLQGLSAVVS-TSFQPGCANVACTS-AQLDEAKKIAASADATVLVVG 508

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
            D + E E RDRV L LPGQQ  L+T VA+ SK PVILV+  GG +D++FA+ D +I+SI
Sbjct: 509 SDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSI 568

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
           LW+G+PGEAG  A+A++IFG FNP GRLPMTWYP+S+  KVPM DM MR  +S  +PGR+
Sbjct: 569 LWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRT 628

Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YRFYTG  +Y FG GLSY+++ +  + AP  ++I   L+ G   +I   +       H  
Sbjct: 629 YRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSI--PLEEG---HICHSS-----KCHSL 678

Query: 705 EVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           EV   SC +L F V + V N G   GSH V L++  P V   +P+K L+GF++V     G
Sbjct: 679 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSV-HNSPQKHLLGFEKVSLGRGG 737

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
              + F VD C+ LS+A++ G R + LG H+L VG L+HSL ++
Sbjct: 738 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVK 781


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  733 bits (1893), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/769 (48%), Positives = 501/769 (65%), Gaps = 40/769 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K  +  SY FC+ SLS+  R   L+  LTLQEKI  L ++A  + RLGIP YEW
Sbjct: 40  FACDVAKNTNVSSYGFCDKSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 99

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG +F+  V   TSFP  +++AASFN SL+  IGS V+ EARAMYN+G
Sbjct: 100 WSEALHGVSNIGPGTHFSSLVPGATSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 159

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA  +VK  Q    ++DDG      
Sbjct: 160 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQ----QTDDG------ 209

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                           D+L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQPP
Sbjct: 210 --------------DSDKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPP 255

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  +F+ Q
Sbjct: 256 FKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLFKDQ 315

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +YTKT E++AA  + +G+D++CG+ + ++T  A+ +G V E  I+ A+ N F+  +RLG 
Sbjct: 316 HYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASINNAVSNNFATLMRLGF 375

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDP K  YG LGP DVCT E+++LA +AARQGIVLLKN    LPL+  A+ SLA+IGP
Sbjct: 376 FDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPGSLPLSSKAIKSLAVIGP 435

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N    M G Y GIPC   S L+GL A+V  T YA GC DV C ++A   +A +IA  A
Sbjct: 436 NANATRVMIGNYEGIPCKYTSPLQGLTAFV-PTSYAPGCPDVQC-ANAQIDDAAKIAASA 493

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D  I+V G +L  E E  DRV++LLPGQQ  LV  VA  SK PVILV+  GG +DVSFA+
Sbjct: 494 DATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAK 553

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
            + +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+  K+PM +MNMR+D 
Sbjct: 554 TNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKIPMTNMNMRSDP 613

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +  YPGR+YRFY G  V+ FG G+S+    +K + AP  +++  +     D         
Sbjct: 614 ATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLA----EDHECRSLECK 669

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD    DE   C +L F + +SV N G +  SH V+LF   P V    P+K L+GF++V
Sbjct: 670 SLDVA--DE--HCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNV-HNAPQKHLLGFEKV 724

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
               K    + F VD C  LS+ ++ G R +PLG+H+L VG L+HSL++
Sbjct: 725 QLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSV 773


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  733 bits (1893), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/780 (48%), Positives = 488/780 (62%), Gaps = 66/780 (8%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC+  L+   RA  L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++  G G+
Sbjct: 32  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 91

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
            FNGTV +VTSFPQV+++AA+F+  LW  +G AV  EARA+YNLGQA GLT W+PN+NIF
Sbjct: 92  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 151

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+  S YAV FV   QG                       G E+ 
Sbjct: 152 RDPRWGRGQETPGEDPVTASRYAVAFVTGLQG----------------------IGGEA- 188

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNA---------------------------- 264
                  SACCKH  AYDL+ W N  RY++++                            
Sbjct: 189 -------SACCKHATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYF 241

Query: 265 MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYI 323
            +T QD EDT+ PPF+SC+ +GKA+CIMC YN +NGVPAC   DL  +K R EWG  GY+
Sbjct: 242 AVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYV 301

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDR 383
            SDCDAVATI +  +YT + ED+ A  +K GMD+NCG     H  +A+ KG + EKDIDR
Sbjct: 302 ASDCDAVATIRDAHHYTLSPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDR 361

Query: 384 ALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           AL+NLF+V++RLG F+GDPR    YG LG  DVC+  HK LAL+AA+ GIVLLKND   L
Sbjct: 362 ALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGAL 421

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS-KTHYASGCHDVPC 501
           PL  +AV+SLA+IGP  +N+  + G Y G PC   + L+G++ Y+  +  + +GC D P 
Sbjct: 422 PLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC-DSPA 480

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            + A  +EA  +A  +D V++  GL   QE +  DR SLLLPG+Q  L+T+VA  ++RPV
Sbjct: 481 CAVAATNEAAALASSSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPV 540

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILVL  GGP+DV+FA+ + +I +ILW GYPG+AG  A+A+++FGD NP GRLP+TWYPE 
Sbjct: 541 ILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEE 600

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI-SA 680
           FTKVPM DM MRAD +  YPGRSYRFY G  VY FG+GLSY+ +S +  S+ S     + 
Sbjct: 601 FTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNL 660

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           SL AG         G  +    + E  V  C+ L F   + V N G +DG H V+++ R 
Sbjct: 661 SLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRW 720

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           P  S G P +QLIGF   H        +SF V PCE  S   + G R++  G H LMVG+
Sbjct: 721 PTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 780


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  733 bits (1892), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/769 (47%), Positives = 499/769 (64%), Gaps = 40/769 (5%)

Query: 42  FPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           F C   K  +  SY FC+ SLS+  R   L+  LTLQEKI  L ++A  + RLGIP YEW
Sbjct: 39  FACDVAKNTNVSSYGFCDNSLSVEDRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEW 98

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ GPG +F+  V   T+FP  +++AASFN SL+  IGS V+ EARAMYN+G
Sbjct: 99  WSEALHGVSNIGPGTHFSSLVPGATNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVG 158

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PNINIFRDPRWGRGQETPGEDP++ S YA  +VK  Q    ++DDG      
Sbjct: 159 LAGLTYWSPNINIFRDPRWGRGQETPGEDPLLSSKYAAGYVKGLQ----QTDDG------ 208

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                           D+L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQPP
Sbjct: 209 --------------DSDKLKVAACCKHYTAYDVDNWKGVQRYTFDAVVSQQDLDDTFQPP 254

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  +++ Q
Sbjct: 255 FKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKLNGYIVSDCDSVEVLYKDQ 314

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +YTKT E++AA  + +G+D++CG+ + ++T  A+ +G V E  I  A+ N F+  +RLG 
Sbjct: 315 HYTKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDEASITNAVSNNFATLMRLGF 374

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GDP K  YG LGP DVCT E+++LA +AARQGIVLLKN  + LPL+  A+ SLA+IGP
Sbjct: 375 FDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNSPRSLPLSSKAIKSLAVIGP 434

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N    M G Y GIPC   S L+GL A+V  T YA GC DV C ++A   +A +IA  A
Sbjct: 435 NANATRVMIGNYEGIPCKYTSPLQGLTAFV-PTSYAPGCPDVQC-ANAQIDDAAKIAASA 492

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D  I+V G +L  E E  DRV++LLPGQQ  LV  VA  SK PVILV+  GG +DVSFA+
Sbjct: 493 DATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKGPVILVIMSGGGMDVSFAK 552

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
            + +I+SILW+GYPGEAG  A+A++IFG +NP GRLPMTWYP+S+  KVPM +MNMRAD 
Sbjct: 553 TNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYPQSYVEKVPMTNMNMRADP 612

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +  YPGR+YRFY G  V+ FG G+S+    +K + AP  +++  +     D         
Sbjct: 613 ATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSVPLA----EDHECRSLECK 668

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
            LD         C +L F + +SV N G +  SH V+LF   P V    P+K L+GF++V
Sbjct: 669 SLDVAD----KHCQNLAFDIHLSVKNMGKMSSSHSVLLFFTPPNV-HNAPQKHLLGFEKV 723

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
               K    + F VD C  LS+ ++ G R +PLG+H+L VG L+HSL++
Sbjct: 724 QLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVGNLKHSLSV 772


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  732 bits (1890), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/784 (46%), Positives = 492/784 (62%), Gaps = 56/784 (7%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           P F C PP    Y FCN +L    RA  L++ LTL+EK+ QL D A  +PR G+P Y WW
Sbjct: 12  PAFSCGPPQQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFGVPGYNWW 71

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE LHG++  G G++FNG V  VT+FPQVL++ ASF+ S+W  IG A+  EARAM+NLGQ
Sbjct: 72  SEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEARAMFNLGQ 131

Query: 160 A-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           A GLT W+PN+NI+RDPRWGRGQETPGEDP   S YAV FV+  QG +  +         
Sbjct: 132 ADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGTSTTT--------- 182

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L  SACCKH  AYDL+ W    RY+FNA +T QD E+TF PP
Sbjct: 183 ------------------LQTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPP 224

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F+SC+ +GKA+C+MC+Y  VNG+PAC    L  K  + EWG  GYI+SDCDAVA ++  +
Sbjct: 225 FKSCVVEGKATCVMCAYTSVNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR 284

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            Y+ T ED+ A  +KAG+D+NCG     H  +A+ + K+ E+D+D+AL NLF++++RLG 
Sbjct: 285 -YSGTPEDAVAAAIKAGLDMNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGH 343

Query: 398 FNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK--NAVSSLAI 454
           F+GDP +   YG+LG  DVC+  HK LAL+AA+ GIVLLKND   LPL++   A +S A+
Sbjct: 344 FDGDPLQSPLYGRLGAQDVCSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAV 403

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-THYASGCHDVPCNSDAGFHEAVRI 513
           IGP  N    + G Y G PC   + L+ L+ + SK   +  GC    CN  A  ++A  +
Sbjct: 404 IGPNANEPGALLGNYFGPPCETTTPLQALQKFYSKNVRFVPGCDSAACNV-ADTYQASGL 462

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A  +D+ I+  GL   QE E  DR SLLLPG+Q SL+T+VA  +KRP+ILVL  GGP+D+
Sbjct: 463 AATSDYTILFMGLSQKQEQEGLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDI 522

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           +FA+ + +I +ILW GYPG+AG  A+A+++FG+ NP GRLP+TWYPE +TKVPM+DM MR
Sbjct: 523 TFAKFNPKIGAILWAGYPGQAGGLAIAKVLFGEHNPSGRLPVTWYPEEYTKVPMDDMRMR 582

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL-------SAP-SELTISASLKAG 685
           AD +  YPGRSYRFY G  VY FG+GLSY+ +S + +        AP +EL  +A++  G
Sbjct: 583 ADPATGYPGRSYRFYKGNAVYKFGYGLSYSKFSRQLVRNSSSNNRAPNTELLAAAAVDCG 642

Query: 686 SDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           + +           Y  ++E+    C  L+F   + V N G +DG   V+LF R P  ++
Sbjct: 643 ASR-----------YYLVEEIGGEVCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATE 691

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
           G P  QL+GF      A     +SF + PCE  S     G +++  G+H LMV E    +
Sbjct: 692 GRPASQLVGFRSQDLRAGEKASVSFDISPCEHFSRTTVDGTKVIDRGSHFLMVDEDEMEI 751

Query: 804 TIET 807
           + ++
Sbjct: 752 SFDS 755


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  731 bits (1888), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/764 (48%), Positives = 494/764 (64%), Gaps = 41/764 (5%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P    + FC++SL    R + L+  LTLQEKI  L +NA  + RLGIP YEWWSE+LHG+
Sbjct: 82  PSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARNVTRLGIPKYEWWSEALHGV 141

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           +  GPG  F+  V   TSFPQV+++AASFN SL+  IG  V+ EARAMYN+G AGLT+W+
Sbjct: 142 SYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTYWS 201

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q    + DDG              
Sbjct: 202 PNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQ----QRDDG-------------- 243

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                   D L ++ACCKH  AYDL+ W    RY FNA+++ QD EDTFQPPF+SC+  G
Sbjct: 244 ------DPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVIDG 297

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
             + +MCSYNQVNG P C   DL     R +W   GYI SDCD+V  ++  Q+YTK+ E+
Sbjct: 298 NVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHYTKSPEE 357

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           +AA  + AG+D++CG  + +HT++A+  G V E  I +A+ N     +RLG F+G+P K 
Sbjct: 358 AAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPSKQ 417

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            YGKLGP DVCT EH++LA +AARQGIVLLKN  K LPL+ +A+ SLA+IGP  N    M
Sbjct: 418 LYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTKTM 477

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
            G Y G PC   + L+GL A VS T +  GC +V C S A   EA +IA  AD  ++V G
Sbjct: 478 IGNYEGTPCKYTTPLQGLSAVVS-TSFQPGCANVACTS-AQLDEAKKIAASADATVLVVG 535

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
            D + E E RDRV L LPGQQ  L+T VA+ SK PVILV+  GG +D++FA+ D +I+SI
Sbjct: 536 SDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITSI 595

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
           LW+G+PGEAG  A+A++IFG FNP GRLPMTWYP+S+  KVPM DM MR  +S  +PGR+
Sbjct: 596 LWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGRT 655

Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YRFYTG  +Y FG GLSY+++ +  + AP  ++I   L+ G   +I   +       H  
Sbjct: 656 YRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSI--PLEEG---HICHSSK-----CHSL 705

Query: 705 EVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           EV   SC +L F V + V N G   GSH V L++  P V   +P+K L+GF++V     G
Sbjct: 706 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPSV-HNSPQKHLLGFEKVSLGRGG 764

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
              + F VD C+ LS+A++ G R + LG H+L VG L+HSL ++
Sbjct: 765 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGTLKHSLNVK 808


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  731 bits (1887), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/807 (46%), Positives = 500/807 (61%), Gaps = 48/807 (5%)

Query: 11  RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSISTRAKS 67
           R R    V  L ++   +   +S    + P F C     P   +Y FCNT++ I  R   
Sbjct: 10  RNRAPSSVFSLSLIF--LCLLDSSNAQSTPVFACDVAGNPSLAAYGFCNTAIKIEYRVAD 67

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L++ LTLQEKI  L+     + RLGIP YEWWSE+LHG++  GPG  F+G V   TSFPQ
Sbjct: 68  LVARLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQ 127

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
           V+++AASFN SL+  IG  V+ EARAMYN+G AGLT+W+PN+NIFRDPRWGRGQETPGED
Sbjct: 128 VILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGED 187

Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKH 245
           P++ S YA  +VK  Q                          E+D  D   L ++ACCKH
Sbjct: 188 PLLSSKYASGYVKGLQ--------------------------ETDSSDANRLKVAACCKH 221

Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
             AYD++ W    RYSFNA++ +QD +DT+QPPF+SC+  G  + +MCSYN+VNG P C 
Sbjct: 222 YTAYDVDNWKGVERYSFNAVVNQQDLDDTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCA 281

Query: 306 RGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
             DL     R EW   GYI SDCD+V  +++ Q+YTKT E++AA  + AG+D+NCG  + 
Sbjct: 282 DPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTKTPEEAAAISINAGLDLNCGYFLG 341

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
            HT++A+  G V+E  ID+A+ N F   +RLG F+GDP+K  YG LGP DVCT  +++LA
Sbjct: 342 DHTEAAVKAGLVKEAAIDKAITNNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTPANQELA 401

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
            +AARQGIVLLKN    LPL+   + +LA+IGP  N    M G Y G PC   + L+GL 
Sbjct: 402 AEAARQGIVLLKNTGA-LPLSPKTIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLA 460

Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
             V  T Y  GC +V C + A    + ++A  +D  ++V G D + E E RDRV L LPG
Sbjct: 461 GTVHTT-YLPGCSNVAC-AVADVAGSTKLAAASDATVLVIGADQSIEAESRDRVDLNLPG 518

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           QQ  LVT VA+ +K PV LV+  GG  D++FA+ D++I+ ILW+GYPGEAG  A A++IF
Sbjct: 519 QQQELVTQVAKAAKGPVFLVIMSGGGFDITFAKNDAKIAGILWVGYPGEAGGIATADVIF 578

Query: 605 GDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           G +NP GRLPMTWYP+S+  KVPM +MNMR D S  YPGR+YRFYTG  VY FG GLSYT
Sbjct: 579 GRYNPSGRLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYT 638

Query: 664 NYSYKFLSAPSELTISAS---LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR-FHVQIS 719
            +S+  + AP  +++S     +   S+   L   G      H D   S T  + F V I 
Sbjct: 639 KFSHSLVKAPRLVSLSLEENHVCRSSECQSLNAIGP-----HCDNAVSGTGGKAFEVHIK 693

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           V N GD +G H V LF   P V  G+P K L+GF+++         + F VD C+ LS+ 
Sbjct: 694 VQNGGDREGIHTVFLFTTPPAV-HGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDLSVV 752

Query: 780 NKHGRRILPLGNHVLMVGELRHSLTIE 806
           ++ G+R + LG H+L VG+++HSL+I 
Sbjct: 753 DEVGKRKIGLGQHLLHVGDVKHSLSIR 779


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/770 (45%), Positives = 492/770 (63%), Gaps = 39/770 (5%)

Query: 42  FPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +    + PFC   L+   R K LI  LTLQEK+  L +NA+A+PRLGI  YEWW
Sbjct: 27  FACDPKNTATKNLPFCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAVPRLGIKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G   + TSFPQV+ +AASFN SLW  IG   + EARAMYN G 
Sbjct: 87  SEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVASDEARAMYNGGT 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+  Q                
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQ---------------- 190

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                     E+D G+ L ++A CKH  AYDL+ W    R+ FNA +++QD EDTF  PF
Sbjct: 191 ----------ETD-GNRLKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           R C+++GK + +MCSYNQVNGVP C    L ++  R +WG  GYI SDCD+V   +  Q+
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQH 299

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT T E++AA  +KAG+D++CG  + +HTQ+A+ KG + E D++ ALLN  +VQ+RLG++
Sbjct: 300 YTSTPEEAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMY 359

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    YGKLGP DVCT  H++LAL+AARQGIVLLKN    LPL+     ++A+IGP 
Sbjct: 360 DGEPSSHPYGKLGPRDVCTPSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPN 419

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            N    M G Y GI C   S L G+  Y +KT +  GC +V C +D  F  A+ +A++AD
Sbjct: 420 SNVTVTMIGNYAGIACGYTSPLEGIGRY-TKTIHELGCANVACTNDKQFGRAINVAQQAD 478

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E  DR  LLLPG+Q  LV+ VA  SK P ILV+  GGP+D++FA+ 
Sbjct: 479 ATVLVMGLDQSIEAETVDRAGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKN 538

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSS 637
           + +I +ILW GYPG+AG  A+A+I+FG  NPGG+LPMTWYP+ + K +PM +M MRA  S
Sbjct: 539 NPRIQAILWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRS 598

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
           + YPGR+YRFY G  VY FG+GLSYT++ +   SAP  ++I        + + +     +
Sbjct: 599 KGYPGRTYRFYNGPVVYPFGYGLSYTHFVHTLASAPKLVSIPVDGHRHGNSSSIANKAIK 658

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRV 756
           + +        C  L   +Q+ V N G  DG+H +++F+  P  +    P KQL+ F ++
Sbjct: 659 VTHAR------CGKLSISLQVDVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKL 712

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           H  +K  + ++  +  C+ LS+ ++ G R +P+G H L +G+++H ++++
Sbjct: 713 HIPSKAQQRVNVNIHVCKLLSVVDRSGTRRVPMGLHSLHIGDVKHYVSLQ 762


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/787 (47%), Positives = 504/787 (64%), Gaps = 48/787 (6%)

Query: 41  DFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           +F C    P  + Y FCNTSLS   RAK LIS LTLQEK+QQ+ ++A+ IPRLGIPAYEW
Sbjct: 24  NFACDKNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPRLGIPAYEW 83

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE+LHG+++ G GV FNGTV   TSFP +++SAASFN +LW  +G  V+ EAR M+++G
Sbjct: 84  WSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTEARTMHSVG 143

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
            AGLT+W+PN+N+FRDPRWGRGQETPGEDP+VVS YAV +V                   
Sbjct: 144 LAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYV------------------- 184

Query: 219 REKRVLKGFGEESDR-GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
              R L+  G+E +   D+L +S+CCKH  AYDL+KW    R+ F+A +T+QD EDT+QP
Sbjct: 185 ---RGLQEVGDEGNSTADKLKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQP 241

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
           PFRSC+E+   S +MCSYN+VNG+P C   DL +   R EW   GYI SDCD++   ++ 
Sbjct: 242 PFRSCVEEAHVSSVMCSYNRVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDS 301

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            NYT T ED+ A  LKAG+++NCG  + ++T  A+   KV+E  +D+AL+  F V +RLG
Sbjct: 302 INYTATPEDAVALALKAGLNMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLG 361

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDP+   +G LGP DVC+  H+KLALDAARQGIVLL N K  LPL+KN   +LA+IG
Sbjct: 362 FFDGDPKSLLFGNLGPSDVCSDGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIG 420

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
           P  N  + M   Y GIPC   + L+GL+ YVS   YA+GC  V C+ D     A + A  
Sbjct: 421 PNANVTTTMISNYAGIPCKYTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAA 480

Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           AD V+++ GLD + E E  DR +L LPG Q  LV  V   +   V+LV+    P+DVSFA
Sbjct: 481 ADAVVLLVGLDQSIEREGLDRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFA 540

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRAD 635
              S+I  ILW+GYPG+AG  A+A+++FGD+NP GR P TWYP+ +  +VPM DMNMRA+
Sbjct: 541 VNKSKIKGILWVGYPGQAGGDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRAN 600

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN---ILQ 692
           S+  +PGR+YRFY G  +Y FGHGLSY+ +S   +S PS L     LK  SD     IL 
Sbjct: 601 STANFPGRTYRFYAGNTLYKFGHGLSYSTFSNFIISGPSTLL----LKTNSDLKPDIILS 656

Query: 693 QTGSRLDYVHID----EVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ--G 744
              S  ++  I+    ++T+  CT+    + + V N G V G HVV++F + P  S+  G
Sbjct: 657 THNSTEEHPFINSQAMDITTLNCTNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTG 716

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----EL 799
               QL+GF RV      ++ ++  +D C++LS+ +  G+R L  G H+  +G     ++
Sbjct: 717 AANVQLVGFSRVEVNRGKTQNVTLEIDVCKRLSLVDSEGKRKLVTGQHIFTIGSDTEHQV 776

Query: 800 RHSLTIE 806
           RH L + 
Sbjct: 777 RHHLDVR 783


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score =  730 bits (1885), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/775 (45%), Positives = 505/775 (65%), Gaps = 40/775 (5%)

Query: 36  VLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
           V ++  F C P +    +  FC  +L I  R + LIS LTLQEKI+ L +NA+A+PRLGI
Sbjct: 36  VESRAPFACDPRNGVTRNLKFCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGI 95

Query: 94  PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
             YEWWSE+LHG+++ GPGV F G     TSFPQV+ +AASFN+SLW  IG  V+ EARA
Sbjct: 96  QGYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARA 155

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN G AGLT+W+PN+N+FRDPRWGRGQETPGEDP++   YA  +V+  Q          
Sbjct: 156 MYNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQ---------- 205

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                            S  G +L ++ACCKH  AYDL+ W    RY FNA +++QD ED
Sbjct: 206 -----------------SSTGLKLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLED 248

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           T+  PF++C+ +GK + +MCSYNQVNG P C    L +   R +WG  GYI SDCD+V  
Sbjct: 249 TYDVPFKACVVEGKVASVMCSYNQVNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGV 308

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           +++ Q+YT T E++AA  +KAG+D++CG  +  HT++A+ KG + E+D++ AL N  +VQ
Sbjct: 309 LYDNQHYTSTPEEAAAATIKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQ 368

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG+F+G+P    YG LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+ +   ++
Sbjct: 369 MRLGMFDGEPSAHPYGNLGPRDVCTPAHQELALEAARQGIVLLENRGQALPLSSSRHHTI 428

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  +    M G Y GI C   S L+G+  Y +KT + +GC DV C+S+  F  A  
Sbjct: 429 AVIGPNSDVTVTMIGNYAGIACKYTSPLQGISRY-AKTLHQNGCGDVACHSNQQFGAAEA 487

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A++AD  ++V GLD + E E RDRV LLLPG Q  LV+ VAR S+ P ILVL  GGP+D
Sbjct: 488 AARQADATVLVMGLDQSIEAEFRDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 547

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
           VSFA+ D ++ +ILW GYPG+AG  A+A+++FG  NPGG+LPMTWYP+ +  KVPM +M 
Sbjct: 548 VSFAKNDPRVGAILWAGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMG 607

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MR D +  YPGR+YRFY G  V+ FGHG+SYT++S+    AP E+++  +     +  I 
Sbjct: 608 MRPDPATGYPGRTYRFYKGNVVFPFGHGMSYTSFSHSLTQAPKEVSLPITNLYALNTTIS 667

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
            +       + +  +   TSL   + I+V N G +DG+H +++F+  P   + +  KQLI
Sbjct: 668 SKA------IRVSHINCQTSLG--IDINVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLI 719

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           GF++V  VA    ++   +  C+ LS  ++ G R +P+G+H + +G+L+HS++++
Sbjct: 720 GFEKVDLVAGSQIQVKIDIHVCKHLSAVDRFGIRRIPIGDHHIYIGDLKHSISLQ 774


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  730 bits (1884), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/766 (46%), Positives = 498/766 (65%), Gaps = 43/766 (5%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P+   +PFCNTSL    RA  L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 58  PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 117

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           +++G GV+F+  + +VT FP V++SAASFN SLW  +G  V+ E RAMYN+GQAGLT+W+
Sbjct: 118 SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 177

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V                      R L+ 
Sbjct: 178 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 215

Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
            G+E +   D L +S+CCKH  AYD++KW    R+ F+A +T QD EDT+QPPF+SC+E+
Sbjct: 216 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEE 275

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G  S +MCSYN+VNGVP C   +L +   R++WG  GYI SDCD++    E  NYT+T E
Sbjct: 276 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 335

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           D+ A  LKAG+++NCG+ +  +T++A++ GKV+E  +++AL+  + V +RLG F+GDP  
Sbjct: 336 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTM 395

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +GK+GP DVCT +H+ LALDAA+QGIVLL N+   LPL+ N   +LA+IGP  +  + 
Sbjct: 396 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 454

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M   Y G+PC   S L+GL+ YVS   Y  GC +V C+ +     A  IA  AD  +VV 
Sbjct: 455 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 514

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLDL  E ED DRV+L LPG Q  LV   A+ +   VILV+   GP+D+SF +  S+I  
Sbjct: 515 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 574

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           ILW+GYPG+AG  A++++IFGD+NPGGR P TWYP+ +  +VPM DMNMR +++  +PGR
Sbjct: 575 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGR 634

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL----------QQ 693
           +YRFYTG  +Y FGHGLSY+ + YKF+ +     +   L      NI             
Sbjct: 635 TYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNT 693

Query: 694 TGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
            G  +D   ID    C +L    + I V NAG++DG+HVV+ F + P+   +G P  +L+
Sbjct: 694 NGQAIDISAID----CRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 749

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           GF+RV      ++ +   +D C ++S  ++ G+R L +G H L+VG
Sbjct: 750 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 795


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  729 bits (1883), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/770 (45%), Positives = 487/770 (63%), Gaps = 39/770 (5%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +       FC  SL+I+ R K LI  LTL+EK++ L +NA+A+PRLG+  YEWW
Sbjct: 27  FACDPKNGGTKKMAFCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLGMKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GP V FN    + TSFPQV+ +AASFN SLW  IG  V+ EARAMYN G 
Sbjct: 87  SEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGT 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+  QG +             
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTH------------- 193

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                          + L ++ACCKH  AYDL+ W    R+ FNA +++QD EDTF  PF
Sbjct: 194 --------------ANRLKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQN 338
           + C+ +GK + +MCSYNQVNGVP C   +L +K  R  W   GYI SDCD+V   ++ Q+
Sbjct: 240 KMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQH 299

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT T E++AA  +KAG+D++CG  +  HTQ+A+ KG + E D++ AL+N  +VQ+RLG+F
Sbjct: 300 YTPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMF 359

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    YG LGP DVC   H++LAL+AARQGIVLLKN    LPL+     ++A+IGP 
Sbjct: 360 DGEPTAHPYGHLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPN 419

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
                 M G Y G+ C   + L+G+  Y ++T +  GC +V C +D  F  A+  A++AD
Sbjct: 420 SKATITMIGNYAGVACGYTNPLQGIGRY-ARTVHQLGCQNVACKNDKLFGPAINAARQAD 478

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E  DR  LLLPG+Q  LV+ VA  SK P ILVL  GGP+D++FA+ 
Sbjct: 479 ATVLVMGLDQSIEAETVDRTGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKN 538

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
           + +I  ILW GYPG+AG  A+A+I+FG  NPGG+LP+TWYPE + TK+PM +M MRA  S
Sbjct: 539 NPRIVGILWAGYPGQAGGAAIADILFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKS 598

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             YPGR+YRFY G  VY FGHGL+YT++ +   SAP+   +S  L      N+   +   
Sbjct: 599 AGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPT--VVSVPLNGHRRANVTNISNRA 656

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLIGFDRV 756
           +   H      C  L   +Q+ + N G  DG+H +++F+  P        EKQL+ F++V
Sbjct: 657 IRVTH----ARCDKLSITLQVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKV 712

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           H  AKG   +   +  C+ LS+ ++ G R +PLG H   +G+++HS++++
Sbjct: 713 HVPAKGQHRVGVNIHVCKLLSVVDRSGIRRIPLGEHSFNIGDVKHSVSLQ 762


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score =  729 bits (1882), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/773 (46%), Positives = 490/773 (63%), Gaps = 40/773 (5%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C P  P   +  FC   + I  R + LI  LTLQEKI  L +NA A+PRLGI  Y
Sbjct: 29  RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+++ GPG  F GT    TSFPQV+ +AASFN SLW  IG  V+ EARAMYN
Sbjct: 89  EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
            G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y   +VK  QG      DG    
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG------DGA--- 198

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                            G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+ 
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
            PFR+C+  G  + +MCSYNQVNG P C   +L +   R +W   GYI SDCD+V   ++
Sbjct: 242 VPFRACVVDGNVASVMCSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYD 301

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E++AA  +KAG+D++CG  +  HT++A+  G+V E DI+ AL N  +VQ+RL
Sbjct: 302 NQHYTKTPEEAAAYAIKAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRL 361

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G+F+G+P   +YG LG  DVC     +LAL+AARQGIVLL+N    LPL+     ++A+I
Sbjct: 362 GMFDGEPSAQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVI 421

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  +    M G Y GI C   + L+G+  Y    H A GC DV CN +     A   A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD  ++V GLD + E E RDR  LLLPG Q  LV+ VAR S+ P ILV+  GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ D +I +I+W+GYPG+AG  A+A+++FG  NP G+LPMTWYP+++   +PM DM MRA
Sbjct: 541 AKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
           D +R YPGR+YRFY G  V+ FG GLSYT +S+     P+ +++   SL A  +  +L  
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGLGLSYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGN 660

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
              R+ +      T+C SL   V I + N+G +DG+H +++FA  P   +  P KQL+GF
Sbjct: 661 HDIRVSH------TNCDSLSLDVHIDIKNSGTMDGTHTLLVFA-TPPTGKWAPNKQLVGF 713

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +VH VA   + +  GV  C+ LS+ ++ G R +PLG H L +G+L+H +++E
Sbjct: 714 HKVHIVAGSERRVRVGVQVCKHLSVVDELGIRRIPLGQHKLEIGDLQHHVSVE 766


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/755 (48%), Positives = 495/755 (65%), Gaps = 30/755 (3%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P   ++PFCNTSLS  TRAK L+S LTLQEK QQL + ++ I RLG+PAYEWWSE+LHG+
Sbjct: 31  PKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISRLGVPAYEWWSEALHGV 90

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           ++ GPG  F+  V   TSFP V++SAASFN +LW  +G  V+ EARAMYN+  AGLTFW+
Sbjct: 91  SNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNEARAMYNVDLAGLTFWS 150

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+N+FRDPRWGRGQETPGEDP+VVS YAV +V+  Q          +G           
Sbjct: 151 PNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQE---------VG----------- 190

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
             E S +GD L +S+CCKH  AYD++ W    R+ F+A +T+QD EDT+QPPF+SC+ +G
Sbjct: 191 -DEASAKGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEG 249

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
             S +MCSYN+VNG+P C   DL Q   R +WG  GYI SDCD+V   +   +YTKT ED
Sbjct: 250 HVSSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHYTKTPED 309

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           + A  LKAG+++NCG  + ++T +A++  KV    +D+AL+  + V +RLG F  +P+  
Sbjct: 310 AVALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFE-NPKSL 368

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            +  LGP DVCT E+++LAL+AA+QGIVLL+N+K  LPL+K  + +LA+IGP  N  + M
Sbjct: 369 PFANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVM 428

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
              Y GIPC   S L+GL+ Y+S   YA GC DV C++   F  AV+ A  AD V++V G
Sbjct: 429 ISNYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVG 488

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
           LD + E E  DRV+L LPG Q  LV  VA  +K  +ILV+   GP+D+SF ++ S I  I
Sbjct: 489 LDQSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGI 548

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRS 644
           LW+GYPG+ G  A+A++IFGD+NPGGR P TWYP+S+  +VPM DMNMRA+SSR +PGR+
Sbjct: 549 LWVGYPGQDGGNAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRT 608

Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YRFY G  +Y FG+GLSY+ +S    SAPS + +  +       N +      +D   I 
Sbjct: 609 YRFYNGKSLYEFGYGLSYSTFSTHIASAPSTIMLQKNTSISKPLNNIFLDDQVIDISTI- 667

Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP--KVSQGTPEKQLIGFDRVHTVAKG 762
              SC +L F + I V N G  DGSHVV++F   P  +   G P KQLIGF+R       
Sbjct: 668 ---SCFNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGK 724

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           ++ ++  +D C+ LS  +  G+R L +G H ++VG
Sbjct: 725 TEFVTVKIDICKMLSNVDSDGKRKLVIGQHNILVG 759


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score =  729 bits (1881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/778 (45%), Positives = 491/778 (63%), Gaps = 38/778 (4%)

Query: 38  NKPDFPCKPPHFDS-YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
            +P F C P    +  PFC  +L    RA+ L++ LT  EK++ L +NA+ +PRLG+  Y
Sbjct: 23  TRPPFACAPGGPSTRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPRLGVEGY 82

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG++  GPGV F G     T+FPQV+ +AASFN SLW  IG AV+ E RA+YN
Sbjct: 83  EWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDEGRAIYN 142

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
             QAGLTFW+PN+NIFRDPRWGRGQETPGEDP V   YA  +V+  Q ++          
Sbjct: 143 GRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQH---------- 192

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               L  +ACCKH  AYDL++W    R+ FNA++T QD EDTF 
Sbjct: 193 -----------------AGRLKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFN 235

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFE 335
            PFR+C+ +G+A+ +MCSYNQVNGVP C  +G L    R +W   GYI SDCD+V   + 
Sbjct: 236 APFRACVVEGRAAAVMCSYNQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYR 295

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YT+T ED+ A  L+AG+D++CG  + ++T++A+ +GKV+E DID A++N  +VQ+RL
Sbjct: 296 EQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRL 355

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN---DKKFLPLN-KNAVSS 451
           G+F+GD     +G LGP  VCT  H++LAL+AA Q IVLLKN   +   LPL+  +   +
Sbjct: 356 GMFDGDVAAQPFGHLGPQHVCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGT 415

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEA 510
           +A++GP       M G Y G PC+  + L+G+  Y   T + +GC DV C  S      A
Sbjct: 416 VAVVGPHSEATVAMIGNYAGKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAA 475

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V  A+ AD  +VV GLD + E E  DR +LLLPG+Q  LV++VAR SK PVILVL  GGP
Sbjct: 476 VDAARHADATVVVVGLDQSVEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGP 535

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
           +D++FA+ D  +++ILW GYPG+AG +A+A++IFG  NPGG+LP+TWYPE +  K PM +
Sbjct: 536 VDIAFAQNDRNVAAILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTN 595

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
           M MRAD +R YPGR+YRFY G  ++ FGHGLSYT +++    AP+ LT+  +    +   
Sbjct: 596 MAMRADPARGYPGRTYRFYAGPTIHPFGHGLSYTKFAHTLAHAPAHLTVRRAAGHRTTAA 655

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEK 748
           I   T S L+ V +     C  L   V + V N G  DG+H V ++A  P  +  G P +
Sbjct: 656 INTTTASHLNDVRVAH-AQCEGLSVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVR 714

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTI 805
           QL+ F++VH  A     +  GVD C  LSIA++ G R +P+G H LM+G EL H++ +
Sbjct: 715 QLVAFEKVHVAAGAVARVKMGVDVCGSLSIADQEGVRRIPIGEHRLMIGEELTHAVML 772


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  728 bits (1880), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/791 (47%), Positives = 505/791 (63%), Gaps = 40/791 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
           L+  + ++  S      + P F C     P   S  FCNTS+ I+ R   L+  LTLQEK
Sbjct: 17  LLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLTLQEK 76

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I  L ++A  + RLGIP YEWWSE+LHG++  GPG +F+  V+  TSFPQV+++AASFN 
Sbjct: 77  IVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAASFNT 136

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
           SL+  IG  V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S Y   
Sbjct: 137 SLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSC 196

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
           +VK  Q    + DDG                      D+L ++ACCKH  AYDL+ W   
Sbjct: 197 YVKGLQ----QRDDG--------------------DPDKLKVAACCKHYTAYDLDNWKGS 232

Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
            RY FNA++T+QD +DTFQPPF+SC+  G  + +MCSYNQVNG P C   DL     R E
Sbjct: 233 DRYHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGE 292

Query: 317 WGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKV 376
           W   GYI +DCD++   ++ QNYTKT E++AA  + AG+D+NCG+ + +HT++A+  G V
Sbjct: 293 WNLNGYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGVDLNCGSFLGQHTEAAVKGGLV 352

Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
            E  ID A+ N F+  +RLG F+GDP K  YGKLGP DVCT+E+++LA +AARQGIVLLK
Sbjct: 353 NEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLK 412

Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N    LPL+  A+ +LA+IGP  N    M G Y G PC   + L+GL A V+ T Y  GC
Sbjct: 413 NTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVATT-YLPGC 471

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            +V C S A   +A ++A  AD  ++V G DL+ E E RDRV +LLPGQQ  L+T+VA  
Sbjct: 472 SNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANV 530

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           S  PVILV+  GG +DVSFA  + +I+SILW+GYPGEAG  A+A+IIFG +NP GRLPMT
Sbjct: 531 SCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMT 590

Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
           WYP+S+  KVPM +MNMR D S  YPGR+YRFYTG  VY FG GLSY+ ++++ + AP  
Sbjct: 591 WYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQL 650

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
           + +          +  Q          +    +C +  F + + V N G + GSH V LF
Sbjct: 651 VYVPLEESHVCHSSECQSV--------VASEQTCQNSTFDMLLRVKNEGTISGSHTVFLF 702

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
           +  P V   +P+K L+GF++V   A+  + + F VD C+ LS+ ++ G + + LG HVL 
Sbjct: 703 SSPPAV-HNSPQKHLVGFEKVFLNAQTGRHVRFKVDICKDLSVVDELGSKKVALGEHVLH 761

Query: 796 VGELRHSLTIE 806
           VG L+H L++ 
Sbjct: 762 VGSLKHFLSVR 772


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  728 bits (1879), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/766 (46%), Positives = 497/766 (64%), Gaps = 43/766 (5%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P+   +PFCNTSL    RA  L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 34  PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 93

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           +++G GV+F+  + +VT FP V++SAASFN SLW  +G  V+ E RAMYN+GQAGLT+W+
Sbjct: 94  SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 153

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V                      R L+ 
Sbjct: 154 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 191

Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
            G+E +   D L +S+CCKH  AYD++KW    R+ F+A +T QD EDT+QPPF+ C+E+
Sbjct: 192 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEE 251

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G  S +MCSYN+VNGVP C   +L +   R++WG  GYI SDCD++    E  NYT+T E
Sbjct: 252 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 311

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           D+ A  LKAG+++NCG+ +  +T++A++ GKV+E  +B+AL+  + V +RLG F+GDP  
Sbjct: 312 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTM 371

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +GK+GP DVCT +H+ LALDAA+QGIVLL N+   LPL+ N   +LA+IGP  +  + 
Sbjct: 372 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 430

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M   Y G+PC   S L+GL+ YVS   Y  GC +V C+ +     A  IA  AD  +VV 
Sbjct: 431 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 490

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLDL  E ED DRV+L LPG Q  LV   A+ +   VILV+   GP+D+SF +  S+I  
Sbjct: 491 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 550

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           ILW+GYPG+AG  A++++IFGD+NPGGR P TWYP+ +  +VPM DMNMR +++  +PGR
Sbjct: 551 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGR 610

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL----------QQ 693
           +YRFYTG  +Y FGHGLSY+ + YKF+ +     +   L      NI             
Sbjct: 611 TYRFYTGKSLYQFGHGLSYSTF-YKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNT 669

Query: 694 TGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
            G  +D   ID    C +L    + I V NAG++DG+HVV+ F + P+   +G P  +L+
Sbjct: 670 NGQAIDISAID----CRNLSNIDIVIGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELV 725

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           GF+RV      ++ +   +D C ++S  ++ G+R L +G H L+VG
Sbjct: 726 GFERVEVKRGKTEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 771


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score =  727 bits (1877), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/796 (44%), Positives = 505/796 (63%), Gaps = 45/796 (5%)

Query: 17  RVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTL 74
           R++ ++++++ I F +      +  F C P       YPFC  +L I  R K LI  LTL
Sbjct: 7   RIIPILIILSAI-FRHGGGA--REPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTL 63

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           QEK++ L +NA+A+PRLGI  YEWWSE+LHG+++ GPG  F G     TSFPQV+ + AS
Sbjct: 64  QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           FN SLW  IG  V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V   Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
           A  ++K  QG +                           GD L ++ACCKH  AYDL+ W
Sbjct: 184 AARYIKGLQGND---------------------------GDRLKVAACCKHFTAYDLDNW 216

Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
               R+ FNA +T QD  DTF+ PFR C+++GK + +MCSYNQVNGVP C   +L +   
Sbjct: 217 NGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTI 276

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           RN+WG  GYI SDCD+V   ++ Q+YT T E++AA  +KAG+D++CG  +  HT+ A+ K
Sbjct: 277 RNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKK 336

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
           G + +  I+ AL N  +VQ+RLG+F+G P    YGKLGP +VC+  H++LALDAARQGIV
Sbjct: 337 GLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIV 396

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
           LLKN    LPL+ +   ++A+IGP  +    M G Y G+ C   + L G++ Y +  H  
Sbjct: 397 LLKNRLPGLPLSADHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVH-R 455

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            GC +V C +D  F +A+  A  AD  ++V GLD + E E +DR  LLLPG+Q  LV  V
Sbjct: 456 KGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKV 515

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  S+ P +++L  GGP+DVSFA+ D +IS+ILW+GYPG+AG  A+A+++FG  NPGG+L
Sbjct: 516 AAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKL 575

Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           PMTWYP+S+ + +PM +M MR+ SS  YPGR+YRFY G  VY FGHGLSYTN+ +  + A
Sbjct: 576 PMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKA 633

Query: 673 PSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           P+ ++IS S  +     + L     R+ +        C  L   + + V N GD DG H 
Sbjct: 634 PTIVSISLSGHRQTHSASTLSSKAIRVTH------AKCQKLSLVIHVDVENKGDRDGFHT 687

Query: 732 VMLFARVP-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
           +++F+  P   +   P KQL+ F+++H  ++  + +   V  C+ LS+ +K G R +PLG
Sbjct: 688 MLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLG 747

Query: 791 NHVLMVGELRHSLTIE 806
           +H + +G ++H+++++
Sbjct: 748 DHYIHIGNVKHTVSLQ 763


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  725 bits (1872), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/770 (47%), Positives = 496/770 (64%), Gaps = 40/770 (5%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    SY FCN S + + RA  L+S LTL EK+  L D  +A+PRLG+P YE
Sbjct: 38  PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN +L+  IG  V+ EARAM+N+
Sbjct: 98  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   QG             
Sbjct: 158 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQG------------- 204

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                 + G G        L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQP
Sbjct: 205 -----AVSGAGA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 252

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYNQVNG P C   DL     R +W   GYI+SDCD+V  ++  
Sbjct: 253 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 312

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT ED+AA  +KAG+D+NCGT + +HT +A+  GK+ E D+DRA+ N     +RLG
Sbjct: 313 QHYTKTPEDAAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLG 372

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCT  +++LA +AARQGIVLLKN  K LPL+  ++ S+A+IG
Sbjct: 373 FFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIG 431

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A 
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S  P ILV+  GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISF 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A++  +I++ILW+GYPGEAG  A+A+++FG  NP GRLP+TWYPESFTKVPM DM MR D
Sbjct: 551 AKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPD 610

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
            S  YPGR+YRFYTG  VY FG GLSYT++++  +SAP +L    +L+       L +  
Sbjct: 611 PSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQC 666

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
             ++     E   C  L F V + V NAG+  G H V LF+  P V    P K L+GF++
Sbjct: 667 PSVEA----EGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEK 721

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           V      +  ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 722 VSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 771


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  725 bits (1872), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/804 (46%), Positives = 496/804 (61%), Gaps = 45/804 (5%)

Query: 11  RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSISTRAKS 67
           R R LF V  L +   ++  S      + P F C     P      FCN  LSI  R   
Sbjct: 4   RNRALFSVSTLFLCF-IVCISEQSNNQSSPVFACDVTGNPSLAGLRFCNAGLSIKARVTD 62

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L+  LTL+EKI  L+  A  + RLGIP+Y+WWSE+LHG+++ G G  F G V   TSFPQ
Sbjct: 63  LVGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQ 122

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
           V+++AASFN SL+  IG  V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQETPGED
Sbjct: 123 VILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGED 182

Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSACCKH 245
           P + S YAV +VK  Q                          E+D GD   L ++ACCKH
Sbjct: 183 PTLSSKYAVAYVKGLQ--------------------------ETDGGDPNRLKVAACCKH 216

Query: 246 LIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL 305
             AYD++ W N +R +FNA++ +QD  DTFQPPF+SC+  G  + +MCSYNQVNG P C 
Sbjct: 217 YTAYDIDNWRNVNRLTFNAVVNQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNGKPTCA 276

Query: 306 RGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
             DL     R +W   GYI SDCD+V  +F  Q+Y KT E++ A  L AG+D+NC     
Sbjct: 277 DPDLLSGVIRGQWQLNGYIVSDCDSVDVLFRKQHYAKTPEEAVAKSLLAGLDLNCDHFNG 336

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
           +H   A+  G V E  ID+A+ N F+  +RLG F+GDP+K  YG LGP DVCT+++++LA
Sbjct: 337 QHAMGAVKAGLVNETAIDKAISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTADNQELA 396

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
            D ARQGIVLLKN    LPL+ +A+ +LA+IGP  N    M G Y G+PC   + L+GL 
Sbjct: 397 RDGARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYHGVPCKYTTPLQGLA 456

Query: 485 AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
             VS T Y  GC +V C  DA    AV +A  AD V++V G D + E E  DRV L LPG
Sbjct: 457 ETVSST-YQLGC-NVAC-VDADIGSAVDLAASADAVVLVVGADQSIEREGHDRVDLYLPG 513

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q  LVT VA  ++ PV+LV+  GG  D++FA+ D +I+SI+W+GYPGEAG  A+A++IF
Sbjct: 514 KQQELVTRVAMAARGPVVLVIMSGGGFDITFAKNDKKITSIMWVGYPGEAGGLAIADVIF 573

Query: 605 GDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           G  NP G LPMTWYP+S+  KVPM++MNMR D S+ YPGRSYRFYTG  VY F   L+YT
Sbjct: 574 GRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFADALTYT 633

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHVQISVT 721
            + ++ + AP  +++S         +  Q     LD +  H +      S  F V ++V 
Sbjct: 634 KFDHQLIKAPRLVSLSLDENHPCRSSECQS----LDAIGPHCENAVEGGS-DFEVHLNVK 688

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N GD  GSH V LF   P+V  G+P KQL+GF+++         + F V+ C+ LS+ ++
Sbjct: 689 NTGDRAGSHTVFLFTTSPQV-HGSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDLSVVDE 747

Query: 782 HGRRILPLGNHVLMVGELRHSLTI 805
            G+R + LG+H+L VG L+HSL I
Sbjct: 748 TGKRKIALGHHLLHVGSLKHSLNI 771


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score =  725 bits (1871), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/758 (45%), Positives = 489/758 (64%), Gaps = 40/758 (5%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PFCN  L+I  R K LI  LT+QEK+  L +NA A+PR+G+ +YEWWSE+LHG+++ GPG
Sbjct: 41  PFCNVKLAIPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPG 100

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
             F G   + TSFPQV+ +AASFN SLW  IG  V+ EARAMYN G AGLT+W+PN+NIF
Sbjct: 101 TRFGGVFPAATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIF 160

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP++   YA  +VK  QG +                          
Sbjct: 161 RDPRWGRGQETPGEDPVLAGRYAASYVKGLQGTD-------------------------- 194

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
            G++L ++ACCKH  AYD++ W    R+ FNA++++QD EDTF  PFR C+++GK + +M
Sbjct: 195 -GNKLKVAACCKHFTAYDVDNWNGVDRFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVM 253

Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           CSYNQVNGVP C   +L +K  R  WG  GYI SDCD+V  ++  Q+YT T E++AA  +
Sbjct: 254 CSYNQVNGVPTCADPNLLKKTVRGVWGLDGYIVSDCDSVGVLYNSQHYTSTPEEAAADAI 313

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           KAG+D++CG  +  HTQ A+ KG + E D++ AL+N   VQ+RLG+F+G+P    YG+LG
Sbjct: 314 KAGLDLDCGPFLGVHTQDAVKKGLLTEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLG 373

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P DVC   H++LAL+AARQGIVLLKN    LPL+     ++A+IGP  +    M G Y G
Sbjct: 374 PKDVCKPAHQELALEAARQGIVLLKNTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAG 433

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
           I C   S L+G+  Y +KT +  GC +V C  D  F  A+  A+ AD  I+V GLD + E
Sbjct: 434 IACGYTSPLQGIGRY-AKTIHQQGCSNVACRDDKQFGPALDAARHADATILVIGLDQSIE 492

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E  DR SLLLPG Q  LV+ VA  SK P ILVL  GGP+D++FA+ D +++ ILW GYP
Sbjct: 493 AETVDRTSLLLPGHQQDLVSKVAAASKGPTILVLMSGGPVDITFAKNDPKVAGILWAGYP 552

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTG 650
           G+AG  A+A+I+FG  +PGG+LP+TWYP+ + K + M +M MR  S   YPGR+YRFY G
Sbjct: 553 GQAGGAAIADILFGTASPGGKLPVTWYPQEYLKNLAMTNMAMRP-SKIGYPGRTYRFYKG 611

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
             VY FGHGL+YT++ ++  SAP+ +++     + G++ NI  +    +   H      C
Sbjct: 612 PVVYPFGHGLTYTHFVHELSSAPTVVSVPVHGHRHGNNTNISNKA---IRVTH----ARC 664

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPK-VSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             L   + + V N G  DG+H +++F+  P   +   P+K L+ F++VH  AK  + +  
Sbjct: 665 GKLSIALHVDVKNVGSRDGTHTLLVFSAPPNGGNHWVPQKSLVAFEKVHVPAKTKQRVRV 724

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +  C+ LS+ +K G R +P+G H L +G+++HS++++
Sbjct: 725 NIHVCKLLSVVDKSGIRRIPMGEHSLHIGDVKHSVSLQ 762


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/778 (46%), Positives = 497/778 (63%), Gaps = 41/778 (5%)

Query: 36  VLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
           V  +P F C P  P      FC  ++ I  R + LI  LTLQEKI+ L +NA A+PRLGI
Sbjct: 26  VHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85

Query: 94  PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
             YEWWSE+LHG+++ GPG  F G     TSFPQV+ +AASFN SLW  IG  V  EARA
Sbjct: 86  QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDEARA 145

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA  +VK  QG      DG 
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------DGA 199

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                               G+ L ++ACCKH  AYDL+ W   +R+ FNA +++QD  D
Sbjct: 200 --------------------GNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLAD 239

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           T+  PF++C+ +G  + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V  
Sbjct: 240 TYNVPFKACVVEGHVASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGV 299

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           ++E Q+YT+T E++AA  +KAG+D++CG  +  HT++A+ +G V + +I+ AL N  +VQ
Sbjct: 300 LYEEQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQ 359

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG+F+G+P   +YG LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+     ++
Sbjct: 360 MRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTV 419

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  +    M G Y G+ C   + L+G+  Y    H A GC DV CN +  F  A  
Sbjct: 420 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEA 478

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A++AD  ++V GLD + E E  DRV LLLPG Q  LV+ VAR S+ P ILVL  GGP+D
Sbjct: 479 AARQADATVLVMGLDQSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 538

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
           V+FA+ D +IS+I+W+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+++ T +PM DM 
Sbjct: 539 VTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMA 598

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
           MRAD +R YPGR+YRFY G  V+ FG GLSYT +++     P+ +++   SLKA ++  +
Sbjct: 599 MRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTM 658

Query: 691 LQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
           L +    +   H D    C +L    V + V N G +DG+H +++F   P   +    KQ
Sbjct: 659 LSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWAASKQ 710

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           L+GF ++H  A     +   V  C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 711 LVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 768


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  724 bits (1870), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/778 (46%), Positives = 497/778 (63%), Gaps = 41/778 (5%)

Query: 36  VLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
           V  +P F C P  P      FC  ++ I  R + LI  LTLQEKI+ L +NA A+PRLGI
Sbjct: 26  VHARPPFACDPHNPITRGLKFCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPRLGI 85

Query: 94  PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
             YEWWSE+LHG+++ GPG  F G     TSFPQV+ +AASFN SLW  IG  V  EARA
Sbjct: 86  QGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDEARA 145

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++ S YA  +VK  QG      DG 
Sbjct: 146 MYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------DGA 199

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                               G+ L ++ACCKH  AYDL+ W   +R+ FNA +++QD  D
Sbjct: 200 --------------------GNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLAD 239

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           T+  PF++C+ +G  + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V  
Sbjct: 240 TYNVPFKACVVEGHVASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGV 299

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           ++E Q+YT+T E++AA  +KAG+D++CG  +  HT++A+ +G V + +I+ AL N  +VQ
Sbjct: 300 LYEEQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQ 359

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG+F+G+P   +YG LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+     ++
Sbjct: 360 MRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTV 419

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  +    M G Y G+ C   + L+G+  Y    H A GC DV CN +  F  A  
Sbjct: 420 AVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEA 478

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A++AD  ++V GLD + E E  DRV LLLPG Q  LV+ VAR S+ P ILVL  GGP+D
Sbjct: 479 AARQADATVLVMGLDQSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPID 538

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
           V+FA+ D +IS+I+W+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+++ T +PM DM 
Sbjct: 539 VTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMA 598

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNI 690
           MRAD +R YPGR+YRFY G  V+ FG GLSYT +++     P+ +++   SLKA ++  +
Sbjct: 599 MRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTM 658

Query: 691 LQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
           L +    +   H D    C +L    V + V N G +DG+H +++F   P   +    KQ
Sbjct: 659 LSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWAASKQ 710

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           L+GF ++H  A     +   V  C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 711 LVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 768


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score =  724 bits (1869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/797 (44%), Positives = 499/797 (62%), Gaps = 43/797 (5%)

Query: 14  ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISL 71
           IL ++  L V++  +  S+S  V ++  F C  K     +  FC  S+ I  R + LI  
Sbjct: 2   ILHKMAFLAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGR 61

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL EK+  L + A+AIPRLGI  YEWWSE+LHG+++ GPG  F G   + TSFPQV+ +
Sbjct: 62  LTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITT 121

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ASFN SLW +IG  V+ EARAMYN G  GLT+W+PN+NI RDPRWGRGQETPGEDP+V 
Sbjct: 122 VASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVA 181

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
             YA  +V+  QG                          +DR   L ++ACCKH  AYDL
Sbjct: 182 GKYAASYVRGLQG--------------------------NDR-SRLKVAACCKHFTAYDL 214

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
           + W    R+ FNA +++QD EDTF  PFR C+++G  + IMCSYNQVNGVP C   +L +
Sbjct: 215 DNWNGVDRFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLK 274

Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
           K  RN+WG  GYI SDCD+V  +++ Q+YT T E++AA  +KAG+D++CG  +  HT  A
Sbjct: 275 KTIRNQWGLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDA 334

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           + K  ++E D+D AL+N  +VQ+RLG+F+GD     YG LGP  VCT  HK LAL+AA+Q
Sbjct: 335 VKKNLLRESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQ 394

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
           GIVLLKN    LPL+     ++A+IGP  +    M G Y G+ C   S ++G+  Y ++T
Sbjct: 395 GIVLLKNHGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGY-ART 453

Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
            +  GC DV C  D  F  AV  A+ AD  ++V GLD + E E +DR SLLLPG+Q  LV
Sbjct: 454 IHQKGCVDVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELV 513

Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
           + VA+ +K PVILVL  GGP+D+SFAE D +I +I+W GYPG+ G  A+A+I+FG  NPG
Sbjct: 514 SRVAKAAKGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPG 573

Query: 611 GRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
           G+LPMTWYP+ + T +PM +M+MR   S++ PGR+YRFY G  VY FGHGLSYT +++  
Sbjct: 574 GKLPMTWYPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNI 633

Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
             AP  + I+   + G+       +G  +   H      C  L   V + VTN G  DG+
Sbjct: 634 ADAPKVIPIAVRGRNGT------VSGKSIRVTH----ARCDRLSLGVHVEVTNVGSRDGT 683

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
           H +++F+  P   +  P+KQL+ F+RVH      K +   +  C+ LS+ ++ G R +P+
Sbjct: 684 HTMLVFS-APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPI 742

Query: 790 GNHVLMVGELRHSLTIE 806
           G+H + +G+  H+++++
Sbjct: 743 GDHGIHIGDESHTVSLQ 759


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score =  724 bits (1868), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/796 (44%), Positives = 503/796 (63%), Gaps = 45/796 (5%)

Query: 17  RVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTL 74
           R++ ++++++ I F +      +  F C P       YPFC  +L I  R K LI  LTL
Sbjct: 7   RIIPILIILSAI-FRHGGGA--REPFACDPKDAALSRYPFCRVALPIPERVKDLIGRLTL 63

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           QEK++ L +NA+A+PRLGI  YEWWSE+LHG+++ GPG  F G     TSFPQV+ + AS
Sbjct: 64  QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           FN SLW  IG  V+ EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V   Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
           A  ++K  QG +                           GD L ++ACCKH  AYDL+ W
Sbjct: 184 AARYIKGLQGND---------------------------GDRLKVAACCKHFTAYDLDNW 216

Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
               R+ FNA +T QD  DTF+ PFR C+++GK + +MCSYNQVNGVP C   +L +   
Sbjct: 217 NGTDRFHFNAKVTRQDMVDTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTI 276

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           RN+WG  GYI SDCD+V   ++ Q+YT T E++AA  +KAG+D++CG  +  HT+ A+ K
Sbjct: 277 RNQWGLNGYIVSDCDSVGVFYDNQHYTSTAEEAAADAIKAGLDLDCGPFLAVHTEDAVKK 336

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
             + +  I+ AL N  +VQ+RLG+F+G P    YGKLGP +VC+  H++LALDAARQGIV
Sbjct: 337 XLLTQTHINNALANTITVQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIV 396

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
           LLKN    LPL+     ++A+IGP  +    M G Y G+ C   + L G++ Y +  H  
Sbjct: 397 LLKNRLPGLPLSAXHHRTVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVH-R 455

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            GC +V C +D  F +A+  A  AD  ++V GLD + E E +DR  LLLPG+Q  LV  V
Sbjct: 456 KGCDNVACATDYSFTDALAAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKV 515

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  S+ P +++L  GGP+DVSFA+ D +IS+ILW+GYPG+AG  A+A+++FG  NPGG+L
Sbjct: 516 AAASRGPTVVILMSGGPIDVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKL 575

Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           PMTWYP+S+ + +PM +M MR+ SS  YPGR+YRFY G  VY FGHGLSYTN+ +  + A
Sbjct: 576 PMTWYPQSYLSNLPMTNMAMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKA 633

Query: 673 PSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           P+ ++IS S  +     + L     R+ +        C  L   + + V N GD DG H 
Sbjct: 634 PTIVSISLSGHRQTHSASTLSSKAIRVTH------AKCQKLSLVIHVDVENKGDRDGFHT 687

Query: 732 VMLFARVP-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
           +++F+  P   +   P KQL+ F+++H  ++  + +   V  C+ LS+ +K G R +PLG
Sbjct: 688 MLVFSTPPANGATWVPRKQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLG 747

Query: 791 NHVLMVGELRHSLTIE 806
           +H + +G ++H+++++
Sbjct: 748 DHYIHIGNVKHTVSLQ 763


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score =  724 bits (1868), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/792 (44%), Positives = 495/792 (62%), Gaps = 46/792 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPCKPPHF--DSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           LI +V ++   +S+    +  F C P +   +++PFC  SL I TR   LI  LTLQEK+
Sbjct: 5   LITIVFLLLLMSSEA---RDPFACDPKNTSTNNFPFCKASLPIPTRVNDLIGRLTLQEKV 61

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
             L +NA+A+PR+GI  YEWWSE+LHG+++ GPG  F G   + TSFPQV+ + ASFN S
Sbjct: 62  SMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQVITTVASFNAS 121

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW  IG   + EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +
Sbjct: 122 LWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASY 181

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  Q                        G +S R   L ++A CKH  AYDL+ W    
Sbjct: 182 VRGLQ------------------------GTDSSR---LKVAASCKHFTAYDLDNWNGVD 214

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ FNA +++QD EDTF  PFR C+++G  + +MCSYNQVNGVP C   +L ++  R +W
Sbjct: 215 RFHFNAKVSKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQW 274

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
              GYI SDCD+V   +  Q+YT T E++AA  +KAG+D++CG  + +HTQ+A+ KG + 
Sbjct: 275 HLDGYIVSDCDSVGVFYTNQHYTSTPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLT 334

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           E D++ AL N  +VQ+RLG+F+G+P    YG LGP DVCT  H++LALDAARQGIVLLKN
Sbjct: 335 ETDVNGALANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKN 394

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
               LPL+     ++A+IGP  N    M G Y GI C   S L+G+  Y ++T +  GC 
Sbjct: 395 TGPSLPLSTKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSPLQGIGKY-ARTIHEPGCA 453

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           +V CN D  F  A+  A++AD  ++V GLD + E E  DR  LLLPG Q  LV+ VA  S
Sbjct: 454 NVACNDDKQFGSALNAARQADATVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAAS 513

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           + P ILVL  GGP+D++FA+ D +I  ILW GYPG+AG  A+A+I+FG  NPG +LPMTW
Sbjct: 514 RGPTILVLMSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTW 573

Query: 618 YPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           YP+ + K + M +M MR  SS  YPGR+YRFY G  VY FG+GLSYTN+ +   SAP   
Sbjct: 574 YPQGYLKNLAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGYGLSYTNFVHTLASAPK-- 631

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +S  +      N   +   R+ +        C  L   + I V N G  DG++ +++F+
Sbjct: 632 VVSVPVDGHRRGNSSNKAAIRVTHAR------CGKLSIRLDIDVKNVGSKDGTNTLLVFS 685

Query: 737 RVPKVSQG--TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
            VP    G   P+KQL+ F++V+  AK  + +   +  C+ LS+ +K G R +P+G H +
Sbjct: 686 -VPPTGNGHWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLLSVVDKSGTRRIPMGAHSI 744

Query: 795 MVGELRHSLTIE 806
            +G+++H ++++
Sbjct: 745 HIGDVKHFVSLQ 756


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score =  723 bits (1867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/759 (47%), Positives = 500/759 (65%), Gaps = 40/759 (5%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S  FC  ++ +  R + LI  LTLQEKI+ L +NA+A+PRLGI  YEWWSE+LHG+++ G
Sbjct: 31  SLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQGYEWWSEALHGVSNVG 90

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PG  F G     TSFPQV+ +AASFN+SLW  IG  V+ EARAM+N G AGLT+W+PN+N
Sbjct: 91  PGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAMFNGGMAGLTYWSPNVN 150

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           +FRDPRWGRGQETPGEDP+V   YA  +V+  QG +          GFR           
Sbjct: 151 VFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNS----------GFR----------- 189

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 L ++ACCKH  AYDL+ W    RY FNA +++QD EDT+  PF+SC+ +GK + 
Sbjct: 190 ------LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVAS 243

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYNQVNG P C   +L +   R EW   GYI SDCD+V  ++E Q+YT T E++AA 
Sbjct: 244 VMCSYNQVNGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEEAAAA 303

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            +KAG+D++CG  +  HT++A+  G + E+D++ AL N  +VQ+RLGLF+G+P    +GK
Sbjct: 304 TIKAGLDLDCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGK 363

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LGP DVCT  H++LAL AA+QGIVLL+N  + LPL++  + ++A+IGP+ +    M G Y
Sbjct: 364 LGPRDVCTPAHQQLALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNY 422

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            G+ C   + L+G+  Y +KT + SGC DV CN +  F  A   A +AD  ++V GLD +
Sbjct: 423 AGVACGYTTPLQGISRY-AKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQS 481

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E RDR  LLLPG Q  L++ VAR S+ P ILVL  GGP+DVSFA+ D +I +ILW G
Sbjct: 482 IEAEFRDRKDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAG 541

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFY 648
           YPG+AG  A+A+++FG  NPGG+LPMTWYP+ +  KVPM +M MRAD SR YPGR+YRFY
Sbjct: 542 YPGQAGGAAIADVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFY 601

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            G  V+ FGHG+SYT +++  + AP E+ +   SL A      LQ T +  + + +    
Sbjct: 602 KGPVVFPFGHGMSYTTFAHSLVQAPQEVAVPFTSLYA------LQNTTAARNSIRVSH-A 654

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  L   V I V N GD+DG   +++F+  P+  + +  K+LIGF++VH VA   K + 
Sbjct: 655 NCEPLVLGVHIDVKNTGDMDGIQTLLVFSSPPE-GKWSANKKLIGFEKVHIVAGSKKRVK 713

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             +  C+ LS+ ++ G R LP+G H L +G+L+HS++++
Sbjct: 714 IDIPVCKHLSVVDRFGIRRLPIGKHDLHIGDLKHSISLQ 752


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  723 bits (1867), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/776 (47%), Positives = 493/776 (63%), Gaps = 49/776 (6%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +     Y FC+ + S + RA  L+  LTL EK+  L +  +A+PRLGIPAYE
Sbjct: 29  PVFACDASNATVSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALPRLGIPAYE 88

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 89  WWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 148

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q       D G G  
Sbjct: 149 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-------DAGGG-- 199

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                            D L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQP
Sbjct: 200 ----------------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 243

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYN+VNG P C   DL     R +W   GYI SDCD+V  ++  
Sbjct: 244 PFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNN 303

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTK  ED+AA  +K+G+D+NCG  + +HT +A+  GK+ E D+DRA+ N F V +RLG
Sbjct: 304 QHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLG 363

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPRK  +G LGP DVCTS +++LA +AARQGIVLLKN    LPL+  ++ S+A+IG
Sbjct: 364 FFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLSAKSIKSMAVIG 422

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A 
Sbjct: 423 PNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAA 481

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S+ PVILV+  GGP D+SF
Sbjct: 482 SADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISF 541

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A++  +IS+ILW+GYPGEAG  ALA+I+FG  NPGGRLP+TWYP SF  KV M DM MR 
Sbjct: 542 AKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRP 601

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           DSS  YPGR+YRFYTG  VY FG GLSYT +++  +SAP ++ +  +             
Sbjct: 602 DSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA------------E 649

Query: 695 GSRLDYVHIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
           G      H   V +    C SL F V + V NAG + G H V LF+  P V    P K L
Sbjct: 650 GHACHTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHL 708

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +GF++V      +  ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L + 
Sbjct: 709 LGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLR 764


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/758 (48%), Positives = 498/758 (65%), Gaps = 33/758 (4%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           ++PFCNTSL I+ R +  ++ LTL+EKI QL + A+ IPRLG+P Y+WW E+LHG+AS+ 
Sbjct: 41  AFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVASS- 99

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F G+V + TSFP  + +AASFN SL+  IG AV+ EARAM+NLGQ+GLTFW+PNIN
Sbjct: 100 PGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPNIN 159

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           I+RDPRWGRGQETPGEDP++ S +A  +V+  Q                         E 
Sbjct: 160 IYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQ-------------------------ES 194

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
               D+L +SACCKH+ AYD++ W    RY FNA++TEQD EDT+  PF+SC+E G  S 
Sbjct: 195 QAGSDKLKVSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSS 254

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYN++NGVP C   +L     R  W   GYI SDCD++   F+  NY  T ED+AA 
Sbjct: 255 VMCSYNRLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAAD 314

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L AG+++NCGT + +HT SAI + KV E  I++AL  L +VQ+RLGL++GDP+   YG 
Sbjct: 315 ALLAGLNLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGS 374

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DVCTSEH+ LAL+AARQG+VLLKN    LPL+ + + SLA++GP  N    M G Y
Sbjct: 375 LGASDVCTSEHQTLALEAARQGMVLLKNLGA-LPLSTSKIKSLAVVGPHANATRAMIGNY 433

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GIPC   S L+  + Y ++  YA GC +V C+SD+    AV  A  AD V+V  GLDLT
Sbjct: 434 AGIPCKYTSPLQAFQKY-AQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLT 492

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DR SLLLPG+Q  LV+ V + +K PV++V+   G +D+ FA +DS+I+ ILW G
Sbjct: 493 IEAESLDRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAG 552

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+AE+IFGD NP G+LP TWYP++FT + M DMNMR ++S  YPGR+YRFYT
Sbjct: 553 YPGQAGGAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYT 612

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           G  ++ FG GLSYT+ S KF+ APS L+I ++         L+++ S       DE  SC
Sbjct: 613 GPTIFKFGDGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTG-LKKSSSCFHLDATDE-KSC 670

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKG-SKEIS 767
            SL+  V ISV N G +  SH +MLF+  P   S G P++QL+GF+++       S  + 
Sbjct: 671 ESLKSQVAISVRNKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVI 730

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           F +DPC     A++ G+++L  G HVL  G  +HSL +
Sbjct: 731 FDLDPCRHFVHADRDGKKLLRSGTHVLTAGNEQHSLRL 768


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  723 bits (1866), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/773 (46%), Positives = 488/773 (63%), Gaps = 40/773 (5%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C P  P   +  FC   + I  R + LI  LTLQEKI  L +NA A+PRLGI  Y
Sbjct: 29  RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+++ GPG  F GT    TSFPQV+ +AASFN SLW  IG  V+ EARAMYN
Sbjct: 89  EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
            G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y   +VK  QG      DG    
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------DGA--- 198

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                            G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+ 
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
            PF++C+  G  + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V   ++
Sbjct: 242 VPFKACVVDGNVASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYD 301

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E +AA  +KAG+D++CG  +  HT++AI  G+V E DI+ AL N  +VQ+RL
Sbjct: 302 NQHYTKTPEAAAAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRL 361

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G+F+G+P   +YG LG  DVC     +LAL+AARQGIVLL+N    LPL+     ++A+I
Sbjct: 362 GMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVI 421

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  +    M G Y GI C   + L+G+  Y    H A GC DV CN +     A   A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD  ++V GLD + E E RDR  LLLPG Q  LV+ VAR S+ P ILV+  GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ D +I +I+W+GYPG+AG  A+A+++FG  NP G+LPMTWYP+++   +PM DM MRA
Sbjct: 541 AKNDPRIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
           D +R YPGR+YRFY G  V+ FG GLSYT +S+     P+ +++   SL A  +  +L  
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSN 660

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G R+ +      T+C SL     I + N G +DG+H +++FA  P   +  P KQL+GF
Sbjct: 661 HGVRVSH------TNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQP-AGKWAPNKQLVGF 713

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +VH VA   + +  GV  C+ LSI +K G R +PLG H L +G+L+H ++IE
Sbjct: 714 HKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSIE 766


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/774 (47%), Positives = 492/774 (63%), Gaps = 62/774 (8%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + FCNT L  STR + LIS +TLQEKI QL +NA+ IPRLG+P YEWW E+LHG+A + P
Sbjct: 44  FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAVS-P 102

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           GV F G     TSFP  +++AASF+         AV+ EARAM+N  +AGLT+W+PN+NI
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           +RDPRWGRGQETPGEDP++ S YA  +V+  Q  N                         
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLG----------------------- 190

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
             GD+L +SACCKH+ AYD++ W   +R+ FNA++T+QD  DT+ PPF+SC+E  K S +
Sbjct: 191 --GDKLKVSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG----------------YITSDCDAVATIF 334
           MCSYN+VNGVP C   +L     R+ W   G                YI SDCD++ T F
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +  NY KT ED  A  L AG++++CG  +  HTQSAI  GK+ E ++++AL  L++VQ+R
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGL++G+PR   YG LGP  VCT E+++LALDAA++GIVLLKN+   LP +K+ + ++A 
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP       M G Y GIPC   +   GL AY ++  Y++GC DV C SD+    AV  A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAY-ARVVYSAGCSDVACYSDSLIGSAVSTA 487

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            +AD V++  GLDL QE E +DR SLLLPG+Q  LVT V + +K P +LV+  GG +DVS
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVS 547

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ ++++  ILW GYPGEAG  A+A+++FGD NPGGRLP+TWYPESFT + M DMNMR 
Sbjct: 548 FAKYNNKVQGILWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI--SASLKAGSDKNILQ 692
           D+SR YPGR+YRFYTG  VY FG+G +Y+  S+KF  AP  L    +A++K   D N+  
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNL-- 665

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLI 751
            T   L+  H DE+T C++L   V+I V N GD   +  V+L++  P   + G P +QL 
Sbjct: 666 -TCFHLN-AH-DEIT-CSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLA 721

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           GF +V       + +   +DPC+ LS A  +G RIL  G H L VG  RH L I
Sbjct: 722 GFGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/809 (46%), Positives = 507/809 (62%), Gaps = 46/809 (5%)

Query: 7   SIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCK---PPHFDSYPFCNTSLSIST 63
           S+  R R LF V  L++   ++  S      + P F C     P      FCNT L+I +
Sbjct: 5   SLTERNRALFSVSTLLLCF-LLCISEQSNAQSSPVFACDVTGNPSLAGLRFCNTGLNIKS 63

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+  LTL+EKI  L  NA  + RLGIPAY+WWSE+LHG+++ G G +F+G V   T
Sbjct: 64  RVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGAT 123

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
           SFPQV+++AASFN SL+  IG  V+ EARAMYN+G AGLTFW+PN+NIFRDPRWGRGQET
Sbjct: 124 SFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQET 183

Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD--ELMLSA 241
           PGEDP + S YAV +V+  Q                          E+D GD   L ++A
Sbjct: 184 PGEDPELSSKYAVAYVRGLQ--------------------------ETDGGDPNRLKVAA 217

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
           CCKH  AYD++ W +  R++FNA++ +QD  DTFQPPF+SC+  G  + +MCSYNQVNG 
Sbjct: 218 CCKHYTAYDVDNWKDVHRFTFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNGK 277

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
           P C   DL     R +W   GYI SDCD+V  ++  Q+YTKT E++ A  + AG+D+NC 
Sbjct: 278 PTCADPDLLSGVIRGQWKLNGYIVSDCDSVDVLYTKQHYTKTPEEAVAKSILAGLDLNCD 337

Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSE 419
               ++   A+  G V E  ID+A+ N F+  +RLG F+GDP+K + YG LGP+DVCT+ 
Sbjct: 338 HFTGQYAMKAVKVGLVNETAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCTAN 397

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL 479
           +++LA DAARQGIVLLKN    LPL+ +A+ +LA+IGP  N    M G Y GIPC   + 
Sbjct: 398 NQELARDAARQGIVLLKNSAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYTTP 457

Query: 480 LRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
           L+GL   VS T Y  GC +V C ++     A  +A  AD V++V G D + E E+ DR+ 
Sbjct: 458 LQGLAETVSST-YQLGC-NVAC-AEPDLGSAAALAASADAVVLVMGADQSIEQENLDRLD 514

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L LPG+Q  LVT VA+ +K PV+LV+  GG  D++FA+ + +I+ I+W+GYPGEAG  A+
Sbjct: 515 LYLPGKQQELVTQVAKVAKGPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGLAI 574

Query: 600 AEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           A++IFG  NP G LPMTWYP+S+  KVPM +MNMR D S  YPGR+YRFYTG  VY FG 
Sbjct: 575 ADVIFGRHNPSGNLPMTWYPQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGD 634

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHV 716
           GLSYTN++++ L AP  +    SL    +          +D +  H D       L F V
Sbjct: 635 GLSYTNFNHQILKAPKLV----SLDLDENHACRSSECQSVDAIGPHCDNAVG-GGLNFEV 689

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
           Q+ V N GD +GSH V LF   P+V  G+P K L+GF+++    K    I F VD C+ L
Sbjct: 690 QLKVRNVGDREGSHTVFLFTTPPEV-HGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKDL 748

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTI 805
           S+ ++ G+R + LG+++L VG  +HSLTI
Sbjct: 749 SVVDEIGKRKIALGHYLLHVGSFKHSLTI 777


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/758 (48%), Positives = 498/758 (65%), Gaps = 33/758 (4%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           ++PFCNTSL+I+ R +  ++ LTL+EKI QL + A+ IPRLG+P Y+WW E+LHG+AS+ 
Sbjct: 41  AFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVASS- 99

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F G+V + TSFP  + +AASFN SL+  IG AV+ EARAM+NLGQ+GLTFW+PNIN
Sbjct: 100 PGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPNIN 159

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           I+RDPRWGRGQETPGEDP++ S +A  +V+  Q                         E 
Sbjct: 160 IYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQ-------------------------ES 194

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
               D+L +SACCKH+ AYD++ W    RY FNA++TEQD EDT+  PF+SC+E G  S 
Sbjct: 195 QAGSDKLKVSACCKHMTAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSS 254

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYN++NGVP C   +L     R  W   GYI SDCD++   F+  NY  T ED+AA 
Sbjct: 255 VMCSYNRLNGVPTCADHELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAEDAAAD 314

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L AG+++NCGT + +HT SAI + KV E  I++AL  L +VQ+RLGL++GDP+   YG 
Sbjct: 315 ALLAGLNLNCGTFLAKHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGS 374

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DVCTSEH+ LAL+AARQG+VLLKN    LPL+ + + SLA++GP  N    M G Y
Sbjct: 375 LGASDVCTSEHQTLALEAARQGMVLLKNLGA-LPLSTSKIKSLAVVGPHANATRAMIGNY 433

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GIPC   S L+  + Y ++  YA GC +V C+SD+    AV  A  AD V+V  GLDLT
Sbjct: 434 AGIPCKYTSPLQAFQKY-AQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLT 492

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DR SLLLPG+Q  LV+ V + +K PV++V+   G +D+ FA +DS+I+ ILW G
Sbjct: 493 IEAESLDRTSLLLPGKQQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAG 552

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+AE+IFGD NP G+LP TWYP++FT + M DMNMR ++S  YPGR+YRFYT
Sbjct: 553 YPGQAGGAAIAEVIFGDHNPSGKLPATWYPQNFTSISMLDMNMRPNASTGYPGRTYRFYT 612

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           G  ++ FG GLSYT+ S KF+ APS L+I ++         L+++ S       DE  SC
Sbjct: 613 GPTIFKFGDGLSYTSLSAKFIKAPSFLSIPSTAPMQPCTG-LKKSSSCFHLDATDE-KSC 670

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKG-SKEIS 767
            SL+  V ISV N G +  SH +MLF+  P   S G P++QL+GF+++       S  + 
Sbjct: 671 ESLKSQVAISVRNKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVI 730

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           F +DPC     A+  G+++L  G HVL  G  +HSL +
Sbjct: 731 FDLDPCRHFVHADPDGKKLLRSGTHVLTAGNEQHSLRL 768


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/771 (46%), Positives = 498/771 (64%), Gaps = 39/771 (5%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +     + FCNT + I  R + LI+ LTL EKI+ + +NA A+PRLGI  YEWW
Sbjct: 36  FACDPRNGLTRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQGYEWW 95

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G     T FPQV+ +AASFN+SLW  IG  V+ EARAMYN GQ
Sbjct: 96  SEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAMYNGGQ 155

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA  +VK  QG      DG       
Sbjct: 156 AGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG------DGA------ 203

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+  PF
Sbjct: 204 --------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPF 249

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++C+ +G+ + +MCSYNQVNG P C   DL +   R +WG  GYI SDCD+V   F+ Q+
Sbjct: 250 KACVLEGQVASVMCSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQH 309

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT+T E++AA  +KAG+D++CG  +  HT SAI KG + E D++ AL NL +VQ+RLG+F
Sbjct: 310 YTRTPEEAAAEAIKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMF 369

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    +G LGP DVCT  H++LAL+AAR+ IVLL+N    LPL+ + +  + +IGP 
Sbjct: 370 DGEPSTQPFGNLGPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPN 429

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            +    M G Y G+ C   + L+G+  YV KT +  GC  V C  +  F  A  IA++ D
Sbjct: 430 TDATVTMIGNYAGVACGYTTPLQGIARYV-KTAHQVGCRGVACRGNELFGAAEIIARQVD 488

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD T E E RDRV LLLPG Q  LVT VAR +K PVILV+  GGP+DVSFA+ 
Sbjct: 489 ATVLVMGLDQTIEAETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKN 548

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
           + +IS+ILW+GYPG+AG  A+A++IFG  NPGGRLPMTWYP+ +  KVPM +M+MR + +
Sbjct: 549 NPKISAILWVGYPGQAGGTAIADVIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPA 608

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKAGSDKNILQQTGS 696
             YPGR+YRFY G  V+ FGHGLSY+ +S     AP ++++   SL+A ++ + L     
Sbjct: 609 TGYPGRTYRFYKGPVVFPFGHGLSYSRFSQSLALAPKQVSVQILSLQALTN-STLSSKAV 667

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
           ++ + + D+     SL     + V N G +DG+H +++F++ P   + +  KQL+ F + 
Sbjct: 668 KVSHANCDD-----SLETEFHVDVKNEGSMDGTHTLLIFSK-PPPGKWSQIKQLVTFHKT 721

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           H  A   + +   V  C+ LS+ ++ G R +P G H L +G+L+HS+ ++T
Sbjct: 722 HVPAGSKQRLKVNVHSCKHLSVVDQFGVRRIPTGEHELHIGDLKHSINVQT 772


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/790 (44%), Positives = 495/790 (62%), Gaps = 43/790 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           L  ++  +  S+S  V ++  F C  K     +  FC  S+ I+ R K LI  LTL EK+
Sbjct: 4   LAAILFFLISSSSVCVQSRETFACDIKDAATATLRFCQLSVPITERVKDLIGRLTLVEKV 63

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
             L + A+AIPRLGI  YEWWSE+LHG+++ GPG  F G   + TSFPQV+ + ASFN S
Sbjct: 64  SLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNAS 123

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW +IG  V+ EARAMYN G  GLT+W+PN+NI RDPRWGRGQETPGEDP+V   YA  +
Sbjct: 124 LWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASY 183

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  QG                          +DR   L ++ACCKH  AYDL+ W    
Sbjct: 184 VRGLQG--------------------------NDR-SRLKVAACCKHFTAYDLDNWNGVD 216

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ FNA +++QD EDTF  PFR C+++G  + IMCSYN+VNGVP C   +L +K  RNEW
Sbjct: 217 RFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEW 276

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
           G  GYI SDCD+V  +++ Q+YT T E++AA  +KAG+D++CG  +  HT  A+ K  ++
Sbjct: 277 GLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLR 336

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           E D+D AL+N  +VQ+RLG+F+GD     YG LGP  VCT  HK LAL+AA+QGIVLLKN
Sbjct: 337 ESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKN 396

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
               LPL+     ++A+IGP  +    M G Y GI C   S ++G+  Y ++T +  GC 
Sbjct: 397 HGSSLPLSSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGITGY-ARTVHQKGCV 455

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           DV C  D  F  AV  A+ AD  ++V GLD + E E +DR SLLLPG+Q  L++ VA+ +
Sbjct: 456 DVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAA 515

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K PVILVL  GGP+D+SFAE D +I +I+W GYPG+ G  A+A+I+FG  NPGG+LPMTW
Sbjct: 516 KGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTW 575

Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           YP+ + T +PM +M+MR   S++ PGR+YRFY G  VY FGHGLSYT +++    AP  +
Sbjct: 576 YPQDYLTNLPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVI 635

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            I+   + G+       +G  +   H      C  L   V + VTN G  DG+H +++F+
Sbjct: 636 PIAVRGRNGT------VSGKSIRVTH----ARCNRLSLGVHVDVTNVGSRDGTHTMLVFS 685

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             P   +  P+KQL+ F+RVH      K +   +  C+ LS+ ++ G R +P+G+H + +
Sbjct: 686 -APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHI 744

Query: 797 GELRHSLTIE 806
           G+  H+++++
Sbjct: 745 GDESHTVSLQ 754


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score =  722 bits (1863), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/790 (44%), Positives = 495/790 (62%), Gaps = 43/790 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           L V++  +  S+S  V ++  F C  K     +  FC  S+ I  R + LI  LTL EK+
Sbjct: 4   LAVILFFLISSSSVCVHSRETFACDTKDAATATLRFCQLSVPIPERVRDLIGRLTLAEKV 63

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
             L + A+AIPRLGI  YEWWSE+LHG+++ GPG  F G   + TSFPQV+ + ASFN S
Sbjct: 64  SLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQVITTVASFNAS 123

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW +IG  V+ EARAMYN G  GLT+W+PN+NI RDPRWGRGQETPGEDP+V   YA  +
Sbjct: 124 LWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDPVVAGKYAASY 183

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  QG                          +DR   L ++ACCKH  AYDL+ W    
Sbjct: 184 VRGLQG--------------------------NDR-SRLKVAACCKHFTAYDLDNWNGVD 216

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ FNA +++QD EDTF  PFR C+++G  + IMCSYNQVNGVP C   +L +K  RN+W
Sbjct: 217 RFHFNAKVSKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQW 276

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
           G  GYI SDCD+V  +++ Q+YT T E++AA  +KAG+D++CG  +  HT  A+ K  ++
Sbjct: 277 GLNGYIVSDCDSVGVLYDTQHYTGTPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLR 336

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           E D+D AL+N  +VQ+RLG+F+GD     YG LGP  VCT  HK LAL+AA+QGIVLLKN
Sbjct: 337 ESDVDNALINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKN 396

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
               LPL+     ++A+IGP  +    M G Y G+ C   S ++G+  Y ++T +  GC 
Sbjct: 397 HGSSLPLSSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGY-ARTIHQKGCV 455

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           DV C  D  F  AV  A+ AD  ++V GLD + E E +DR SLLLPG+Q  LV+ VA+ +
Sbjct: 456 DVHCMDDRLFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAA 515

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K PVILVL  GGP+D+SFAE D +I +I+W GYPG+ G  A+A+I+FG  NPGG+LPMTW
Sbjct: 516 KGPVILVLMSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTW 575

Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           YP+ + T +PM +M+MR   S++ PGR+YRFY G  VY FGHGLSYT +++    AP  +
Sbjct: 576 YPQDYLTNLPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVI 635

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            I+   + G+       +G  +   H      C  L   V + VTN G  DG+H +++F+
Sbjct: 636 PIAVRGRNGT------VSGKSIRVTH----ARCDRLSLGVHVEVTNVGSRDGTHTMLVFS 685

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             P   +  P+KQL+ F+RVH      K +   +  C+ LS+ ++ G R +P+G+H + +
Sbjct: 686 -APPGGEWAPKKQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHI 744

Query: 797 GELRHSLTIE 806
           G+  H+++++
Sbjct: 745 GDESHTVSLQ 754


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/774 (47%), Positives = 491/774 (63%), Gaps = 62/774 (8%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + FCNT L  STR + LIS +TLQEKI QL +NA+ IPRLG+P YEWW E+LHG+A + P
Sbjct: 44  FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVAVS-P 102

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           GV F G     TSFP  +++AASF+         AV+ EARAM+N  +AGLT+W+PN+NI
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           +RDPRWGRGQETPGEDP++ S YA  +V+  Q  N                         
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQDTNLG----------------------- 190

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
             GD+L +SACCKH+ AYD++ W   +R+ FNA++T+QD  DT+ PPF+SC+E  K S +
Sbjct: 191 --GDKLKVSACCKHMTAYDVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSV 248

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG----------------YITSDCDAVATIF 334
           MCSYN+VNGVP C   +L     R+ W   G                YI SDCD++ T F
Sbjct: 249 MCSYNRVNGVPTCADYNLLSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFF 308

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +  NY KT ED  A  L AG++++CG  +  HTQSAI  GK+ E ++++AL  L++VQ+R
Sbjct: 309 DNTNYAKTAEDVVADALLAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMR 368

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LGL++G+PR   YG LGP  VCT E+++LALDAA++GIVLLKN+   LP +K+ + ++A 
Sbjct: 369 LGLYDGNPRSQPYGNLGPQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAA 428

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP       M G Y GIPC   +   GL AY ++  Y++GC DV C S++    A   A
Sbjct: 429 IGPHAKATRAMIGNYQGIPCKYTTPHDGLSAY-ARVVYSAGCSDVACYSNSLIGSAASTA 487

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            +AD V++  GLDL QE E +DR SLLLPG+Q  LVT V + +K PV+LV+  GG +DVS
Sbjct: 488 SQADAVVLFVGLDLNQEAEGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVS 547

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           FA+ D ++  +LW GYPGEAG  A+A+++FGD NPGGRLP+TWYPESFT + M DMNMR 
Sbjct: 548 FAKYDKKVQGMLWAGYPGEAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITMLDMNMRP 607

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI--SASLKAGSDKNILQ 692
           D+SR YPGR+YRFYTG  VY FG+G +Y+  S+KF  AP  L    +A++K   D N+  
Sbjct: 608 DASRGYPGRTYRFYTGQSVYNFGYGKTYSKLSHKFKEAPLSLGFPEAAAVKRSCDGNL-- 665

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLI 751
            T   L+  H DE+T C++L   V+I V N GD   +  V+L++  P   + G P +QL 
Sbjct: 666 -TCFHLN-AH-DEIT-CSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLA 721

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           GF +V       + +   +DPC+ LS A  +G RIL  G H L VG  RH L I
Sbjct: 722 GFGKVSVAPGAVENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPI 775


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/773 (46%), Positives = 487/773 (63%), Gaps = 40/773 (5%)

Query: 39  KPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           +P F C P  P   +  FC   + I  R + LI  LTLQEKI  L +NA A+PRLGI  Y
Sbjct: 29  RPPFACDPRNPITRTLKFCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPRLGIQGY 88

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           EWWSE+LHG+++ GPG  F GT    TSFPQV+ +AASFN SLW  IG  V+ EARAMYN
Sbjct: 89  EWWSEALHGVSNVGPGTKF-GTFLGATSFPQVITTAASFNESLWEEIGRVVSDEARAMYN 147

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
            G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ + Y   +VK  QG      DG    
Sbjct: 148 GGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------DGA--- 198

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                            G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+ 
Sbjct: 199 -----------------GNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYN 241

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
            PF++C+  G  + +MCSYNQVNG P C   DL +   R +W   GYI SDCD+V   ++
Sbjct: 242 VPFKACVVDGNVASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYD 301

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q+YTKT E +AA  +KAG+D++CG  +  HT++AI  G+V E DI+ AL N  +VQ+RL
Sbjct: 302 NQHYTKTPEAAAAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRL 361

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G+F+G+P   +YG LG  DVC     +LAL+AARQGIVLL+N    LPL+     ++A+I
Sbjct: 362 GMFDGEPSTQRYGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVI 421

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  +    M G Y GI C   + L+G+  Y    H A GC DV CN +     A   A+
Sbjct: 422 GPNSDVTETMIGNYAGIACGYTTPLQGIARYTRTIHQA-GCTDVHCNGNQLIGAAEVAAR 480

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD  ++V GLD + E E RDR  LLLPG Q  LV+ VAR S+ P ILV+  GGP+DV+F
Sbjct: 481 QADATVLVIGLDQSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTF 540

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+ D  I +I+W+GYPG+AG  A+A+++FG  NP G+LPMTWYP+++   +PM DM MRA
Sbjct: 541 AKNDPCIGAIIWVGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRA 600

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQ 693
           D +R YPGR+YRFY G  V+ FG GLSYT +S+     P+ +++   SL A  +  +L  
Sbjct: 601 DPARGYPGRTYRFYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSN 660

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G R+ +      T+C SL     I + N G +DG+H +++FA  P   +  P KQL+GF
Sbjct: 661 HGVRVSH------TNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQP-AGKWAPNKQLVGF 713

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +VH VA   + +  GV  C+ LSI +K G R +PLG H L +G+L+H ++IE
Sbjct: 714 HKVHIVAGSERRVRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGDLKHYVSIE 766


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  719 bits (1855), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/784 (45%), Positives = 495/784 (63%), Gaps = 60/784 (7%)

Query: 38  NKPDFPC--KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++ +F C    P    Y FCN SLS   RAK L+S L+L+EK+QQL + A+ + RLG+P 
Sbjct: 23  SQKNFACDRNDPATAKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVSRLGVPP 82

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           YEWWSE+LHG++  GPGV FNGTV   TSFP  +++AASFN SLW  +G  V+ EARAM+
Sbjct: 83  YEWWSEALHGVSDVGPGVRFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMH 142

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           N+G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+VVS YAV +VK  Q            
Sbjct: 143 NVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV---------- 192

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                        +++ +   L +S+CCKH  AYDL+ W    R+ F+A +T+QD EDT+
Sbjct: 193 -------------QDAGKSRRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDTY 239

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
           QPPF+SC+E+G  S +MCSYN+VNG+P C   +L +   R +W   GYI SDCD++   F
Sbjct: 240 QPPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDGYIVSDCDSIQVYF 299

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +  +YTKT            +++NCG  + ++T++A+   K+   ++D AL+  + V +R
Sbjct: 300 DDIHYTKTR-----------LNMNCGDFLGKYTENAVKLKKLNGSEVDEALIYNYIVLMR 348

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG F+GDP+   +G+LGP DVC+ +H+ LAL+AA+QGIVLL+N +  LPL+K AV  +A+
Sbjct: 349 LGFFDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RGDLPLSKTAVKKIAV 407

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCNSDAGFHEAVRI 513
           IGP  N    M   Y G+PC   S L+GL+ YV  K  Y  GC DV C        AV+ 
Sbjct: 408 IGPNANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDVNCGEQTLISAAVKA 467

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
             +AD  ++V GLD T E E  DRV+L LPG Q  LV  VA  +K+ V+LV+   GP+D+
Sbjct: 468 VSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKKTVVLVIMSAGPIDI 527

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           SFA+  S IS++LW+GYPGEAG  A+A++IFGD+NP GRLP TWY + F  KV M DMNM
Sbjct: 528 SFAKNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYSQEFADKVAMTDMNM 587

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           R +S+  +PGRSYRFYTG  +Y FG+GLSY+ +S   LSAPS + I    K     N+ +
Sbjct: 588 RPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHI----KTNPILNLNK 643

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-----GTPE 747
            T   +  V+      C  L+  + I V N G   GSHVV++F + PK S+     G P+
Sbjct: 644 TTSIDISTVN------CHDLKIRIVIGVKNRGQRSGSHVVLVFWKPPKCSKTLVGAGVPQ 697

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHS 802
            QL+GF+RV      +++++   D C+ LS+ + HG+R L  G+H L++G     ++ H 
Sbjct: 698 TQLVGFERVEVGRSMTEKVTVEFDVCKALSLVDTHGKRKLVTGHHTLVIGSNSDQQIYHH 757

Query: 803 LTIE 806
           L + 
Sbjct: 758 LNVR 761


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score =  719 bits (1855), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/794 (45%), Positives = 507/794 (63%), Gaps = 44/794 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
            I+++ V+AF+ S    ++  F C P +    +  FC TSL I  R + LI+ LTLQEKI
Sbjct: 6   FILIIFVLAFAYSGE--SRQPFACDPANAGIRNLRFCKTSLPIHVRVQDLIARLTLQEKI 63

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
           + L +NA+ + RLGI  YEWWSE+LHG+++ G GV F G     TSFPQV+ +AASFN S
Sbjct: 64  RLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVITTAASFNAS 123

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW  IG  V+ E RAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDP +V+ Y V +
Sbjct: 124 LWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHLVAQYGVSY 183

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           VK  QG   + +                          L ++ACCKH  AYDL+ W  + 
Sbjct: 184 VKGLQGGGGRGN------------------------TRLKVAACCKHYTAYDLDDWNGYD 219

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           RY FNA ++ QD EDT+  PF++C+ +G  + +MCSYNQ+NG P+C    L +   RN+W
Sbjct: 220 RYHFNAKVSMQDLEDTYNAPFKACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQW 279

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
              GYI SDCD+V  +FE Q+YT+  ED+AA  +KAG+D++CG  +  HT  A+  GKV 
Sbjct: 280 HLNGYIVSDCDSVGVLFEKQHYTRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVS 339

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           + +I+ AL N  +VQ+RLG+F+G    G Y  LGP DVC+  H++LAL AAR+GIVLLKN
Sbjct: 340 QVEINNALANTITVQMRLGMFDGP--NGPYANLGPKDVCSPAHQQLALQAAREGIVLLKN 397

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
             + LPL+     ++A+IGP  +    M G Y G+PC   S L+G+  Y ++T +  GC 
Sbjct: 398 IGQALPLSTKRHRTVAVIGPNSDATLAMIGNYAGVPCGYISPLQGISRY-ARTIHQQGCM 456

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
            V C  +  F  A   A+ AD  ++V GLD + E E +DRV+LLLPG Q  L++ VA  S
Sbjct: 457 GVACPGNQNFGLAEVAARHADATVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMAS 516

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K PV+LVL  GGP+DV+FA+ D ++SSI+W+GYPG+AG  A+A+++FG  NPGG+LPMTW
Sbjct: 517 KGPVVLVLMSGGPIDVTFAKNDPRVSSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTW 576

Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP--- 673
           YP+ +  KV M +M+MRA+ S+ YPGR+YRFY G  V+ FG G+SYT +S   +SAP   
Sbjct: 577 YPQDYVAKVSMANMDMRANPSKGYPGRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITV 636

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
           S  T+ +     ++   L +  + +  +H    T+C SL   + I V N GD+DG+H V+
Sbjct: 637 SVPTLHSHDLVSNNTTTLMKAKATVRTIH----TNCESLDIDMHIDVKNTGDMDGTHAVL 692

Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
           +F+  P     T  KQL+ F++VH VA   + +   ++ C+ LS+A+++G R + +G H 
Sbjct: 693 IFSTPP---DPTETKQLVAFEKVHVVAGAKQRVKINMNACKHLSVADEYGVRRIYMGEHK 749

Query: 794 LMVGE-LRHSLTIE 806
           + VG+ L+HS+T +
Sbjct: 750 IHVGDHLKHSITFQ 763


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  716 bits (1847), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/779 (46%), Positives = 504/779 (64%), Gaps = 39/779 (5%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C      ++PFC+TSLS   RAK L+S LTLQEK QQL + ++ I RLG+PAYEWWSE+L
Sbjct: 26  CGSAKTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEAL 85

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
           HG+++ GPG  F+  V   TSFP V++SAASFN SLW  +G  V+ EARAMYN+  AGLT
Sbjct: 86  HGVSNLGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAGLT 145

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+PN+N+FRDPRWGRGQETPGEDP+VVS YAV +++  Q    + +D            
Sbjct: 146 FWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQ----EVED------------ 189

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                E S + D L +S+CCKH  AYDL+ W    R+ F+A +T+QD ED++QPPF+SC+
Sbjct: 190 -----EASAKADRLKVSSCCKHYTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCV 244

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +G  S +MCSYN+VNG+P C   DL +   R +WG  GYI SDCD+V   +   +YT T
Sbjct: 245 VEGHVSSVMCSYNRVNGIPTCADPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTAT 304

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            ED+ A  LKAG+++NCG  + ++T +A++  KV    +D+AL+  + V +RLG F+ DP
Sbjct: 305 PEDAVALALKAGLNMNCGDFLKKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFD-DP 363

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           +   +  LGP DVCT ++++LALDAA+QGIVLL+N+   LPL++  +  LA+IGP  N  
Sbjct: 364 KSLPFANLGPSDVCTKDNQQLALDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANAT 423

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
           + M   Y GIPC   S L+GL+ Y+S  +YA GC +V C++ +    AV+ A  AD V++
Sbjct: 424 TVMISNYAGIPCRYTSPLQGLQKYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVL 483

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           V GLD + E E  DR +L LPG Q   V  VA  +K  VILV+   GP+D+S  ++ S I
Sbjct: 484 VVGLDQSIEAEGLDRENLTLPGFQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNI 543

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYP 641
             ILW+GYPG+AG  A+A++IFGD+NPGGR P TWYP+S+  +VPM DMNMRA+ SR +P
Sbjct: 544 GGILWVGYPGQAGGDAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFP 603

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL-----TISASLKAGSDKNILQQTGS 696
           GR+YRFY G  +Y FGHGLSY+ +S    SAPS +     +IS      S  N   Q  S
Sbjct: 604 GRTYRFYNGNSLYEFGHGLSYSTFSMYVASAPSSIMIENTSISEPHNMLSSNNSGTQVES 663

Query: 697 RLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ---GTPEKQLIG 752
             D   ID  T +C  L F + I V N G ++GSHVV++F   P  S+   G P KQLIG
Sbjct: 664 LSDGQAIDISTINCQDLTFLLVIGVKNNGPLNGSHVVLVFWE-PATSEFVIGAPIKQLIG 722

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
           F+RV  V   ++ ++  +D C+ +S  +  G+R L +G H ++VG     ++RH + ++
Sbjct: 723 FERVQVVVGVTEFVTVKIDICQLISNVDSDGKRKLVIGQHTILVGSSSETQVRHHIDVK 781


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score =  716 bits (1847), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/756 (47%), Positives = 488/756 (64%), Gaps = 42/756 (5%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC  SL I  R K LI  LTL EKI+ L +NA A+PRLGI  YEWWSE+LHG+++ GPG 
Sbjct: 46  FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
            F GT    TSFPQV+ +AASFN+SLW  IG  V+ EARAMYN G AGLT+W+PN+NIFR
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGRGQETPGEDP++ + YA  +V+  QG + K                         
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKK------------------------ 201

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L ++ACCKH  AYDL+ W    RY FNA +++QD EDT+  PF++C+ +GK + +MC
Sbjct: 202 --RLKVAACCKHYTAYDLDNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMC 259

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           SYNQVNG P C   DL +   R  WG  GYI SDCD+V  +++ Q++T T E++AA  +K
Sbjct: 260 SYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSVGVLYDSQHFTPTPEEAAASTIK 319

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
           AG+D++CG  +  HT +A+ +G ++E D++ AL NL SVQ+RLG+F+G+P    YG LGP
Sbjct: 320 AGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGP 379

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
            DVCT  HK LAL+AARQGIVLL+N    LPL+     ++A+IGP  +    M G Y G+
Sbjct: 380 KDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGV 439

Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
            C   + ++G+  YV KT +A GC +V C  D    EA   A+ AD  +VV GLD + E 
Sbjct: 440 ACEYTTPVQGISKYV-KTIHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEA 498

Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
           E RDR  +LLPG+Q  LV  +    K P ++VL  GGP+DVSFA+ D +IS ILW+GYPG
Sbjct: 499 ESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPG 558

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
           +AG  A+A+++FG  NPGG+LPMTWYP+S+  KVPM +M +R D S  YPGR+YRFY G 
Sbjct: 559 QAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGP 618

Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
            V+ FG GLSY+ +S  F  AP+++++  S  + +    ++ +       H D    C S
Sbjct: 619 VVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSATVKVS-------HTD----CAS 667

Query: 712 LR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
           +    + I V N G VDGSH +++F+ VP  +  +PEK LIGF++VH +A   K +  G+
Sbjct: 668 VSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTW-SPEKHLIGFEKVHLIAGSQKRVRIGI 726

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             C+ LS  ++ G R +P+G H L +G+L HS++++
Sbjct: 727 HVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQ 762


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/755 (46%), Positives = 488/755 (64%), Gaps = 69/755 (9%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P+   +PFCNTSL    RA  L+S LTLQEK +QL ++A+ I RLG+P YEWWSE+LHG+
Sbjct: 58  PNSSQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGV 117

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWA 166
           +++G GV+F+  + +VT FP V++SAASFN SLW  +G  V+ E RAMYN+GQAGLT+W+
Sbjct: 118 SNSGIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWS 177

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PN+NIFRDPRWGRGQETPGEDP+VVS YAV +V                      R L+ 
Sbjct: 178 PNVNIFRDPRWGRGQETPGEDPLVVSRYAVNYV----------------------RGLQE 215

Query: 227 FGEESD-RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
            G+E +   D L +S+CCKH  AYD++KW    R+ F+A +T QD EDT+QPPF+SC+E+
Sbjct: 216 VGKEGNFAADRLKVSSCCKHYTAYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEE 275

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G  S +MCSYN+VNGVP C   +L +   R++WG  GYI SDCD++    E  NYT+T E
Sbjct: 276 GHVSSVMCSYNRVNGVPTCANPELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNYTETPE 335

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           D+ A  LKAG+++NCG+ +  +T++A++ GKV+E  +++AL+  + V +RLG F+GDP  
Sbjct: 336 DAVALALKAGLNLNCGSYLGDYTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTM 395

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +GK+GP DVCT +H+ LALDAA+QGIVLL N+   LPL+ N   +LA+IGP  +  + 
Sbjct: 396 LPFGKMGPSDVCTVDHQLLALDAAKQGIVLLHNNGA-LPLSPNTTKTLAVIGPNADATNT 454

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M   Y G+PC   S L+GL+ YVS   Y  GC +V C+ +     A  IA  AD  +VV 
Sbjct: 455 MLSNYAGVPCRYTSPLQGLQKYVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVV 514

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLDL  E ED DRV+L LPG Q  LV   A+ +   VILV+   GP+D+SF +  S+I  
Sbjct: 515 GLDLFIEAEDLDRVNLTLPGFQEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGG 574

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           ILW+GYPG+AG  A++++IFGD+NPGGR P TWYP+ +  +VPM DMNMR +++  +PGR
Sbjct: 575 ILWVGYPGQAGGDAISQVIFGDYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGR 634

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           +YRFYTG  +Y FGHGLSY+ + YK LS                              +I
Sbjct: 635 TYRFYTGKSLYQFGHGLSYSTF-YKNLS------------------------------NI 663

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG 762
           D V           I V NAG++DG+HVV+ F + P+   +G P  +L+GF+RV      
Sbjct: 664 DIV-----------IGVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGK 712

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           ++ +   +D C ++S  ++ G+R L +G H L+VG
Sbjct: 713 TEMVGMRLDVCGKISNVDEEGKRKLVMGMHTLVVG 747


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/793 (44%), Positives = 504/793 (63%), Gaps = 43/793 (5%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
           ++ L++ V+ + F+    V  +P F C P  P    + FC T + +  R + LI  LTLQ
Sbjct: 11  LIALVLCVSALLFN---LVHARPPFACDPRNPLTRGFKFCRTRVPVHVRVQDLIGRLTLQ 67

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EKI+ L +NA A+PRLGI  YEWWSE+LHG+++ GPG  F G     TSFPQV+ +AASF
Sbjct: 68  EKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASF 127

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N+SLW  IG  V+ EARAMYN GQAGLT+W+PN+NIFRDPRWGRGQETPGEDP++ + YA
Sbjct: 128 NQSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLSAKYA 187

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             +VK  QG      DG                     G+ L ++ACCKH  AYDL+ W 
Sbjct: 188 ASYVKGLQG------DGA--------------------GNRLKVAACCKHYTAYDLDNWN 221

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R+ FNA +++QD  DT+  PFR C+ +GK + +MCSYNQVNG P C   DL +   R
Sbjct: 222 GVDRFHFNARVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIR 281

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
            EW   GYI SDCD+V   ++ Q+YT+T E++AA  +KAG+D++CG  +  HT+ AI  G
Sbjct: 282 GEWKLNGYIVSDCDSVGVFYDQQHYTRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAG 341

Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
            + E D+D AL N  +VQ+RLG+F+G+P   +YG LGP DVCT  H++LAL+A+RQGIVL
Sbjct: 342 LLPEIDVDYALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVL 401

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           L+N+   LPL+     ++A++GP  +    M G Y G+ C   + L+G+  Y +KT +  
Sbjct: 402 LQNNGHTLPLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRY-TKTIHQQ 460

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           GC +V C ++  F  A   A++AD  ++V GLD + E E RDR  L++PG Q  LV+ VA
Sbjct: 461 GCTNVACTTNQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVA 520

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
           R S+ P +LVL  GGP+DVSFA+ D +I +I+W+GYPG+AG  A+A+++FG  NP G+LP
Sbjct: 521 RASRGPTVLVLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLP 580

Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           MTWYP+ + +KVPM +M MRA   R YPGR+YRFY G  V+ FG GLSYT +++     P
Sbjct: 581 MTWYPQDYVSKVPMTNMAMRA--GRGYPGRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVP 638

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
           + +++  +  + +  + +  +  R+ +      T+C  L   + + V N G  DG+H ++
Sbjct: 639 TSVSVPLTSLSATTNSTMLSSAVRVSH------TNCNPLSLALHVVVKNTGARDGTHTLL 692

Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
           +F+  P   +    KQL+GF +VH VA   K +   V  C+ LS+ ++ G R +P+G H 
Sbjct: 693 VFSSPPS-GKWAANKQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQFGIRRIPIGEHK 751

Query: 794 LMVGELRHSLTIE 806
           L +G+L H +++E
Sbjct: 752 LQIGDLEHHISVE 764


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score =  713 bits (1841), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/774 (45%), Positives = 494/774 (63%), Gaps = 41/774 (5%)

Query: 38  NKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++  F C P +     YPFC  SL +  R K LI  LTL+EK++ L  NA  +PRLGI A
Sbjct: 23  SRAPFACDPNNSVTTDYPFCRRSLVVGERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKA 82

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WWSE+LHG+++ GPG  F G   + TSFPQV+ +AASFN SLW  IG  V+ EARAMY
Sbjct: 83  YQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMY 142

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           N G  GLT+W+PN+NIFRDPRWGRGQETPGEDP++   YAV +V+  QG           
Sbjct: 143 NGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGT---------- 192

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                             G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDTF
Sbjct: 193 -----------------EGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTF 235

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
           + PFR C++ GK S +MCSYNQVNGVP C   +L     R++W   GYI SDCD+V   +
Sbjct: 236 EVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFY 295

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             Q+YT T E++AA  +KAG+D++CG+ +  HT++A+ +G + E  I+ AL N  SVQ+R
Sbjct: 296 NSQHYTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMR 355

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+GD +   Y  LG   VC+  +++LA+DAARQGIVLL+N +  LPL+ N    +A+
Sbjct: 356 LGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAV 415

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           +GP  N    M G Y GI C   + L+G+  Y ++T +  GC  V C S+  F  A+  A
Sbjct: 416 VGPNSNATLTMIGNYAGIACEYITPLQGISKY-TRTIHQEGCRGVACRSNKFFGGAIEAA 474

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           + AD V++V GLD + E E RDR  LLLPG Q  LV  VA  +K PVILVL  GGP+DVS
Sbjct: 475 RVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVS 534

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
           FA+   +IS I+W GYPG+AG  A+A+++FG  NPGG+LPMTWYP+ + +K+PM  M++R
Sbjct: 535 FAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLR 594

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             +S  YPGR+YRFY G  VY FGHGLSYT +++K LSAP+ LT+  +       N  + 
Sbjct: 595 PGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHR-HPHNGSEF 651

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G  +   H      C  L   ++++V N G  DG+H +++++ +P +    P+KQL+ F
Sbjct: 652 WGKAVRVTH----AKCDRLSLVIKVAVRNIGARDGAHTLLVYS-IPPMGVWVPQKQLVAF 706

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-LRHSLTIE 806
           ++VH  A+  KE+   +  C+ LS+ +K+G R +P+G H + +G+ +RH ++++
Sbjct: 707 EKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQ 760


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score =  713 bits (1840), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/774 (45%), Positives = 494/774 (63%), Gaps = 41/774 (5%)

Query: 38  NKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++  F C P +     YPFC  SL +  R K LI  LTL+EK++ L  NA  +PRLGI A
Sbjct: 23  SRAPFACDPNNSVTTDYPFCRRSLVVEERVKDLIGRLTLEEKVKLLVSNAGGVPRLGIKA 82

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WWSE+LHG+++ GPG  F G   + TSFPQV+ +AASFN SLW  IG  V+ EARAMY
Sbjct: 83  YQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTAASFNASLWEAIGRVVSDEARAMY 142

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           N G  GLT+W+PN+NIFRDPRWGRGQETPGEDP++   YAV +V+  QG           
Sbjct: 143 NGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGTYAVNYVRGLQGT---------- 192

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                             G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD EDTF
Sbjct: 193 -----------------EGNRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTF 235

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
           + PFR C++ GK S +MCSYNQVNGVP C   +L     R++W   GYI SDCD+V   +
Sbjct: 236 EVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDSVGVFY 295

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             Q+YT T E++AA  +KAG+D++CG+ +  HT++A+ +G + E  I+ AL N  SVQ+R
Sbjct: 296 NSQHYTSTPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNTLSVQMR 355

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+GD +   Y  LG   VC+  +++LA+DAARQGIVLL+N +  LPL+ N    +A+
Sbjct: 356 LGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNRHRIVAV 415

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           +GP  N    M G Y GI C   + L+G+  Y ++T +  GC  V C S+  F  A+  A
Sbjct: 416 VGPNSNATLTMIGNYAGIACEYITPLQGISKY-TRTIHQEGCRGVACRSNKFFGGAIEAA 474

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           + AD V++V GLD + E E RDR  LLLPG Q  LV  VA  +K PVILVL  GGP+DVS
Sbjct: 475 RVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGGPIDVS 534

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
           FA+   +IS I+W GYPG+AG  A+A+++FG  NPGG+LPMTWYP+ + +K+PM  M++R
Sbjct: 535 FAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMTTMSLR 594

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             +S  YPGR+YRFY G  VY FGHGLSYT +++K LSAP+ LT+  +       N  + 
Sbjct: 595 PGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHR-HPHNGSEF 651

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G  +   H      C  L   ++++V N G  DG+H +++++ +P +    P+KQL+ F
Sbjct: 652 WGKAVRVTH----AKCDRLSLVIKVAVRNIGARDGAHTLLVYS-IPPMGVWVPQKQLVAF 706

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-LRHSLTIE 806
           ++VH  A+  KE+   +  C+ LS+ +K+G R +P+G H + +G+ +RH ++++
Sbjct: 707 EKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGDNVRHIVSLQ 760


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  711 bits (1834), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/791 (46%), Positives = 502/791 (63%), Gaps = 33/791 (4%)

Query: 30  FSNSKPVLNKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA 87
           F   +  + K  + C P  P    +PFCNTS+S   R + LIS LT+QEKI+QL + A+ 
Sbjct: 8   FEKPRIKVYKLQYACDPDGPADLLFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAAN 67

Query: 88  IPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAV 147
           + RLGIP Y+WW E LHG+A + P V F G   + TSFP   +S  S+NR+LW+ IG  V
Sbjct: 68  VSRLGIPPYQWWGEGLHGVAIS-PSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVV 126

Query: 148 AVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENW 207
           + E RAMYN G++GLT+W+PNINI RDPRWGR QETPGEDP + S YAV FVK  Q  ++
Sbjct: 127 STEGRAMYNQGRSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDY 186

Query: 208 KSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMIT 267
             +        + + V +G          L +SACCKH  A+DL++W ++ R  F++ +T
Sbjct: 187 DQN--------QPQAVSRG-------PRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVT 231

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSD 326
           +QD EDT+ P F+SC+++G++S +MCSYN++NG+P C   +L     RN+WGF GYI SD
Sbjct: 232 QQDLEDTYNPSFKSCVKEGQSSSVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSD 291

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
           CDAVA I +Y NY  T ED+ + V+ AGMD+NCG+  L H  +A+DK  + E  ID  L 
Sbjct: 292 CDAVALIHDYINYAPTSEDAVSYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLR 351

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           NLF V++RLG+F+G+P    YG LGP+D+CT +++ LAL+AARQ +VLLKN+K  LP  K
Sbjct: 352 NLFRVRMRLGMFDGNPSTLPYGSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKK 411

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----KTHYASGCHDVPCN 502
                LA+IG   +   +M G Y G PC   S L+G    +S    +  +  GC D  C 
Sbjct: 412 THGLKLAVIGHHADATREMLGNYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACE 471

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPV 561
                + A   A +AD V++V G+   QE E RDR SLLLPG+QM LV+SV   S  RPV
Sbjct: 472 DQFYIYAAKEAAAQADAVVLVLGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPV 531

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           +LVL  G PLDVSFA  D +I SI+W GYPG++G +A+AE IFG  NPGGRL  +WY E+
Sbjct: 532 VLVLLSGSPLDVSFANDDPRIQSIIWAGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYEN 591

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
           +T + M++MNMR ++S  YPGR+YRF+T T ++ FGHGLSY+++ Y  +SAP  + ++  
Sbjct: 592 YTNIDMSNMNMRPNASTGYPGRTYRFFTDTPLWEFGHGLSYSDFKYTMVSAPQSI-MAPH 650

Query: 682 LK---AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           L+     SD+ ++    + L Y    E  +C    FHV++ V N G + G H V+LF++ 
Sbjct: 651 LRYQLCSSDRAVMTSDLNCLHY----EKEACKESSFHVRVWVINHGPLSGDHSVLLFSKP 706

Query: 739 P-KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P +   G P KQL+ F+RVH  A   +EI F V+PCE L      G R + LG H LMVG
Sbjct: 707 PSRGIDGIPLKQLVSFERVHLEAGAGQEILFKVNPCEDLGTVGDDGIRTVELGEHTLMVG 766

Query: 798 ELRHSLTIETY 808
            ++H LT+E +
Sbjct: 767 MVQHVLTVENW 777


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score =  710 bits (1832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/770 (45%), Positives = 493/770 (64%), Gaps = 40/770 (5%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +    + PFC  SL I  RA+ L+  LTLQEKI+ L +NA  +PRLGI  YEWW
Sbjct: 27  FACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G+    TSFPQV+ +AASFN SLW  IG  V+ EARAMYN G 
Sbjct: 87  SEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGM 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  QG               
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG--------------- 191

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                       +  D L ++ACCKH  AYDL+ WG   R+ FNA +++QD EDT+  PF
Sbjct: 192 ------------NARDRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++C+ +G  + +MCSYNQVNG P C    L +   R EW   GYI SDCD+V   ++ Q+
Sbjct: 240 KACVVEGNVASVMCSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQH 299

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT T E++AA  +KAG+D++CG  +  HT++AI  GK+ E D++ AL+N  SVQ+RLG+F
Sbjct: 300 YTATPEEAAAVAIKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMF 359

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    YG LGP DVCT  H++LAL+AARQGIVL++N    LPL+ +   ++A+IGP 
Sbjct: 360 DGEPSAQPYGNLGPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPN 419

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            +    M G Y G+ C   + L+G+  Y    H A GC  V C  D  F  AV  A++AD
Sbjct: 420 SDVTETMIGNYAGVACGYTTPLQGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQAD 478

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E RDRV +LLPG+Q  LV+ VA  S+ P +LVL  GGP+DVSFA+ 
Sbjct: 479 ATVLVMGLDQSIEAEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKN 538

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
           D +I++I+W+GYPG+AG  A+A+++FG  NPGG+LP+TWYP+S+  K PM +M MRA  S
Sbjct: 539 DPRIAAIIWVGYPGQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPS 598

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGS 696
           R YPGR+YRFY G  V+ FGHGLSY+ +++    AP+ +++S ASL+   +  I+     
Sbjct: 599 RGYPGRTYRFYNGPVVFPFGHGLSYSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGAI 658

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
           R+ +       +C +      I V N G +DGSH ++LF+  P     +P K+L+ F++V
Sbjct: 659 RISHA------NCNTQPLGFHIDVKNTGTMDGSHTLLLFS-TPPPGTWSPNKRLLAFEKV 711

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           H  A   + + F V  C+ LS+ +  G   +P+G H   +G+L+HS++++
Sbjct: 712 HVGAGSQERVRFDVHVCKHLSVVDHFGIHRIPMGEHHFHIGDLKHSISLQ 761


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score =  710 bits (1832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/794 (45%), Positives = 498/794 (62%), Gaps = 45/794 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           ++++V ++   +S   L +P F C P +    +  FC  ++ I  R + L+  LTLQEKI
Sbjct: 15  VVILVFLLCLVHSSESL-RPLFACDPANGLTRTLRFCRANVPIHVRVQDLLGRLTLQEKI 73

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
           + L +NA+A+PRLGI  YEWWSE+LHGI+  GPG  F G     TSFPQV+ +AASFN+S
Sbjct: 74  RNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFPQVITTAASFNQS 133

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW  IG  V+ EARAMYN G AGLT+W+PN+NI RDPRWGRGQETPGEDP+V + YA  +
Sbjct: 134 LWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASY 193

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  QG                          +  G+ L ++ACCKH  AYDL+ W    
Sbjct: 194 VRGLQG--------------------------TAAGNRLKVAACCKHYTAYDLDNWNGVD 227

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ FNA +T+QD EDT+  PF+SC+ +GK + +MCSYNQVNG P C   +L +   R +W
Sbjct: 228 RFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQW 287

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
              GYI SDCD+V   F  Q+YT T E++AA  +KAG+D++CG  +   T+ A+ KG + 
Sbjct: 288 RLNGYIVSDCDSVDVFFNQQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLT 347

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           E DI+ AL N  +VQ+RLG+F+G+   G Y  LGP DVCT  HK LAL+AA QGIVLLKN
Sbjct: 348 ENDINLALANTLTVQMRLGMFDGN--LGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKN 405

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
             + LPL+     ++A+IGP  +    M G Y G  C+  S L+G+  Y    H A GC 
Sbjct: 406 SARSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQA-GCA 464

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
            V C  + GF  A   A++AD  ++V GLD + E E RDR  LLLPG Q  LVT VA+ S
Sbjct: 465 GVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQAS 524

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           + PVILVL  GGP+DV+FA+ D ++++I+W GYPG+AG  A+A IIFG  NPGG+LPMTW
Sbjct: 525 RGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTW 584

Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SE 675
           YP+ +  KVPM  M MRA  S  YPGR+YRFY G  V+ FG GLSYT +++    +P ++
Sbjct: 585 YPQDYVAKVPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQ 642

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVML 734
           L++S S    S   IL  +   +   H    T+C S  +  + + V+N G+ DG+H V +
Sbjct: 643 LSVSLS-NLNSANTILNSSSHSIKVSH----TNCNSFPKMPLHVEVSNTGEFDGTHTVFV 697

Query: 735 FARVP--KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           FA  P   +      KQLI F++VH +A   + +   VD C+ L + +++G+R +P+G H
Sbjct: 698 FAEPPINGIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEH 757

Query: 793 VLMVGELRHSLTIE 806
            L +G+L+H++ ++
Sbjct: 758 KLHIGDLKHTILVQ 771


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score =  710 bits (1832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/756 (45%), Positives = 487/756 (64%), Gaps = 36/756 (4%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC  +L I  R + LI  L LQEK++ L +NA+ +PRLGI  YEWWSE+LHG+++ GPG 
Sbjct: 40  FCRVNLPIRARVQDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGT 99

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
            F G   + TSFPQV+ +AASFN SLW  IG  V+ EARAMYN G AGLT+W+PN+NIFR
Sbjct: 100 KFRGAFPAATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFR 159

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGRGQETPGEDP + S YA  +V+  QG   K                         
Sbjct: 160 DPRWGRGQETPGEDPTLASQYAASYVRGLQGIYNK------------------------- 194

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
            + L ++ACCKH  AYDL+ W    R+ FNA +++QD EDT+  PF+ C+++G+ + +MC
Sbjct: 195 -NRLKVAACCKHYTAYDLDNWNAVDRFHFNAKVSKQDLEDTYNVPFKGCVQEGRVASVMC 253

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           SYNQVNG P C   DL +   R +W   GYI SDCD+V  +++ Q+YT+T E++AA  +K
Sbjct: 254 SYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVLYDDQHYTRTPEEAAADTIK 313

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
           AG+D++CG  +  HT++A+ +G + E D+++AL N F+VQ+RLG+F+G+     +G LGP
Sbjct: 314 AGLDLDCGPFLAVHTEAAVKRGLLTEADVNQALTNTFTVQMRLGMFDGEAAAQPFGHLGP 373

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
            DVC+  H+ LAL AARQGIVLL+N  + LPL+     ++A+IGP  +    M G Y G+
Sbjct: 374 KDVCSPAHQDLALQAARQGIVLLQNRGRSLPLSTARHRNIAVIGPNADATVTMIGNYAGV 433

Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
            C   S L+G+  Y +KT + +GC  V C S+  F  A   A  AD  ++V GLD + E 
Sbjct: 434 ACGYTSPLQGIARY-AKTVHQAGCIGVACTSNQQFGAATAAAAHADATVLVMGLDQSIEA 492

Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
           E RDR S+LLPG Q  LV+ VA  S+ P ILVL  GGP+DV+FA+ D +IS+ILW+GYPG
Sbjct: 493 EFRDRASVLLPGHQQELVSKVALASRGPTILVLMCGGPVDVTFAKNDPKISAILWVGYPG 552

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGT 651
           +AG  A+A+++FG  NPGG+LP TWYP+S+  KVPM D+ MRA+ S  YPGR+YRFY G 
Sbjct: 553 QAGGTAIADVLFGTTNPGGKLPNTWYPQSYVAKVPMTDLAMRANPSNGYPGRTYRFYKGP 612

Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
            V+ FG GLSYT ++     AP+++ +  + +  ++ NI       L  +H    T+C +
Sbjct: 613 VVFPFGFGLSYTRFTQSLAHAPTKVMVPLANQF-TNSNITSFNKDALKVLH----TNCDN 667

Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
           +   + I V N G VDGSH +++F+  PK ++ + EKQLIGF RVH  A   + +   + 
Sbjct: 668 IPLSLHIDVKNKGKVDGSHTILVFSTPPKGTKSS-EKQLIGFKRVHVFAGSKQRVRMNIH 726

Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTIE 806
            C  LS A++ G R +P+G H L +G + +H L++ 
Sbjct: 727 VCNHLSRADEFGVRRIPIGEHTLHIGDDHKHKLSLH 762


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  709 bits (1830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/776 (44%), Positives = 483/776 (62%), Gaps = 57/776 (7%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + PFC  SL +  RA+ L+S LT  EK++ L +NA+ + RLG+  YEWWSE+LHG++  G
Sbjct: 44  TLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHGVSDTG 103

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F G     T+FPQV+ +AA+ N +LW  IG AV+ EARAMYN G+AGLTFW+PN+N
Sbjct: 104 PGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFWSPNVN 163

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRGQETPGEDP + S YA  +V+  Q                         + 
Sbjct: 164 IFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQ-------------------------QP 198

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
            D  + L L+ACCKH  AYDL+ WG   R+ FNA+++ QD EDTF  PFR+C+  G+A+ 
Sbjct: 199 YDH-NRLKLAACCKHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAAS 257

Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYNQVNGVP C  +G L    R  WG  GYI SDCD+V   F  Q+YT+T ED+ A 
Sbjct: 258 VMCSYNQVNGVPTCADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHYTRTAEDAVAA 317

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L+AG+D++CG  +  +T++A+ + KV + D+D ALLN  +VQ+RLG+F+GDP  G +G 
Sbjct: 318 TLRAGLDLDCGPFLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGH 377

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKN-------DKKFLPLNKNAVSSLAIIGPLVNNI 462
           LG  DVCT  H+ LALDAARQ +VLLKN       D+  LPL   A   +A++GP  +  
Sbjct: 378 LGAADVCTKAHQDLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADAT 437

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVI 521
             M G Y G PC   + L+G+ AY ++  + +GC DV C   +     AV  A++     
Sbjct: 438 VAMIGNYAGKPCRYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPS 497

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
              GL            SLLLPG+Q  L+++VA+ +K PVILVL  GGP+D++FA+ D +
Sbjct: 498 SSPGLTR----------SLLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPR 547

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQY 640
           I  ILW+GYPG+AG +A+A++IFG  NPGG+LP+TWYP+ +  KVPM +M MRA+ +R Y
Sbjct: 548 IDGILWVGYPGQAGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGY 607

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ----TGS 696
           PGR+YRFYTG  ++ FGHGLSYT +++    AP++LT+  S  + S             +
Sbjct: 608 PGRTYRFYTGPTIHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNAT 667

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-----ARVPKVSQGT--PEKQ 749
           R           C  L   V + V N GD DG+H V+++     +       GT  P +Q
Sbjct: 668 RPSRAVRVAHARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQ 727

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           L+ F++VH  A G   +  G+D C++LS+A++ G R +P+G H LM+GEL HS+T+
Sbjct: 728 LVAFEKVHVPAGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGELTHSVTL 783


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/752 (47%), Positives = 472/752 (62%), Gaps = 49/752 (6%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC+  L+   RA  L++ LTL EK+ QL D A+ + RLG+PAYEWWSE LHG++  G G+
Sbjct: 31  FCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHGLSIWGRGI 90

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIF 172
            FNGTV +VTSFPQV+++AA+F+  LW  +G AV  EARA+YNLGQA GLT W+PN+NIF
Sbjct: 91  RFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTIWSPNVNIF 150

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDP   R    PG+                                   R     GE+  
Sbjct: 151 RDPSGTR----PGD----------------------------------ARRGPRHGEQGI 172

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
            G+    SACCKH  AYDL+ W N  RY++++ +T QD EDT+ PPF+SC+ +GKA+CIM
Sbjct: 173 GGEA---SACCKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIM 229

Query: 293 CSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           C YN +NGVPAC   DL  +K R EWG  GY+ SDCDAVATI +  +YT + ED+ A  +
Sbjct: 230 CGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTLSPEDTVAVSI 289

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKL 410
           K GMD+NCG     H  +A+ KG + EKDIDRAL+NLF+V++RLG F+GDPR    YG L
Sbjct: 290 KVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHL 349

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           G  DVC+  HK LAL+AA+ GIVLLKND   LPL  +AV+SLA+IGP  +N+  + G Y 
Sbjct: 350 GAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYF 409

Query: 471 GIPCSPKSLLRGLEAYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
           G PC   + L+G++ Y+  +  + +GC    C  DA  +EA  +A  +D V++  GL   
Sbjct: 410 GPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVDA-TNEAAALASSSDHVVLFMGLSQK 468

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
           QE +  DR SLLLPG+Q  L+T+VA  ++RPVILVL  GGP+DV+FA+ + +I +ILW G
Sbjct: 469 QEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAG 528

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD +  YPGRSYRFY 
Sbjct: 529 YPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQ 588

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTI-SASLKAGSDKNILQQTGSRLDYVHIDE--V 706
           G  VY FG+GLSY+ +S +  S+ S     + SL AG         G  +    + E  V
Sbjct: 589 GNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSSYLVKEIGV 648

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
             C+ L F   + V N G +DG H V+++ R P  S G P +QLIGF   H        +
Sbjct: 649 ERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMV 708

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           SF V PCE  S   + G R++  G H LMVG+
Sbjct: 709 SFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 740


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/794 (45%), Positives = 498/794 (62%), Gaps = 45/794 (5%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           ++++V ++   +S   L +P F C P +    +  FC  ++ I  R + LI  LTLQEKI
Sbjct: 15  VVILVFLLCLVHSSESL-RPLFACDPANGLTRTLRFCRVNVPIHVRVQDLIGRLTLQEKI 73

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
           + L +NA+A+PRLGI  YEWWSE+LHG++  GPG  F G     TSFPQV+ +AASFN+S
Sbjct: 74  RNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQVITTAASFNQS 133

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW  IG  V+ EARAMYN G AGLT+W+PN+NI RDPRWGRGQETPGEDP+V + YA  +
Sbjct: 134 LWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGEDPIVAAKYAASY 193

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  QG                          +  G+ L ++ACCKH  AYDL+ W    
Sbjct: 194 VRGLQG--------------------------TAAGNRLKVAACCKHYTAYDLDNWNGVD 227

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ FNA +T+QD EDT+  PF+SC+ +GK + +MCSYNQVNG P C   +L +   R +W
Sbjct: 228 RFHFNAKVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKW 287

Query: 318 GFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
              GYI SDCD+V   F  Q+YT T E++AA  +KAG+D++CG  +   T+ A+ KG + 
Sbjct: 288 RLNGYIVSDCDSVDVFFNQQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLT 347

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           E DI+ AL N  +VQ+RLG+F+G+   G Y  LGP DVC+  HK LAL+AA QGIVLLKN
Sbjct: 348 ENDINLALANTLTVQMRLGMFDGN--LGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKN 405

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
             + LPL+     ++A+IGP  +    M G Y G  C+  + L+G+  Y    H A GC 
Sbjct: 406 SGRSLPLSPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQA-GCA 464

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
            V C  + GF  A   A++AD  ++V GLD + E E RDR  LLLPG Q  LVT VA+ S
Sbjct: 465 GVACKGNQGFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQAS 524

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           + PVILVL  GGP+DV+FA+ D ++++I+W GYPG+AG  A+A IIFG  NPGG+LPMTW
Sbjct: 525 RGPVILVLMSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTW 584

Query: 618 YPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SE 675
           YP+ +  KVPM  M MRA  S  YPGR+YRFY G  V+ FG GLSYT ++     +P ++
Sbjct: 585 YPQDYVAKVPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTNSLAKSPLAQ 642

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-RFHVQISVTNAGDVDGSHVVML 734
           L++S S    S   IL  T   +   H    T+C S  +  + + V+N G+ DG+H V +
Sbjct: 643 LSVSLS-NLNSANAILNSTSHSIKVSH----TNCNSFPKMPLHVEVSNTGEFDGTHTVFV 697

Query: 735 FARVPK--VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           FA  PK  +      KQLI F++VH +A   + +   VD C+ L + +++G+R +P+G H
Sbjct: 698 FAEPPKNGIKGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKH 757

Query: 793 VLMVGELRHSLTIE 806
            L +G+L+H++ ++
Sbjct: 758 KLHIGDLKHTILVQ 771


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score =  707 bits (1824), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/770 (44%), Positives = 482/770 (62%), Gaps = 44/770 (5%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +    SY FCNT + I  R + LI  L L EKI+ + +NA A+PRLGI  YEWW
Sbjct: 27  FACDPKNGLTRSYKFCNTRVPIHARVQDLIGRLALPEKIRLVVNNAIAVPRLGIQGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G  S+ TSFPQV+ +AASFN+SLW  IG  V+ EARAMYN G 
Sbjct: 87  SEALHGVSNVGPGTKFGGAFSAATSFPQVITTAASFNQSLWLEIGRIVSDEARAMYNGGA 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLTFW+PN+NIFRDPRWGRGQETPGEDP V   YA  +V+  QG               
Sbjct: 147 AGLTFWSPNVNIFRDPRWGRGQETPGEDPTVAGKYAASYVQGLQGNG------------- 193

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G+ L ++ACCKH  AYDL+ W    R+ FNA +++QD  DT+  PF
Sbjct: 194 -------------AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLADTYDVPF 240

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++C+  GK + +MCSYNQVNG P C   +L +   R EWG  GYI SDCD+V  +++ Q+
Sbjct: 241 KACVRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSDCDSVGVLYDNQH 300

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT+T E +AA  +KAG+D++CG  +  HT  AI +G + E D++ AL NL +VQ+RLG+F
Sbjct: 301 YTRTPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLALANLITVQMRLGMF 360

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +GD +   YG LG  DVC   H  +AL+AARQGIVLL+N    LPL+     ++ +IGP 
Sbjct: 361 DGDAQ--PYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPTRYRTVGVIGPN 418

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            +    M G Y GI C   + L+G+  YV   H A GC DV C  +  F  + ++A++AD
Sbjct: 419 SDVTVTMIGNYAGIACGYTTPLQGIARYVKTIHQA-GCKDVGCGGNQLFGLSEQVARQAD 477

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E RDR  LLLPG Q  LV+ VAR ++ PVILVL  GGP+DV+FA+ 
Sbjct: 478 ATVLVMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSGGPIDVTFAKN 537

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS 637
           D +IS+ILW+GYPG++G  A+A++IFG  NP GRLP TWYP+ +  KVPM +M+MRA+ +
Sbjct: 538 DPKISAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPMTNMDMRANPA 597

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             YPGR+YRFY G  V+ FGHGLSY+ +++    AP ++++  +               +
Sbjct: 598 TGYPGRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQAFTNSSNKAMK 657

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
           + + + DE+     + FHV   V N G +DG+H ++++++ P        KQL+ F + +
Sbjct: 658 VSHANCDEL----EVGFHV--DVKNEGSMDGAHTLLVYSKAPN-----GVKQLVNFHKTY 706

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
             A     +  GV  C  LS  ++ G R +P+G H L +G+L+HS+ ++T
Sbjct: 707 VPAGSKTRVKVGVHVCNHLSAVDEFGVRRIPMGEHELQIGDLKHSILVQT 756


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  704 bits (1818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/772 (46%), Positives = 492/772 (63%), Gaps = 37/772 (4%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    +Y FCN   + S RA+ L+S LTL EK+  L +   A+ RLGIPAYE
Sbjct: 35  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 95  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q     +  GG+  G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 210

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  ++  
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCTS +++LA + ARQGIVLLKN    LPL+  ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     AV  A 
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S  PVILV+  GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+A  +I++ILW+GYPGEAG  ALA+I+FG  NP GRLP+TWYP S+   V M DM MR 
Sbjct: 551 AKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRP 610

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D+S  YPGR+YRFYTG  V+ FG GLSYT  S+  +SAP       S++   D     + 
Sbjct: 611 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEE 667

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
            + ++         C  L F V++ V NAG+V G+H V+LF+  P  +   P K L+GF+
Sbjct: 668 CASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFE 722

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +V      +  ++F VD C  LS+ ++ G R + LG H L VG+L+H++ + 
Sbjct: 723 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 774


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  703 bits (1815), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/771 (47%), Positives = 488/771 (63%), Gaps = 39/771 (5%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +     Y FC+ + S S RA  L+S LTL +K+  L +   A+ RLGIPAYE
Sbjct: 36  PVFACDAANSTVAGYAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALARLGIPAYE 95

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 96  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSNEARAMHNV 155

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q       D G    
Sbjct: 156 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQ-------DAG---- 204

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                        +D    L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQP
Sbjct: 205 -------------ADADGPLKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 251

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  GK + +MCSYN+VNG P C   DL     R +W   GYI SDCD+V  ++  
Sbjct: 252 PFKSCVIDGKVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQ 311

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G + E D+DRA+ N F + +RLG
Sbjct: 312 QHYTKTPEEAAAITIKSGLDLNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLG 371

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPRK  YG LGP DVCTS +++LA + ARQGIVLLKND   LPL+  ++ S+A+IG
Sbjct: 372 FFDGDPRKLAYGSLGPKDVCTSSNQELARETARQGIVLLKNDGA-LPLSAKSIKSMAVIG 430

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L GL   V+ T Y  GC +V C+ ++     A   A 
Sbjct: 431 PNANASFTMIGNYEGTPCKYTTPLHGLGNNVA-TVYQPGCSNVGCSGNSLQLSAATAAAA 489

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  L+++VA  SK  VILV+  GGP D+SF
Sbjct: 490 SADVTVLVVGADQSIEREALDRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISF 549

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+A  +IS+ILW+GYPGEAG  A+A+IIFG +NP GRLP+TWYP SF  KVPM DM MR 
Sbjct: 550 AKASDKISAILWVGYPGEAGGAAIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRP 609

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D+S  YPGR+YRFYTG  V+ FG GLSYT  S+  ++AP    +S  L  G   +  +  
Sbjct: 610 DNSTGYPGRTYRFYTGETVFAFGDGLSYTTMSHNLVAAPPS-EVSMQLAEGHACHTKECA 668

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
                  H      C  + F V++ V N G++ G+H V+LF+  P V    P K L+GF+
Sbjct: 669 SVEAAGDH------CEGMAFEVRLRVHNTGEMAGAHTVLLFSSPPAV-HNAPAKHLLGFE 721

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +++     +   +F VD C+ LS+ ++ G R + LG H L VG+L+H+L +
Sbjct: 722 KLNLEPGQAGVAAFKVDVCKDLSVVDELGNRKVALGGHTLHVGDLKHTLNL 772


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  703 bits (1814), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/772 (46%), Positives = 492/772 (63%), Gaps = 37/772 (4%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    +Y FCN   + S RA+ L+S LTL EK+  L +   A+ RLGIPAYE
Sbjct: 35  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 95  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q     +  GG+  G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 210

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  ++  
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCTS +++LA + ARQGIVLLKN    LPL+  ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     AV  A 
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S  PVILV+  GGP D+SF
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+A  +I++ILW+GYPGEAG  ALA+I+FG  NP G+LP+TWYP S+   V M DM MR 
Sbjct: 551 AKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRP 610

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D+S  YPGR+YRFYTG  V+ FG GLSYT  S+  +SAP       S++   D     + 
Sbjct: 611 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEE 667

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
            + ++         C  L F V++ V NAG+V G+H V+LF+  P  +   P K L+GF+
Sbjct: 668 CASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFE 722

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +V      +  ++F VD C  LS+ ++ G R + LG H L VG+L+H++ + 
Sbjct: 723 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVELR 774


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  701 bits (1809), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/806 (45%), Positives = 499/806 (61%), Gaps = 81/806 (10%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + PFC  SL    RA+ L++ LT  EK++ L +NA+ +PRLG+  YEWWSE+LHG++  G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG-------------------------- 144
           PGV F G     T+FPQV+ +AASFN +LW  IG                          
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVFM 159

Query: 145 ------SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
                  AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA  +
Sbjct: 160 YVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAY 219

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  Q                         ++      L L+ACCKH  AYDL+ W    
Sbjct: 220 VRGLQ-------------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTD 254

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKA 313
           R+ FNA++T QD EDTF  PFRSC+  G+A+ +MCSYNQVNGVP C     LRG +    
Sbjct: 255 RFHFNAVVTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI---- 310

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           R  WG  GYI SDCD+V   +  Q+YT+T ED+ A  L+AG+D++CG  + ++T+ A+ +
Sbjct: 311 RRRWGLAGYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQ 370

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
           GKV + DID A+ N  +VQ+RLG+F+GDP    +G LGP  VCT+ H++LA++AARQGIV
Sbjct: 371 GKVGDGDIDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIV 430

Query: 434 LLKNDKKFLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
           LLKND + LPL+   A  ++A++GP       M G Y G PC   + L+G+  Y ++  +
Sbjct: 431 LLKNDGRALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAH 490

Query: 493 ASGCHDVPC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
             GC DV C  S      AV  A++AD  IVVAGLD   E E  DR SLLLPG+Q  L++
Sbjct: 491 QPGCTDVACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELIS 550

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
           SVA+ SK PVILVL  GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG  NPGG
Sbjct: 551 SVAKASKGPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGG 610

Query: 612 RLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           +LP+TWYP+ +  KVPM +M MRA+ ++ YPGR+YRFYTG  ++ FGHGLSYT++++   
Sbjct: 611 KLPVTWYPQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIA 670

Query: 671 SAPSELTI--SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDV 726
            APS+LT+  SA   A S    L  T +RL            C  LR  V + V N G+ 
Sbjct: 671 HAPSQLTVRLSAHHAAASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGER 729

Query: 727 DGSHVVMLFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           DG+H V+++A  P  S            +QL+ F++VH  A G+  +  G+D C+ LS+A
Sbjct: 730 DGAHTVLVYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVA 789

Query: 780 NKHGRRILPLGNHVLMVGELRHSLTI 805
           +++G R +P+G H L++GEL H++TI
Sbjct: 790 DRNGVRRIPVGEHRLIIGELTHTVTI 815


>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score =  700 bits (1807), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/770 (44%), Positives = 488/770 (63%), Gaps = 41/770 (5%)

Query: 33  SKPVLNKPDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
           S+ V ++  F C P +    S  FC TS+ I  R + LI  LTLQEKI+ L +NA+A+PR
Sbjct: 19  SRFVESRVPFACDPRNGLTRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPR 78

Query: 91  LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
           LGI  YEWWSE+LHG+++ GPG  F G     TSFPQV+ +AA+FN SLW  IG  V+ E
Sbjct: 79  LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDE 138

Query: 151 ARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
           ARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+  QG      
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQG------ 192

Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
                                + G  L ++ACCKH  AYDL+ W    RY FNA +++QD
Sbjct: 193 ---------------------NTGSRLKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQD 231

Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDA 329
            EDT+  PF++C+ +GK + +MCSYNQVNG P C   D+ +   R +W   GYI SDCD+
Sbjct: 232 LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDS 291

Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
           V  ++  Q+YT+T E++AA  +KAG+D++CG  +  HT+ A+  G ++E+D++ A     
Sbjct: 292 VGVLYNTQHYTRTPEEAAADAIKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTI 351

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           +VQ+RLG+F+G+P    +G LGP DVCT  H++LAL AA QGIVLLKN  + LPL+    
Sbjct: 352 TVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRH 411

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
            ++A+IGP  +    M G Y G+ C   + L+G+  Y +KT + +GC  V CN +     
Sbjct: 412 HTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGISRY-AKTIHQAGCLGVACNGNQLIGA 470

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           A   A++AD  ++V GLD + E E  DR  LLLPG+Q  LV+ VA+ S+ PV+LVL  GG
Sbjct: 471 AEVAARQADATVLVMGLDQSIEAEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGG 530

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
           P+DVSFA+ D +I +ILW+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+ +  ++PM 
Sbjct: 531 PVDVSFAKNDPRIGAILWVGYPGQAGGAAIADVLFGRANPGGKLPMTWYPQDYVARLPMT 590

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           DM MRA   R YPGR+YRFY G  V+ FGHG+SYT +++    AP++ ++  +    + K
Sbjct: 591 DMRMRA--GRGYPGRTYRFYKGPVVFPFGHGMSYTTFAHTLSKAPNQFSVPIATSLYAFK 648

Query: 689 N-ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
           N  +     R+ + + ++     SL  HV   V N GD+ G+H +++FA+ P     +P 
Sbjct: 649 NTTISSNAIRVAHTNCNDA---MSLGLHVD--VKNTGDMAGTHTLLVFAK-PPAGNWSPN 702

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           KQLIGF +VH  A   + +   +  C+ LS+ +K G R +P  N V ++G
Sbjct: 703 KQLIGFKKVHVTAGALQSVRLDIHVCKHLSVVDKFGIRRIPWVNIVSILG 752


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  695 bits (1794), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/454 (71%), Positives = 392/454 (86%)

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           MDINCG+  +R+ QSA+DKGK++E+DIDRALLNLFSVQLRLGLF+GD   G + KLGP+D
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           VCT EHKKLAL+AARQGIVLLKN+KKFLPLNK AVSSLAIIGPL NN   +GG YTG  C
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
           +P+SL  G++AY+ +T YA GC +V C+SD  F EA+ IAK ADFVIVVAG+DL+QETED
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
           RDR+SLLLPG+QM+LV+ VA  SK+PVILVLTGGGP+DVSFA+ DS+I+SILWIGYPGEA
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           GAKALA+IIFG++NPGGRLPMTWYPESFT VPMNDMNMRA+ +R YPGR+YRFYTG +VY
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTNVPMNDMNMRANPNRGYPGRTYRFYTGERVY 300

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
           GFG GLSYTNY+YKFLSAPS+L++S SL A S K IL Q G RLDY+ IDE++SC SLRF
Sbjct: 301 GFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSRKRILHQRGDRLDYIFIDEISSCNSLRF 360

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
            VQISV N GD+DGSHVVMLF+RVP+VS+GTPEKQL+GF+R++TV+  S E S  +DPC+
Sbjct: 361 TVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSILLDPCK 420

Query: 775 QLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
            LSIAN  G+RI+P+G+HVL++G+L+H +TIE Y
Sbjct: 421 HLSIANGQGKRIMPVGSHVLLLGDLQHFVTIEAY 454


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  693 bits (1788), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/772 (46%), Positives = 487/772 (63%), Gaps = 37/772 (4%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    +Y FCN   + S RA+ L+S LTL EK+  L +   A+ RLGIPAYE
Sbjct: 37  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 96

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 97  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 156

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q     +  GG+  G
Sbjct: 157 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG 212

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQP
Sbjct: 213 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 254

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  ++  
Sbjct: 255 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 314

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G++ E+D+DRA+ N F + +RLG
Sbjct: 315 QHYTKTPEEAAAITIKSGVDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 374

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCTS +++LA + ARQGIVLLKN    LPL+  ++ S+A+IG
Sbjct: 375 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 433

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     AV  A 
Sbjct: 434 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 492

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S  PVILV+  GGP D+SF
Sbjct: 493 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISF 552

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
           A+A  +I++ LW+GYPGEAG  AL + +FG  NP GRLP+TWYP S+   V M DM MR 
Sbjct: 553 AKASDKIAATLWVGYPGEAGGAALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRP 612

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D+S  YPGR+YRFYTG  V+ FG GLSYT  S+  +SAP       S++   D     + 
Sbjct: 613 DTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHLCRAEE 669

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
            + ++         C  L   V++ V NAG+V G+H V+LF+  P  +   P K L+GF+
Sbjct: 670 CASVEAAG----DHCDDLALDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLVGFE 724

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +V      +  ++F VD C  LS+ ++ G R + LG H L  G+L+H++ + 
Sbjct: 725 KVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHDGDLKHTVELR 776


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/773 (44%), Positives = 487/773 (63%), Gaps = 44/773 (5%)

Query: 38  NKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           ++P F C      + PFC  SL I  R + LI  LTLQEKI+ L +NA+A+PRLGI  YE
Sbjct: 27  SRPAFACDGA-TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIKGYE 85

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG+++  PGV F G     TSFPQV+ +AASFN SLW +IG  V+ EARAMYN 
Sbjct: 86  WWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDEARAMYNG 145

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLT+W+PN+NIFRDPRWGRGQETPGEDP++   YA  +V+  QG             
Sbjct: 146 GMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQG------------- 192

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                         + G++L ++ACCKH  AYDL+ W +  RY FNA +++QD  DT+  
Sbjct: 193 --------------NSGNQLKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDV 238

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
           PF++C+ +GK         ++   P  L   L     + W +  ++   C  +   + + 
Sbjct: 239 PFKACVVEGKYQVYCAHTIKLMANPLVL--TLISPQHHPWSWHSWL--HCFRLYRCWGFI 294

Query: 338 NYTKTH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
            ++  H   ED+AA  +KAG+D+ CG  +  HT+ A+ +GK+ E D++ AL+N  SVQ+R
Sbjct: 295 CHSTLHSTPEDAAAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMR 354

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+G+P    YG LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+     ++A+
Sbjct: 355 LGMFDGEPSSQPYGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAV 414

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  +    M G Y G+ C   + L+G+E YV   H  SGC  V C+++  F  A   A
Sbjct: 415 IGPNSDVTVTMLGNYAGVACGFTTPLQGIERYVRTIH-QSGCDSVACSNNQLFGVAETAA 473

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           ++AD  ++V GLD + ETE +DRV LLLPG Q  LV+ VA  S+ PV+LVL  GGP+DVS
Sbjct: 474 RQADATVLVMGLDQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVS 533

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMR 633
           FA+ D +I +ILW+GYPG+AG  A+A+++FG  NPGGRLPMTWYP+ +  K PM +M MR
Sbjct: 534 FAKNDPRIGAILWVGYPGQAGGTAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMR 593

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           A+ S  YPGR+YRFY G  V+ FGHG+SYT ++++   AP+ +++  +   G   +    
Sbjct: 594 ANPSSGYPGRTYRFYKGPVVFPFGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQNSTTFN 653

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G R+ +      T+C +L   + I V N GD+DG+H V++F+  P V +    KQLIGF
Sbjct: 654 NGIRVTH------TNCDTLILGIHIDVKNTGDMDGTHTVLVFS-TPPVGKWGANKQLIGF 706

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +VH VA+G + +   V  C QLS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 707 KKVHVVARGRQRVKIHVHVCNQLSVVDQFGIRRIPIGEHSLHIGDIKHSISLQ 759


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/769 (45%), Positives = 480/769 (62%), Gaps = 70/769 (9%)

Query: 42  FPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P +    + PFC  SL I  RA+ L+  LTLQEKI+ L +NA  +PRLGI  YEWW
Sbjct: 27  FACDPRNGVTRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G+    TSFPQV+ +AASFN SLW  IG  V+ EARAMYN G 
Sbjct: 87  SEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAMYNGGM 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  QG               
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG--------------- 191

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                       +  D L ++ACCKH  AYDL+ WG   R+ FNA +++QD EDT+  PF
Sbjct: 192 ------------NARDRLKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++C+ +G  + +MCSYNQVNG P C    L +   R EW   GYI SDCD+V   ++ Q+
Sbjct: 240 KACVVEGNVASVMCSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQH 299

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           YT T E++AA  +KAG+D++CG  +  HT++AI  GK+ E D++ AL+N  SVQ+RLG+F
Sbjct: 300 YTATPEEAAAVAIKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMF 359

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +G+P    YG LGP DVCT  H++LAL+AARQGIVL++N    LPL+ +   ++A+IGP 
Sbjct: 360 DGEPSAQPYGNLGPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPN 419

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
            +    M G Y G+ C   + L+G+  Y    H A GC  V C  D  F  AV  A++AD
Sbjct: 420 SDVTETMIGNYAGVACGYTTPLQGIGRYARTIHQA-GCSGVACRDDQQFGAAVAAARQAD 478

Query: 519 FVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
             ++V GLD + E E RDRV +LLPG+Q  LV+ VA  S+ P +LVL  GGP+DVSFA+ 
Sbjct: 479 ATVLVMGLDQSIEAEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKN 538

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
           D +I++I+W+GYPG+AG  A+A+++FG  NPGG+LP+TWYP+S+  K PM +M MRA  S
Sbjct: 539 DPRIAAIIWVGYPGQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPS 598

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
           R YPGR+YRFY G  V+ FGHGLSY+ +++    AP+                       
Sbjct: 599 RGYPGRTYRFYNGPVVFPFGHGLSYSTFAHSLAQAPT----------------------- 635

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                       T L FH  I V N G +DGSH ++LF+  P     +P K+L+ F++VH
Sbjct: 636 ------------TPLGFH--IDVKNTGTMDGSHTLLLFS-TPPPGTWSPNKRLLAFEKVH 680

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             A   + + F V  C+ LS+ +  G   +P+G H   +G+L+HS++++
Sbjct: 681 VGAGSQERVRFDVHVCKHLSVVDHFGIHRIPMGEHHFHIGDLKHSISLQ 729


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/819 (45%), Positives = 497/819 (60%), Gaps = 107/819 (13%)

Query: 18  VLQLIVVVNVIAFSNSKPVL--NKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLL 72
           V     +V  + FS+S  VL  + P F C     P   S+ FCNTSL +S R   L+  L
Sbjct: 15  VFLFFSLVCFLLFSSSHVVLAQSSPVFACDVVSNPSLASFGFCNTSLGVSDRVVDLVKRL 74

Query: 73  TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           TLQEKI  L ++A ++ RLGIP YEWWSE+LHG++  GPG +F+  V   TSFPQV+++A
Sbjct: 75  TLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTA 134

Query: 133 ASFNRSLWSNIG----SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
           ASFN SL+  IG      V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 135 ASFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 194

Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
           ++ S Y   +VK  Q    + DDG                      D L ++ACCKH  A
Sbjct: 195 LLSSKYGSGYVKGLQ----QRDDG--------------------NPDGLKVAACCKHYTA 230

Query: 249 YDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
           YDL+ W    RY FNA ++T+QD +DTFQPPF+SC+  G  + +MCSYN+VNG+P C   
Sbjct: 231 YDLDNWKGVDRYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADP 290

Query: 308 DLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAGVLKA--GMDINCGTC 362
           DL     R EW   G  YI +DCD++   +  Q+YTKT E++AA  + A  G+D+NCG+ 
Sbjct: 291 DLLSGVIRGEWKLNGYVYIVTDCDSIDVFYNSQHYTKTPEEAAAKAILAGIGLDLNCGSF 350

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
           + +HT++A+  G V E  IDRA+ N F+  +RLG F+GDP K  YGKLGP DVCT+E+++
Sbjct: 351 LGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQE 410

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
           LA +AARQGIVLLKN                                TG PC   + L+G
Sbjct: 411 LAREAARQGIVLLKN--------------------------------TGTPCKYTTPLQG 438

Query: 483 LEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
           L A V+ T Y  GC +V C S A   +A +IA  AD  ++V G DL+ E E RDRV +LL
Sbjct: 439 LAALVATT-YLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILL 496

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           PGQQ  L+T+VA  S  PVILV+  GG +DVSFA+ + +I+SILW+GYPGEAG  A+A+I
Sbjct: 497 PGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADI 556

Query: 603 IFGDFN------PGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
           IFG +N      PGGRLPMTWYP+S+  KVPM +MNMR D S  YPGR+YRFYTG  VY 
Sbjct: 557 IFGSYNPSTHQPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYS 616

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL--- 712
           FG GLSY+ +S++   AP  +++                   L+  H+   + C S+   
Sbjct: 617 FGDGLSYSEFSHELTQAPGLVSVP------------------LEENHVCYSSECKSVAAA 658

Query: 713 -----RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
                 F V + + N G   GSH V LF+  P V   +P+K L+GF++V   A+    + 
Sbjct: 659 EQTCQNFDVHLRIKNTGTTSGSHTVFLFSTPPSV-HNSPQKHLVGFEKVFLHAQTDSHVG 717

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           F VD C+ LS+ ++ G + + LG HVL +G L+HS+T+ 
Sbjct: 718 FKVDVCKDLSVVDELGSKKVALGEHVLHIGSLKHSMTVR 756


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  690 bits (1780), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/682 (51%), Positives = 452/682 (66%), Gaps = 40/682 (5%)

Query: 11  RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLIS 70
           R  I   VL L +V+     ++       P F C  P   SYPFC+  L I  RA  L S
Sbjct: 4   RSIIFHVVLPLCLVLQATMATD-------PPFSCGSP--SSYPFCDRKLPIGQRAADLAS 54

Query: 71  LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP----GVNFN-GTVSSVTSF 125
            LT++EK+  L D +  +PRLG+PAY+WWSE+LHG+A N P    GV F+ G V + TSF
Sbjct: 55  RLTVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVA-NAPADRAGVRFDDGPVRAATSF 113

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETP 184
           PQVLV+AASFN  LW  IG  +  EAR +YN GQA GLTFWAPNIN+FRDPRWGRGQETP
Sbjct: 114 PQVLVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETP 173

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP +   YA  FV+  QG           +G          G  +  G  L  SACCK
Sbjct: 174 GEDPTMTGKYAAVFVRGVQG-----------YG--------ASGAVNSSG--LEASACCK 212

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  AYDLE W   +R++FNA ++EQD  DT+ PPFRSC+E G AS IMCSYN+VNGVP C
Sbjct: 213 HFTAYDLENWNGVTRFAFNAKVSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTC 272

Query: 305 LRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
              +L  K AR +W F GYITSDCDAVA I + Q Y K  ED+ A VLKAGMD+NCG  +
Sbjct: 273 ADHNLLSKTARGDWRFNGYITSDCDAVAIIHDVQGYAKEPEDAVADVLKAGMDVNCGDYV 332

Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKL 423
            +H  SA  +GK+ E+DIDRAL NLF++++RLGLF+G+P+  +YG +G D VC  EH+ L
Sbjct: 333 QKHGVSAFHQGKITEQDIDRALQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDL 392

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGL 483
           AL+AA+ GIVLLKND   LPL K  +SSLA+IG   N+  ++ G Y G PC   S L+ L
Sbjct: 393 ALEAAQDGIVLLKNDAGTLPLPKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQAL 452

Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
           + YV +T + +GC+   CN  +    A + A +A++V++  GLD  QE ED DR+ L LP
Sbjct: 453 QGYVRETKFVAGCNAAVCNV-SDIAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLP 511

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G Q SLV +VA  +K+PV+LVL  GGP+DV+FA+ + +I +I+W GYPG+AG  A+A+++
Sbjct: 512 GMQESLVNAVADAAKKPVVLVLLCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVL 571

Query: 604 FGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           FG+ NPGGRLP+TWYP+ + T V M DM MRAD+S  YPGR+YRFY G  VY FG+GLSY
Sbjct: 572 FGEHNPGGRLPVTWYPKEYATAVAMTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSY 631

Query: 663 TNYSYKFLSAPSELTISASLKA 684
           + YS+ F+S P    +   ++A
Sbjct: 632 SKYSHSFVSKPPASMMIGEVRA 653


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  687 bits (1772), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/699 (48%), Positives = 462/699 (66%), Gaps = 31/699 (4%)

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNIN 170
           G+  NG++ + TSFPQV+++AASF+  LW  I   +  EAR +YN GQA G+ FWAPNIN
Sbjct: 2   GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRGQET GEDP+V S Y V +V+  QG+++   +GG   G R           
Sbjct: 62  IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQGDSF---EGGKLIGGR----------- 107

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 L  SACCKH  AYDLE W   +RY F+A +T QD  DT+QP F SC+ QG++S 
Sbjct: 108 ------LKASACCKHFTAYDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSG 161

Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           IMC+YN+VNGVP C   +L    AR +W F GYI SDCDAV  I+E Q Y KT ED  A 
Sbjct: 162 IMCAYNRVNGVPNCADYNLLTNTARKKWNFNGYIASDCDAVRFIYEKQGYAKTPEDVVAD 221

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
           VL+AGMD+ CG  M +H +SA+ + K+    IDRAL NLF++++RLGLF+G+P K +YG+
Sbjct: 222 VLRAGMDLECGNYMTKHAKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGR 281

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS-QMGGG 468
           +GP+ VC+ E+  LAL+AAR GIVLLKN    LPL +  V++L +IGP  N  S  + G 
Sbjct: 282 IGPNQVCSKENLDLALEAARSGIVLLKNTASILPLPR--VNTLGVIGPNANKSSIVLLGN 339

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G PC   S+L+G   Y S+THY SGC D    + A    AV +AK +D+VI+V GLD 
Sbjct: 340 YIGPPCKNVSILKGFYTYASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQ 399

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           +QETE  DR  L LPG+Q  L+ SVA+ SK+PVILVL  GGP+D++FA+ + +I  I+W 
Sbjct: 400 SQETETLDRDHLELPGKQQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWA 459

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPGE G +ALA+++FGD+NPGGRLPMTWYP+ F K+PM DM MRAD S  YPGR+YRFY
Sbjct: 460 GYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIKIPMTDMRMRADPSSGYPGRTYRFY 519

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV-- 706
           TG +VY FG+GLSY+NYSY F+S  +    +  +   +  +IL+ + + ++Y  + E+  
Sbjct: 520 TGPKVYEFGYGLSYSNYSYNFISVKNN---NLHINQSTTYSILENSET-INYKLVSELGE 575

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
            +C ++   V + +TN G + G H V+LF +  K   G P KQL+GF+ V     G  E+
Sbjct: 576 ETCKTMSISVTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEV 635

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            F V  CE LS AN+ G +++  G ++ +VG+  +S+ I
Sbjct: 636 GFEVSVCEHLSRANESGVKVIEEGGYLFLVGQEEYSINI 674


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/759 (45%), Positives = 473/759 (62%), Gaps = 52/759 (6%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           FC+TSLS   R   L+S LTL+EK+ QL + ASAIPRL IPAYEWW E LHG+A     V
Sbjct: 3   FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAH----V 58

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
           +F G++   TSFP  +++ ASFN+ LW+ IG A + EARA YN G AGLT+W+P INI R
Sbjct: 59  SFGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINIAR 118

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR QET GEDP   SAYA  FV+  Q                         E    
Sbjct: 119 DPRWGRIQETSGEDPYTTSAYATHFVQGMQ-------------------------EGDAN 153

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L LSACCKH  AYD++ W    RY F+A   + +  DT+ PPF+SC+++G+++ +MC
Sbjct: 154 SKRLKLSACCKHFTAYDVDNWEGIDRYHFDA---KANLADTYNPPFQSCVQEGRSASLMC 210

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           SYN+VNGVP C   D  +   R  WG  GYI SDCD+V  + E  NY  T ED+AA  L 
Sbjct: 211 SYNKVNGVPTCANYDFLENTVRRAWGLNGYIVSDCDSVLVMHESTNYAPTTEDAAADALN 270

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
           AG+D+NCG  +  +T+ A+  GKV    +D A+ N+F V++RLG+F+G+P   ++G +G 
Sbjct: 271 AGLDLNCGDYLASYTEGAVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGV 330

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
            DVCT  H++LA++AARQGIVLLKND   LPL+KN   + A+IGP  N    M G Y GI
Sbjct: 331 ADVCTPAHQELAVEAARQGIVLLKNDGNILPLSKNI--NTAVIGPNANATHTMLGNYEGI 388

Query: 473 PCSPKSLLRGLEAYVS----KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           PC   + L+GL  + S    K  ++ GC +  C  D     AV  A  AD V++V GL  
Sbjct: 389 PCQYITPLQGLVKFGSGDYHKVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQ 448

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVA-RTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            QE+E  DR SLLLPG Q +L+  VA   + RPV+LVL   GP+D++FA+ D +I SILW
Sbjct: 449 VQESEALDRTSLLLPGYQQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILW 508

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
           +GYPG++G +A+AE+IFG  NPGG+LPM+WYPE +TK+ M +MNMR DS   YPGR+YRF
Sbjct: 509 VGYPGQSGGQAIAEVIFGAHNPGGKLPMSWYPEDYTKISMTNMNMRPDSRSNYPGRTYRF 568

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           YTG ++Y FG+GLSYT Y + F  AP+ + ++ S+ +          GS+          
Sbjct: 569 YTGEKIYDFGYGLSYTEYKHSFALAPTTV-MTPSIHSQLCDPHQTSAGSK---------- 617

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFDRVHTVAKGSKEI 766
           +C+S  F V I+V N G + G+H ++LF   P   + GTP KQL  FD V+  +   +++
Sbjct: 618 TCSSSNFDVHINVENIGAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKV 677

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
              ++PC+ L    + G R+L  GNH+L VG+ +HSL++
Sbjct: 678 VLTLNPCQHLGTVAEDGTRMLEAGNHILSVGDAKHSLSV 716


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/733 (47%), Positives = 472/733 (64%), Gaps = 35/733 (4%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           +K+  L +   A+ RLGIPAYEWWSE+LHG++  GPG  F+  V   TSFPQ +++AASF
Sbjct: 7   QKVGFLVNKQPALGRLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASF 66

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N SL+  IG  V+ EARAM+N+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YA
Sbjct: 67  NASLFRAIGEVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYA 126

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
           V +V   Q     +  GG+  G                   L ++ACCKH  AYD++ W 
Sbjct: 127 VGYVTGLQ----DAGAGGVTDG------------------ALKVAACCKHYTAYDVDNWK 164

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              RY+F+A +++QD +DTFQPPF+SC+  G  + +MCSYN+VNG P C   DL +   R
Sbjct: 165 GVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIR 224

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
            +W   GYI SDCD+V  ++  Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G
Sbjct: 225 GDWKLNGYIVSDCDSVDVLYTQQHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAG 284

Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
           ++ E+D+DRA+ N F + +RLG F+GDPR+  +G LGP DVCTS +++LA + ARQGIVL
Sbjct: 285 ELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVL 344

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           LKN    LPL+  ++ S+A+IGP  N    M G Y G PC   + L+GL A V+ T Y  
Sbjct: 345 LKNSGA-LPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQP 402

Query: 495 GCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
           GC +V C+ ++     AV  A  AD  ++V G D + E E  DR SLLLPGQQ  LV++V
Sbjct: 403 GCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRTSLLLPGQQTQLVSAV 462

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  S  PVILV+  GGP D+SFA+A  +I++ILW+GYPGEAG  ALA+I+FG  NP GRL
Sbjct: 463 ANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAALADILFGSHNPSGRL 522

Query: 614 PMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           P+TWYP S+   V M DM MR D+S  YPGR+YRFYTG  V+ FG GLSYT  S+  +SA
Sbjct: 523 PVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFGDGLSYTKMSHSLVSA 582

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
           P       S++   D     +  + ++         C  L F V++ V NAG+V G+H V
Sbjct: 583 PPSYV---SMRLAEDHPCRAEECASVEAAG----DHCDDLAFDVKLQVRNAGEVAGAHSV 635

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           +LF+  P  +   P K L+GF++V      +  ++F VD C  LS+ ++ G R + LG H
Sbjct: 636 LLFSS-PPPAHNAPAKHLLGFEKVSLAPGEAGTVAFRVDVCRDLSVVDELGGRKVALGGH 694

Query: 793 VLMVGELRHSLTI 805
            L VG+L+H++ +
Sbjct: 695 TLHVGDLKHTVEL 707


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/789 (43%), Positives = 472/789 (59%), Gaps = 40/789 (5%)

Query: 32  NSKPVLNKPDFPCKPPHFD-------SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
           +  P +N   F C P  +D       S+ FC++SLS   RAK LI  +TL EK  QL   
Sbjct: 46  DDPPAVNNFTFVCDPSRYDKLGLDFSSFGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHV 105

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           AS + RLG+P Y WWSE+LHG+++ GPG  F+  V   TSFP V+ +A+SFN  LW  IG
Sbjct: 106 ASGVDRLGLPPYNWWSEALHGVSNVGPGTQFDKVVPGATSFPNVITTASSFNEDLWKTIG 165

Query: 145 SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG 204
            AV+ EARAMYNLG+AGLT+W+P IN+ RDPRWGR  ETPGEDP VV  YA  +V+  Q 
Sbjct: 166 QAVSTEARAMYNLGRAGLTYWSPTINVIRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQD 225

Query: 205 ENWKSDDGGIGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFN 263
                              ++G    +D     L +S+CCKH  AYD++ W    RYSF+
Sbjct: 226 -------------------VEGSENVTDLNSRPLKVSSCCKHYAAYDVDNWLGVERYSFD 266

Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGY 322
           A +TEQD  +TF  PF  C+++G  S +MCSYN+VNG+P C    L +   R  WG  GY
Sbjct: 267 ARVTEQDMLETFNKPFEMCVKEGDVSSVMCSYNRVNGIPTCADPVLLKDTIRGNWGLHGY 326

Query: 323 ITSDCDAVATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
           I SDCD+V  + E  +Y + T+ED+ A  LKAG+D++CG     +T+S + +GKV  ++I
Sbjct: 327 IVSDCDSVKVMVEDAHYLQDTNEDAVAQTLKAGLDLDCGQIYPNYTESTVRQGKVGMRNI 386

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D AL NL+ V +RLG F+G+     +  LG  D+C+ EH +LA +AARQG VLLKND   
Sbjct: 387 DNALNNLYVVLMRLGYFDGNT---GFESLGKPDICSDEHIELATEAARQGTVLLKNDNDT 443

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
           LP + +   +LA++GP  N  S M G Y G+PC   S + GL  Y +K  Y  GC  V C
Sbjct: 444 LPFDPSNYKTLAVVGPHANATSAMLGNYAGVPCRMNSPMDGLSEY-AKVKYQMGCDSVAC 502

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            +D     A+  A+ +D  ++  G+DL+ E E  DRV LLLPG Q  LV  VA  SK PV
Sbjct: 503 KNDTFIFGAMEAARTSDATVIFVGIDLSIEAESLDRVDLLLPGYQTQLVQQVATVSKGPV 562

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           +LV+   G +DVSFA+ +S I +I+W GYPGE G +A+A++IFG FNPGGRLP+TWY   
Sbjct: 563 VLVILSAGGIDVSFAKNNSNIKAIIWAGYPGEEGGRAIADVIFGKFNPGGRLPLTWYEND 622

Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           +  ++PM  M +R   S  YPGR+Y+FY G  VY FGHGLSYT + +   SA   + I  
Sbjct: 623 YVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPVVYPFGHGLSYTFFLHNLTSAKRSIAIDL 682

Query: 681 SLKAGSDKNILQQTGS---RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           S +    ++I    G+       V +D++T    + F  Q+ V N G+ DGS V+++++ 
Sbjct: 683 SNRTQC-RDIAYTNGTFKPECPAVLVDDLTCTEEIEF--QMEVENTGERDGSQVLLVYSV 739

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P     T  KQ++GF RV   A  S+ ++F ++ C+ L + +  G  +LP G H ++VG
Sbjct: 740 PPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLNACKSLGLVDFTGYNLLPAGGHTIVVG 799

Query: 798 ELRHSLTIE 806
           +   S  +E
Sbjct: 800 DGEVSFPVE 808


>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/790 (43%), Positives = 479/790 (60%), Gaps = 49/790 (6%)

Query: 26  NVIAFSNSKPVLNKPDFPCKPPHFD-------SYPFCNTSLSISTRAKSLISLLTLQEKI 78
           N +   ++ P  +   + C    +D       ++ FC++SLS   RAK L++ +TL+EK+
Sbjct: 33  NTLNHDDANPRGSSFTYVCDSSRYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKV 92

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
           QQL D A  +PRLGIP YEWWSE+LHG++  GPG  F+  V   TSFP  +++ ASFN S
Sbjct: 93  QQLGDLAYGVPRLGIPKYEWWSEALHGVSDVGPGTFFDDLVPGATSFPTTILTTASFNES 152

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           LW NIG A A +ARAMYNLG+AGLT+W+PN+N+ RDPRWGR  ETPGEDP VV  YAV +
Sbjct: 153 LWKNIGQASA-KARAMYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNY 211

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V+  Q       +G   +     R LK             +S+CCKH  AYD+EKW    
Sbjct: 212 VRGLQDV-----EGTENYTDLNTRPLK-------------VSSCCKHYAAYDVEKWQGVE 253

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEW 317
           R +F+A +TEQD  +TF  PF  C+++G  S +MCS+N+VNG+P C    L  Q  R +W
Sbjct: 254 RLTFDARVTEQDMVETFLRPFEMCVKEGDVSSVMCSFNRVNGIPTCADPKLLNQTIRGDW 313

Query: 318 GFKGYITSDCDAVATIFEYQNYT-KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKV 376
              GYI SDCD++  + +   +   T+ED+ A VLKAG+D++CG      T++++ +GK 
Sbjct: 314 DLHGYIVSDCDSIEVMVDNHKFLGDTNEDAVAQVLKAGLDLDCGGYYTNFTETSVKQGKA 373

Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
           +E+ IDR+L  L+ V +RLG F+G P   +Y KLG  D+CT E+ +LA  AAR+GIVLLK
Sbjct: 374 REEYIDRSLKYLYVVLMRLGFFDGTP---QYQKLGKKDICTKENVELAKQAAREGIVLLK 430

Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N+   LPL+ + V +LA++GP  N    M G Y G+PC   S + G   Y S   Y  GC
Sbjct: 431 NNDT-LPLSMDKVKNLAVVGPHANATRVMIGNYAGVPCRYVSPIDGFSIY-SNVTYEIGC 488

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            DVPC +++    AV  AK AD  I+VAGLDLT E E  DR  LLLPG Q  L+  VA  
Sbjct: 489 -DVPCKNESLVFPAVHAAKNADATIIVAGLDLTIEAEGLDRNDLLLPGYQTQLINQVAGA 547

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           +  PVILV+   G +D+SFA  + +I +ILW+GYPG+ G  A+A+++FG +NPGGRLP+T
Sbjct: 548 ANGPVILVIMAAGGVDISFARDNEKIKAILWVGYPGQEGGHAIADVVFGKYNPGGRLPIT 607

Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
           WY   F  +VPM  M +R D    YPG++Y+FY G+ VY FG+GLSYT +SY   SA   
Sbjct: 608 WYEADFVEQVPMTYMQLRPDEELGYPGKTYKFYDGSTVYPFGYGLSYTTFSYNITSAKRS 667

Query: 676 LTISAS-------LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
             I+ +       L+ G++           D++  ++        F +++ V N G  DG
Sbjct: 668 KHIALNKFQHCRDLRYGNETFKPSCPAVLTDHLPCND-------DFELEVEVENTGSRDG 720

Query: 729 SHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP 788
           S VVM++++ P+   G+  KQ+IGF RV   A   ++++F  + C+   I + +   ILP
Sbjct: 721 SEVVMVYSKTPEGIVGSYIKQVIGFKRVFVQAGSVEKVNFRFNVCKSFRIIDYNAYSILP 780

Query: 789 LGNHVLMVGE 798
            G H +MVG+
Sbjct: 781 SGGHTIMVGD 790


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 313/672 (46%), Positives = 431/672 (64%), Gaps = 44/672 (6%)

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN G+AGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  Q + + + +GG
Sbjct: 1   MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQ-QPYAAPNGG 59

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                                + L L+ACCKH  AYDL+KWG   R+ FNA++  QD ED
Sbjct: 60  -------------------HRNRLKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLED 100

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCD 328
           TF  PFR+C+E G+A+ +MCSYNQVNGVP C     LRG +    R  WG  GYI SDCD
Sbjct: 101 TFNVPFRACVEDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RGRWGLDGYIVSDCD 156

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           +V   F  Q+YT+T ED+AA  L+AG+D++CG  +  +  SA+  GKV + D+D ALLN 
Sbjct: 157 SVDVFFRDQHYTRTPEDAAAATLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNT 216

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN------DKKFL 442
            +VQ+RLG+F+GDP  G +G+LGP DVCT EH+ LALDAARQG+VLLKN      ++  L
Sbjct: 217 VTVQMRLGMFDGDPAAGPFGRLGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVL 276

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           PL   A   +A++GP  +    M G Y G PC   + L+G+ AY ++  + +GC DV C 
Sbjct: 277 PLRPAAHRVVAVVGPHADATVAMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACR 336

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
            +     AV  A++AD  +VVAGLD   E E  DR +LLLPG+Q  L+++VA+ SK PVI
Sbjct: 337 GNQPIAAAVEAARQADATVVVAGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVI 396

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           LVL  GGP+D++FA+ D +I  ILW+GYPG+AG +A+A++IFG  NPG +LP+TWY + +
Sbjct: 397 LVLMSGGPIDIAFAQNDPRIDGILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDY 456

Query: 623 -TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             KVPM +M MRA+ +R YPGR+YRFYTG  +Y FGHGLSYT +++    AP++LT+  S
Sbjct: 457 LQKVPMTNMAMRANPARGYPGRTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLS 516

Query: 682 LKAGSDKNILQQTGSRLDY-VHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
               S  +      + L   V    V    C  L   V + V+N GD DG+H V+++   
Sbjct: 517 GSGHSAASAASLLNATLARPVRAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAA 576

Query: 739 PKVSQG-----TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
           P  S        P +QL+ F++VH  A G   +   +  C++LS+A+++G R +P+G H 
Sbjct: 577 PSPSHAAPGADAPARQLVAFEKVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHR 636

Query: 794 LMVGELRHSLTI 805
           LM+GEL HS+++
Sbjct: 637 LMIGELTHSVSL 648


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  634 bits (1634), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 339/686 (49%), Positives = 441/686 (64%), Gaps = 55/686 (8%)

Query: 17  RVLQLIVVVNVIAFSNSKPVLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           +V+ L+  + ++  S      + P F C     P   S  FCNTS+ I+ R   L+  LT
Sbjct: 14  KVVFLLFCMFLVFLSTHVSAQSSPVFACDVVSNPSLASLGFCNTSIGINDRVVDLVKRLT 73

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           LQEKI  L ++A  + RLGIP YEWWSE+LHG++  GPG +F+  V+  TSFPQV+++AA
Sbjct: 74  LQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVILTAA 133

Query: 134 SFNRSLWSNIG-----SAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
           SFN SL+  IG       V+ EARAMYN+G AGLTFW+PNINIFRDPRWGRGQETPGEDP
Sbjct: 134 SFNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDP 193

Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
           ++ S Y   +VK  Q    + DDG                      D+L ++ACCKH  A
Sbjct: 194 LLSSKYGSCYVKGLQ----QRDDG--------------------DPDKLKVAACCKHYTA 229

Query: 249 YDLEKWGNFSRYSFNAMI-TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
           YDL+ W    RY FNA++ T+QD +DTFQPPF+SC+  G  + +MCSYNQVNG P C   
Sbjct: 230 YDLDNWKGSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADP 289

Query: 308 DLFQKA-RNEWGFKGY-------ITSDCDAVATIFEYQNYTKTHEDS------AAGVLKA 353
           DL     R EW   GY       I +DCD++   ++ QNYTKT E++      A   L  
Sbjct: 290 DLLSGVIRGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVT 349

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G+D+NCG+ + +HT++A+  G V E  ID A+ N F+  +RLG F+GDP K  YGKLGP 
Sbjct: 350 GVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPK 409

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-GI 472
           DVCT+E+++LA +AARQGIVLLKN    LPL+  A+ +LA+IGP  N    M G Y  G 
Sbjct: 410 DVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGT 469

Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           PC   + L+GL A V+ T Y  GC +V C S A   +A ++A  AD  ++V G DL+ E 
Sbjct: 470 PCKYTTPLQGLAASVATT-YLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEA 527

Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
           E RDRV +LLPGQQ  L+T+VA  S  PVILV+  GG +DVSFA  + +I+SILW+GYPG
Sbjct: 528 ESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPG 587

Query: 593 EAGAKALAEIIFGDFNPG----GRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           EAG  A+A+IIFG +NP     GRLPMTWYP+S+  KVPM +MNMR D S  YPGR+YRF
Sbjct: 588 EAGGAAIADIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRF 647

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP 673
           YTG  VY FG GLSY+ ++++ + AP
Sbjct: 648 YTGETVYSFGDGLSYSQFTHELIQAP 673


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  632 bits (1631), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 340/769 (44%), Positives = 455/769 (59%), Gaps = 91/769 (11%)

Query: 87  AIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSA 146
           A+PRLGIPAYEWWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG +
Sbjct: 44  ALPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGES 103

Query: 147 ------------------------------------------VAVEARAMYNLGQAGLTF 164
                                                     V+ EARAM+N+G AGLTF
Sbjct: 104 ACNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTF 163

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q       D G G         
Sbjct: 164 WSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ-------DAGGG--------- 207

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                     D L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQPPF+SC+ 
Sbjct: 208 ---------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVI 258

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            G  + +MCSYN+VNG P C   DL     R +W   GYI SDCD+V  ++  Q+YTK  
Sbjct: 259 DGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKNP 318

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           ED+AA  +K+G+D+NCG  + +HT +A+  GK+ E D+DRA+ N F V +RLG F+GDPR
Sbjct: 319 EDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPR 378

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           K  +G LGP DVCTS +++LA +AARQGIVLLKN    LPL+  ++ S+A+IGP  N   
Sbjct: 379 KLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLSAKSIKSMAVIGPNANASF 437

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIV 522
            M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A  AD  ++
Sbjct: 438 TMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLSAATQAAASADVTVL 496

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           V G D + E E  DR SLLLPGQQ  LV++VA  S+ PVILV+  GGP D+SFA++  +I
Sbjct: 497 VVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKI 556

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYP 641
           S+ILW+GYP  +  +               LP+TWYP SF  KV M DM MR DSS  YP
Sbjct: 557 SAILWVGYPRRSRWRRPRRHPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYP 614

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
           GR+YRFYTG  VY FG GLSYT +++  +SAP ++ +  +             G      
Sbjct: 615 GRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA------------EGHACHTE 662

Query: 702 HIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
           H   V +    C SL F V + V NAG + G H V LF+  P V    P K L+GF++V 
Sbjct: 663 HCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPPSV-HSAPAKHLLGFEKVS 721

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
                +  ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L + 
Sbjct: 722 LEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNLR 770


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 317/760 (41%), Positives = 460/760 (60%), Gaps = 37/760 (4%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S+ FC+ SLS   RAK L+S +TLQEK+ Q    AS + RLG+P Y WWSE+LHGI++ G
Sbjct: 63  SFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISNLG 122

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F+ T+   TS P V++S A+FN++LW  +G  V+ E RAMYNLG AGLTFW+PNIN
Sbjct: 123 PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPNIN 182

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RD RWGR QET GEDP +V  +AV +V+  Q                    ++G    
Sbjct: 183 VVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQD-------------------VEGTENV 223

Query: 231 SDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
           +D     L +S+CCKH  AYD++ W N  R++F+A ++EQD ++TF  PF  C+ +G  S
Sbjct: 224 TDLNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVS 283

Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE-DSA 347
            +MCS+N++NG+P C    L +   R+EW   GYI SDC  +  I + QNY    + D+ 
Sbjct: 284 SVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAV 343

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L+AG+D+ CG         ++  GKV + ++DRAL N++ + +R+G F+G P    Y
Sbjct: 344 AKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AY 400

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             LG  D+C ++H +LA +AARQGIVLLKND + LPL       +A++GP  N    M G
Sbjct: 401 ESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIG 458

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G+PC   S L    A +    YA+GC D  C++D  F EA   AK A+  I+  G D
Sbjct: 459 NYAGLPCKYVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTD 517

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
           L+ E E  DRV  LLPG Q  L+  VA  S  PVILV+  G  +D++FA+ + +IS+ILW
Sbjct: 518 LSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILW 577

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYR 646
           +G+PGE G  A+A+++FG +NPGGRLP+TWY   +   +PM+ M++R      YPGR+Y+
Sbjct: 578 VGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYK 637

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL----KAGSDKNILQQTGSRLDYVH 702
           F+ G+ VY FG+G+SYT +SY    A S+++I   L    K  +      Q       V 
Sbjct: 638 FFDGSTVYPFGYGMSYTKFSYSL--ATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVL 695

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           +D+++   ++ F  +++VTN G VDGS V+M+++  P    GT  KQ+IGF +V   A  
Sbjct: 696 LDDMSCDDTIEF--EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGD 753

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
           ++ + F ++ C+ L I +  G  +LP G+H + VG+  +S
Sbjct: 754 TERVKFSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNS 793


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 319/668 (47%), Positives = 424/668 (63%), Gaps = 47/668 (7%)

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           V+ EARAM+N+G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q   
Sbjct: 212 VSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQ--- 268

Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
               D G G                   D L ++ACCKH  AYD++ W    RY+F+A++
Sbjct: 269 ----DAGGG------------------SDALKVAACCKHYTAYDVDNWKGVERYTFDAVV 306

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITS 325
           ++QD +DTFQPPF+SC+  G  + +MCSYN+VNG P C   DL     R +W   GYI S
Sbjct: 307 SQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVS 366

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
           DCD+V  ++  Q+YTK  ED+AA  +K+G+D+NCG  + +HT +A+  GK+ E D+DRA+
Sbjct: 367 DCDSVDVLYNNQHYTKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAI 426

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            N F V +RLG F+GDPRK  +G LGP DVCTS +++LA +AARQGIVLLKN    LPL+
Sbjct: 427 TNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKNTGA-LPLS 485

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
             ++ S+A+IGP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++
Sbjct: 486 AKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNS 544

Query: 506 -GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
                A + A  AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S+ PVILV
Sbjct: 545 LQLSAATQAAASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILV 604

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
           +  GGP D+SFA++  +IS+ILW+GYPGEAG  ALA+I+FG  NPGGRLP+TWYP SF  
Sbjct: 605 VMSGGPFDISFAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFAD 664

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           KV M DM MR DSS  YPGR+YRFYTG  VY FG GLSYT +++  +SAP ++ +  +  
Sbjct: 665 KVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLA-- 722

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTS----CTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
                      G      H   V +    C SL F V + V NAG + G H V LF+  P
Sbjct: 723 ----------EGHACHTEHCFSVEAAGEHCGSLSFDVHLRVRNAGGMAGGHTVFLFSSPP 772

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
            V    P K L+GF++V      +  ++F VD C+ LS+ ++ G R + LG+H L VG+L
Sbjct: 773 SV-HSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDL 831

Query: 800 RHSLTIET 807
           +H+L +  
Sbjct: 832 KHTLNLRV 839


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  620 bits (1600), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 302/640 (47%), Positives = 402/640 (62%), Gaps = 36/640 (5%)

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W+PN+NIFRDPRWGRGQETPGEDP   S Y   FVK  QG +  +              
Sbjct: 1   MWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQGSSLTN-------------- 46

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                        L  SACCKH+ AYD+E+W   SRY+FNA +T QD  DT+ PPFRSC+
Sbjct: 47  -------------LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCV 93

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
             GKASCIMC+Y  +NGVPAC   DL  K  R EW   GY  SDCDAVA + + +++T+T
Sbjct: 94  VDGKASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHFTRT 153

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            E++ A  LKAG+DINCG  M ++  SA+ +GK+ EKD+D+AL NLF++++RLG F+GDP
Sbjct: 154 AEEAVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDP 213

Query: 403 RKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           R  K YG+L   DVCT  HK LAL+AAR+G+VLLKND + LPL    V+S A+IG   N+
Sbjct: 214 RGNKLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNAND 273

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           I  + G Y G+PC   +   G++ YV    +  GC    C+  A   +A  +AK +D+V 
Sbjct: 274 ILALLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVF 332

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
           +V GL   QE E  DR SLLLPG+Q +L+T+VA  SKRPVIL+L  GGP+D++FA+ + +
Sbjct: 333 LVMGLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPK 392

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
           I +ILW GYPG+AG +A+A+++FG+FNP G+LP+TWYPE FTK  M DM MR D +  YP
Sbjct: 393 IGAILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFTMTDMRMRPDPATGYP 452

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
           GRSYRFY G  VY FG+GLSY+ ++ + +S       S+S    +   +   T    D V
Sbjct: 453 GRSYRFYKGKTVYKFGYGLSYSKFACRIVSGAGN---SSSYGKAALAGLRAATTPEGDAV 509

Query: 702 H-IDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
           + +DE+    C  LRF V + V N G +DG H V++F R      G P +QLIGF   H 
Sbjct: 510 YRVDEIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHL 569

Query: 759 VAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
                K++   + PCE LS A   G +++  G+H LMV E
Sbjct: 570 KVGEKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMVEE 609


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score =  620 bits (1599), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 316/752 (42%), Positives = 454/752 (60%), Gaps = 33/752 (4%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C++S     RAK L+  +TL EK+ Q  D AS + R+G+P Y WWSE+LHG+++ G 
Sbjct: 68  FHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGR 127

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
            V F+  V   TSFP V++SAASFN+SLW  +G AV+ EARAMYN G AGLTFW+PNIN+
Sbjct: 128 CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINV 187

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RDPRWGR  ETPGEDP +V  YAV +V+  Q                    + G    +
Sbjct: 188 VRDPRWGRILETPGEDPHLVGLYAVNYVRGLQD-------------------VVGAENTT 228

Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           D     L +S+CCKH  AYDL+ W    R  F+A ++ QD  +TF  PF  C+++G  S 
Sbjct: 229 DLNSRPLKVSSCCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSS 288

Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAA 348
           +MCSYN++NG+P+C    L  Q  R EW   GYI SDCD+V  +   Q +   +  DSAA
Sbjct: 289 VMCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAA 348

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L AGM+++CGT   R    A+++GK  + D+D +L  L+ + +R+G F+G P    + 
Sbjct: 349 QALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIP---AFA 405

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG DD+C++EH +LA +AARQGIVLLKND   LPL   +V ++A++GP  N    M G 
Sbjct: 406 SLGKDDICSAEHIELAREAARQGIVLLKNDNATLPL--KSVKNIALVGPHANATDAMIGN 463

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y GIPC   S L    + + +  Y  GC DV C ++     A+  AK+AD  I+ AG DL
Sbjct: 464 YAGIPCYYVSPLDAFSS-MGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDL 522

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E  DRV LLLPG Q  L+  VA  S  PV+LV+  GG +D+SFA  + +I++ILW 
Sbjct: 523 SIEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWA 582

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRF 647
           GYPGE G  A+A++I G +NPGGRLP+TWY   +   +PM  M +R   S  YPGR+Y+F
Sbjct: 583 GYPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKF 642

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP--SELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
           + G+ VY FG+G+SYTN+SY   ++   + + +    +  S   I          V +D+
Sbjct: 643 FNGSTVYPFGYGMSYTNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDD 702

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           ++   S+ F  +++V N G +DGS VV++++  P    GT  K+++GF+RV     G+++
Sbjct: 703 LSCKESIEF--EVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEK 760

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F ++ C+ L I +  G  +LP G+H + VG
Sbjct: 761 VKFSMNVCKSLGIVDSTGYALLPSGSHTIKVG 792


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score =  619 bits (1595), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 314/746 (42%), Positives = 442/746 (59%), Gaps = 65/746 (8%)

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L++ +TL EK+ QL + A  +PRLG+  Y+WWSE+LHG+++ GPG  F+  +   TSFP 
Sbjct: 3   LVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFPT 62

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGED 187
           V+ +AA+FN SLW  IG AV+ EARAMYNLG+AGLT+W+PNIN+ RDPRWGR  ETPGED
Sbjct: 63  VITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGED 122

Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLI 247
           P +V  YAV +V+  Q       +G   +     R LK             +S+CCKH  
Sbjct: 123 PYLVGRYAVNYVRGLQ-----DVEGSENYTDPNSRPLK-------------VSSCCKHYA 164

Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
           AYD++ W    RY+F+A ++EQD  +TF  PF  C++ G  S +MCSYN+VNG+P C   
Sbjct: 165 AYDVDNWKGVERYTFDARVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADP 224

Query: 308 DLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRH 366
            L  Q  R +W   GYI SDCD++  + E   +              G+D++CG     +
Sbjct: 225 KLLNQTIRGDWDLHGYIVSDCDSLQVMVENHKWL-------------GLDLDCGAYYTEN 271

Query: 367 TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALD 426
            ++A+ +GKV+E DID++L  L+ V +RLG F+G P   +Y   G +DVC+ E+ +LA +
Sbjct: 272 VEAAVRQGKVREADIDKSLNFLYVVLMRLGFFDGIP---QYNSFGKNDVCSKENIELATE 328

Query: 427 AARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY 486
           AAR+G VLLKN+   LPL+   V +LA+IGP  N  S M G Y GIPC   + + GL  Y
Sbjct: 329 AAREGAVLLKNENDSLPLSIEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKY 388

Query: 487 VSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            +K  Y  GC D+ C  ++    A+  AKKAD  I++AG+DL+ E E  DR  LLLPG Q
Sbjct: 389 -AKVDYQMGCSDIACKDESFIFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQ 447

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+  VA  S  PV+LVL   G +D+SFA+++  I SILW+GYPGE G  A+A++IFG 
Sbjct: 448 TQLINQVASVSNGPVVLVLMSAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGK 507

Query: 607 FNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
           +NPGGRLP+TW+   +   +PM  M +R   S  YPGR+Y+F+ G+ VY FGHGLSYT +
Sbjct: 508 YNPGGRLPLTWHEADYVDMLPMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQF 567

Query: 666 SYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL-----RFHVQISV 720
           +YK  S    L I                        +D+   C  L      F     V
Sbjct: 568 TYKLTSTIRSLDI-----------------------KLDKYQYCHDLGYKNDSFKPSFEV 604

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
            NAG  DGS VV+++A+ P+    T  KQ+IGF RV   A GS+++ F  +  + L + +
Sbjct: 605 LNAGAKDGSEVVIVYAKPPEGIDATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVD 664

Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
            +   +LP G H +M+G+   S +++
Sbjct: 665 FNAYSVLPSGGHTIMLGDDIISFSVQ 690


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score =  614 bits (1584), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 320/766 (41%), Positives = 454/766 (59%), Gaps = 45/766 (5%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C++SL    R K L+  +TL+EK + + D AS +PR+G+P Y+WWSE+LHG+A+ G 
Sbjct: 65  FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
              F+  V   TSFP V++SAASFN+SLW  +G  V+ EARAMYNLG AGLTFW+PNIN+
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINV 184

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RDPRWGR  ETPGEDP+ V  Y V +V+  Q                    ++G    +
Sbjct: 185 ARDPRWGRILETPGEDPLTVGVYGVNYVRGLQD-------------------IEGTENTT 225

Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           D     L +++ CKH  AYDL++W N  R  F+A ++EQD  +TF  PF  C+++G  S 
Sbjct: 226 DLNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSS 285

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAA 348
           +MCS+N +NG+P C      +   R +W   GYI SDC A+ TI + Q +   T E+  A
Sbjct: 286 VMCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVA 345

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +KAG+D+ CG        +A+ +G+V E D+D++L  L+ V +R+G F+G P      
Sbjct: 346 LSMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIP---SLA 402

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG  D+C  EH +LA +AARQGIVLLKND   LPL    V  LA++GP  N    M G 
Sbjct: 403 SLGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGN 460

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y GIPC   S L      +    Y  GC DV C++D   ++A   AK AD  I++ G DL
Sbjct: 461 YAGIPCHYVSPLDAFSE-LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDL 519

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E+RDR  LLLPG Q  +V  V   S  PVILV+  GGP+D+SFA+ + +I++ILW 
Sbjct: 520 SIEAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWA 579

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           G+PGE G  A+A+I+FG +NPGGR P+TWY   +   +PM  M +R   S  YPGR+Y+F
Sbjct: 580 GFPGEQGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKF 639

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP--------SELTISASLKAGSDKNILQQTGSRLD 699
           + G+ VY FG+GLSYTN+SY  L+AP        + L    S+   SD    + +   +D
Sbjct: 640 FNGSTVYPFGYGLSYTNFSYS-LTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVD 698

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
            +  DE        F  Q++V N G +DGS VVM+++  P    GT  KQ+IGF+RV   
Sbjct: 699 DLSCDE-------SFEFQVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVK 751

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
              ++++ F ++ C+ L + +  G  +LP G+H +M G+   S++ 
Sbjct: 752 VGNTEKVKFSMNVCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVSF 797


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  610 bits (1573), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 319/757 (42%), Positives = 448/757 (59%), Gaps = 60/757 (7%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C++SL    R K L+  +TL+EK + + D AS +PR+G+P Y+WWSE+LHG+A+ G 
Sbjct: 65  FVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALHGVANVGS 124

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
              F+  V   TSFP V++SAASFN+SLW  +G  V+ EARAMYNLG AGLTFW+PNIN+
Sbjct: 125 ATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTFWSPNINV 184

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RDPRWGR  ETPGEDP+ V  Y V +V+  Q                    ++G    +
Sbjct: 185 ARDPRWGRILETPGEDPLTVGVYGVNYVRGLQD-------------------IEGTENTT 225

Query: 232 DRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           D     L +++ CKH  AYDL++W N  R  F+A ++EQD  +TF  PF  C+++G  S 
Sbjct: 226 DLNSRPLKIASSCKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSS 285

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAA 348
           +MCS+N +NG+P C      +   R +W   GYI SDC A+ TI + Q +   T E+  A
Sbjct: 286 VMCSFNNINGIPPCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVA 345

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +KAG+D+ CG        +A+ +G+V E D+D++L  L+ V +R+G F+G P      
Sbjct: 346 LSMKAGLDLECGHYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIP---SLA 402

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG  D+C  EH +LA +AARQGIVLLKND   LPL    V  LA++GP  N    M G 
Sbjct: 403 SLGKKDICNDEHIELAREAARQGIVLLKNDNATLPLKP--VKKLALVGPHANATVAMIGN 460

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y GIPC   S L      +    Y  GC DV C++D   ++A   AK AD  I++ G DL
Sbjct: 461 YAGIPCHYVSPLDAFSE-LGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDL 519

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E+RDR  LLLPG Q  +V  V   S  PVILV+  GGP+D+SFA+ + +I++ILW 
Sbjct: 520 SIEAEERDREDLLLPGYQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWA 579

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           G+PGE G  A+A+I+FG +NPGGR P+TWY   +   +PM  M +R   S  YPGR+Y+F
Sbjct: 580 GFPGEQGGNAIADIVFGKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKF 639

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           + G+ VY FG+GLSYTN+SY  L+AP+                          VHI    
Sbjct: 640 FNGSTVYPFGYGLSYTNFSYS-LTAPTR------------------------SVHI---- 670

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           S TS  F  Q++V N G +DGS VVM+++  P    GT  KQ+IGF+RV      ++++ 
Sbjct: 671 SLTSFEF--QVAVKNVGSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVK 728

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
           F ++ C+ L + +  G  +LP G+H +M G+   S++
Sbjct: 729 FSMNVCKSLGLVDSSGYILLPSGSHTIMAGDNSTSVS 765


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  609 bits (1570), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 319/679 (46%), Positives = 434/679 (63%), Gaps = 49/679 (7%)

Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
           AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  Q  
Sbjct: 228 AVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ-- 285

Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
                                  ++      L L+ACCKH  AYDL+ W    R+ FNA+
Sbjct: 286 -----------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAV 322

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFK 320
           +T QD EDTF  PFRSC+  G+A+ +MCSYNQVNGVP C     LRG +    R  WG  
Sbjct: 323 VTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RRRWGLA 378

Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
           GYI SDCD+V   +  Q+YT+T ED+ A  L+AG+D++CG  + ++T+ A+ +GKV + D
Sbjct: 379 GYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGD 438

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           ID A+ N  +VQ+RLG+F+GDP    +G LGP  VCT+ H++LA++AARQGIVLLKND +
Sbjct: 439 IDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGR 498

Query: 441 FLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV 499
            LPL+   A  ++A++GP       M G Y G PC   + L+G+  Y ++  +  GC DV
Sbjct: 499 ALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDV 558

Query: 500 PC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
            C  S      AV  A++AD  IVVAGLD   E E  DR SLLLPG+Q  L++SVA+ SK
Sbjct: 559 ACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASK 618

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            PVILVL  GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG  NPGG+LP+TWY
Sbjct: 619 GPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWY 678

Query: 619 PESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           P+ +  KVPM +M MRA+ ++ YPGR+YRFYTG  ++ FGHGLSYT++++    APS+LT
Sbjct: 679 PQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLT 738

Query: 678 I--SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
           +  SA   A S    L  T +RL            C  LR  V + V N G+ DG+H V+
Sbjct: 739 VRLSAHHAAASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHTVL 797

Query: 734 LFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
           ++A  P  S            +QL+ F++VH  A G+  +  G+D C+ LS+A+++G R 
Sbjct: 798 VYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRR 857

Query: 787 LPLGNHVLMVGELRHSLTI 805
           +P+G H L++GEL H++TI
Sbjct: 858 IPVGEHRLIIGELTHTVTI 876



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 48/94 (51%), Positives = 65/94 (69%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + PFC  SL    RA+ L++ LT  EK++ L +NA+ +PRLG+  YEWWSE+LHG++  G
Sbjct: 40  TLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           PGV F G     T+FPQV+ +AASFN +LW  IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIG 133


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  608 bits (1568), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 316/681 (46%), Positives = 433/681 (63%), Gaps = 51/681 (7%)

Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
           AV+ E RAMYN GQAGLTFW+PN+NIFRDPRWGRGQETPGEDP V + YA  +V+  Q  
Sbjct: 228 AVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQ-- 285

Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
                                  ++      L L+ACCKH  AYDL+ W    R+ FNA+
Sbjct: 286 -----------------------QQQPSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAV 322

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-----LRGDLFQKARNEWGFK 320
           +T QD EDTF  PFRSC+  G+A+ +MCSYNQVNGVP C     LRG +    R  WG  
Sbjct: 323 VTRQDLEDTFNVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTI----RRRWGLA 378

Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
           GYI SDCD+V   +  Q+YT+T ED+ A  L+AG+D++CG  + ++T+ A+ +GKV + D
Sbjct: 379 GYIVSDCDSVDVFYSDQHYTRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGD 438

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           ID A+ N  +VQ+RLG+F+GDP    +G LGP  VCT+ H++LA++AARQGIVLLKND +
Sbjct: 439 IDAAVTNTVTVQMRLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGR 498

Query: 441 FLPLN-KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV 499
            LPL+   A  ++A++GP       M G Y G PC   + L+G+  Y ++  +  GC DV
Sbjct: 499 ALPLSPATARRAVAVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDV 558

Query: 500 PC-NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
            C  S      AV  A++AD  IVVAGLD   E E  DR SLLLPG+Q  L++SVA+ SK
Sbjct: 559 ACAGSGQPIAAAVDAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASK 618

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            PVILVL  GGP+D+ FA+ D +I+ ILW GYPG+AG +A+A++IFG  NPGG+LP+TWY
Sbjct: 619 GPVILVLMSGGPIDIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWY 678

Query: 619 PESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           P+ +  KVPM +M MRA+ ++ YPGR+YRFYTG  ++ FGHGLSYT++++    APS+LT
Sbjct: 679 PQDYLQKVPMTNMAMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLT 738

Query: 678 I----SASLKAGSDKNILQQTGSRLD--YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           +      +  + S    L  T +RL            C  LR  V + V N G+ DG+H 
Sbjct: 739 VRLAAHHAAASASASASLNAT-ARLSRAAAVRVAHARCEELRMPVHVDVRNVGERDGAHT 797

Query: 732 VMLFARVPKVSQGTPE-------KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
           V+++A  P  S            +QL+ F++VH  A G+  +  G+D C+ LS+A+++G 
Sbjct: 798 VLVYAAAPASSAAEAAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGV 857

Query: 785 RILPLGNHVLMVGELRHSLTI 805
           R +P+G H L++GEL H++TI
Sbjct: 858 RRIPVGEHRLIIGELTHTVTI 878



 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 47/94 (50%), Positives = 65/94 (69%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + PFC  SL    RA+ L++ +T  EK++ L +NA+ +PRLG+  YEWWSE+LHG++  G
Sbjct: 40  TLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDTG 99

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           PGV F G     T+FPQV+ +AASFN +LW  IG
Sbjct: 100 PGVRFGGAFPGATAFPQVIGTAASFNATLWELIG 133


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  599 bits (1544), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 295/662 (44%), Positives = 415/662 (62%), Gaps = 40/662 (6%)

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EARAMYN G AGLTFW+PN+NIFRDPRWGRGQETPGEDPM+   YA  +V+  QG +   
Sbjct: 2   EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGND--- 58

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                   G+ L ++ACCKH  AYDL+ W    R+ FNA +++Q
Sbjct: 59  ------------------------GERLKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQ 94

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           D +DTF+ PFR C+  GK + +MCSYNQVNG+P C    L +   R  W   GYI SDCD
Sbjct: 95  DIKDTFEIPFRECVLGGKVASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCD 154

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           +V   FE Q+YT   E++ A  +KAG+D++CG  +  HT++A+ +G V + +I+ AL N 
Sbjct: 155 SVGVFFENQHYTSKPEEAVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANT 214

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            + Q+RLG+F+G+P   +YG LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+   
Sbjct: 215 MTAQMRLGMFDGEPSAHQYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRR 274

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
             ++A+IGP  +    M G Y G+ C   + L+G+  Y    H A GC DV CN +  F 
Sbjct: 275 HRTVAVIGPNSDVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFG 333

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
            A   A++AD  ++V GLD + E E  DR   LLPG Q  LV+ VAR S+ P ILVL  G
Sbjct: 334 AAEAAARQADATVLVMGLDQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSG 393

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPM 627
           GP+DV+FA+ D +IS+I+W+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+++ T +PM
Sbjct: 394 GPIDVTFAKNDPRISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPM 453

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGS 686
            DM MRAD +R YPGR+YRFY G  V+ FG GLSYT +++     P+ +++   SLKA +
Sbjct: 454 TDMAMRADPARGYPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATA 513

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
           +  +L +    +   H D    C +L    V + V N G +DG+H +++F   P   +  
Sbjct: 514 NSTMLSKA---VRVSHAD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWA 565

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
             KQL+GF ++H  A     +   V  C+ LS+ ++ G R +PLG H L +G+L H +++
Sbjct: 566 ASKQLVGFHKIHIAAGSETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSL 625

Query: 806 ET 807
           +T
Sbjct: 626 QT 627


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  593 bits (1529), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 331/781 (42%), Positives = 464/781 (59%), Gaps = 61/781 (7%)

Query: 44  CKPPHFDS-------YPFCNTSLSI-STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           C P  F +       + +CN+SL I   R K L+  +TL+EK   +   A+ + R+G+P 
Sbjct: 17  CDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEKATNVIYKAAGVERIGLPP 76

Query: 96  YEWWSESLHGIAS---NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
           Y+WWSE+LHG++S   NGP   F+ TV   TSFP V++SAASFN+SLW  I   V+ EAR
Sbjct: 77  YQWWSEALHGVSSVSINGPTF-FDETVPGATSFPNVILSAASFNQSLWKTIRQVVSKEAR 135

Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
           A YNLG AGLTFW PN+N+ RDPRWGR QET GEDP  VS YAV +V+  Q         
Sbjct: 136 ATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVYAVSYVRGLQD-------- 187

Query: 213 GIGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                      ++G    +D     L +S+  KH  AYDL+ W N  R  FNA ++EQD 
Sbjct: 188 -----------VEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDM 236

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
            +TF  PF +C+ +G  S +MCS+N +NG+P C    LF+   R+EW   GYI SDC ++
Sbjct: 237 AETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSI 296

Query: 331 ATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
            TI E Q +   T E++ A  LKAG+D+ CG        SA+  G+V + D+D++L NL+
Sbjct: 297 ETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLY 356

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            V +RLG F+G P       LG DD+C S EH +LA +AARQGIVLLKND   LPL   +
Sbjct: 357 VVLMRLGFFDGIP---ALASLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPL--KS 411

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
           V +LA++GP  +    M G Y G PC   S      A +    Y  GC DV C++D   +
Sbjct: 412 VKNLALVGPNADAYGAMMGNYAGPPCRSVSPRDAFSA-IGNVTYEMGCGDVLCHNDTYVY 470

Query: 509 EAVRIAKKADFVIVVAGL-DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           +AV  AK AD  I+V G+ D++  TED+DRV LLLPG Q  LV  +A+ +  P+ILV+ G
Sbjct: 471 KAVEAAKHADTTIIVVGITDVSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCG 530

Query: 568 --GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-K 624
             GGP+D+SFA  +  I  ILW G+PGE G  A+A++++G +NPGGRLP+TWY   +   
Sbjct: 531 HCGGPIDISFARDNPGIEPILWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGM 590

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK- 683
           +PM  M +R+  S  YPGR Y+F++G+ VY FG GLSYTN+SY  L+AP+  +I   LK 
Sbjct: 591 LPMTSMALRSVESLGYPGRKYKFFSGSTVYPFGCGLSYTNFSYS-LTAPTR-SIHTHLKK 648

Query: 684 -------AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                  A S  +++ Q  + L    +D++ SC    F  +++V   G +DGS VV++++
Sbjct: 649 LQPCRSMAYSICSVIPQCPAVL----VDDL-SCNE-TFEFEVAVKTVGSMDGSEVVIVYS 702

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             P    GT  KQ+IGF+RV       +++ F ++ C+ L I +  G  +LP G+ ++  
Sbjct: 703 SPPSGIVGTHIKQVIGFERVFVKVGXVEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKA 762

Query: 797 G 797
           G
Sbjct: 763 G 763


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 310/668 (46%), Positives = 422/668 (63%), Gaps = 41/668 (6%)

Query: 143 IGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSF 202
           IG  V+ E RAMYN+GQAGLT+W+PN+NI+RDPRWGRGQET GEDP + S Y V +VK  
Sbjct: 2   IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61

Query: 203 QGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSF 262
           Q    + DDG                    + D L +++CCKH  AYD++ W    RY+F
Sbjct: 62  Q----QRDDG--------------------KKDMLKVASCCKHYTAYDVDDWKGIQRYNF 97

Query: 263 NAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG 321
           NA +T+QD +DTF PPF+SC+  G  + +MCSYNQV+G P C   DL     R +W   G
Sbjct: 98  NAKVTQQDLDDTFNPPFKSCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNG 157

Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
           YI +DCD++  ++  Q+YTKT E++AA  L AG+ +NCG+ + ++TQ A+++G V E  I
Sbjct: 158 YIVTDCDSLNEMYWAQHYTKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVI 217

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           DRA+ N F+  +RLG F+G+P+   YG LGP D+CT +H++LA +AARQGIVLLKN    
Sbjct: 218 DRAVTNNFATLMRLGFFDGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGS 277

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
           LPL+  ++ SLA+IGP  N    M G Y G PC   + L GL A VS T Y  GC D+ C
Sbjct: 278 LPLSPKSIKSLAVIGPNANLAYTMVGSYEGSPCKYTTPLDGLGASVS-TVYQQGC-DIAC 335

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            + A    A ++A  AD V++V G D T E E +DR ++ LPGQQ  LVT VA  SK PV
Sbjct: 336 -ATAQVDNAKKVAAAADAVVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPV 394

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILV+  GG +DV FA  + +++SILW+G+PGEAG  ALA+++FG  NPGGRLPMTWYP+S
Sbjct: 395 ILVIMSGGGMDVKFAVDNPKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQS 454

Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           +  KV M +MNMRAD    +PGRSYRFY G  V+ FG GLSYT Y +  + AP  ++I  
Sbjct: 455 YVDKVDMTNMNMRADPKTGFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSI-- 512

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
            L+ G   +  + T  +     ID V    C +L   + + V N G + GSH V+LF   
Sbjct: 513 PLEEG---HACRSTKCK----SIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSP 565

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           P V    P+K L+ F ++H   +    + F +D C+ LS+ ++ G R + LG HVL +G+
Sbjct: 566 PSV-HNAPQKHLLDFQKIHLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624

Query: 799 LRHSLTIE 806
           L+HSLT+ 
Sbjct: 625 LKHSLTLR 632


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 302/755 (40%), Positives = 438/755 (58%), Gaps = 87/755 (11%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S+ FC+ SLS   RAK L+S +TLQEK+ Q    AS + RLG+P Y WWSE+LHGI++ G
Sbjct: 63  SFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLPEYSWWSEALHGISNLG 122

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV F+ T+   TS P V++S A+FN++LW  +G  V+ E RAMYNLG AGLTFW+PNIN
Sbjct: 123 PGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAMYNLGHAGLTFWSPNIN 182

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RD RWGR QET GEDP +V  +AV +V+  Q                        G E
Sbjct: 183 VVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVE---------------------GTE 221

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           +       +S+CCKH  AYD++ W N  R++F+A ++EQD ++TF  PF  C+ +G  S 
Sbjct: 222 N-------VSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSS 274

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAA 348
           +MCS+N++NG+P C    L +   R+EW   GYI SDC  +  I + QNY   +  D+ A
Sbjct: 275 VMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVA 334

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L+AG+D+ CG         ++  GKV + ++DRAL N++ + +R+G F+G P    Y 
Sbjct: 335 KTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AYE 391

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG  D+C ++H +LA +AARQGIVLLKND + LPL       +A++GP  N    M G 
Sbjct: 392 SLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGN 449

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+PC   S L    A +    YA+G                       F I+  G DL
Sbjct: 450 YAGLPCKYVSPLEAFSA-IGNVTYATG-----------------------FTIIFVGTDL 485

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E  DRV  LLPG Q  L+  VA  S  PVILV+  G  +D++FA+ + +IS+ILW+
Sbjct: 486 SIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWV 545

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRF 647
           G+PGE G  A+A+++FG +NPGGRLP+TWY   +   +PM+ M++R      YPGR+Y+F
Sbjct: 546 GFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKF 605

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           + G+ VY FG+G+SYT +SY    A S+++I                      + +++  
Sbjct: 606 FDGSTVYPFGYGMSYTKFSYSL--ATSKISID---------------------IDLNKFQ 642

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
            C +     +++VTN G VDGS V+M+++  P    GT  KQ+IGF +V   A  ++ + 
Sbjct: 643 KCRTF----EVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVK 698

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
           F ++ C+ L I +  G  +LP G+H + VG+  +S
Sbjct: 699 FSMNACKSLRIVDSTGYSLLPSGSHTIRVGDYSNS 733


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  590 bits (1520), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 313/790 (39%), Positives = 453/790 (57%), Gaps = 65/790 (8%)

Query: 44  CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           C P  F +       + +C+ SL  + R + L+  L+L+EK++ L D A   PR+G+P Y
Sbjct: 51  CDPVRFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPPY 110

Query: 97  EWWSESLHGIASNGPGVN-FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           +WW E+LHG++  GPG   F   V   TSFP V+ SAA+FN SLW  IG  V+ E RAMY
Sbjct: 111 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 170

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           NLG A LT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q          + 
Sbjct: 171 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQD---------VV 221

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                      F         + +S+CCKH  AYD++ W    R +F+A + E+D  +TF
Sbjct: 222 IAAGAAATADPFSRP------IKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETF 275

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
           + PF  CI  G ASC+MCSYN++NG+PAC    L  +  R++W   GYI SDCD+V  + 
Sbjct: 276 ERPFEMCIRDGDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMV 335

Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRA 384
              ++ NYT     +AA  +KAG+D++CG            +   A+ +GK++E D+D A
Sbjct: 336 RDAKWLNYTGVEATAAA--MKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNA 393

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           L N+++  +RLG F+G P   ++  LG DDVCT +HK+LA DAARQG+VLLKND + LPL
Sbjct: 394 LGNVYTTLMRLGFFDGMP---EFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPL 450

Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           + + ++S++++G L  +N    M G Y G PC   +    +   V+ T Y   C    C+
Sbjct: 451 DPSKINSVSLVGLLEHINATDVMLGDYRGKPCRIVTPYDAIRQVVNAT-YVHACDSGACS 509

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
           +  G   A R AK AD  IV+AGL+++ E E  DR  LLLP  Q S + +VA  S  P++
Sbjct: 510 TAEGMGRASRTAKIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIV 569

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           LV+   G +DVSFA+ +++I +I+W GYPGE G  A+A+++FG +NPGGRLP+TW+   +
Sbjct: 570 LVIMSAGGVDVSFAQNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEY 629

Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
             ++PM  M +R D++  YPGR+Y+FY G  V Y FGHGLSYT+++Y   S  +  T++ 
Sbjct: 630 VNQIPMTSMALRPDAAHGYPGRTYKFYGGPAVLYPFGHGLSYTSFTYA--SGTTGATVTI 687

Query: 681 SLKAGSDKNILQQTGSR------------LDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
            + A     +L     +            +     DEV S  SLR      V N G V G
Sbjct: 688 PIGAWEHCKMLTYKSGKAPSPSPACPALNVASHRCDEVVS-FSLR------VANTGGVGG 740

Query: 729 SHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP 788
            HVV ++   P      P KQL+ F RV   A  + ++ F ++ C+  +I  +    ++P
Sbjct: 741 DHVVPVYTAPPPEVGDAPRKQLVEFRRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVP 800

Query: 789 LGNHVLMVGE 798
            G   ++VG+
Sbjct: 801 SGVSTVIVGD 810


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  590 bits (1520), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 307/749 (40%), Positives = 431/749 (57%), Gaps = 90/749 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C++S     RAK L+  +TL EK+ Q  D AS + R+G+P Y WWSE+LHG+++ G 
Sbjct: 68  FHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEALHGVSNFGR 127

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
            V F+  V   TSFP V++SAASFN+SLW  +G AV+ EARAMYN G AGLTFW+PNIN+
Sbjct: 128 CVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLTFWSPNINV 187

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RDPRWGR  ETPGEDP +V  YAV +                                 
Sbjct: 188 VRDPRWGRILETPGEDPHLVGLYAVNY--------------------------------- 214

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                        H  AYDL+ W    R  F+A ++ QD  +TF  PF  C+++G  S +
Sbjct: 215 -------------HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSV 261

Query: 292 MCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           MCSYN++NG+P+C    L  Q  R EW   GYI SDCD+V  +   Q +   +  DSAA 
Sbjct: 262 MCSYNKINGIPSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQ 321

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L AGM+++CGT   R    A+++GK  + D+D +L  L+ + +R+G F+G P    +  
Sbjct: 322 ALNAGMNLDCGTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIP---AFAS 378

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG DD+C++EH +LA +AARQGIVLLKND   LPL   +V ++A++GP  N    M G Y
Sbjct: 379 LGKDDICSAEHIELAREAARQGIVLLKNDNATLPLK--SVKNIALVGPHANATDAMIGNY 436

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GIPC   S L    + + +  Y  GC DV C ++     A+  AK+AD  I+ AG DL+
Sbjct: 437 AGIPCYYVSPLDAFSS-MGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLS 495

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DRV LLLPG Q  L+  VA  S  PV+LV+  GG +D+SFA  + +I++ILW G
Sbjct: 496 IEAEALDRVDLLLPGYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAG 555

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFY 648
           YPGE G  A+A++I G +NPGGRLP+TWY   +   +PM  M +R   S  YPGR+Y+F+
Sbjct: 556 YPGEQGGNAIADVILGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFF 615

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            G+ VY FG+G+SYTN+SY        L+ S S K                         
Sbjct: 616 NGSTVYPFGYGMSYTNFSYS-------LSTSQSCK------------------------- 643

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             S+ F  +++V N G +DGS VV++++  P    GT  K+++GF+RV     G++++ F
Sbjct: 644 -ESIEF--EVAVKNVGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKF 700

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            ++ C+ L I +  G  +LP G+H + VG
Sbjct: 701 SMNVCKSLGIVDSTGYALLPSGSHTIKVG 729


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  588 bits (1516), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 312/782 (39%), Positives = 449/782 (57%), Gaps = 51/782 (6%)

Query: 44  CKPPHF-------DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           C P  F         + +C+ SL  + R + L+  L L+EK++ L D A   PR+G+P Y
Sbjct: 63  CDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPPY 122

Query: 97  EWWSESLHGIASNGPGVN-FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           +WW E+LHG++  GPG   F   V   TSFP V+ SAA+FN SLW  IG  V+ E RAMY
Sbjct: 123 KWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAMY 182

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           NLG A LT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q      DD    
Sbjct: 183 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ----DVDD---- 234

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                 R      +   R   + +S+CCKH  AYD++ W    R +F+A + E+D  +TF
Sbjct: 235 ------RPYAAAADPFSR--PIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETF 286

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
           + PF  CI  G ASC+MCSYN++NG+PAC    L  +  R++W   GYI SDCD+V  + 
Sbjct: 287 ERPFEMCIRDGDASCVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMV 346

Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRA 384
              ++ NYT     +AA  +KAG+D++CG            +   A+ +GK++E D+D A
Sbjct: 347 RDAKWLNYTGVEATAAA--MKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNA 404

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           L N+++  +RLG F+G P   ++  LG  +VCT  HK+LA DAARQG+VLLKND + LPL
Sbjct: 405 LSNVYTTLMRLGFFDGMP---EFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPL 461

Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           + N ++S++++G L  +N    M G Y G PC   +    +   V+ T Y   C    CN
Sbjct: 462 DPNKINSVSLVGLLEHINATDVMLGDYRGKPCRIVTPYNAIRNMVNAT-YVHACDSGACN 520

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
           +  G   A   AK AD  IV+AGL+++ E E  DR  LLLP  Q S + +VA  S  P++
Sbjct: 521 TAEGMGRASSTAKIADATIVIAGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIV 580

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           LV+   G +DVSFA  +++I +I+W GYPGE G  A+A+++FG +NPGGRLP+TW+   +
Sbjct: 581 LVIMSAGGVDVSFAHNNTKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEY 640

Query: 623 T-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
             ++PM  M +R D++  YPGR+Y+FY G  V Y FGHGLSYTN+SY   S  +  T++ 
Sbjct: 641 VNQIPMTSMALRPDAALGYPGRTYKFYGGPAVLYPFGHGLSYTNFSYA--SGTTGATVTI 698

Query: 681 SLKAGSDKNILQ-QTGSRLDYVHIDEVTSCTSLRFHV---QISVTNAGDVDGSHVVMLFA 736
            + A     +L  + G+         +   + +   V    + V N G V G HVV ++ 
Sbjct: 699 HIGAWEHCKMLTYKMGAPSPSPACPALNVASHMCSEVVSFSLRVANTGGVGGDHVVPVYT 758

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             P      P KQL+ F RV   A  + ++ F ++ C+  +I  +    ++P G   ++V
Sbjct: 759 APPPEVGDAPLKQLVAFRRVFVPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVV 818

Query: 797 GE 798
           G+
Sbjct: 819 GD 820


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score =  588 bits (1515), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 286/663 (43%), Positives = 408/663 (61%), Gaps = 43/663 (6%)

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           V+ EARAM+N G AGLT+W+PN+NIFRDPRWGRGQETPGEDP+VV  YA  +V+  QG +
Sbjct: 4   VSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQGSD 63

Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
                                      G+ L ++ACCKH  AYDL+ W    R+ FNA +
Sbjct: 64  ---------------------------GNRLKVAACCKHFTAYDLDNWNGVDRFHFNAEV 96

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSD 326
           ++QD EDTF  PFR C+++GK + +MCSYNQVNG+P C   +L +K       +G +   
Sbjct: 97  SKQDMEDTFDVPFRMCVKEGKVASVMCSYNQVNGIPTCADPNLLKKT-----VRGTLFQT 151

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLK-AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
              +  I       +        +LK A +D++CG  + +HT+ A+ KG + E +I+ AL
Sbjct: 152 VTLLEFIMGSNTILQPRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNAL 211

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
           LN  +VQ+RLG+F+G+P    YG LGP+DVCT  H++LAL+AARQGIVLLKN    LPL+
Sbjct: 212 LNTLTVQMRLGMFDGEPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLS 271

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
                S+AI+GP  N  + M G Y G+ C   + L+G++ Y    H   GC DV C SD 
Sbjct: 272 TRRHLSVAIVGPNSNVTATMIGNYAGLACGYTTPLQGIQRYAQTIHR-QGCADVACVSDQ 330

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
            F  A+  A++AD  ++V GLD + E E RDR  LLLPG+Q  LV+ VA  SK P ILVL
Sbjct: 331 QFSAAIDAARQADATVLVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVL 390

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TK 624
             GGP+DVSFAE D +I SI+W GYPG+AG  A+++++FG  NPGG+LPMTWYP+ + T 
Sbjct: 391 MSGGPIDVSFAENDPKIGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITN 450

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLK 683
           +PM +M MR+  S+ YPGR+YRFY G  VY FGHG+SYTN+ +   SAP+ +++     +
Sbjct: 451 LPMTNMAMRSSKSKGYPGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHR 510

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
            GS    +     R+ +        C  L   +Q+ V N G +DG+H +++++R P    
Sbjct: 511 HGSGNATISGKAIRVTHAR------CNRLSLGMQVDVKNTGSMDGTHTLLVYSR-PPARH 563

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
             P KQL+ F++VH  A   + +   +  C+ LS+ +  G R +P+G H L +G+++HS+
Sbjct: 564 WAPHKQLVAFEKVHVAAGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGDVKHSV 623

Query: 804 TIE 806
           +++
Sbjct: 624 SLQ 626


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 285/575 (49%), Positives = 374/575 (65%), Gaps = 12/575 (2%)

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
           +L  SACCKH  AYDLE W   +R++F+A +TEQD  DT+ PPF+SC+E G AS IMCSY
Sbjct: 4   DLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSY 63

Query: 296 NQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+VNGVP C   +L  K AR +W F GYITSDCDAVA I + Q Y K  ED+ A VLKAG
Sbjct: 64  NRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGYAKAPEDAVADVLKAG 123

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           MD+NCG  +  H  SA  +GK+  +DIDRAL NLF++++RLGLFNG+P+  +YG +G D 
Sbjct: 124 MDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNIGADQ 183

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           VC  EH+ LAL AA+ GIVLLKND   LPL+K+ VSS+A+IGP  NN S + G Y G PC
Sbjct: 184 VCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYFGPPC 243

Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
              +  + L+ YV    +  GC+   CN  +   EAV  A  AD+V++  GLD  QE E+
Sbjct: 244 ISVTPFQALQGYVKDATFVQGCNAAVCNV-SNIGEAVHAASSADYVVLFMGLDQNQEREE 302

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR+ L LPG Q SLV  VA  +K+PVILVL  GGP+DV+FA+ + +I +I+W GYPG+A
Sbjct: 303 VDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQA 362

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  A+A+++FG+ NPGGRLP+TWYP+ FT VPM DM MRAD S  YPGR+YRFY G  VY
Sbjct: 363 GGIAIAQVLFGEHNPGGRLPVTWYPKEFTAVPMTDMRMRADPSTGYPGRTYRFYKGKTVY 422

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID----EVTSCT 710
            FG+GLSY+ YS++F S  ++    + ++       L+ T S    V  D       +C 
Sbjct: 423 NFGYGLSYSKYSHRFASEGTKPPSMSGIEG------LKATASAAGTVSYDVEEMGAEACD 476

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
            LRF   + V N G +DG H V+LF R P  + G P  QLIGF  VH  A  +  + F V
Sbjct: 477 RLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEV 536

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            PC+  S A + GR+++  G+H + VG+    L+ 
Sbjct: 537 SPCKHFSRAAEDGRKVIDQGSHFVKVGDDEFELSF 571


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 310/779 (39%), Positives = 445/779 (57%), Gaps = 53/779 (6%)

Query: 44  CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           C P  F +       + +C+ SL  + R + L+  LTL+EK++ L D A    R+G+P Y
Sbjct: 46  CDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPPY 105

Query: 97  EWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
            WW E+LHG++  GPG   F   V   TSFP V+ SAA+FN +LW  IG AV+ E RAMY
Sbjct: 106 LWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAMY 165

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           NLG A LT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV++ Q      D  G G
Sbjct: 166 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQ----DIDGAGPG 221

Query: 216 FGFRE-KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
            G     R +K             +S+CCKH  AYD++ W    R +F+A + E+D  +T
Sbjct: 222 AGADPFARPIK-------------VSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIET 268

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
           F+ PF  C+  G ASC+MCSYN++NGVPAC    L  +  R EW   GYI SDCD+V  +
Sbjct: 269 FERPFEMCVRDGDASCVMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVM 328

Query: 334 FEYQNYTKTHE-DSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDRAL 385
                +   +  ++ A  +KAG+D++CG                A+ +GK++E ++D AL
Sbjct: 329 VRDAKWLGYNGVEATAAAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNAL 388

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            NL+   +RLG F+G P   +   LG +DVCT EHK+LA DAARQG+VL+KND   LPL+
Sbjct: 389 RNLYLTLMRLGFFDGIP---ELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLD 445

Query: 446 KNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
            + V+SL+++G L  +N    M G Y G PC   +    +   VS T     C    C++
Sbjct: 446 TSKVNSLSLVGLLQHINATDVMLGDYRGKPCRVVTPYDAIRKVVSATSMQV-CDHGACST 504

Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
            A         K  D  IV+AGL+++ E E  DR  LLLP  Q + + +VA  S  P+IL
Sbjct: 505 AAN-------GKTVDATIVIAGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIIL 557

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY-PESF 622
           V+   G +DVSFA+ + +I +I+W GYPGE G  A+A+++FG +NPGGRLP+TWY  E  
Sbjct: 558 VIISAGGVDVSFAQNNPKIGAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYI 617

Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISAS 681
           +K+PM  M +R  + + YPGR+Y+FY G +V Y FGHGLSY+N+SY   +  + +T+   
Sbjct: 618 SKIPMTSMALRPVADKGYPGRTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVG 677

Query: 682 LKAGSDKNILQQTGSR--LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
               S K + ++ G+   L    ++             ++V N G  DG+HVVM++   P
Sbjct: 678 AWE-SCKQLTRKPGTTAPLACPAVNVAGHGCKEEVSFSLTVANRGSRDGAHVVMVYTVPP 736

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
                 P KQL+ F RV   A  + ++ F ++ C+  +I  +    ++P G   ++VG+
Sbjct: 737 AEVDDAPLKQLVAFRRVFVPAGAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGD 795


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  578 bits (1489), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 320/790 (40%), Positives = 446/790 (56%), Gaps = 56/790 (7%)

Query: 44  CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           C P  F +       +P+C+ SL  + R + L+  +TL+EK+  L D A   PR+G+P Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 97  EWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
            WW E+LHG++  GPG   F   V   TSFP V+ SAASFN +LW  IG  V+ E RAMY
Sbjct: 107 LWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAMY 166

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           NLG A LT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q      D  G  
Sbjct: 167 NLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ------DIDGAT 220

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                      F         + +S+CCKH  AYD++ W    R +F+A + E+D  +TF
Sbjct: 221 TAASAAAATDAFSR------PIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETF 274

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF 334
           + PF  CI  G ASC+MCSYN++NGVPAC    L  +  R +W   GYI SDCD+V  + 
Sbjct: 275 ERPFEMCIRDGDASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMV 334

Query: 335 ---EYQNYTKTHEDSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDRA 384
              ++  YT     +AA  +KAG+D++CG            +   A+ +GK++E  +D A
Sbjct: 335 RDAKWLGYTGVEATAAA--MKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNA 392

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           L NL+   +RLG F+G P   +   LG  DVCT EHK+LA DAARQG+VLLKND   LPL
Sbjct: 393 LTNLYLTLMRLGFFDGIP---ELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPL 449

Query: 445 NKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           +   V+S+A+ G L  +N    M G Y G PC   +   G+   VS T     C    C+
Sbjct: 450 SPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTS-VHACDKGSCD 508

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
           +      A   AK  D  IVVAGL+++ E E  DR  LLLP  Q S + +VA  S  P++
Sbjct: 509 T------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIV 562

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP-ES 621
           LV+   G +DVSFA+ + +I +++W GYPGE G  A+A+++FG +NPGGRLP+TWY  E 
Sbjct: 563 LVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEY 622

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISA 680
            +K+PM  M +R D+   YPGR+Y+FY G  V Y FGHGLSYTN++Y   +A + +T+  
Sbjct: 623 VSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTV-- 680

Query: 681 SLKAGSDKNILQQT-----GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
             K G+ +   Q T      S      ++  +          ++V N G  DG+HVV ++
Sbjct: 681 --KVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMY 738

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
              P    G P KQL+ F RV   A  + E++F ++ C+  +I  +    ++P G   ++
Sbjct: 739 TAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVL 798

Query: 796 VGELRHSLTI 805
           VG+   SL+ 
Sbjct: 799 VGDDALSLSF 808


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 293/615 (47%), Positives = 383/615 (62%), Gaps = 43/615 (6%)

Query: 190 VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
           + S YAV FVK  QG +                              L  SACCKH+ AY
Sbjct: 1   MASKYAVAFVKGMQGNSSAI---------------------------LQTSACCKHVTAY 33

Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
           DLE W    RY+FNA +T QD EDT+ PPFRSC+   KA+CIMC+Y  +NGVPAC   DL
Sbjct: 34  DLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADL 93

Query: 310 FQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ 368
             K  R +WG  GYI SDCDAVA + + Q YT+T ED+ A  LKAG+D+NCGT M +H  
Sbjct: 94  LTKTVRGDWGLDGYIASDCDAVAIMRDAQRYTQTPEDAVAVALKAGLDMNCGTYMQQHAT 153

Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDA 427
           +AI +GK+ E+DID+AL NLF++++RLG F+GDPR    YG LG  D+CT EH+ LAL+A
Sbjct: 154 AAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEA 213

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
           A  GIVLLKND   LPL++ AV+S A+IGP  N+   + G Y G PC   + L G+  Y+
Sbjct: 214 AMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYI 273

Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
               + +GC+   C+  A   +A  +A  +D+V +  GL   QE+E RDR SLLLPG+Q 
Sbjct: 274 KNVRFLAGCNSAACDVAA-TDQAAAVASSSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQ 332

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           SL+T+VA  +KRPVILVL  GGP+DV+FA+ + +I +ILW GYPG+AG  A+A ++FGD 
Sbjct: 333 SLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDH 392

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NPGGRLP+TWYPE FTKVPM DM MRAD +  YPGRSYRFY G  VY FG+GLSY++YS 
Sbjct: 393 NPGGRLPVTWYPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSR 452

Query: 668 KFLS----APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQISVT 721
           + +S    A S   + ASL+  +       T    +  HI+E+ +  C  L+F   + V 
Sbjct: 453 QLVSGGKPAESYTNLLASLRTTT-------TSEGDESYHIEEIGTDGCEQLKFPAVVEVQ 505

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G +DG H V+++ R P    G P  QLIGF   H        I F + PCE  S   K
Sbjct: 506 NHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRK 565

Query: 782 HGRRILPLGNHVLMV 796
            G++++  G+H LMV
Sbjct: 566 DGKKVIDRGSHYLMV 580


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 289/580 (49%), Positives = 380/580 (65%), Gaps = 35/580 (6%)

Query: 36  VLNKPDFPC---KPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
           V+    F C   K P    Y FC+ SL +  R K L+  LTLQEKI  L ++A  + RLG
Sbjct: 24  VVRGQTFACDVGKSPAVAGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLG 83

Query: 93  IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
           IP YEWWSE+LHG+++ G G  F+  V   TSFP  +++AASFN SL+  IG  V+ EA 
Sbjct: 84  IPRYEWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAG 143

Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
           AMYN+G AGLT+W+PNINIFRDPRWGRG ETPGEDP++ S YA  +VK  Q         
Sbjct: 144 AMYNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQ--------- 194

Query: 213 GIGFGFREKRVLKGFGEESDRGD--ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
                            ++D GD  +L ++ACCKH  AYD++KW    RY+FNA++T+QD
Sbjct: 195 -----------------QTDGGDPNKLKVAACCKHYTAYDVDKWKGIQRYTFNAVLTKQD 237

Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDA 329
            EDTFQPPF+SC+  G  + +MCSYN+VNG P C   DL +   R EW   GY+ SDCD+
Sbjct: 238 LEDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVVRGEWKLNGYMVSDCDS 297

Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
           V  +++YQ+YTKT E++AA  + AG+D+NCG  + ++T+ A+ +G + E  I+ A+ N F
Sbjct: 298 VEVLYKYQHYTKTPEEAAAISILAGLDLNCGRFLGQYTEGAVKQGLIDES-INNAVSNNF 356

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           +  +RLG F+GDPRK  YG LGP DVCT  +++LA +AARQGIV LKN    LPLN  A+
Sbjct: 357 ATLMRLGFFDGDPRKQPYGNLGPKDVCTPANQELAREAARQGIVSLKNSPASLPLNAKAI 416

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
            SLA+IGP  N    M G Y GIPC   S L+GL A+V  T YA+GC DV C +     +
Sbjct: 417 KSLAVIGPNANATRVMIGNYEGIPCKYISPLQGLTAFVP-TSYAAGCLDVRCPNPV-LDD 474

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           A +I+   D  ++V G  L  E E  DRV++LLPGQQ  LVT VA  SK PVILV+  GG
Sbjct: 475 AKKISASGDATVIVVGASLAIEAESLDRVNILLPGQQQLLVTEVANASKGPVILVIMSGG 534

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
            +DVSFA+ +++I+SILW+GYPGEAG  A+A++IFG  NP
Sbjct: 535 GMDVSFAKDNNKITSILWVGYPGEAGGAAIADVIFGFHNP 574


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 311/771 (40%), Positives = 449/771 (58%), Gaps = 74/771 (9%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+   F++Y +C+ + SI  R   L+S +T+ +KI QL  +A AIP L IPAY+WWSE L
Sbjct: 21  CEKAPFNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGL 80

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
           HG+A + PGV+F G   + TSFPQV+   A+FN SL   +   ++ EARA  N GQAGLT
Sbjct: 81  HGVAGS-PGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQAGLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           ++APNINIFRDPRWGRGQETPGEDP + S YA  FVK  Q                    
Sbjct: 140 YFAPNINIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ-------------------- 179

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                E +D    L   A CKH  AYDLE + N SR++FNA++++QD E+T+ P FRSC+
Sbjct: 180 -----EGADDTRYLKTIATCKHYAAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCV 234

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           E+GK   IMCSYN VNGVP+C    +  + AR +WGF+GY+ SDC A++ I     YT  
Sbjct: 235 EEGKVGSIMCSYNAVNGVPSCANDFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSN 294

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            +D+ A  L+ G D+NCG     H Q+A D G + + DIDRA+  LF+ ++RLG+F+  P
Sbjct: 295 TDDTVAAGLRGGCDLNCGHFYSDHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFD-PP 353

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
               +     D V T +H+ LALDA+R+ IVLL+N+K  LPL+      +A++GP     
Sbjct: 354 SMQPFRDYTNDKVDTKQHEALALDASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQ 413

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK--ADFV 520
             M G Y G      S ++GL+       +A+GC  V C + AGF E  ++ ++   + +
Sbjct: 414 GAMQGNYKGTAPYLISPMQGLQDLGLSVTFAAGCTQVACPTIAGFSEVTKLVEEHSIEAI 473

Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVILVLTGGGPLDVSFAEA 578
           I V GLD +QE+E  DR SL LPGQQ+ L+  + + +    P I+V+  GGP+D+S  + 
Sbjct: 474 IAVIGLDESQESEGHDRTSLTLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKD 533

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSS 637
            +   +ILW GYPG++G +A+AE+I+G  NP GRLP+T+YP S+  ++P  +M+MR    
Sbjct: 534 IAD--AILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-- 589

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
              PGRSY+FYTGT V+ FG GLSYT +  K+ + P+       LK   D ++       
Sbjct: 590 ---PGRSYKFYTGTPVFPFGFGLSYTTFEMKWKNPPN----VTHLKTTHDVDV------- 635

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                            + ++ VTNAG   GS  V + A +     G P K+L GF +++
Sbjct: 636 -----------------NYEVVVTNAGKRSGS--VSVLAYITSTVPGAPMKELFGFQKIY 676

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
              + S  +SF  +P +  +  +KHG R +  G + + +G   +L+H++ I
Sbjct: 677 LKPEQSMTLSFVAEP-KVFTTVDKHGERKIRPGTYKITIGDTSDLKHTVFI 726


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 308/783 (39%), Positives = 441/783 (56%), Gaps = 50/783 (6%)

Query: 44  CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS-AIPRLGIPA 95
           C P  F S       + +C+ SL  + R + L+  LTL+EK+  L D A  A  R+G+P 
Sbjct: 46  CDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLPR 105

Query: 96  YEWWSESLHGIASNGPG-VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
           Y WW E+LHG++   PG   F   V   TSFP VL SAA+FN +LW  IG A + E RAM
Sbjct: 106 YMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRAM 165

Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           YNLG A LT+W+PNIN+ RDPRWGR  ETPGEDP +V  +AV FV++ Q  +  ++ G  
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGAG 225

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                 +R              L +S+CCKH  AYD++KW    R SF+A + E+D  +T
Sbjct: 226 AADPFARR--------------LKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVET 271

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATI 333
           F+ PF  C+  G ASC+MCSYN++NGVPAC  G L     R +W   GYI SDCD+V  +
Sbjct: 272 FERPFEMCVRDGDASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVM 331

Query: 334 F---EYQNYTKTHEDSAAGVLKAGMDINCGT-------CMLRHTQSAIDKGKVQEKDIDR 383
               ++  Y      +AA  +KAG+D++CG            +   A+ +GK++E ++D 
Sbjct: 332 VRDAKWLGYDGVQATAAA--MKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDE 389

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
           AL +L+   +RLG F+G P   ++  LG  DVCT EHK++A +AARQG+VLLKND   LP
Sbjct: 390 ALGHLYLTLMRLGFFDGSP---EFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLP 446

Query: 444 LNKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
           L+ N V+SLA++G L  +N    M G Y G PC   +    +   VS T   + C    C
Sbjct: 447 LDANKVNSLALVGLLQHINATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQA-CDKGAC 505

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            + A    A   AK  D  IV+ GL+++ E E  DR  LLLP  Q   + +VA  S+ P+
Sbjct: 506 GTTA--LGAAIAAKTVDATIVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPI 563

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
            LV+   G +D+SFA+ + +I +ILW GYPGE G   +A+++FG +NPGGRLP+TWY   
Sbjct: 564 TLVIISAGGVDISFAQNNPKIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNE 623

Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTIS 679
           +  K+PM  M +R  + + YPGR+Y+FY+G  V Y FGHGLSYTN++Y   +  + +T+ 
Sbjct: 624 YIGKLPMTSMALRPVADKGYPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVK 683

Query: 680 -ASLKAGSDKNILQQTGSRLDYV---HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
             +    S KN+  + G+         I+             + V+N G + GSHVV ++
Sbjct: 684 IGTAWEDSCKNLTYKPGTTASTAPCPAINVAGHGCQEEVSFTLKVSNTGGIGGSHVVPVY 743

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
              P      P KQL+ F R+   A  + E+ F +  C+  +I       ++P G   ++
Sbjct: 744 TAPPAEVDDAPLKQLVAFRRMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVL 803

Query: 796 VGE 798
           VG+
Sbjct: 804 VGD 806


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 303/772 (39%), Positives = 439/772 (56%), Gaps = 72/772 (9%)

Query: 41  DFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           D  C  P   S PFC+ SL I  R   L++ + L++ +  L + ASA P + +P+YEWW+
Sbjct: 21  DNVCDDPKVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWN 80

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHG+A + PGV F G +++ TSFPQVL +AASFNR+L+  I  A++ EARA YN   A
Sbjct: 81  EALHGVALS-PGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNA 139

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLTFW PN+NIFRDPRWGRGQETPGEDP +   YAV FV+  QGE               
Sbjct: 140 GLTFWTPNVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGE--------------- 184

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
              ++G  E  D    L +S+CCKH  AY  E      R+  +A++T+QD  DT+ P F 
Sbjct: 185 --AMEGH-ENKDDNKFLKISSCCKHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFE 237

Query: 281 SCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
            C+++G  S IMCSYN VNG+P+C  +G L    RN+W F GYITSDC+AVA +    ++
Sbjct: 238 DCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF 297

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           T++ E + A  L AGMD+NCG  + +H  SAI++G V  + +  AL N F V +RLG+F 
Sbjct: 298 TQSPEQTCATTLDAGMDLNCGEFLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFE 357

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS---SLAIIG 456
              +   +  +  D V T+ H++LAL+AARQ +VLLKN+   LPL  +  S   SLA+IG
Sbjct: 358 KGTQ--PFSNITKDAVDTAAHRQLALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIG 415

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
           P  N  + + G Y GIP    + L+G+ +YV    Y+ GC  V       F EA+ + KK
Sbjct: 416 PHFNASTALLGNYFGIPSHIVTPLKGVSSYVPNVAYSLGC-KVSGEVLPDFDEAIEVVKK 474

Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           AD V+V  GLD +QE E+ DR  L LPG Q++L+  +   +  P++LVL  GG +D+S  
Sbjct: 475 ADRVVVFMGLDQSQEREEIDRYHLKLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLY 534

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRAD 635
           +   ++ +I++ GY G+AG +ALA+++FG ++P GRL  T+Y   +   +P+ DM+MR  
Sbjct: 535 KNHPKVGAIVFGGYLGQAGGQALADMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPT 594

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                PGR+YRF++G  VY FG GLSYT +                              
Sbjct: 595 FVTGNPGRTYRFFSGAPVYEFGFGLSYTTFH----------------------------- 625

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFD 754
                       +C S     +I+VTN GDV+G   ++++A  P   + G P + L+ F+
Sbjct: 626 -----------KACRSCVASFEITVTNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFE 674

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           R   V  G    +      +  ++AN  G  ++  GN  + V  L+H + ++
Sbjct: 675 RTALVTTGKTATADFCLEAKAFALANAEGSWVVEQGNWTIHVDTLQHRVNVQ 726


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  563 bits (1452), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 327/770 (42%), Positives = 439/770 (57%), Gaps = 74/770 (9%)

Query: 43   PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
            PC      ++PFCN SL + TR + +IS L++Q+K+   ++ A A    G+PAY+WWSE+
Sbjct: 915  PCDELPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEA 974

Query: 103  LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
            LHG+  + PGV F G V + TSFPQV+ ++ASFN++LW +IG  ++ EARAM N+ QAGL
Sbjct: 975  LHGVGFS-PGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGL 1033

Query: 163  TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ-GENWKSDDGGIGFGFREK 221
            TFWAPNINI RDPRWGRGQETPGEDP     YA  FV   Q GE              + 
Sbjct: 1034 TFWAPNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------------DT 1079

Query: 222  RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
            R +K              S+CCKH   Y+LE W N  R+ FNA+ T+QD  DT+ P F S
Sbjct: 1080 RYIKA-------------SSCCKHFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFES 1126

Query: 282  CIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYT 340
            C+  G+AS +MCSYN VNGVP+C   D+    AR  WGF GYITSDC AV  ++    Y 
Sbjct: 1127 CVRFGRASSLMCSYNAVNGVPSCANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYY 1186

Query: 341  KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
             T   +  GVL AGMD++CG+ + +H   AID G V    +D+AL NLF VQ RLG+F  
Sbjct: 1187 NTTGATVNGVLSAGMDVDCGSFLSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMF-- 1244

Query: 401  DPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            DP + + Y  L  D V T EH++LAL+AARQG+ LL+N    LPL+ +++  LA+IGP  
Sbjct: 1245 DPAEDQPYLNLTTDAVNTPEHQQLALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNA 1304

Query: 460  NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
            N    M G Y G      S  +G++ YVS      G              AV  AK AD 
Sbjct: 1305 NATGVMQGNYNGKAPFLISPQQGVQQYVSNVALELG--------------AVTAAKAADT 1350

Query: 520  VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
            V++V GLD TQE+E  DR  + LPG Q  LV  VA  S  P+++V+  GG +D++  +  
Sbjct: 1351 VVMVIGLDQTQESEGHDREIIALPGMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDL 1410

Query: 580  SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
              +         G+AG +ALAE +FGD NPGGRLP T YP     +V M D  MR +++ 
Sbjct: 1411 DNV---------GQAGGQALAETLFGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATS 1461

Query: 639  QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
              PGR+YRFYTGT VY +G GLSYT++SY+        T + SL+  +++ +     +R 
Sbjct: 1462 GNPGRTYRFYTGTPVYAYGTGLSYTSFSYE--------TSTPSLRVSAER-VRAWVAARG 1512

Query: 699  DYVHI-DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRV 756
                I DEV +      ++ ++V N G V G+ VV +F +     + G P K L GF+RV
Sbjct: 1513 QTSFIRDEVDA----EDYITVTVQNNGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERV 1568

Query: 757  HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV-GELRHSLTI 805
                  +  I F V P   LS+ N  G R+   G   + V  E R S+ I
Sbjct: 1569 FLKPGETTSIQFPVTP-HDLSVVNSRGERVAVPGTWTVEVHHEARLSIPI 1617


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  563 bits (1451), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 302/775 (38%), Positives = 436/775 (56%), Gaps = 75/775 (9%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           Y +C+  L    R + LI  +T++EK+  L D A+  PR+G+P Y+WWSE+LHG++S GP
Sbjct: 60  YRYCDAKLPYGDRVRDLIGWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGP 119

Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
              F+             V + T F  V+ SAASFN SLW +IG A++ EARAMYNLG+ 
Sbjct: 120 TTKFDDLKKPRLHSGRAAVFNGTVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKG 179

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q      DD   GF    
Sbjct: 180 GLTYWSPNINVVRDPRWGRALETPGEDPFVVGRYAVNFVRGMQ----DVDDAAAGFN--- 232

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                  G+   R   L  SACCKH  AYD++ W   +R+ F+A +TE+D  +TFQ PF 
Sbjct: 233 -------GDPLSR--PLKTSACCKHYAAYDVDDWYGHTRFKFDARVTERDMVETFQRPFE 283

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
            C+  G AS +MCSYN+VNG+PAC    L     R +WG  GYI SDCDAV  + +   +
Sbjct: 284 MCVRDGDASAVMCSYNRVNGIPACADARLLAGTLRRDWGLHGYIVSDCDAVRVMTDNATW 343

Query: 340 TK-THEDSAAGVLKAGMDINCGTCML------------RHTQSAIDKGKVQEKDIDRALL 386
              T  +++A  LKAG+D++CG   +             +  +A+ +GK++E DID AL+
Sbjct: 344 LGYTPAEASAASLKAGLDLDCGESWIVQKGKPVMDFLSTYGMAAVRQGKMRESDIDNALV 403

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           NL++  +RLG F+G PR   Y  L   D+C+  H+ LALD ARQ +VLLKN    LPL+ 
Sbjct: 404 NLYTTLMRLGYFDGMPR---YESLDEKDICSEAHRSLALDGARQSMVLLKNLDGLLPLDA 460

Query: 447 NAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
           + ++S+A+ GP      + M G YTG PC           Y++                 
Sbjct: 461 SKLASVAVRGPHAEAPEKVMDGDYTGPPCR----------YITPRE-------------- 496

Query: 506 GFHEAVRIAKK-ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           G  + V I+++  D  I + G+++  E E  DR  LLLP  Q   +  VA  S  P++LV
Sbjct: 497 GISKDVNISQQGGDVTIYMGGINMHIEREGNDREDLLLPKNQTEEILRVAAASPSPIVLV 556

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
           +  GG +DVSFA++  +I +ILW GYPG  G  A+A++IFG +NPGGRLP+TW+   +  
Sbjct: 557 ILSGGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIADVIFGRYNPGGRLPLTWFKNKYIH 616

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASL 682
           ++PM  M +R      YPGR+Y+FY G  V Y FG+GLSYT + Y+ L+  + +T++   
Sbjct: 617 QLPMTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYGLSYTKFRYELLNKETAVTLAPGR 676

Query: 683 KAGSDKNILQQTGS-RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
           +    + +  +TGS   D   +D  +   +      +SV NAG  DG++ V+++   P  
Sbjct: 677 R--HCRQLSYKTGSVGPDCPAVDVASHACAETVSFNVSVVNAGKADGANAVLVYTAPPAE 734

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             G P KQ+  F RV   A  ++ + F ++ C+   I  K    ++P G   ++V
Sbjct: 735 LAGAPIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIVEKTAYTVVPSGVSTVIV 789


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 305/782 (39%), Positives = 436/782 (55%), Gaps = 83/782 (10%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           +  ++P+C+ SL  + R + LI  +T++EK+  L D +   PR+G+P Y+WWSE+LHG++
Sbjct: 64  NMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPPYKWWSEALHGVS 123

Query: 108 SNGPGVNFN------------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           S GP + F+             TV++ T F  V+ SAASFN +LW++IG AV+ EARAMY
Sbjct: 124 STGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSIGQAVSTEARAMY 183

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           NLG+ GLT+W+PNIN+ RDPRWGR  ETPGEDP V   YAV FV+  Q      D  G  
Sbjct: 184 NLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQ------DIPGHY 237

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
            G    R +K              SACCKH  AYD++ W N +R++++A ++E+D  +TF
Sbjct: 238 SGDPSARPIK-------------TSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETF 284

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIF 334
             PF  C+ +G  S +MCSYN+VNGVPAC    L     R EW   GYI SDCDAV  + 
Sbjct: 285 LRPFEMCVREGDVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMT 344

Query: 335 EYQNYTK-THEDSAAGVLKAGMDINCGTCML------------RHTQSAIDKGKVQEKDI 381
           +   +   T  +S+A  L+AGMD++C    +             +  +A+ +GK++E DI
Sbjct: 345 DNATWLNFTAAESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDI 404

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D AL NL+   +RLG F+  PR   Y  L   DVCT EHK LALD ARQGIVLLKND   
Sbjct: 405 DNALTNLYMTLMRLGYFDNIPR---YASLNETDVCTDEHKSLALDGARQGIVLLKNDHGL 461

Query: 442 LPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
           LPL+     ++A+ GP      + M G YTG PC           YV+            
Sbjct: 462 LPLDPKKTLAVAVHGPHARAPEKIMDGDYTGPPCR----------YVTPRQ--------- 502

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
                G    V+I+ KA   I + G++L  E E  DR  LLLP  Q   +   A+ S  P
Sbjct: 503 -----GISRDVKISHKAKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTP 557

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           +ILV+  GG +D+SFA+   +I +ILW GYPG  G  A+A++IFG +NPGGRLP+TW+  
Sbjct: 558 IILVILSGGGIDISFAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKN 617

Query: 621 SFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTI 678
            +  ++PM  M  R    + YPGR+Y+FY G +V Y FG+GLSYT + Y+  ++   +++
Sbjct: 618 KYIEQIPMTSMEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYE--TSTDGVSV 675

Query: 679 SASLKAGSDKNILQQ----TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML 734
           S     G  K +  +    T      V++ +     ++ F+V  SVTNAG   G+HVV++
Sbjct: 676 SLPAPGGHCKGLSYKPSVATVPACQAVNVADHACTETVSFNV--SVTNAGGRGGAHVVLV 733

Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
           +   P      P KQ+  F RV   A+ +  + F ++ C+   I  +    ++P G   +
Sbjct: 734 YTAPPPEVAEAPIKQVAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKV 793

Query: 795 MV 796
           +V
Sbjct: 794 LV 795


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  556 bits (1434), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 309/786 (39%), Positives = 438/786 (55%), Gaps = 76/786 (9%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C+ SL  + R + LI  +T++EK+  L D      R+G+PAY WWSE+LHG++S GP
Sbjct: 67  FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 126

Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
              F+             V + T F  V+ SAASFN +LW +IG AV+ EARAMYN+G+ 
Sbjct: 127 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 186

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q      D  G       
Sbjct: 187 GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ------DIPG------- 233

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
              +   G+ + R   L  SACCKH  AYDL+ W N +R+ F+A + E+D  +TFQ PF 
Sbjct: 234 HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFE 291

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
            C+  G  S +MCSYN+VNG+PAC    L  Q  R +WG  GYI SDCDAV  + +   +
Sbjct: 292 MCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATW 351

Query: 340 TK-THEDSAAGVLKAGMDINCGTCMLRHTQ-------------SAIDKGKVQEKDIDRAL 385
              T  +++A  LKAG+D++CG      T+              A++KGK++E DID AL
Sbjct: 352 LGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNAL 411

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            N +   +RLG F+      +Y  LG  D+CT +HK LALD ARQGIVLLKND K LPL+
Sbjct: 412 TNQYMTLMRLGYFD---DITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLD 468

Query: 446 KNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
            N V  + + GP V    + M G YTG PC   +  +G+  Y                  
Sbjct: 469 ANKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKY------------------ 510

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
                 VR + +A+  I   GL+L  E E  DR  +LLP  Q   +  VA+ S  P+ILV
Sbjct: 511 ------VRFSHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILV 564

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
           +  GG +DVSFA+ + +I +ILW GYPG  G  A+A++IFG  NP GRLP+TW+   +  
Sbjct: 565 ILSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIY 624

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISAS- 681
           ++PM  M++R  +   YPGR+Y+FY G  V Y FG+GLSYT + Y+  +  + LT+  + 
Sbjct: 625 QLPMTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAG 684

Query: 682 --LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
              K  S K+ +         ++++      ++ F+V  SVTN GD  GSH V++F++ P
Sbjct: 685 GHCKKLSYKSGVSSAAPACPAINVNGHACTETVSFNV--SVTNGGDTGGSHPVIVFSKPP 742

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
                 P KQ++ F  V   A  +  +SF ++ C+   I  K    ++P G   ++V  +
Sbjct: 743 AEVDDAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENV 802

Query: 800 RHSLTI 805
             S++ 
Sbjct: 803 DSSVSF 808


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 305/787 (38%), Positives = 436/787 (55%), Gaps = 79/787 (10%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +C+ SL  + R + LI  +T++EK+  L D      R+G+PAY WWSE+LHG++S GP
Sbjct: 68  FRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPAYRWWSEALHGLSSTGP 127

Query: 112 GVNFN-----------GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
              F+             V + T F  V+ SAASFN +LW +IG AV+ EARAMYN+G+ 
Sbjct: 128 TTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIGQAVSTEARAMYNMGKG 187

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q                 
Sbjct: 188 GLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ-------------DIPG 234

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
              +   G+ + R   L  SACCKH  AYDL+ W N +R+ F+A + E+D  +TFQ PF 
Sbjct: 235 HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPFE 292

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
            C+  G  S +MCSYN+VNG+PAC    L  Q  R +WG  GYI SDCDAV  + +   +
Sbjct: 293 MCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNATW 352

Query: 340 TK-THEDSAAGVLKAGMDINCGTC-------------MLRHTQSAIDKGKVQEKDIDRAL 385
              T  +++A  LKAG+D++CG               +  +   A++KGK++E DID AL
Sbjct: 353 LGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNAL 412

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            N +   +RLG F+      +Y  LG  D+CT +HK LALD ARQGIVLLKND K LPL+
Sbjct: 413 TNQYMTLMRLGYFD---DIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLD 469

Query: 446 KNAVSSLAIIGPLVNNISQ-MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
            N V  + + GP V    + M G YTG PC   +  +G+  Y                  
Sbjct: 470 ANKVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKY------------------ 511

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
                 VR + +A+  I   GL+L  E E  DR  +LLP  Q   +  VA+ S  P+ILV
Sbjct: 512 ------VRFSHRANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILV 565

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
           +  GG +DVSFA+ + +I +ILW GYPG  G  A+A++IFG  NP GRLP+TW+   +  
Sbjct: 566 ILSGGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIY 625

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASL 682
           ++PM  M++R  +   YPGR+Y+FY G  V Y FG+GLSYT + Y+  +  + L +  + 
Sbjct: 626 QLPMTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVA- 684

Query: 683 KAGSDKNILQQTG----SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
             G  K +  ++G         ++++      ++ F+V  SVTN GD  GSH V++F++ 
Sbjct: 685 -GGHCKKLSYKSGVSTAPACPAINVNGHVCTETVSFNV--SVTNGGDTGGSHPVIVFSKP 741

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           P      P KQ++ F  V   A  +  +SF ++ C+   I  K    ++P G   ++V  
Sbjct: 742 PAEVDDAPMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVEN 801

Query: 799 LRHSLTI 805
           +  S++ 
Sbjct: 802 VDSSVSF 808


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 300/804 (37%), Positives = 440/804 (54%), Gaps = 74/804 (9%)

Query: 23  VVVNVIAFSNS-KPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
             VNV +F  +   V +   F     +  ++P+C+ SL  + R + LI  +T++EK+  L
Sbjct: 31  TTVNVTSFGKAYTKVCDADRFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNL 90

Query: 82  SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN------------GTVSSVTSFPQVL 129
            D +   PR+G+P Y+WWSE+LHG++S GP + F+             TV++ T F  V+
Sbjct: 91  GDVSHGAPRVGLPPYKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVI 150

Query: 130 VSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPM 189
            SAASFN +LW +IG AV+ EARAMYNLG+ GLT+W+PNIN+ RDPRWGR  ETPGEDP 
Sbjct: 151 NSAASFNETLWKSIGQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPF 210

Query: 190 VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY 249
           V   YAV FV+  Q  +    DGG              G++      +  SACCKH  AY
Sbjct: 211 VAGRYAVNFVRGMQ--DIPGHDGG--------------GDDPST-RPIKTSACCKHYAAY 253

Query: 250 DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDL 309
           D++ W N +R++F+A ++E+D  +TF  PF  C+  G AS +MCSYN+VNG+PAC    L
Sbjct: 254 DVDDWHNHTRFTFDARVSERDMAETFLRPFEMCVRDGDASGVMCSYNRVNGIPACADARL 313

Query: 310 FQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK-THEDSAAGVLKAGMDINCGTCML--- 364
                R +W   GYI SDCDAV  + +   +   T  +S+A  ++AG+D++C    +   
Sbjct: 314 LSGTIRGDWQLHGYIVSDCDAVRVMTDNATWLHFTGAESSAASIRAGLDLDCAESWIEEK 373

Query: 365 ---------RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
                     + ++A+ +GK++E DID AL N +   +RLG F+  PR   Y  L   D+
Sbjct: 374 GRPLRDFLSEYGKAAVAQGKMRESDIDSALRNQYMTLMRLGYFDNIPR---YASLNETDI 430

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIPC 474
           CT EHK LA D ARQG+VLLKND   LPL+   + ++A+ GP      + M G YTG PC
Sbjct: 431 CTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKILAVAVHGPHARAPEKIMDGDYTGPPC 490

Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
                      YV+                 G  + V+I+ +A+  I + G++L  E E 
Sbjct: 491 R----------YVTPRQ--------------GISKDVKISHRANTTIYLGGINLHIEREG 526

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  LLLP  Q   +   A+ S  P+ILV+  GG +D+SFA    +I +ILW GYPG  
Sbjct: 527 NDREDLLLPKNQTEEILHFAKASPNPIILVILSGGGIDISFAHKHPKIGAILWAGYPGGE 586

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G  A+A++IFG +NPGGRLP+TW+   +  ++PM  M  R    + YPGR+Y+FY G +V
Sbjct: 587 GGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMTSMEFRPVPEKGYPGRTYKFYDGPEV 646

Query: 654 -YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
            Y FG+GLSYT + Y+  +  + +T+ A+       +      +      +D      + 
Sbjct: 647 LYPFGYGLSYTKFLYETSTNGTAVTLPATGGHCKGLSYKPSVATTPACQAVDVAGHACTE 706

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
                ISVTNAG   G+HVV+++   P      P KQ+  F RV   A+ +  + F ++ 
Sbjct: 707 TVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQAPIKQVAAFRRVFVPARSTATVPFTLNV 766

Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
           C+   I  +    ++P G   ++V
Sbjct: 767 CKAFGIVERTAYTVVPSGVSKVLV 790


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  550 bits (1417), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 313/774 (40%), Positives = 449/774 (58%), Gaps = 73/774 (9%)

Query: 36  VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           V ++ D+ C+ P F++Y +CN S SI+ R K L+S LT+ EK+ Q + NASAI RL IPA
Sbjct: 41  VASRADY-CEKPPFNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPA 99

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y+WWSE LHG+A + PGV F   ++S TSFPQV+   A+FN SL   +G  ++ EARA  
Sbjct: 100 YDWWSECLHGLAQS-PGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFA 158

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           N GQ+GLTF+APNINI+RDPRWGRGQETPGEDP + S YA  FVK            GI 
Sbjct: 159 NNGQSGLTFFAPNINIYRDPRWGRGQETPGEDPYLTSQYAANFVK------------GIQ 206

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
            G  ++R LK               A CKH  AY+LE++ +  R +FNA++++QD E+T+
Sbjct: 207 EGSEDRRYLKAI-------------ATCKHYAAYNLERYLDVRRVNFNAIVSDQDLEETY 253

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIF 334
            P F++C+++G+   IMCSYN +NGVP C    +  K AR+ WGF+GYI SDC A+  I 
Sbjct: 254 LPAFKACVQEGQVGSIMCSYNAINGVPNCANDFINNKIARDTWGFEGYIVSDCGAILDIQ 313

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              NYT     + A  LK G D+NCG    ++ + A D   + E+DID++L  LF+ ++R
Sbjct: 314 YKHNYTSDTNITVADALKGGCDLNCGHFYEKYMEDAFDNSTITEEDIDKSLTRLFTSRMR 373

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+  P    + +    DV T E + LAL+AAR+GIVLL+N    LPL+    S++A 
Sbjct: 374 LGMFD-PPEIQPFRQYSVKDVNTPEAQDLALNAAREGIVLLQNKGSVLPLDIVKHSNIAA 432

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  +    M G Y GI     S L+G         Y  GC  V CN   GF +AV+  
Sbjct: 433 IGPNADATHIMQGNYHGIAPYLISPLQGFSNLGINATYQIGC-PVACNDTEGFPDAVKAV 491

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVILVLTGGGPLD 572
           +  D VI V GL+ TQE E  DR S+ LPG Q  L+  + + + +  P+I+V+  GG +D
Sbjct: 492 QGVDAVIAVIGLNNTQEGESHDRTSIALPGHQEDLLLELKKNAAKGTPLIVVVMSGGSVD 551

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMN 631
           ++  +  +   +ILW GYPG++G +A+AE+I+G  NP GRLP+T+YP S+  ++P  +M+
Sbjct: 552 LTGVKDIAD--AILWAGYPGQSGGQAIAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMS 609

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MR       PGRSY+FYTGT V+ FG GLSYT +  K+                  K+  
Sbjct: 610 MRVP-----PGRSYKFYTGTPVFPFGFGLSYTTFEIKW------------------KDTS 646

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF--ARVPKVSQGTPEKQ 749
                 L   H DEV        + + +VTN+G   GS  V+ F  + VP    G P K+
Sbjct: 647 TAKDYYLKTTH-DEVV-------NYEATVTNSGSRPGSVSVLAFITSSVP----GAPMKE 694

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
           L  F +++     S ++SF  +P +  +  + +G R +  G + +++G+  H +
Sbjct: 695 LFAFKKIYLEPTESVDVSFVAEP-KVFTTVDIYGIRKIRPGAYKIIIGDDDHHI 747


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  545 bits (1404), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 313/755 (41%), Positives = 436/755 (57%), Gaps = 53/755 (7%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S PFCNT+LS   R + LIS +   +    L ++A+ +  L +PAY+WWSE+LHG+  + 
Sbjct: 181 SLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSATGVEHLNLPAYQWWSEALHGVG-HS 239

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           PGV+F G V + TSFPQV+ + A+FN++L+  IG+ ++ EARAM N+ +AG TFWAPNIN
Sbjct: 240 PGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTVISTEARAMNNVQRAGNTFWAPNIN 299

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           I RDPRWGRGQETPGEDP     YA  FV  FQ                        GE+
Sbjct: 300 IIRDPRWGRGQETPGEDPFATGEYAANFVSGFQD-----------------------GED 336

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
            +    +  S+CCKH   Y+LE W    R+ +NA+ T+QD  DT+ P F +C+  G+AS 
Sbjct: 337 MNY---IKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVRYGRASG 393

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MCSYN VNGVP+C  GD+    AR  WGF GYITSDC AVA +     +T+   ++   
Sbjct: 394 LMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNTSETIRA 453

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYG 408
           VL+AGMD +CG+ + ++   A+ +G V  + ++ AL  LF VQ RLGLF  DP  K  Y 
Sbjct: 454 VLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLF--DPVSKQPYT 511

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
                 V T  +++LAL+AA+QGIVLLKN    LPL       +A+IGP  +  + M G 
Sbjct: 512 NYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGL--HVALIGPNADATTVMQGN 569

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S +RG + Y +   YA GC DV C   +GF  AV  AK+AD V+VV GLD 
Sbjct: 570 YQGTAPFLISPVRGFKNYSAAVTYAKGC-DVACKDTSGFDAAVAAAKEADAVVVVVGLDQ 628

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
            QE+E  DR S+ LPG Q  LV  VA  +K P+++ +  GG +D+S  +A+  ++ ILW 
Sbjct: 629 GQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVAGILWC 688

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G +A+A+++FG  +PGGRLP T YP S+     M D  MR + +   PGR+YRF
Sbjct: 689 GYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPGRTYRF 748

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID--E 705
           YTG  VY +G GLSYT++SY      +  T  A+++       +Q       ++  D  E
Sbjct: 749 YTGKPVYEYGTGLSYTSFSYHIHYLNTMDTSLATVQT-----YVQDAKQNHKFIRYDAPE 803

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ--GTPEKQLIGFDRVHTVAKGS 763
            T        V+++VTN G V G+ VV +F   PK     G P K LIGF+RV       
Sbjct: 804 FT-------RVEVNVTNVGRVAGADVVQVFVE-PKTPAELGAPIKTLIGFERVFLNPGQW 855

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
             + F V+    L+  +  G+R+   G  ++ +G 
Sbjct: 856 TIVQFSVN-AHDLTFVDASGKRVARAGEWLVHIGH 889


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  536 bits (1382), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 281/642 (43%), Positives = 394/642 (61%), Gaps = 35/642 (5%)

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           P +     P +    ETPGEDP++ S YAV +V   Q     +  GG+  G         
Sbjct: 83  PALGRLGIPAYEWWSETPGEDPLLASKYAVGYVTGLQ----DAGAGGVTDG--------- 129

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                     L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQPPF+SC+  G
Sbjct: 130 ---------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDG 180

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
             + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  ++  Q+YTKT E+
Sbjct: 181 NVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHYTKTPEE 240

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           +AA  +K+G+D+NCG  + +HT +A+  G++ E+D+DRA+ N F + +RLG F+GDPR+ 
Sbjct: 241 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 300

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            +G LGP DVCTS +++LA + ARQGIVLLKN    LPL+  ++ S+A+IGP  N    M
Sbjct: 301 AFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIGPNANASFTM 359

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVA 524
            G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     AV  A  AD  ++V 
Sbjct: 360 IGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVV 418

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G D + E E  DR SLLLPGQQ  LV++VA  S  PVILV+  GGP D+SFA+A  +I++
Sbjct: 419 GADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAA 478

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGR 643
           ILW+GYPGEAG  ALA+I+FG  NP GRLP+TWYP S+   V M DM MR D+S  YPGR
Sbjct: 479 ILWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGR 538

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           +YRFYTG  V+ FG GLSYT  S+  +SAP       S++   D     +  + ++    
Sbjct: 539 TYRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYV---SMRLAEDHPCRAEECASVEAAG- 594

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
                C  L F V++ V NAG+V G+H V+LF+  P  +   P K L+GF++V      +
Sbjct: 595 ---DHCDDLAFDVKLQVRNAGEVAGAHSVLLFSS-PPPAHNAPAKHLLGFEKVSLAPGEA 650

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
             ++F VD C  LS+ ++ G R + LG H L VG+L+H++ +
Sbjct: 651 GTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGDLKHTVEL 692



 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/65 (47%), Positives = 40/65 (61%), Gaps = 2/65 (3%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    +Y FCN   + S RA+ L+S LTL EK+  L +   A+ RLGIPAYE
Sbjct: 35  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94

Query: 98  WWSES 102
           WWSE+
Sbjct: 95  WWSET 99


>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
          Length = 699

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 284/662 (42%), Positives = 390/662 (58%), Gaps = 102/662 (15%)

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           V+ EARAMYN+G AGLTFW+PN+NIF+DPRWGRGQETPGEDP++ S YA  +V+  Q   
Sbjct: 137 VSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ--- 193

Query: 207 WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMI 266
            +SDDG                      D L ++ACCKH  AYDL+ W     + FNA++
Sbjct: 194 -QSDDGS--------------------PDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVV 232

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSD 326
           T QD +DTFQPPF+SC+  G  + ++                             YI SD
Sbjct: 233 TNQDMDDTFQPPFKSCVIDGNVASVI-----------------------------YIVSD 263

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
           CD+V   +  Q+YTKT E++AA  + AG+D+NCG+ + +HT++A+  G V E  +D+A+ 
Sbjct: 264 CDSVDVFYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVS 323

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           N F+  +RLG F+G+P K  YGKLGP DVCTSEH++ A +A RQGIV             
Sbjct: 324 NNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV------------- 370

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
                                 + G PC   + L+GL A V+ T Y  GC +V C + A 
Sbjct: 371 ----------------------FAGTPCKYTTPLQGLTALVATT-YLPGCSNVACGT-AQ 406

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
             EA +IA  AD  +++ G+D + E E RDRV++ LPGQQ  L+T VA+ SK  VILV+ 
Sbjct: 407 IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVM 466

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
            GG  D+SFA+ D +I+SI W+GYPGEAG  A+A++IFG +NP G+LPMTWYP+S+  KV
Sbjct: 467 SGGGFDISFAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKV 526

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
           PM +MNMR D +  YPGR+YRFYTG  +Y FG GLSYT +++  + AP  ++I       
Sbjct: 527 PMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHS 586

Query: 686 SDKNILQQTGSRLDYVHIDEVT-SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
              +  +          +D V  SC +L F + + V NAG++ GSH V LF+  P V   
Sbjct: 587 CHSSKCKS---------VDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV-HN 636

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
           +P+K L+GF++V   AK    + F VD C+ LSI ++ G R + LG HVL VG L+HSL 
Sbjct: 637 SPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLN 696

Query: 805 IE 806
           + 
Sbjct: 697 VR 698


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 287/753 (38%), Positives = 421/753 (55%), Gaps = 74/753 (9%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           + + R+   ++ + L + +  L + A+  P + IP+YEWW+E+LHG+A + PGV F G++
Sbjct: 3   ATTPRSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVALS-PGVTFKGSI 61

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGR 179
           ++ TSFPQVL +AASFNRSL+  I   ++ EARA +N   AGLTFW PN+NIFRDPRWGR
Sbjct: 62  TAATSFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGR 121

Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
           GQETPGEDP +   YAV FV+  QGE  +                   G E +    L +
Sbjct: 122 GQETPGEDPYLTGEYAVAFVRGLQGEGME-------------------GREVENSKFLKI 162

Query: 240 SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
           S+CCKH  AY  E      R+  NAM+T+QD  DT+ P F  C+++G  S IMCSYN VN
Sbjct: 163 SSCCKHFSAYSQE----VPRHRNNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVN 218

Query: 300 GVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN 358
           G+P+C  +G L    R +W F GYI SDC+AVA + ++ +YT++ E + A  L AGMD+N
Sbjct: 219 GIPSCADKGLLTDLVRGQWKFDGYIASDCEAVADVIDHHHYTQSPEQTCATTLDAGMDLN 278

Query: 359 CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
           CG  + +H   A+++G V  + I  AL N F V +RLG+F    +   +  +  D V T+
Sbjct: 279 CGEFLRQHLPKALEQGIVTTEMIHNALKNQFRVLMRLGMFE---KVEPFANITKDSVDTT 335

Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVS---SLAIIGPLVNNISQMGGGYTGIPCS 475
            H++LAL+AARQ IVLLKND   LPL     +   SLA+IGP  N  + + G Y GIP  
Sbjct: 336 MHRQLALEAARQSIVLLKNDGNTLPLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSH 395

Query: 476 PKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR 535
             + L G+  +V    ++ GC  V       F +A+ +AKKAD +IV  GLD +QE E+ 
Sbjct: 396 IVTPLEGISQFVPNVAHSLGC-KVSGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEI 454

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+  V   +  P++ V+  GG +D+S  +   ++ +I++ GY G+AG
Sbjct: 455 DRYHIGLPAFQSTLLKRVLEVASHPIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAG 514

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
            +ALA+++FG +NP G+LP T+Y   +   + + DM+MR        GR+YRF+TG  VY
Sbjct: 515 GQALADVLFGKYNPSGKLPQTFYDSEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVY 574

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG GLSYT +                      KN                   C +   
Sbjct: 575 EFGFGLSYTTFH---------------------KN-------------------CHACVA 594

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQ-GTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
              I+VTNAG + G  V++ +   P   + G P K L+ F+R   +A G +  +      
Sbjct: 595 TFNITVTNAGAISGEDVILTYVEPPLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA 654

Query: 774 EQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           +  ++AN+ G  ++  GN  + V  L+H + I+
Sbjct: 655 KAFALANEAGNWVVEPGNWTIHVDTLQHKVDIQ 687


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  519 bits (1337), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 308/782 (39%), Positives = 434/782 (55%), Gaps = 91/782 (11%)

Query: 44  CKPPHFDSYP-----FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           C  P + + P     +CN  LS   R K L+S LTL+EKI QL ++ASAI RLGIP Y+W
Sbjct: 22  CTEPPYKTRPNFTPAYCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQW 81

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           WSE LHG+A + PG++  G ++  TSFPQ++ +A+SFN+SL+  IG AV+ EAR   + G
Sbjct: 82  WSEGLHGVAVS-PGLHLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNG 140

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           Q GLT++ PNINI RDPRWGRGQET GEDP + S YAV  V+  QG + +          
Sbjct: 141 QGGLTYFTPNINIVRDPRWGRGQETAGEDPYLTSQYAVNLVRGAQGNDSEY--------- 191

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQP 277
             K+++                A CKH  AYDLE + N   R SFNA +T+QD E+T+ P
Sbjct: 192 --KKII----------------ATCKHFAAYDLESYINGDVRDSFNAEVTKQDLEETYFP 233

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
            FRSC+  G    IMCSYN VNGVP+C+ G    K ARN+W F GY+ SDC A+  +   
Sbjct: 234 AFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFNNKIARNKWKFDGYLVSDCGAIDDVMNK 293

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            +YT T  D+ A  LK G D+NCG+    H   A   G + E DIDRA+  LF+ ++RLG
Sbjct: 294 HHYTSTPTDTVAAGLKGGTDLNCGSFYQTHAMDAFLNGSITEVDIDRAVGRLFTARMRLG 353

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           LF+  P+   Y     D V T +H+ LAL AAR+ IVLL+N+ K LPL+      +A++G
Sbjct: 354 LFDL-PKYQPYSYFNTDVVNTKQHQDLALQAARESIVLLQNNGK-LPLSYEDHHKIAVVG 411

Query: 457 P-LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVR 512
           P ++ N++  G      P     L+  ++ + SK     Y+ GC DV C    GFH+A +
Sbjct: 412 PNILANVTMQGISQVIAP----YLISPVDGFKSKGLHVTYSLGC-DVKCIVTDGFHDAFK 466

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART-----SKRPVILVLTG 567
           + K A  V+ V GLD   E E  DR  + LPG Q   +  +  T     S  P+I+V+  
Sbjct: 467 LVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDKFLLGLRDTLTNLQSPVPLIVVIMS 526

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G  +D+S  E+ S   +ILW+GYPG++G +A+AE+I+G+ NP GRLP+T+YP  +  +  
Sbjct: 527 GSSVDLS--ESKSLADAILWVGYPGQSGGQAIAEVIYGEVNPSGRLPLTFYPGEYIDL-- 582

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF---LSAPSELTISASLKA 684
             +  R  S R+ PGR+YRFYT   V+ FGHGLSYT +   +   ++  +E+ IS S+  
Sbjct: 583 --VAYRHMSMREPPGRTYRFYTENPVFPFGHGLSYTTFELSWTNKMNNVTEIVISDSVD- 639

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
                           ++ID             I+V N G + G+  V +   V      
Sbjct: 640 ----------------INID-----------FDITVVNTGYLSGA--VSVLGYVSSNIPD 670

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
            P ++L  FD+V      SK+IS         ++  K  R ILP G + + +  L H + 
Sbjct: 671 APLRELFDFDKVFIDKYESKKISLFATNDAFTTVDEKGRRNILP-GEYDIAIENLSHKII 729

Query: 805 IE 806
           I+
Sbjct: 730 IK 731


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  518 bits (1335), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 258/514 (50%), Positives = 345/514 (67%), Gaps = 14/514 (2%)

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           MCSYN+VNGVP C   +L    AR +WGF GYITSDCDAVA I + Q Y KT ED+ A V
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGYAKTAEDAVADV 60

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           LKAGMD+NCG+ +  H  SA+ +GK+ E+DI+RAL NLF+V++RLGLFNGDPR+  YG +
Sbjct: 61  LKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDI 120

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           GPD VCT EH+ LAL+AA+ GIVLLKND     LPL+K  V+SLA+IG   N+  ++ G 
Sbjct: 121 GPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGN 180

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G PC   + L+ L+ YV  T + +GC+   CN      EAV+ A  AD V++  GLD 
Sbjct: 181 YFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAASSADSVVLFMGLDQ 239

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
            QE E+ DR+ L LPGQQ +L+ SVA  +K+PVILVL  GGP+DVSFA+ + +I +ILW 
Sbjct: 240 DQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWA 299

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPGEAG  A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM MRAD +  YPGR+YRFY
Sbjct: 300 GYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFY 359

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            G  V+ FG+GLSY+ YS++F + P   +  A LKA      ++ T   +    ++ + S
Sbjct: 360 RGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------VEATAGGMASYDVEAIGS 413

Query: 709 --CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKGSK 764
             C  L+F   + V N G +DG H V++F R P  + G+  P  QLIGF  +H  A  + 
Sbjct: 414 ETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTA 473

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            + F V PC+  S A + GR+++  G+H +MVGE
Sbjct: 474 HVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 507


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 297/779 (38%), Positives = 431/779 (55%), Gaps = 88/779 (11%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+   F S+ +CN SL IS R K L+S +TL EKI QL + A +I RL IPAY+WWSE L
Sbjct: 24  CEKTKFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGL 83

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
           HG+A + PGV+FNG   + TSFPQV+ +A+SFN++L+  I + ++ EARA  N    G+ 
Sbjct: 84  HGVA-DSPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIV 139

Query: 164 FWAPNINIF--------RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           ++  +  +         RDPRWGR QETPGEDP + S YA++FV   QG++         
Sbjct: 140 YFKQHQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQGDS--------- 190

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN-FSRYSFNAMITEQDTEDT 274
                                L +   CKH   YDLE + +  +R+SFNA IT QD E+T
Sbjct: 191 -------------------KYLKVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEET 231

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATI 333
           + P F++C+E+   + IMCSYN+VNGVP+C  G +  K AR+ WGF G+I SDC A+  I
Sbjct: 232 YYPAFKACVEEANVASIMCSYNEVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDI 291

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
               +YT   +D+ A  LK G D+NCG+    H QSA   G +   +I+ AL  LF+ ++
Sbjct: 292 QNKHHYTNNTDDTVAAALKGGCDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARM 351

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           +LG+F+  P    Y  + PD V + EH+ LAL+AAR+ IVLL+N+   LPLN    S++A
Sbjct: 352 KLGMFD-PPELQPYNAISPDVVNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIA 410

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY-VSKTHYASGCHDVPCNSDAGFHEAVR 512
           ++GP       M G Y G+     S + G E   +     ASGC DV C    GF +A  
Sbjct: 411 VVGPHAMATDVMQGNYNGVAPYLISPVEGFENLGIDSVLTASGC-DVNCEVTDGFQDAFD 469

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-----RPVILVLTG 567
           IA KAD VI V GLD + E+E  DR  L LP  Q   V  +  T K      P+I+V+  
Sbjct: 470 IAVKADAVIAVLGLDQSHESEGHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMS 529

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G  +D++  +  +   +ILW GYPG++G +A+AEII+G  NP GRLP+T+YP S+  +  
Sbjct: 530 GSSVDLTVTKKHAD--AILWAGYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDL-- 585

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
             +  R  S R+YPGR+Y+FY  T  + FG GLSYT + Y   S P              
Sbjct: 586 --VAFRHMSMREYPGRTYKFYNDTPDFSFGDGLSYTTF-YLEWSKP-------------- 628

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                        V++  V S +       ++VTN G + G+  + + A +   + G P+
Sbjct: 629 -------------VNMSGVRSVSYPTVVYNVTVTNTGKMPGA--ISVLAYISYNNSGAPK 673

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-ELRHSLTI 805
           K+L GF++V      S  ++F  D  +  S  +K G+R +  G++ + +G +L H +++
Sbjct: 674 KKLFGFEKVFLNPLQSVSVTFPAD-SKAFSTVDKSGKRSVNPGDYHVTIGDQLIHKISL 731


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 285/740 (38%), Positives = 410/740 (55%), Gaps = 77/740 (10%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA++L++ LTL EK+  +   AS + RL IP Y+WWSE+LHG+A++ PGV F       T
Sbjct: 4   RARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVAAS-PGVVFQEPTPFAT 62

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
           +FPQV ++A SF++ L+ +I S ++ EAR M N  +A LT+W+PN+N++RDPRWGRGQET
Sbjct: 63  AFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQET 122

Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
           PGEDP +V+ YAVEFV+  Q                          E +    L +SACC
Sbjct: 123 PGEDPFLVATYAVEFVRGLQ--------------------------EGEDPRYLKVSACC 156

Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
           KH  AYDLE W    R+ F+A+++++D  DTFQ PF  C+++G  S +MCSYN +NG+PA
Sbjct: 157 KHYSAYDLENWHGVERFEFDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPA 216

Query: 304 CL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
           C  R  L+  AR  WGF+GYITSDC A+ TI    +YT   + +A   ++A  D++CG  
Sbjct: 217 CADRELLYGTARGGWGFEGYITSDCGAIDTIIYNHHYTNDTDTTAMLGVRATCDLDCGGF 276

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHK 421
             +H   +++ G+++E ++D AL NLF VQ+RLGLF  DP + + Y   G D + T EH+
Sbjct: 277 YQQHILHSVESGRLKEAEVDDALANLFKVQMRLGLF--DPVEQQVYTHYGLDKLNTKEHQ 334

Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
            +AL AAR+GI LLKN   FLPL+      + ++GP   +   M G Y GIP    ++ +
Sbjct: 335 AMALRAAREGIALLKNQNDFLPLSLKD-KHVVVMGPYAEDAGVMLGNYNGIPEFIVTVAQ 393

Query: 482 GLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
           GL          + C  V         EA+   +  D ++V  GL+   E E  DR  LL
Sbjct: 394 GLR---------NVCDHVDVVKSL---EALSKLEGVDLIVVTVGLNQEIEREGLDREDLL 441

Query: 542 LPGQQMSLVTSVARTSKRPVIL-VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
           LP  Q +L+  +   +  PV+L +L+GGG +D+S  E +  +  +L +GY G  G +A+A
Sbjct: 442 LPASQRALLDGLLAQTDVPVVLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIA 501

Query: 601 EIIFGDFNPGGRLPMT-WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           E+I GD NP GRL  T +Y +  T +   DMNMR      +PGR+YRF+ G  ++ FG G
Sbjct: 502 EVIVGDVNPSGRLVNTMYYNDYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFG 561

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +++                      I Q    RL           ++L   V + 
Sbjct: 562 LSYTTFAHAV-------------------EIGQMRNHRLR----------SALAIDVYVK 592

Query: 720 VTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
           VTN G   G   V+LF + P    QG P K L  F RV ++A G  +    V   EQL +
Sbjct: 593 VTNTGSRQGDESVLLFVKSPLAGKQGYPLKSLADFSRV-SLAPGETQTVHFVLGEEQLHL 651

Query: 779 ANKHGRRILPLGNHVLMVGE 798
           AN+  + +L  G   + V E
Sbjct: 652 ANEQAKYVLLRGEWKVEVEE 671


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  503 bits (1294), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 300/788 (38%), Positives = 426/788 (54%), Gaps = 80/788 (10%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     + PFCN +L+   RA  L+  LTLQEKI Q    A  + RLG+ AYEWWSE+L
Sbjct: 27  CEDAALRNLPFCNPNLAWEQRADDLVGRLTLQEKISQFGTTAPGVARLGVNAYEWWSEAL 86

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVS--------AASFNRSLWSNIGSAVAVEARAMY 155
           HG+A + PGVNF G     T FPQ++ +         A+FN    + +   ++ EARA  
Sbjct: 87  HGVAES-PGVNFTGNTPVSTCFPQIIGNNCSSLSRVGATFNLDSVAAMAQVISTEARAFA 145

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ-GENWKSDDGGI 214
           N G AGLT++ PNINIFRDPRWGRGQETPGEDP + S Y    V++ Q GE         
Sbjct: 146 NAGHAGLTYFTPNINIFRDPRWGRGQETPGEDPYLTSRYVETLVQNLQNGE--------- 196

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                + R LK             + A CKH  AYD+E WG   R+ FNA++++QD  +T
Sbjct: 197 -----DARYLK-------------VVATCKHYTAYDMEDWGGIDRFHFNAVVSDQDLVET 238

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVA 331
           F PPF +C+  GK + +MCSYN VNG+P+C   D F   + AR +WGF GYI SDC A+ 
Sbjct: 239 FMPPFEACVRVGKGASLMCSYNAVNGIPSC--ADDFINNEIAREQWGFDGYIVSDCGAID 296

Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
            I    NYT T + + A  ++ G D++CG     H   AI    + E D+D +L  LF  
Sbjct: 297 CIQYTHNYTNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLHEADLDFSLRRLFGH 356

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           ++RLG F+    +  Y ++    + + EH++LAL  AR+ IVLL ND   LP +   V  
Sbjct: 357 RIRLGEFDAASIQ-PYRQIPVSAINSQEHQELALQIARESIVLLGNDNNTLPFSLATVRK 415

Query: 452 LAIIGPLVNNISQMGGGYTG---IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
           LAIIGP  ++   + G Y G      +P    + L+  +S T +  GC DV     +GF 
Sbjct: 416 LAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSIT-FVKGC-DVNSTDTSGFV 473

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
            A   AK AD  IVV GL+ T E+E+ DR +L+LPG Q  L+ ++   ++ PVILV+  G
Sbjct: 474 AAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAAARGPVILVVMSG 533

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
            P+D+S       + + LWIGYPG+AG +ALAE +FG F+P GRLP T YP  +  ++PM
Sbjct: 534 SPIDLS--NVIHPVRAALWIGYPGQAGGRALAEAVFGVFSPAGRLPFTVYPADYVNQLPM 591

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            +M+MRA      PGR+YRFYTGT ++ FGHGLSY+ + Y + +     + S+S  + + 
Sbjct: 592 TNMDMRAG-----PGRTYRFYTGTPLFEFGHGLSYSTFQYTWSN-----SSSSSSSSATS 641

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS----- 742
           ++ L        ++         S R  VQ    N G +    VV+ FA     S     
Sbjct: 642 QHSLSTAALAAQHLAARAPVEAVSFRVLVQ----NTGKMASDDVVLAFASFNASSIIDQS 697

Query: 743 ----QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILP------LGNH 792
                  P + L+GF R+H     S+EI F V   +   + +   + ++P       G+ 
Sbjct: 698 SSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQLAQVDSTGAQTLVPSRLQVAFGSD 757

Query: 793 VLMVGELR 800
             +V E++
Sbjct: 758 ARLVAEIQ 765


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 291/750 (38%), Positives = 417/750 (55%), Gaps = 57/750 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNTS +   RAK+L++ +TL+E +    + +  +PRLG+P YEWWSE+LHG+AS+ PGV 
Sbjct: 36  CNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVASS-PGVT 94

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFP+ ++ +A+F+  L  ++ S ++ EARA  N   +GL F+ PNIN F
Sbjct: 95  FETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFTPNINPF 154

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP+  S Y  + +   QG        G+G     K +         
Sbjct: 155 KDPRWGRGQETPGEDPLHTSRYVYQLITGLQG--------GVGPSPYYKII--------- 197

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH  AYDLE W   +R +FNA+++ QD  + + P F+SC+   K   +M
Sbjct: 198 --------ADCKHFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVM 249

Query: 293 CSYNQVNGVPACLRGDLFQ---KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNGVPAC    L Q   +   E G   +ITSDCDAV  IF+  NYT T  +++A 
Sbjct: 250 CSYNAVNGVPACGSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTTTLTNASAV 309

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KYG 408
            L AG D++CGT        A+ +G V + D++RAL+ L+   +RLG F  DP     Y 
Sbjct: 310 ALLAGTDVDCGTSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYF--DPEDSVPYR 367

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG  DV T   + LA  AA +GIVLLKND   LPL+ N VS +A+IGP  N  +QM G 
Sbjct: 368 ALGASDVNTPAAQTLAYTAAVEGIVLLKND-GLLPLSSN-VSHIALIGPWANATTQMQGN 425

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y GI     S L G  +      + +G   +  NS +GF +A+ +A  AD ++ + G+D 
Sbjct: 426 YEGIAPLLISPLDGFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDD 484

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E +DR S+  PG Q+ L+  +    K P +++  GGG +D +  +A+S ++++LW 
Sbjct: 485 TVEAEGQDRTSITWPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWG 543

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG+AG KALA+II G   P GRL  T YP S+  +V M DM++R  +S   PGR+Y++
Sbjct: 544 GYPGQAGGKALADIITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKW 603

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           YTGT V+ FG GL YT +  ++    +E + +AS        +     S     H+D   
Sbjct: 604 YTGTPVFEFGFGLHYTTFDVEW----AEGSPAASYSI--QDLVASANSSSSAVAHVDSAI 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
             T   F VQ  VTN G+V   +V +LF+         P ++L+ + RV       K I+
Sbjct: 658 LDT---FTVQ--VTNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARV-------KGIT 705

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            GV     L++      R+   GN ++  G
Sbjct: 706 PGVSATASLNVTLGTIARVDEDGNSIIYPG 735


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 248/515 (48%), Positives = 333/515 (64%), Gaps = 28/515 (5%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    +Y FCN   + S RA+ L+S LTL EK+  L +   A+ RLGIPAYE
Sbjct: 35  PVFACDASNATLAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLGIPAYE 94

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN SL+  IG  V+ EARAM+N+
Sbjct: 95  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEARAMHNV 154

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   Q     +  GG+  G
Sbjct: 155 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQ----DAGAGGVTDG 210

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L ++ACCKH  AYD++ W    RY+F+A +++QD +DTFQP
Sbjct: 211 ------------------ALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQP 252

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYN+VNG P C   DL +   R +W   GYI SDCD+V  ++  
Sbjct: 253 PFKSCVLDGNVASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQ 312

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT E++AA  +K+G+D+NCG  + +HT +A+  G++ E+D+DRA+ N F + +RLG
Sbjct: 313 QHYTKTPEEAAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLG 372

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCTS +++LA + ARQGIVLLKN    LPL+  ++ S+A+IG
Sbjct: 373 FFDGDPRQLAFGSLGPKDVCTSSNRELARETARQGIVLLKNSGA-LPLSAKSIKSMAVIG 431

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAK 515
           P  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     AV  A 
Sbjct: 432 PNANASFTMIGNYEGTPCKYTTPLQGLGAKVN-TVYQPGCTNVGCSGNSLQLSTAVAAAA 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
            AD  ++V G D + E E  DR SLLLPGQQ  LV
Sbjct: 491 SADVTVLVVGADQSIERESLDRTSLLLPGQQTQLV 525


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 289/764 (37%), Positives = 427/764 (55%), Gaps = 63/764 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RA +++ L T+ E I      +  +PRLG+P Y+WWSE LHG+A + PGVN
Sbjct: 22  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA-DSPGVN 80

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ ++  A+F+  L  ++G+ V +E R+  N G+AGL FW PNIN F
Sbjct: 81  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  ++ Y    V+  QG            G   K   +       
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG------------GLDPKPYYQ------- 181

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + + CKH  AYDLE W    RY F+A++T QD  + + P F+SC    K    M
Sbjct: 182 ------VISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAM 235

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNG+P+C    L Q   R+ WGF    ++TSDCDAV  I++  NYTKT E++ A 
Sbjct: 236 CSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVAD 295

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
            LKAG DI+CGT    +   A ++  + E ++ +AL+  ++  +RLG F  DP   + Y 
Sbjct: 296 ALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYR 353

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           +   ++V T + ++LA  AA +GIVLLKND   LPL+ + + ++A+IGP  N   +M G 
Sbjct: 354 QYNWNNVDTPQAQQLAYQAAAEGIVLLKNDGT-LPLSSD-IKNIALIGPWGNATGEMQGN 411

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+     S L G  A      Y  G + +  N  +GF  A+  A+ AD VI   G+D 
Sbjct: 412 YYGVAPYLISPLMGAVATGYNVTYVFGTN-ITSNDTSGFAAAIAAAQGADVVIYAGGIDE 470

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E+E  DR  +  PG Q+ LV  +A   K P+++V  GGG +D +  +A+S ++++LW 
Sbjct: 471 TVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWA 529

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G  AL +II G   P GRLP+T YP  +  ++PM DM++R +++   PGR+Y++
Sbjct: 530 GYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKW 587

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHIDE 705
           YTGT +Y FG+GL YT +SYK+  APS           S  NI  L Q+G+   Y+ +  
Sbjct: 588 YTGTPIYDFGYGLHYTTFSYKWAKAPS-----------STYNIQTLVQSGNLYSYLDLAP 636

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SK 764
             + T       ++VTN G+V      +LF          P K LI + R+H +A G + 
Sbjct: 637 FDTFT-------VNVTNTGNVTSDFASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTA 689

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
            ++ GV     ++ A+ +G   L  G + + +  L +S+   T+
Sbjct: 690 SVALGVT-LGSIARADTYGNMWLYPGTYQVTLDTLGNSVGANTF 732


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 239/334 (71%), Positives = 276/334 (82%), Gaps = 4/334 (1%)

Query: 23  VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
           +V+  ++ SNS+   +K  +PC+PP  +SY FCN SLS+ TRA SLISLLTL+EKI+QLS
Sbjct: 13  IVLFELSVSNSES--HKLQYPCQPPLHNSYTFCNQSLSVPTRAHSLISLLTLEEKIKQLS 70

Query: 83  DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWS 141
           DNAS IPR GIP YEWWSESLHGIA NGPGV+F  G VS+ T FPQV++SAA+FNR+LW 
Sbjct: 71  DNASGIPRFGIPPYEWWSESLHGIAINGPGVSFTIGPVSAATGFPQVIISAAAFNRTLWF 130

Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
            IGSA+A+EARAM+N+GQ+GLTFWAPN+NIFRDPRWGRGQETPGEDPM+ SAYA+EFVK 
Sbjct: 131 LIGSAIAIEARAMHNVGQSGLTFWAPNVNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKG 190

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
           FQG NWKS   G G G R     K    + D  D LMLSACCKHL AYDLEKWGNFSRYS
Sbjct: 191 FQGGNWKSGVSGSGSG-RYGFGEKRMLRDDDGDDGLMLSACCKHLTAYDLEKWGNFSRYS 249

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKG 321
           FNA++TEQD EDT+QPPFRSCIE+GKASC+MCSYN+VNGVPAC R DL QKAR EWGF+G
Sbjct: 250 FNAVVTEQDLEDTYQPPFRSCIEEGKASCLMCSYNEVNGVPACAREDLLQKAREEWGFEG 309

Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           YI SDCDAVATIFEYQNY+K+ ED+ A  LKAGM
Sbjct: 310 YIVSDCDAVATIFEYQNYSKSAEDAVAIALKAGM 343


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  480 bits (1235), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 299/794 (37%), Positives = 416/794 (52%), Gaps = 76/794 (9%)

Query: 17  RVLQ-LIVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTL 74
           R+LQ LI V  V+A            FP C+     +   CNTS S   RAKSL++L TL
Sbjct: 6   RLLQPLIWVTGVLAV-----------FPDCQNGPLQTETICNTSASPLDRAKSLVTLYTL 54

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVS 131
           +EKI   S  +  +PRLGIP Y+WWSE LHGIA  GP  NF+ +    S  TSFPQ ++ 
Sbjct: 55  EEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA--GPYTNFSTSGIEYSYSTSFPQPILM 112

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            A+F+  L +++   ++ EARA  N  + GL FW PNIN FRDPRWGRGQETPGED   +
Sbjct: 113 GAAFDDHLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDAFHL 172

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S+Y    +   QGE               KRV+                A CKH   YD+
Sbjct: 173 SSYVKALIAGLQGETTDP----------YKRVV----------------ATCKHFAGYDI 206

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
           E W    RY F+A I++QD  + +  PF++C+ Q      MCSYN VNGVP C    L Q
Sbjct: 207 EDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCADPYLLQ 265

Query: 312 KA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT 367
              R  WG+   + ++TSDCDAV  I+    ++ T E + A  L AG D++CGT M  H 
Sbjct: 266 TILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSATREQAVADALIAGTDLDCGTYMQEHL 325

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
             A  +G V E  +D+AL+  +S  +RLG F+ D     Y + G D V T   + LA  A
Sbjct: 326 PGAFAQGLVNENVLDQALVRQYSSLVRLGWFD-DAADQPYRQFGWDSVATDASQALARRA 384

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
           A +GIVLLKND   LPL+ ++  SL + G   N  SQ+ G Y G+P    S L  L+   
Sbjct: 385 AVEGIVLLKNDG-VLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLWALQQEN 443

Query: 488 SKTHYASGC----HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
              +YA G      D   N  +    A+     +D +I + G+D + E E  DR SL   
Sbjct: 444 LTINYAGGNPGGQGDPTTNRWSSLSGAI---ATSDILIYIGGIDNSIEEEGHDRTSLAWT 500

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G Q+ ++  +A T K P I+V+ GGG +D +    ++ IS+ILW GYPG+ G  A+ +I+
Sbjct: 501 GAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIVDIL 559

Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            G   P GRLP T YP S+T  VPM DM +R   S   PGR+Y++Y GT  Y FGHGL Y
Sbjct: 560 TGKSPPAGRLPQTQYPASYTSLVPMTDMGLR--PSENNPGRTYKWYNGTATYEFGHGLHY 617

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           TN+S   +++P + +   +    + KN    T  R  +  +D             ISVTN
Sbjct: 618 TNFS-ATVTSPMQQSYRIADLMSTCKNATSITLERCAFTSVD-------------ISVTN 663

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G V   +V + +          P+K L+G+ R+  +A G+ + +      E L+  ++ 
Sbjct: 664 TGAVASDYVTLCYISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARIDLTLESLARVDEV 723

Query: 783 GRRILPLGNHVLMV 796
           G ++L  G + LMV
Sbjct: 724 GNKVLYPGEYSLMV 737


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  479 bits (1234), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 242/538 (44%), Positives = 348/538 (64%), Gaps = 16/538 (2%)

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
           EDT+  PF++C+ +GK + +MCSYNQVNG P C   DL +   R  WG  GYI SDCD+V
Sbjct: 2   EDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDSV 61

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
             +++ Q++T T E++AA  +KAG+D++CG  +  HT +A+ +G ++E D++ AL NL S
Sbjct: 62  GVLYDSQHFTPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANLLS 121

Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
           VQ+RLG+F+G+P    YG LGP DVCT  HK LAL+AARQGIVLL+N    LPL+     
Sbjct: 122 VQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTRHR 181

Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
           ++A+IGP  +    M G Y G+ C   + ++G+  YV KT +A GC +V C  D    EA
Sbjct: 182 TVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYV-KTIHAKGCANVACVGDQLIGEA 240

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
              A+ AD  +VV GLD + E E RDR  +LLPG+Q  LV  +    K P ++VL  GGP
Sbjct: 241 EAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGGP 300

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
           +DVSFA+ D +IS ILW+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+S+  KVPM +
Sbjct: 301 IDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMTN 360

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
           M +R D S  YPGR+YRFY G  V+ FG GLSY+ +S  F  AP+++++  S  + +   
Sbjct: 361 MGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQSFAEAPTKISLPLSSLSPNSSA 420

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
            ++ +       H D    C S+    + I V N G VDGSH +++F+ VP  +  +PEK
Sbjct: 421 TVKVS-------HTD----CASVSDLPIMIDVKNTGTVDGSHTILVFSTVPNQTW-SPEK 468

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            LIGF++VH +A   K +  G+  C+ LS  ++ G R +P+G H L +G+L HS++++
Sbjct: 469 HLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGDLTHSISLQ 526


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 287/755 (38%), Positives = 422/755 (55%), Gaps = 63/755 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RA +++ L T+ E I      +  +PRLG+P Y+WWSE LHG+A + PGVN
Sbjct: 37  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVA-DSPGVN 95

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ ++  A+F+  L  ++G+ V +E R+  N G+AGL FW PNIN F
Sbjct: 96  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 155

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  ++ Y    V+  QG            G   K   +       
Sbjct: 156 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG------------GLDPKPYYQ------- 196

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + + CKH  AYDLE W    RY F+A++T QD  + + P F+SC    K    M
Sbjct: 197 ------VISTCKHFAAYDLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAM 250

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNG+P+C    L Q   R+ WGF    ++TSDCDAV  I++  NYTKT E++ A 
Sbjct: 251 CSYNAVNGIPSCANTYLLQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNYTKTPEEAVAD 310

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
            LKAG DI+CGT    +   A ++  + E ++ +AL+  ++  +RLG F  DP   + Y 
Sbjct: 311 ALKAGTDIDCGTFYSEYLPGAYNQSLITETELRQALIRQYASLVRLGYF--DPTDIQPYR 368

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           +   ++V T + ++LA  AA +GIVLLKND   LPL+ + + ++A+IGP  N   +M G 
Sbjct: 369 QYNWNNVDTPQAQQLAYQAAAEGIVLLKNDGT-LPLSSD-IKNIALIGPWGNATGEMQGN 426

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+     S L G  A      Y  G + +  N  +GF  A+  A+ AD VI   G+D 
Sbjct: 427 YYGVAPYLISPLMGAVATGYNVTYVFGTN-ITSNDTSGFAAAIAAAQGADVVIYAGGIDE 485

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E+E  DR  +  PG Q+ LV  +A   K P+++V  GGG +D +  +A+S ++++LW 
Sbjct: 486 TVESEGNDRNYITWPGNQLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWA 544

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G  AL +II G   P GRLP+T YP  +  ++PM DM++R +++   PGR+Y++
Sbjct: 545 GYPGQSGGSALFDIISGKVAPAGRLPVTQYPADYVYEIPMTDMDLRPNATS--PGRTYKW 602

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI--LQQTGSRLDYVHIDE 705
           YTGT +Y FG+GL YT +SYK+  APS           S  NI  L Q+G+   Y+ +  
Sbjct: 603 YTGTPIYDFGYGLHYTTFSYKWAKAPS-----------STYNIQTLVQSGNLYSYLDLAP 651

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SK 764
             + T       ++VTN G+V      +LF          P K LI + R+H +A G + 
Sbjct: 652 FDTFT-------VNVTNTGNVTSDFASLLFVNGTYGPSPYPNKSLITYARLHDIASGDTA 704

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
            ++ GV     ++ A+ +G   L  G + + +  L
Sbjct: 705 SVALGVT-LGSIARADTYGNMWLYPGTYQVTLDTL 738


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 272/760 (35%), Positives = 405/760 (53%), Gaps = 58/760 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNT+ +   RAK+LI   T +E +    + +  +PRLG+P Y+WWSE+LHG+A   PGV+
Sbjct: 37  CNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPGVH 96

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F          TSFPQ ++  A+F+  L   + + ++ EARA  N G +G+ F+ PNIN 
Sbjct: 97  FAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFSGIDFFTPNINP 156

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           FRDPRWGRGQETPGEDP+ +S Y  + V + QG        G+G     K V        
Sbjct: 157 FRDPRWGRGQETPGEDPLHISRYVFQLVTALQG--------GLGPSPYYKIV-------- 200

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    A CKH   YDLE W    R+ F+A+IT QD  + + P F+SC+   K   +
Sbjct: 201 ---------ADCKHFAGYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSV 251

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           MCSYN VNGVPAC    L Q   R+ +G   G+ITSDCDAV  +F   N+T T  +++A 
Sbjct: 252 MCSYNSVNGVPACASSYLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTTTQANASAI 311

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D++CG    +    A+D+G V+E D+ +AL+ L+   +R G F+  P +  + +
Sbjct: 312 SLKAGTDVDCGNVYAQSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQ 370

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV T   ++LAL AA +GIVLLKND   LPL+   V ++ ++GP  N  + M G Y
Sbjct: 371 LGWADVDTPASRRLALLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNY 429

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            G      S  +G         + +G         +GF EAV  A   D ++ V G D  
Sbjct: 430 FGNAPYLVSPRQGFVDAGFNVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNV 489

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E RDR+++  PG Q+ L+  +A   K P+I++  G G +D ++ +    I++++W G
Sbjct: 490 VERESRDRINITWPGVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGG 548

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG++G  ALA I+ G   P  RLP+T YPE +  +PM DMN+R  +S   PGR+Y+++T
Sbjct: 549 YPGQSGGTALANIVTGKTAPAARLPITQYPEDYISLPMTDMNVRPSNSS--PGRTYKWFT 606

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           G  ++ FG GL Y+ + + +   P      AS   G   +++    S +D          
Sbjct: 607 GEPIFEFGFGLHYSKFDFAWAEEP-----PASFAIG---DLVANASSPVDLA-------- 650

Query: 710 TSLRFHV-QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
               FH  Q++VTN G V    V MLF          P K+L+G+ R+  +  G+   + 
Sbjct: 651 ---TFHTFQVNVTNLGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRLTNIPVGATVTAS 707

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
                  ++ A++ G  +L  G + + +   GE+ H   +
Sbjct: 708 VPVTLGTIARADEDGNSVLFPGQYSVWLDTTGEILHDFEL 747


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  477 bits (1228), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 291/761 (38%), Positives = 412/761 (54%), Gaps = 51/761 (6%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS    TRA +LI L T +E      + +  +PRLG+PAY WWSE LHG+A + PGV 
Sbjct: 41  CDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQS-PGVT 99

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++  A+F+  L   + S V+ E RA  N+G+AGL +W PNIN F
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWTPNINPF 159

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  +  Y    +   QG            G       K       
Sbjct: 160 KDPRWGRGQETPGEDPFHLQGYVYNLILGLQG------------GLDPTPYFK------- 200

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + A CKH  AYD++ W    RY FNA++T+QD  + + P F++C+   K + +M
Sbjct: 201 ------VVADCKHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVM 254

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNG+P+C    L Q   R+ WGF    ++TSDCDAV  I+   NYT     +AA 
Sbjct: 255 CSYNAVNGIPSCANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNYTDNPAQAAAD 314

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L AG DI+CGT    +   A+ +G V   D+ RA +  ++  +RLG F+  P    Y +
Sbjct: 315 ALLAGTDIDCGTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFD-PPESQPYRQ 373

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV T E ++LA  AA +G+VLLKND   LPL+K+ V  LA+IGP  N  + M G Y
Sbjct: 374 LGWSDVNTPEAQQLAHTAAVEGMVLLKNDGT-LPLSKH-VRKLALIGPWANATTLMQGNY 431

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GI     S L G +       Y  G +    N  +GF  AV  AK+AD VI   GLD T
Sbjct: 432 AGIAPYLISPLLGAQQAGFDVEYVFGTNVTTTNDTSGFAAAVAAAKRADAVIFAGGLDET 491

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E+ DR+++  PG Q+ LV  +A   K P+I+   GGG LD S  ++   +++I+W G
Sbjct: 492 VEREEVDRLNVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGG 550

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
           YPG++G  AL +I+ G   P GRLP+T YP  +  +VPM DM +R  ++   PGR+Y++Y
Sbjct: 551 YPGQSGGTALFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRPSATN--PGRTYKWY 608

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
           TGT V+ FG GL YT +S+ + S     T +AS    +      ++ + LD   +D    
Sbjct: 609 TGTPVFEFGFGLHYTTFSFAWASNAHANTPAASYSIDALMASGNKSAAFLDLAPLDTFA- 667

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
                    + VTN G +   +V +LFA         P KQL+ + RVH VA     I+ 
Sbjct: 668 ---------VRVTNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTIAE 718

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTIE 806
                  ++ A++ G + +  G + L +    +L+H+ T+E
Sbjct: 719 LTVTLGAIARADESGAKWVYPGTYTLALDTTEQLKHTFTLE 759


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 289/751 (38%), Positives = 407/751 (54%), Gaps = 66/751 (8%)

Query: 22  IVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
           I  + V+ F    P      FP C      S P C+++   +TRA+SLI + T  E IQ 
Sbjct: 8   IFFIAVVIF----PAHGTFSFPDCTNGPLKSTPVCDSAKDPATRAQSLIQMFTDDELIQN 63

Query: 81  LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRS 138
             + +  +PRLG+P YEWWSE+LHG+  + PGV F  +G  SS TSFPQ +V  A+F+  
Sbjct: 64  GDNASPGVPRLGLPPYEWWSEALHGVG-HSPGVVFAPSGDFSSATSFPQPIVIGAAFDDD 122

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           L   + + V+ EARA  N G+AGL ++ PNIN F+DPRWGRGQETPGEDP  +S Y    
Sbjct: 123 LVKAVANVVSTEARAFNNFGRAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHL 182

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V   Q        GGI                 D    + ++A CKH  AYDLE W    
Sbjct: 183 VDGLQ--------GGI-----------------DPWPYIKVAADCKHFAAYDLENWEGID 217

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R+ F+A +++QD  + + PPF+SC+   KA+ +MCSYN VNGVPAC    L Q   R+ W
Sbjct: 218 RFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPACASTYLLQDILRDAW 277

Query: 318 GFKG--YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGK 375
           GF    ++TSDC A+  IF+  N+T++  ++AA  LKAG DI+CG+    H  +A+++  
Sbjct: 278 GFDDDRWVTSDCWALDKIFDSHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPAALNQSL 337

Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVL 434
           +   D+ RA +  ++  +RLG F  DP   + Y +    DV T E + L+  AA +G+VL
Sbjct: 338 ISRDDLTRAFIRQYTSLIRLGYF--DPSDSQTYRQFDWSDVNTPEAQALSRRAAVEGLVL 395

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           LKND   LPL  +   ++AIIGP  N  S M G Y G      S  +G +    K   A+
Sbjct: 396 LKND-GLLPLAPDG-KTIAIIGPYTNATSSMQGNYFGNAPIITSPFQGAQDVGFKVVSAA 453

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           G   V   S AGF EA+  AK AD V+ V G+D T E E  DR S+  PG Q+ LV  +A
Sbjct: 454 GT-TVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDLVKDLA 512

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
              K P+I+V  GGG +D +   A+ ++ +I+W GYPG++G  A+ +II G   P GRLP
Sbjct: 513 SLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTAIFDIIVGSTAPAGRLP 571

Query: 615 MTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           +T YP  +T +V M DM++R  S    PGR+Y++Y  T V  +GHGL +T + + +   P
Sbjct: 572 VTQYPADYTHQVRMTDMSLRPSSHN--PGRTYKWYK-TPVLEYGHGLHFTTFDFSWQRQP 628

Query: 674 -SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
            +E  I   ++A   K         LD  H D    C          V N G++   +V 
Sbjct: 629 AAEYDIQELIRASHSK--------FLDLAHFDTFEIC----------VRNTGNITSDYVG 670

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
           +LF          P K L+ + RVH +  G+
Sbjct: 671 LLFLSGNTGPGPHPIKSLVAYSRVHDIQGGT 701


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  474 bits (1221), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 284/731 (38%), Positives = 401/731 (54%), Gaps = 62/731 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      S P C+++   +TRA+SLI + T  E IQ   + +  +PRLG+P YEWWS
Sbjct: 24  FPDCTNGPLKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWS 83

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+  + PGV F  +G  SS TSFPQ +V  A+F+  L   + + V+ EARA  N G
Sbjct: 84  EALHGVG-HSPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFG 142

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           +AGL ++ PNIN F+DPRWGRGQETPGEDP  +S Y    V   Q        GGI    
Sbjct: 143 RAGLNYFTPNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQ--------GGI---- 190

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                        D    + ++A CKH  AYDLE W    R+ F+A +++QD  + + PP
Sbjct: 191 -------------DPWPYIKVAADCKHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPP 237

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
           F+SC+   KA+ +MCSYN VNGVPAC    L Q   R+ WGF    ++TSDC A+  IF+
Sbjct: 238 FQSCVRDAKAASVMCSYNSVNGVPACASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFD 297

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
             N+T++  ++AA  LKAG DI+CG+    H  +A+++  +   D+ RA +  ++  +RL
Sbjct: 298 SHNFTRSFAEAAAISLKAGTDIDCGSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRL 357

Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G F  DP   + Y +    DV T E + L+  AA +G+VLLKND   LPL  +   ++AI
Sbjct: 358 GYF--DPSHSQTYRQFDWSDVNTPEAQALSRRAAVEGLVLLKND-GLLPLAPDG-KTIAI 413

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           IGP  N  S M G Y G      S  +G +    K   A+G   V   S AGF EA+  A
Sbjct: 414 IGPYTNATSSMQGNYFGNAPFITSPFQGAQDVGFKVVSAAGTI-VNGTSSAGFAEAINTA 472

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           + AD V+ V G+D T E E  DR S+  PG Q+ LV  +A   K P+I+V  GGG +D +
Sbjct: 473 RAADVVVFVGGIDNTLEREGLDRSSISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDT 531

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
              A+ ++ +I+W GYPG++G  A+ +II G   P GRLP+T YP  +T +V M DM++R
Sbjct: 532 EILANEKVQAIIWAGYPGQSGGTAIFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLR 591

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKNILQ 692
             S    PGR+Y++Y  T V  +GHGL +T + + +   P +E  I   ++A   K    
Sbjct: 592 PSSHN--PGRTYKWYK-TPVLEYGHGLHFTTFDFSWQRQPAAEYDIQELIRASHSK---- 644

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                LD  H D    C          V N G++   +V +LF          P K L+ 
Sbjct: 645 ----FLDLAHFDTFEIC----------VRNTGNITSDYVGLLFLSGNSGPGPHPIKSLVA 690

Query: 753 FDRVHTVAKGS 763
           + RVH +  G+
Sbjct: 691 YSRVHDIQGGT 701


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 245/516 (47%), Positives = 332/516 (64%), Gaps = 13/516 (2%)

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           MCSYNQVNG P C   DL     R +W   GYI+SDCD+V  ++  Q+YTKT ED+AA  
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHYTKTPEDAAAIS 60

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           +KAG+D+NCGT + +HT +A+  GK+ E D+DRA+ N     +RLG F+GDPR+  +G L
Sbjct: 61  IKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNL 120

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           GP DVCT  +++LA +AARQGIVLLKN  K LPL+  ++ S+A+IGP  N    M G Y 
Sbjct: 121 GPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNYE 179

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDLT 529
           G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A  AD  ++V G D +
Sbjct: 180 GTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DR SLLLPGQQ  LV++VA  S  P ILV+  GGP D+SFA++  +I++ILW+G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPGEAG  A+A+++FG  NP GRLP+TWYPESFTKVPM DM MR D S  YPGR+YRFYT
Sbjct: 299 YPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFYT 358

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           G  VY FG GLSYT++++  +SAP +L    +L+       L +    ++     E   C
Sbjct: 359 GDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQCPSVEA----EGAHC 410

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
             L F V + V NAG+  G H V LF+  P V    P K L+GF++V      +  ++F 
Sbjct: 411 EGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEKVSLEPGQAGVVAFK 469

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 470 VDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 505


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 278/761 (36%), Positives = 417/761 (54%), Gaps = 58/761 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CN S   ++RA +L+  LTL+E +    + +  +PRLG+P Y WWSE+LHG+A + PG N
Sbjct: 38  CNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVALS-PGTN 96

Query: 115 FN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F+      SS TSFPQ ++  A+F+  L ++I + ++ EARA  N G+AGL F+ PNIN 
Sbjct: 97  FSVPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTPNINP 156

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           F+DPRWGRGQETPGEDP  ++ Y  + V   QG            G       K      
Sbjct: 157 FKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG------------GLSPDPYYK------ 198

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  + A CKH   YDLE W   SR +FNA+I+ QD  + + P F+SC+       +
Sbjct: 199 -------VIADCKHFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSV 251

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           MCSYN VNG+P+C    L Q   R  +G   G+ITSDCDAVA IF    YT T  +++A 
Sbjct: 252 MCSYNAVNGIPSCANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTTTLVNASAV 311

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D++CGT   +    A+D+  V E DI  +++ L+   +RLG F+  P +  + +
Sbjct: 312 ALKAGTDVDCGTTYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFD-SPAEQPFRQ 370

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV T   + LAL AA +G+ LLKND   LPL+ +A+  +A++GP  N  +QM G Y
Sbjct: 371 LGWSDVNTPSSQALALTAAEEGVTLLKNDGT-LPLS-SAIKRIALVGPWANATTQMQGNY 428

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            GI     S L+ L+    +  +A+G   +    D+GF  AV   + AD VI   G+D T
Sbjct: 429 QGIAPFLVSPLQALQDAGFQVTFANGTA-INSTDDSGFAAAVSAVQVADAVIYAGGIDET 487

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E+E  DR  +  PG Q+ LV+ +A   K P +++  GGG +D S  +++  +++++W G
Sbjct: 488 IESEGNDREIITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGG 546

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
           YPG++G  A+  I+ G   P GRLP+T YP  +  ++PM DM +R + +   PGR+Y+++
Sbjct: 547 YPGQSGGAAIVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGTS--PGRTYKWF 604

Query: 649 TGTQVYGFGHGLSYTNYSYKFL-SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           TGT ++GFG GL YT +S  +  + PS   IS  +   +   +                T
Sbjct: 605 TGTPIFGFGFGLHYTTFSLDWAPTPPSSFAISTLVSEANTAGV--------------SFT 650

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +   L F  +++V N G V   +V +LF+      Q  P KQL+ + RV  +A G  E +
Sbjct: 651 NLAPL-FTFRVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETA 709

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
                   ++  +++G   L  G + + V   G++ HS  +
Sbjct: 710 ELKVTLGSIARIDENGDSALYPGRYNIWVDTTGDIVHSFEL 750


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 283/755 (37%), Positives = 412/755 (54%), Gaps = 56/755 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL+ + RA +L+ L T++E I    + +  +PRLG+PAY+WWSE LHG+A + PGVN
Sbjct: 37  CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVA-DSPGVN 95

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ +V +A+F+ +L   +G  V +E R+  N G AGL FW PNIN F
Sbjct: 96  FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  ++ Y    ++  QG                     G   E  
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG---------------------GVNPEP- 193

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + A CKH   YDLE W N  RY F+A+IT QD  + + P F+SC    +A   M
Sbjct: 194 ---YFQVVATCKHFAGYDLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASM 250

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNG+P C    L Q   R+ W F    ++TSDCDAV  I+   NYT   + +AA 
Sbjct: 251 CSYNAVNGIPTCADTYLLQDILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQAAAD 310

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
            L+AG D++CGT    +   A ++  + E ++  AL   ++  +RLG F  DP   + Y 
Sbjct: 311 ALRAGTDLDCGTFYTEYLPLAYNQSLITETELRAALTRQYASLVRLGYF--DPAAQQPYR 368

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           + G  +V T   ++LA  AA +GI LLKND   LPL  + + ++A+IGP  N  +QM G 
Sbjct: 369 QYGWSNVDTPYAQQLAYTAATEGITLLKNDGT-LPL-PSTLKNIALIGPWANATNQMQGN 426

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+     S L+G  A      Y  G  ++  NS AGF  A+  A++AD V+   G+D+
Sbjct: 427 YFGVAPYLVSPLQGALAAGYNVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDV 485

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR ++  PG Q+ L+  +A   K P ++   GGG +D +  +A++ ++S++W 
Sbjct: 486 TVEAEAMDRYNVTWPGNQLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWA 544

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS-RQYPGRSYR 646
           GYPG++G +AL +II G   P GRL  T YP  +  ++PM DMN+R +++    PGR+Y+
Sbjct: 545 GYPGQSGGQALFDIISGKVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYK 604

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTG  VY FG+GL YTN++Y +  AP+           S  NI     +     HID  
Sbjct: 605 WYTGAPVYEFGYGLHYTNFTYTWTKAPA-----------STYNIQTLVSAASGAAHIDLA 653

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
              T     + ++VTNAG V   +  +LF          P K L  + R+H+VA G+ + 
Sbjct: 654 PFDT-----LSVAVTNAGAVTSDYSALLFVNGTYGPAPYPNKALAAYTRLHSVAAGAAQT 708

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           +       Q++ A+ +G   L  G + L +   R 
Sbjct: 709 ATFDVVLNQIARADAYGNFWLYPGAYELALDTTRE 743


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 282/750 (37%), Positives = 394/750 (52%), Gaps = 55/750 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+    TRAK+LI+  TL EKI      +  +PRLG+  YEWW E+LHG+AS+ PGVN
Sbjct: 39  CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVASS-PGVN 97

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G     TSFPQ ++  A+F+  L  ++ S ++ EARA  N  +AGL FW PNIN F
Sbjct: 98  FSVSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNINPF 157

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  +S+Y    ++  QG+N              K+V+        
Sbjct: 158 KDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGDN-----------PSYKKVV-------- 198

Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                   A CKH +AYD+E W GNF RY  +A I  QD  + + PPFRSC         
Sbjct: 199 --------ATCKHFVAYDVENWNGNF-RYQLDAHINSQDLVEYYMPPFRSCARDSNVGAF 249

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSA 347
           MCSYN +NGVP C    L Q   R  W +     ++TSDCD+V  +F Y NY  + E++A
Sbjct: 250 MCSYNSLNGVPTCADPYLLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNYASSREEAA 309

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  LKAG DINCGT    H   A ++G + E D+D +L+  +   +RLG F+GD  +  Y
Sbjct: 310 AISLKAGTDINCGTYYQEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGD--RVPY 367

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             L  +DV T   + LAL AA  GI LLKND   LPL     + +A+IG   N   QM G
Sbjct: 368 RNLTWNDVSTPYAQDLALKAATSGITLLKNDG-ILPLQITNGTKIALIGDWANATDQMLG 426

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y GIP    S L   +   ++  Y  G       +   +      A K+D +I + G+D
Sbjct: 427 NYHGIPPYFHSPLWAAQQTGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMD 486

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
              E E++DRVS+   G Q+ ++  +A    +P I+V  GGG LD S    +  I ++LW
Sbjct: 487 ERVEAEEKDRVSIAWSGPQLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLW 546

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
            GYPG+ G KA+ +I+ G   P GRLP+T Y   + +KVPM D ++R +++   PGR+Y 
Sbjct: 547 GGYPGQDGGKAIFDILQGISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYI 606

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +     V+ FG+GL YTN++     A S  T  +     SD      T S LD       
Sbjct: 607 WLNEEPVFEFGYGLHYTNFTATIPDAESSDTTYSIDSLASDC-----TESYLD------- 654

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
             C    F   I VTN G V   +V + F       +  P K+L+ + R+H +  GS + 
Sbjct: 655 -RCPFKTF--SIDVTNTGSVTSDYVTLGFLTGAHGPEPCPNKRLVSYQRLHNITAGSTQT 711

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +        LS  +  G  +L  G++ L+V
Sbjct: 712 AALNLTLGSLSRVDDKGNTVLFPGSYALLV 741


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 244/531 (45%), Positives = 336/531 (63%), Gaps = 16/531 (3%)

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           S +  G  + +MCSYNQVNG P C   DL     R EW   GYI SDCD+V  +++ Q+Y
Sbjct: 1   SYVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHY 60

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           TKT  ++AA  + AG+D+NCG+ + +HT+ A+  G V E  ID+A+ N F   +RLG F+
Sbjct: 61  TKTPAEAAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFD 120

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
           G+P+   YG LGP DVCTS +++LA DAARQGIVLLKN    LPL+  ++ +LA+IGP  
Sbjct: 121 GNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNA 179

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
           N    M G Y G PC   + L+GL   VS T Y  GC +V C + A    A ++A  AD 
Sbjct: 180 NVTKTMIGNYEGTPCKYTTPLQGLAGTVSTT-YLPGCSNVAC-AVADVAGATKLAATADV 237

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
            ++V G D + E E RDRV L LPGQQ  LV  VA+ +K PV+LV+  GG  D++FA+ D
Sbjct: 238 SVLVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKND 297

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
            +I+ ILW+GYPGEAG  A+A+IIFG +NP G+LPMTWYP+S+  KVPM  MNMR D + 
Sbjct: 298 PKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKAS 357

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS-- 696
            YPGR+YRFYTG  VY FG GLSYT +S+  + APS +++      G ++N + ++    
Sbjct: 358 GYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSL------GLEENHVCRSSECQ 411

Query: 697 RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
            LD +  H +   S     F V I V N GD +G H V LF   P +  G+P K L+GF+
Sbjct: 412 SLDAIGPHCENAVSGGGSAFEVHIKVRNGGDREGIHTVFLFTTPPAI-HGSPRKHLVGFE 470

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           ++    +    + F V+ C+ LS+ ++ G+R + LG H+L VG+L+HSL+I
Sbjct: 471 KIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSI 521


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  464 bits (1193), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 278/757 (36%), Positives = 408/757 (53%), Gaps = 59/757 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+    TRA +LI+  TLQEK+      +  +PRLG+PAY WW E+LHG+AS+ PGVN
Sbjct: 39  CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVASS-PGVN 97

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G     TSFPQ ++  A+F+  L  ++ + ++ EARA  N  +AGL FW PNIN F
Sbjct: 98  FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +D RWGRGQETPGEDP  +S+Y    ++  QG     DD       + KRV+        
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQG---SPDD-------KYKRVV-------- 199

Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                   A CKH +AYD+E W GNF RY F+A ++ QD  + + PPF+ C         
Sbjct: 200 --------ATCKHFVAYDMESWNGNF-RYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAF 250

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
           MCSYN +NGVP C    L Q   R +W +   + ++TSDCDAV  +F   +Y  T E++A
Sbjct: 251 MCSYNALNGVPTCADPWLLQTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDYASTREEAA 310

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  LKAG DINCGT    H  +A D+G +   D+D +L+  +S  +RLG F+G      Y
Sbjct: 311 ALSLKAGTDINCGTYYQDHLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDG--LAVPY 368

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             L  +DV T   ++LA  AA +GI LLKND   LPL  +  +S+A+IG   N   QM G
Sbjct: 369 RNLTWNDVSTPHAQQLAYKAAAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLG 427

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y GIP    S L   +   +  ++A+G       +   +      A K+D +I   G+D
Sbjct: 428 NYDGIPPFFHSPLYAAQQTGATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGID 487

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            + E+E  DRVSL   G Q+ ++  +A   K PVI++  GGG +D S    +  +S+++W
Sbjct: 488 NSVESEGMDRVSLTWTGAQLDMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIW 546

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
            GYPG+ G  AL +II G   P GRLP T YP  + ++VPM DM +R +S+   PGR+Y 
Sbjct: 547 GGYPGQDGGVALFDIIRGITAPAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYI 606

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V+ +G GL YTN++           I  S  +  D +      +  D   I  +
Sbjct: 607 WYNENAVFPYGLGLHYTNFT---------AAIKPSFPSTYDSSSSNSGSASYD---ISTL 654

Query: 707 TS-CTSLRFHV------QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
           TS CT+    +       +S+TN G++   +V + F          P K+L+ + R+H +
Sbjct: 655 TSNCTATYKDLCPFTSFSVSITNTGEIMSDYVTLGFLAGIHGPAPHPNKRLVSYQRLHNI 714

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             GS + ++       L+  ++ G ++L  G++ L+V
Sbjct: 715 TAGSSQTAWLNLTLGSLARVDEMGNKVLYPGDYALLV 751


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 290/756 (38%), Positives = 409/756 (54%), Gaps = 72/756 (9%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL ++TRA+SL++ +T +EK+     N+  +PRLG+PAY WWSE+LHG+A + PGV 
Sbjct: 43  CDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWSEALHGVAGS-PGVE 101

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++  A+F+  L   + + V+ E RA  N G+AGL FW PNIN F
Sbjct: 102 FADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAGRAGLDFWTPNINPF 161

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RD RWGRGQETPGEDP+ VS Y    V   Q           G G    +V+        
Sbjct: 162 RDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGPANPKVV-------- 203

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH  AYDLE W    R+SFNA ++ QD  + + PPF+SC    K   +M
Sbjct: 204 --------ATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVM 255

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NGVPAC    L Q   R  W +     +IT DC A+  I+   NYTKT  ++AA
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNYTKTPAEAAA 315

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CGT   ++   A D+G    K +D+AL+ L+S  ++LG F  DP + + Y
Sbjct: 316 TALNAGTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYF--DPAEDQPY 373

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +G  DV +   + LA  AA +GIVLLKNDK  LPL   A  +LA+IGP  N   QM G
Sbjct: 374 RSIGWKDVDSPAAEALAHKAAVEGIVLLKNDKT-LPL--KAKGTLALIGPYANATKQMQG 430

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G P   ++LL           Y +G   +  NS AGF  A+  AK+AD V+   G+D
Sbjct: 431 NYEGPPKYIRTLLWAATQAGYDVKYVAGTA-INANSTAGFDAALSAAKQADVVVYAGGID 489

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR +++ PG Q+ L+  +++  K P+++V  GGG +D S   ++  ++++LW
Sbjct: 490 NTIEAEGHDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLW 548

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYP + G  A+ +I+ G   P GRLP+T YP  +  +VP+ DM +R  S+   PGR+YR
Sbjct: 549 TGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYR 606

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V  FG GL YT +   +         +A+L + S KN           V ID  
Sbjct: 607 WYD-KAVLPFGFGLHYTTFKISWPRRALGPYDTAALVSRSPKN-----------VPIDRA 654

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFAR-VPKVSQGTPEKQLIGFDRVHTVAKGSK- 764
              T   FH+Q  VTN G     +V +LF + +    +  P K L+G+ R   +  G K 
Sbjct: 655 AFDT---FHIQ--VTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGEKR 709

Query: 765 ----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
               ++S G      L+   ++G  +L  G + L V
Sbjct: 710 SVDIKVSLG-----SLARTAENGDLVLYPGRYTLEV 740


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 272/751 (36%), Positives = 409/751 (54%), Gaps = 57/751 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG-- 112
           C++S +   RA +L+  LTL+EK+    + +  +PRLGIP Y+WW+E+LHG+A   PG  
Sbjct: 41  CDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPGQP 100

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           +  +G  SS TSFPQ ++  A+F+  L   + S V+ EARA  N G++GL +W PNIN +
Sbjct: 101 MTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNINPY 160

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  +++Y    ++  +G                         ++D
Sbjct: 161 KDPRWGRGQETPGEDPFHLASYVQNLIRGLEGN------------------------QND 196

Query: 233 RGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              +++  A CKH   YD+E W GNF RY F+A I  +D  + + PPF++C  + K    
Sbjct: 197 PYKKIV--ATCKHFTGYDMENWNGNF-RYQFDAQINMRDMVEYYMPPFQACAREAKVGAF 253

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSA 347
           MCSYN VNGVP C    L Q   R  WG+     ++ SDCDA+  ++    + ++ E + 
Sbjct: 254 MCSYNAVNGVPTCADPWLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEWAESREQAV 313

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF-NGDPRKGK 406
           A  L AG D+NCGT   R+   A ++G + +  +DRAL   +S  ++LG F N D +   
Sbjct: 314 ADTLNAGTDLNCGTYYQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQ--P 371

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
           Y ++G  DV +   ++LAL AA++GIVLLKND   LPL+ + VSS+A+IG   N   QM 
Sbjct: 372 YRQIGWQDVNSQHAQELALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQ 430

Query: 467 GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
           G Y G+     S L   E    K +YA G       +D    E    A+ +D +IVV G+
Sbjct: 431 GNYAGVAPYLHSPLYAAEQLGVKVNYAEGASQSNPTTDQWGAE-YTAAENSDVIIVVGGI 489

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D   E+E+ DRV++   G Q+ ++T +A T  +PVI+V  G G LD +   +++ IS++L
Sbjct: 490 DNDIESEELDRVAIAWSGPQLDMITKLA-TYGKPVIVVQMGAGQLDSTPLVSNANISALL 548

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSY 645
           W GYPG+ G  AL +II G   P GRLP+T YP  +TK V M DM++R  S+    GR+Y
Sbjct: 549 WGGYPGQDGGTALFDIITGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSA--GRTY 606

Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
           ++Y GT V+ FG GL YTN+S    S P+     + L A    N      S+LD      
Sbjct: 607 KWYNGTAVFPFGFGLHYTNFSAAIPSPPASSFAISDLVASCSAN----DTSKLD------ 656

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           +   TSL     + + N G      V + F          P+  L+ + R+H +A G  +
Sbjct: 657 LCPFTSL----AVDIANDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQ 712

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            +        L   +++G ++L  G++ +++
Sbjct: 713 TARLNLTLGSLVRVDENGDKLLYPGDYSVLI 743


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 288/781 (36%), Positives = 425/781 (54%), Gaps = 78/781 (9%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     S+P C+TSLSI+ R  SL+  LTL+EKI  L D ++   RLG+P+YEWWSE+ 
Sbjct: 69  CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 128

Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           HG+ S  PGV F       S  TSFP  +++AASF+ +L   I   +  E RA  N G +
Sbjct: 129 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 187

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G  FWAPNIN FRDPRWGRGQETPGEDP+V   Y   FV   QG++ K+           
Sbjct: 188 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 236

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+V+                A CKH   YDLE      RY  N   T+QD  D F  PF+
Sbjct: 237 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 276

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
           +C+       IMCSYN V+G+PAC    L  +  R  W F     Y+ SDC AV  I++Y
Sbjct: 277 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQY 336

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            N+T T E +A+  L AG+D+ CG+  L+  +S +   +   K +DR+L  L+S    +G
Sbjct: 337 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDRSLARLYSALFTVG 395

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN-KNAVSSLAII 455
            F+G    GKY KL   DV T + + LA +AA +G+ LLKND   LPL+  +   S+A+I
Sbjct: 396 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDFPHKYKSVAVI 450

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
           GP  N  +QM G Y+G       L+  LEA+     K +YA G   +   + +GF EA+ 
Sbjct: 451 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALA 506

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A K+D +I + G+D + E+E  DR SL  PG Q+ L+TS+++ SK P+++V  GGG +D
Sbjct: 507 AANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 565

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
            S    +  I +++W GYP ++G  AL +++ G  +P GRLP+T YP S+  +V + D+N
Sbjct: 566 DSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 625

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           +R   S  YPGR+Y++YTG  V  FG+GL YT + + +     E T++       + NI 
Sbjct: 626 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 672

Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
               S  +     I++ T  T+++  V+    N G     +V +LF           P K
Sbjct: 673 DLVASCRNSSGGPINDNTPLTTVKARVK----NVGHKTSDYVSLLFLSSKNAGPAPRPNK 728

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
            L+ + R+  +A+GS +++        L+ A+++G  ++  G + + +    EL    T+
Sbjct: 729 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIALDNSEELTFEFTL 788

Query: 806 E 806
           +
Sbjct: 789 K 789


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 294/770 (38%), Positives = 413/770 (53%), Gaps = 73/770 (9%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C          C+TSL ++TRA+SL++ +T +EK+     N+  +PRLG+PAY WWS
Sbjct: 29  FPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWS 88

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A + PGV F  +G  S  TSFPQ ++  A+F+  L   + + V+ E RA  N G
Sbjct: 89  EALHGVAGS-PGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAG 147

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           ++GL FW PNIN FRD RWGRGQETPGEDP+ VS Y    V   Q           G G 
Sbjct: 148 RSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGP 197

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
              +V+                A CKH  AY LE W    R+SFNA ++ QD  + + PP
Sbjct: 198 ANPKVV----------------ATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPP 241

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE---WGFKG-YITSDCDAVATIF 334
           F+SC    +   +MCSYN +NGVPAC    L Q    E   W   G +ITSDC A+  I+
Sbjct: 242 FKSCARDARVDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIY 301

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              N+T T  ++AA  L AG D++CGT   ++   A D+G    + +DRAL+ L+S  ++
Sbjct: 302 NGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVK 361

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP + + Y  +G  DV T   + LA  AA +GIVLLKNDK  LPL   A  +LA
Sbjct: 362 LGYF--DPAEDQPYRSIGWTDVDTPAVEALAHKAAGEGIVLLKNDKT-LPL--KAKGTLA 416

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N   QM G Y G     ++LL           YA+G   +  NS AGF  A+  
Sbjct: 417 LIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTA-INTNSTAGFDAALSA 475

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK+AD V+   G+D T E E RDR ++  PG Q++L+  +++  K P+++V  GGG +D 
Sbjct: 476 AKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDD 534

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           S   ++ +++++LW GYP + G  A+ +I+ G   P GRLP+T YP  +  +VPM DM +
Sbjct: 535 SSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMAL 594

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           R  S+   PGR+YR+Y    V  FG GL YT +   +         +A+L + S KN   
Sbjct: 595 RPGSNT--PGRTYRWYD-KAVLPFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKN--- 648

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR-VPKVSQGTPEKQLI 751
                   V ID     T   FH+Q  VTN G     +V +LF + +    +  P K L+
Sbjct: 649 --------VPIDRAAFDT---FHIQ--VTNTGKTTSDYVALLFLKTIDAGPKPYPLKTLV 695

Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           G+ R   +  G K     E+S G      L+   ++G  +L  G + L V
Sbjct: 696 GYTRAKQIKPGEKRSVDIEVSLG-----SLARTAENGDLVLYPGRYTLEV 740


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 292/780 (37%), Positives = 425/780 (54%), Gaps = 80/780 (10%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     S+P C+TSLSI+ R  SL+  LTL+EKI  L D ++   RLG+P+YEWWSE+ 
Sbjct: 69  CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 128

Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           HG+ S  PGV F       S  TSFP  +++AASF+ +L   I   +  E RA  N G +
Sbjct: 129 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 187

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G  FWAPNIN FRDPRWGRGQETPGEDP+V   Y   FV   QG++ K+           
Sbjct: 188 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 236

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+V+                A CKH   YDLE      RY  N   T+QD  D F  PF+
Sbjct: 237 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 276

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
           +C+       IMCSYN V+G+PAC    L  +  R  W F     Y+ SDC AV  I++Y
Sbjct: 277 TCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 336

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            N+T T E +A+  L AG+D+ CG+  L+  +S +   +   K +D++L  L+S    +G
Sbjct: 337 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 395

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
            F+G    GKY KL   DV T + + LA +AA +G+ LLKND   LPL+  +   S+A+I
Sbjct: 396 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 450

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
           GP  N  +QM G Y+G       L+  LEA+     K +YA G      N+ +GF EA+ 
Sbjct: 451 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTAMNNQNT-SGFEEALA 506

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A K+D +I + G+D + E+E  DR SL  PG Q+ L+TS+++ SK P+++V  GGG +D
Sbjct: 507 AANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 565

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
            S    +  I +++W GYP ++G  AL +++ G  +P GRLP+T YP S+  +V + D+N
Sbjct: 566 DSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 625

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           +R   S  YPGR+Y++YTG  V  FG+GL YT + + +     E T++       + NI 
Sbjct: 626 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 672

Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
               S  +     I++ T  T+    V++ V N G     +V +LF           P K
Sbjct: 673 DLVASCRNSSGGPINDNTPLTT----VKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNK 728

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS--LTIE 806
            L+ + R+  +A+GS +++        L+ A+++G  ++  G + +    L HS  LT E
Sbjct: 729 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIA---LDHSEELTFE 785


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 292/780 (37%), Positives = 425/780 (54%), Gaps = 80/780 (10%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     S+P C+TSLSI+ R  SL+  LTL+EKI  L D ++   RLG+P+YEWWSE+ 
Sbjct: 129 CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 188

Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           HG+ S  PGV F       S  TSFP  +++AASF+ +L   I   +  E RA  N G +
Sbjct: 189 HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFS 247

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G  FWAPNIN FRDPRWGRGQETPGEDP+V   Y   FV   QG++ K+           
Sbjct: 248 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 296

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+V+                A CKH   YDLE      RY  N   T+QD  D F  PF+
Sbjct: 297 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 336

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
           +C+       IMCSYN V+G+PAC    L  +  R  W F     Y+ SDC AV  I++Y
Sbjct: 337 TCVRDTDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 396

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            N+T T E +A+  L AG+D+ CG+  L+  +S +   +   K +D++L  L+S    +G
Sbjct: 397 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 455

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
            F+G    GKY KL   DV T + + LA +AA +G+ LLKND   LPL+  +   S+A+I
Sbjct: 456 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 510

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
           GP  N  +QM G Y+G       L+  LEA+     K +YA G      N+ +GF EA+ 
Sbjct: 511 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTAMNNQNT-SGFEEALA 566

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A K+D +I + G+D + E+E  DR SL  PG Q+ L+TS+++ SK P+++V  GGG +D
Sbjct: 567 AANKSDLIIYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 625

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
            S    +  I +++W GYP ++G  AL +++ G  +P GRLP+T YP S+  +V + D+N
Sbjct: 626 DSDILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 685

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           +R   S  YPGR+Y++YTG  V  FG+GL YT + + +     E T++       + NI 
Sbjct: 686 LRPTDS--YPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 732

Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
               S  +     I++ T  T+    V++ V N G     +V +LF           P K
Sbjct: 733 DLVASCRNSSGGPINDNTPLTT----VKVRVKNVGHKTSDYVSLLFLSSKNAGPAPRPNK 788

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS--LTIE 806
            L+ + R+  +A+GS +++        L+ A+++G  ++  G + +    L HS  LT E
Sbjct: 789 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIA---LDHSEELTFE 845


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  460 bits (1183), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 285/770 (37%), Positives = 414/770 (53%), Gaps = 62/770 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ +    TRA +LI L T +E      + +  +PRLG+PAY WWSE LHG+A + PGV 
Sbjct: 41  CDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQS-PGVT 99

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++  A+F+ +L   I + V+ E RA  N G+AGL +W PNIN F
Sbjct: 100 FAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  +S Y    +   QG            G   K   K       
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQG------------GLDPKPYFK------- 200

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + A CKH  AYDLE W    R  F+A++++QD  + + PPF++C+   K + +M
Sbjct: 201 ------VVADCKHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVM 254

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN VNG+P+C    L Q   R+ WGF    ++TSDCDAV  I     YT     +AA 
Sbjct: 255 CSYNAVNGIPSCANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQAAAD 314

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            L AG DI+CGT    +   A+ +G V   D+ RA +  ++  +RLG F+ DP    Y +
Sbjct: 315 ALLAGTDIDCGTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFD-DPAAQPYRQ 373

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV T + ++LA  AA +GIVLLKND   LP +K+ V  LA+IGP  N  S + G Y
Sbjct: 374 LGWSDVNTPQAQQLAHTAAVEGIVLLKND-GVLPFSKH-VRKLALIGPWANATSLLQGSY 431

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            G+     S L+G +    +  Y  G +    N  +GF  AV   ++AD V+   GLD T
Sbjct: 432 IGVAPYLVSPLQGAQEAGFEVEYVLGTNVTTQNDMSGFAAAVAAVRRADAVVFAGGLDET 491

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DR+++  PG Q+ LV  + R  K P+I+   GGG LD +  +    +++I+W G
Sbjct: 492 VECEGTDRLNVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGG 550

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFY 648
           YPG++G  AL +I+ G   P GRLP+T YP ++TK VPM DM++R  ++   PGR+Y++Y
Sbjct: 551 YPGQSGGTALFDILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRPSATN--PGRTYKWY 608

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL---------QQTGSRLD 699
           +GT V+ FG GL YT + + + +  +   + ++   GS              Q++ + LD
Sbjct: 609 SGTPVFEFGFGLHYTTFVFSWAAPSAAAAVDSTASFGSLAKSYSISQLVAHGQESTAFLD 668

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
              +D             + VTN G V   +V +LF          P+KQL+ + RVH +
Sbjct: 669 LAPLDTFA----------VRVTNTGRVASDYVALLFVSGAFGPAPHPKKQLVAYTRVHGL 718

Query: 760 A-KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
           A +GS      V     ++ A+K+G + +  G + L +     L H+ T+
Sbjct: 719 APRGSTVAQLPVT-LGAIARADKNGEKWVHPGTYTLALDTDAVLTHTFTL 767


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 295/770 (38%), Positives = 413/770 (53%), Gaps = 73/770 (9%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C          C+TSL ++TRA+SL++ +T +EK+     N+  +PRLG+PAY WWS
Sbjct: 29  FPDCSSGPLSKLAVCDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAYNWWS 88

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A + PGV F  +G  S  TSFPQ ++  A+F+  L   + + V+ E RA  N G
Sbjct: 89  EALHGVAGS-PGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAFGNAG 147

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           ++GL FW PNIN FRD RWGRGQETPGEDP+ VS Y    V   Q           G G 
Sbjct: 148 RSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQN----------GIGP 197

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
              +V+                A CKH  AYDLE W    R+SFNA ++ QD  + + PP
Sbjct: 198 ANPKVV----------------ATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPP 241

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE---WGFKG-YITSDCDAVATIF 334
           F+SC    +   +MCSYN +NGVPAC    L Q    E   W   G +ITSDC A+  I+
Sbjct: 242 FKSCARDARVDAVMCSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIY 301

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              N+T T  ++AA  L AG D++CGT   ++   A D+G    + +DRAL+ L+S  ++
Sbjct: 302 NGHNFTTTPAEAAATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVK 361

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP + + Y  +G  DV T   + LA  AA +GIVLLKNDK  LPL   A  +LA
Sbjct: 362 LGYF--DPAEDQPYRSIGWTDVDTPAAEALAHKAAGEGIVLLKNDKT-LPL--KAKGTLA 416

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N   QM G Y G     ++LL           YA+G   +  NS AGF  A+  
Sbjct: 417 LIGPYANATKQMQGNYEGPAKYIRTLLWAATQAGYDVKYAAGTA-INTNSTAGFDAALSA 475

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK+AD V+   G+D T E E RDR ++  PG Q++L+  +++  K P+++V  GGG +D 
Sbjct: 476 AKQADVVVYAGGIDNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDD 534

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           S   ++ +++++LW GYP + G  A+ +I+ G   P GRLP+T YP  +  +VPM DM +
Sbjct: 535 SSLLSNPRVNALLWAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMAL 594

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           R  S+   PGR+YR+Y    V  FG GL YT +   +         +A+L + S KN   
Sbjct: 595 RPGSNT--PGRTYRWYD-KAVLPFGFGLHYTTFKISWPRRALGPYNTAALVSRSPKN--- 648

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
                   V ID     T   FH+Q  VTN G     +V +LF +      +  P K L+
Sbjct: 649 --------VPIDRAAFDT---FHIQ--VTNTGKTTSDYVALLFLKTTDAGPKPYPLKTLV 695

Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           G+ R   +  G K     E+S G      L+   ++G  +L  G + L V
Sbjct: 696 GYTRAKQIKPGEKRSVDIEVSLG-----SLARTAENGDLVLYPGRYTLEV 740


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  459 bits (1180), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 272/810 (33%), Positives = 424/810 (52%), Gaps = 83/810 (10%)

Query: 19  LQLIVVVNVIAFSNSKPVLNKPDF-----PCKPP------HFDSYP-----------FCN 56
           ++L+  + +  F+N +   + PD+     P  PP      H+ ++P            CN
Sbjct: 4   IKLLATLALAIFANGETPPSSPDYNTEANPQLPPITFEAVHY-TFPDCQNGPLKHNAVCN 62

Query: 57  TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF- 115
             L    RAK+++ L T+ E I  + + +  + RLG+P Y+WWSE+LHGIA +    NF 
Sbjct: 63  PHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIARS----NFT 118

Query: 116 -NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
            +G  S  TSFPQ ++   +FN  L+  +G+ +  EARA  N+G+AGL F++PNIN FRD
Sbjct: 119 ASGEYSHATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPFRD 178

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
            RWGRGQE   E P++V  YA+ +V+  QG                       G +S++ 
Sbjct: 179 ARWGRGQEVASESPVLVGNYALNYVQGLQG-----------------------GLDSNQN 215

Query: 235 DE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
           D+ L ++A CKH + YD+E W   SR  +NA+I++QD  D + P F+SC+   KA+  MC
Sbjct: 216 DDTLQVAATCKHFVGYDMESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMC 275

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFK-GYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           SYN VNGVPAC          R+ + F+ G I SDCDA+  ++    Y +    +AA  +
Sbjct: 276 SYNAVNGVPACASEFFLNTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLGGAAADAI 335

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           KAG+D+NCG     +   A+    + E  I  ++   +S  +RLG F+  P+  KY K  
Sbjct: 336 KAGVDVNCGDTYQNNLGYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYD 394

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
            +DV T +  +LA  AA +GI LLKND   LP NK  V  +A+IGP  N  +QM G Y G
Sbjct: 395 WNDVSTPQANQLAYQAAVEGIALLKNDGT-LPFNKQKVRKVAVIGPWANATTQMLGDYAG 453

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
            P    S L+G ++   +  YA G   +     +G+  A+  AK AD ++   G+D + E
Sbjct: 454 TPPYMISPLQGAQSEGFQVEYALGTQ-INTTDTSGYTAALNAAKGADAIVYFGGIDNSVE 512

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E  DR SL  PG Q+ LV+ ++   K+P++++  GGG +D +  + +  +++I++ GYP
Sbjct: 513 NEALDRESLAWPGNQLDLVSKLSGL-KKPLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYP 571

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQ-YPGRSYRFYT 649
           G++G  A+ +I+ G + P GRL  T YP S+  +VPM DM +R    RQ YPGR++ +Y 
Sbjct: 572 GQSGGTAIWDILSGKYAPAGRLTTTQYPASYADQVPMTDMTLR---PRQGYPGRTFMWYN 628

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ---TGSRLDYVHIDEV 706
           G  VY FG+GL YT +S    +AP         + G     ++Q      R  YV    +
Sbjct: 629 GEPVYEFGYGLHYTTFSASLANAP---------RGGHQSFNIEQVVAAAKRSQYVDTGLI 679

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
           T+         +++ N G     +  +L+++        P K L+ FD++H +  G  + 
Sbjct: 680 TT-------FDVNIKNTGKTTSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQT 732

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +        L   + +G + L  G +   V
Sbjct: 733 AKLPVTIGSLLQTDTNGNKWLYPGTYTFFV 762


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 284/781 (36%), Positives = 403/781 (51%), Gaps = 114/781 (14%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
           P +       PFCNTSL+   R   L+S LTL + I Q+   A A+P LGIPAY WW+E 
Sbjct: 4   PLQDAPLKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTEC 63

Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
           LHG+ +   G N        TSFP      A+FN  L   +  A++ EARA+ N G  GL
Sbjct: 64  LHGVLTKC-GTN------CPTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGL 116

Query: 163 TFWAPNI-----------------------NIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
            FWAPNI                       +I RDPRWGR  E PGEDP + + Y   F+
Sbjct: 117 DFWAPNIKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFM 176

Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
           +  Q                        GE+S R  +++    CKH  AY LE W ++ R
Sbjct: 177 RGLQE-----------------------GEDS-RYPQVV--GTCKHFAAYSLEAWKDYDR 210

Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
           + F+A++++ D  +T+ P F+ CI +G+A  IMCSYN VNGVP+C    L +   R+ W 
Sbjct: 211 FMFDAIVSDYDFVETYLPAFKGCIVEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWS 270

Query: 319 FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQE 378
           F GY+ SDCDAV TI+   ++TKT E + A  L AG D+NCG    +H   A  +G+V E
Sbjct: 271 FDGYVVSDCDAVDTIYNNHHFTKTPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTE 330

Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
            ++  A+  LF  ++ LG+++  P +  Y +  P  V + EH  LAL AAR+ +VLL+N 
Sbjct: 331 DEVRLAVKRLFRQRMELGMWD-PPAEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNR 389

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--------SKT 490
           +  LPL K +V  +A+IGP  N    M G Y G  C   +    +  Y+        +  
Sbjct: 390 RGVLPLRK-SVRRVAVIGPNANATETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALV 448

Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLV 550
            Y  GC DV   +  G  EAV+ A+ AD  IVV GL+ + E+E +DRV++ LPG Q  L+
Sbjct: 449 TYNLGC-DVDSTNTTGIPEAVKAAQAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLI 507

Query: 551 TSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPG 610
            S+  T+   V++++ GG    V+      Q+  I+   YPGE G +A+A+++FGD+NPG
Sbjct: 508 KSIVATNTPTVVVMMHGGA---VAIEWIKDQVDGIVDAFYPGENGGQAIADVLFGDYNPG 564

Query: 611 ----------GRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTG-TQVYGFGH 658
                     GRLP+T  P ++   VP+ +M+MRA  S   PGR+YR+YTG   ++ FG 
Sbjct: 565 DNKTDGTTLLGRLPVTVLPANYVDMVPLTNMSMRA--SGNNPGRTYRYYTGPAPLWEFGF 622

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT +  ++LS P    + +  +                    DE  S        ++
Sbjct: 623 GLSYTTFKTEWLSTPQPSALKSYAR--------------------DEAVS-------FRV 655

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G V G  VV+ F       +G P KQL  F+RVH     SKEI F   P + L++
Sbjct: 656 RVTNVGPVAGDEVVLAFVTRDNADRG-PLKQLFAFERVHLNPGESKEIFFNTGP-DTLAV 713

Query: 779 A 779
           A
Sbjct: 714 A 714


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 284/769 (36%), Positives = 417/769 (54%), Gaps = 72/769 (9%)

Query: 52  YPFCNTSLSI---STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
           YP     LS    + R KSLI  LTL+EKI  L D ++   RLG+P+YEWW+E+ HG+ S
Sbjct: 10  YPPAKQRLSAVINTQRVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGS 69

Query: 109 NGPGVNFNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW 165
             PGV F       S  TSFP  +++AASF+ +L   I S +  E RA  N G +G  FW
Sbjct: 70  -APGVQFTEKPVNFSYATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFW 128

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNIN FRDPRWGRGQETPGED  VV +Y   F+   QG++ +           +K+V+ 
Sbjct: 129 APNINPFRDPRWGRGQETPGEDSFVVQSYIRNFIPGLQGDDPE-----------DKQVI- 176

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                          A CKH  AYDLE      RY  +   T+QD  D F  PF++C+  
Sbjct: 177 ---------------ATCKHYAAYDLET----GRYGNDYNPTQQDLADYFLAPFKTCVRD 217

Query: 286 GKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKG---YITSDCDAVATIFEYQNYTK 341
                IMC+YN V+G+P C    L  Q  R  W F     Y+ SDC AV  I++Y N+T 
Sbjct: 218 TGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHWNFTADYNYVVSDCGAVTDIWQYHNFTD 277

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQS-AIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
           T E +A+  L AG+D+ CG+  L+  +S A ++  VQ   +D+AL  L+S    +G F+G
Sbjct: 278 TEEAAASVSLNAGVDLECGSSYLKLNESLAANQTTVQA--LDQALTRLYSALFTVGFFDG 335

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL-NKNAVSSLAIIGPLV 459
               GKY  LG  DV T E + LA +AA +G+ LLKNDK+ LP+ + +   S+A+IGP  
Sbjct: 336 ----GKYTALGFADVSTPEAQSLAYEAAVEGMTLLKNDKRLLPIRSSHKYKSVALIGPFA 391

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
           N  +QM G Y+GIP    S L   + +  + +YA G   +   +  GF  A+  A+K+D 
Sbjct: 392 NATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAMGT-GINNQTTTGFASALAAAEKSDL 450

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           VI + G+D + E E  DR SL  PG Q+ LVT +++  K P+I+V  GGG LD S    +
Sbjct: 451 VIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLSKLHK-PLIVVQFGGGQLDDSALLQN 509

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSR 638
             + +++W GYP ++G  AL +++ G  +  GRLP+T YP S+  +V + D+N+R + S 
Sbjct: 510 EGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLPVTQYPASYADQVSIFDINIRPNDS- 568

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
            YPGR+Y++YTG  V  FG+GL YT + +++           +     + NI Q   S  
Sbjct: 569 -YPGRTYKWYTGMPVVPFGYGLHYTKFEFEW-----------AQTLNHEYNIQQLVASCQ 616

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVH 757
               I + T  T+++ HV+    N G     +V +LF   P       P K L+ + R+H
Sbjct: 617 STGPISDNTPFTTVKAHVK----NIGPEASDYVGLLFLSSPDAGPAPRPNKSLVSYLRLH 672

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +  GS+           ++ A+++G  ++  G++ + + ++  SLT E
Sbjct: 673 NITSGSQGTLDLPLTLGSMARADENGNLVIFPGHYKIAL-DVSDSLTFE 720


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 288/796 (36%), Positives = 426/796 (53%), Gaps = 59/796 (7%)

Query: 27  VIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNA 85
           + AFS     +    FP C      +   C+ +    TRA +LIS+ T +E      + +
Sbjct: 12  LTAFSVGAATVRAYGFPDCVNGPLKNNAVCDVTKDPITRATALISIWTDEELTNNTVNAS 71

Query: 86  SAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNI 143
             +PRLG+PAY WWSE LHG+A + PGV F  +G  S  TSFPQ ++  A+F+  L   I
Sbjct: 72  PGVPRLGLPAYNWWSEGLHGVAQS-PGVTFAPSGNFSYATSFPQPILMGAAFDDPLIQAI 130

Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
            + V+ E RA  N G+AGL +W PNIN F+DPRWGRGQETPGEDP  +S Y    +   Q
Sbjct: 131 ATIVSTEGRAFNNAGRAGLDYWTPNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQ 190

Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
           G            G   K   K             + A CKH  AYD++ W    RY FN
Sbjct: 191 G------------GLDPKPYFK-------------VVADCKHFAAYDMDNWEGVVRYGFN 225

Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG- 321
           A++++QD  + + PPF++C+   K + +MCSYN VNG+P+C    L Q   R+ WGF   
Sbjct: 226 AVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFLLQDVLRDHWGFTDD 285

Query: 322 -YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
            ++TSDCDAV  IF   NYT     +AA  L AG DI+CGT    +   A+ +G V   D
Sbjct: 286 RWVTSDCDAVQNIFTPHNYTTDPAQAAADALLAGTDIDCGTFSSTYLPEALQRGLVNSTD 345

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           + RA +  ++  +RLG F+ DP    Y +LG  DV T + ++LA  AA +G+VLLKND  
Sbjct: 346 LRRAAIRQYASLVRLGYFD-DPAAQPYRQLGWSDVNTLQAQQLAHTAAVEGMVLLKND-G 403

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
            LPL+K  V  LA+IGP  N    + G Y GI     S ++G +    +  Y  G +   
Sbjct: 404 LLPLSKR-VRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQAGFEVEYVFGTNVTT 462

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
            N  +GF  AV  AK+AD V+   GLD T E E+ DR+++  PG Q+ LV  + R  K P
Sbjct: 463 RNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPGNQLDLVAELERVGK-P 521

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           +I+   GGG LD +  +    +++I+W GYPG++G  AL +I+ G   P GRLP+T YP 
Sbjct: 522 LIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILTGKAAPAGRLPITQYPA 581

Query: 621 SFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           ++  +VPM DM +R  ++   PGR+Y++Y+GT V+ FG GL YT +++ + +  +    +
Sbjct: 582 AYAEQVPMTDMTLRPSATN--PGRTYKWYSGTPVFEFGFGLHYTTFAFAWAAPGAAADST 639

Query: 680 ASL----KAGSDKNIL---QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
           AS     K+ S   ++   Q++ + LD   +D             + VTN G V   +V 
Sbjct: 640 ASFGGPAKSYSISQLVAHGQESAAFLDLAPLDTFA----------VRVTNTGKVASDYVA 689

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           +LF          P+K L+ + R+H +A     +         ++ A+++G + +  G +
Sbjct: 690 LLFVSGSFGPAPHPKKTLVAYTRIHGLAPRGSTVGQLPVTLGAIARADENGEKWVHPGTY 749

Query: 793 VLMV---GELRHSLTI 805
            L +    +L H+ T+
Sbjct: 750 TLALDTDAKLTHTFTL 765


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 281/755 (37%), Positives = 407/755 (53%), Gaps = 62/755 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNTS     RA++L+SL TL+E I    + A  +PRLG+P Y+ W+E+LHG+       N
Sbjct: 64  CNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDR----AN 119

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFP  ++S ASFNR+L + I S +A +ARA  N G+ GL  +APNIN F
Sbjct: 120 FSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYAPNINGF 179

Query: 173 RDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   +S AYA E++   QG                     G   E 
Sbjct: 180 RSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG---------------------GVDPE- 217

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                + + A  KH   YDLE WGN SR   NA+IT+QD  + + P F +     K   +
Sbjct: 218 ----HVKIVATAKHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSL 273

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R  + F   GY++SDCDAV  +F    Y      +AA
Sbjct: 274 MCSYNAVNGVPSCSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGYALNQSGAAA 333

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L AG DI+CG  M  H   +  +  V   DI+++L  L++  +RLG F+G+     Y 
Sbjct: 334 DSLLAGTDIDCGQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGN--NSVYR 391

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L  +DV T++   ++ +AA +GI LLKND   LPL+K  V S+A+IGP  N   QM G 
Sbjct: 392 NLNWNDVVTTDAWNISYEAAVEGITLLKNDGT-LPLSKK-VRSIALIGPWANATVQMQGN 449

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G P    S L   +A     +YA G  ++  +S   F EA+  AKK+D +I   G+D 
Sbjct: 450 YYGTPPYLISPLEAAKASGFTVNYAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDN 508

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E +DR  L  PG Q+ L+  +++  K P++++  GGG +D S  +A+  +++++W 
Sbjct: 509 TIEAEGQDRTDLKWPGNQLDLIEQLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWG 567

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G  AL +I+ G   P GRL  T YP  + T+ P NDMN+R + S   PG++Y +
Sbjct: 568 GYPGQSGGAALFDILTGKRAPAGRLVSTQYPAEYATQFPANDMNLRPNGSN--PGQTYIW 625

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS-RLDYVHIDEV 706
           YTGT VY FGHGL YT +         + + +A     S  +IL    +    Y +I+ V
Sbjct: 626 YTGTPVYEFGHGLFYTEF---------QESAAAGTNKTSTLDILDLVPTPHPGYEYIELV 676

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-SKE 765
                   +V + V N G     +  +LFA      +  P K L+GFDR+ T+    + +
Sbjct: 677 P-----FLNVTVDVKNVGHTPSPYTGLLFANTTAGPKPYPNKWLVGFDRLATIHPAKTAQ 731

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
           ++F V P   ++ A+++G +++  G + L +   R
Sbjct: 732 VTFPV-PLGAIARADENGNKVIFPGEYELALNNER 765


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 226/453 (49%), Positives = 303/453 (66%), Gaps = 8/453 (1%)

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           MD+NCG  +  +T+SA++K KV E +IDRAL NLFS+++RLGLFNG+P K  YG +  D 
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           VC+ EH+ +AL+AAR GIVLLKN  + LPL+K+  +SLAIIGP  +N + + G Y G PC
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
              +  +GL+ Y+  T Y  GC  V C+S A   +A++IAK+AD V++V GLD TQE E+
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVACSS-AAIDQAIKIAKEADQVVLVMGLDQTQEREE 179

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DRV L+LPG+Q  L+ SVAR +K+PV+LVL  GGP+D+SFA+ D  I  ILW GYPGEA
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  ALAEIIFG+ NPGGRLP+TWYP+ FTKVPM DM MR   S  YPGR+YRFY G +V+
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTKVPMTDMRMRPQPSSGYPGRTYRFYKGKKVF 299

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSL 712
            FG+GLSY+NYSY+ +S         SL++  D+    +  S + Y  I E+    C   
Sbjct: 300 EFGYGLSYSNYSYELVSVTQN---KISLRSSIDQK--AENSSPIGYKTISEIEEELCERS 354

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           +F V + V N G++ G H V+LFAR  K   G P K+LI F  V   A  + EI + V+P
Sbjct: 355 KFSVTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNP 414

Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           CE LS AN+ G  ++  G+  L+VG+  + + I
Sbjct: 415 CEHLSRANEDGLMVMEEGSQYLLVGDKEYPINI 447


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 284/781 (36%), Positives = 424/781 (54%), Gaps = 78/781 (9%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     S+P C+TSLSI+ R  SL+  LTL+EKI  L D ++   RLG+P+YEWWSE+ 
Sbjct: 22  CQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEAT 81

Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           HG+ S  PGV F       S  TSFP  +++AASF+ +L   I   +  E R   N G +
Sbjct: 82  HGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFS 140

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G  FWAPNIN FRDPRWGRGQETPGEDP+V   Y   FV   QG++ K+           
Sbjct: 141 GFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKN----------- 189

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+V+                A CKH   YDLE      RY  N   T+QD  + F  PF+
Sbjct: 190 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSEYFLAPFK 229

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
           +C+       IMCSYN V+G+PAC    L  +  R  W F     Y+ SDC AV  I++Y
Sbjct: 230 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQY 289

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            N+T T E +A+  L AG+D+ CG+  L+  +S +   +   K +D++L  L+S    +G
Sbjct: 290 HNFTDTEEAAASVALNAGVDLECGSSYLKLNES-LAANQTSVKVMDQSLARLYSALFTVG 348

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK-NAVSSLAII 455
            F+G    GKY KL   DV T + + LA +AA +G+ LLKND   LPL+  +   S+A+I
Sbjct: 349 FFDG----GKYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVI 403

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVR 512
           GP  N  +QM G Y+G       L+  LEA+     K +YA G   +   + +GF EA+ 
Sbjct: 404 GPFANATTQMQGDYSG---DAPYLISPLEAFGDSRWKVNYALGTA-INNQNTSGFEEALA 459

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            A K+D +I + G+D + E+E  DR SL  PG Q+ L+TS+++ SK P+++V  GGG +D
Sbjct: 460 AANKSDLIIYLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVD 518

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
            S    +  I +++W GYP ++G  AL +++ G  +P GRLP+T YP S+  +V + D+N
Sbjct: 519 DSAILKNKDIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDIN 578

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           +R   +  YPGR+Y++YTG  V  FG+GL YT + + +     E T++       + NI 
Sbjct: 579 LRP--TDLYPGRTYKWYTGKPVLPFGYGLHYTKFMFDW-----EKTLN------REYNIQ 625

Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEK 748
               S  +     I++ T  T+++  V+    N G     +V +LF           P K
Sbjct: 626 DLVASCRNSSGGPINDNTPLTTVKARVK----NVGHKTSDYVSLLFLSSKNAGPAPRPNK 681

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG---ELRHSLTI 805
            L+ + R+  +A+GS +++        L+ A+++G  ++  G + + +    EL    T+
Sbjct: 682 SLVSYVRLLNIARGSDQVAELPLTLGSLARADENGSLVIFPGRYKIALDNSEELTFEFTL 741

Query: 806 E 806
           +
Sbjct: 742 K 742


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 281/758 (37%), Positives = 407/758 (53%), Gaps = 62/758 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS S   RA +LI L TL+EKI    + +  +PRLG+PAY+WW E+LHG+A + PGV 
Sbjct: 34  CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAES-PGVI 92

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F   G  S  TSFPQ ++  A+F+  L + + + V+ EARA  N  ++GL FW PNIN F
Sbjct: 93  FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  + +Y    +   QG          G     KR++        
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH   YDLE W    RY F+A+I+ QD  + +   F +C         M
Sbjct: 195 --------ATCKHYAGYDLENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFM 246

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN VNGVP+C    L Q   R  W +     +ITSDCDA+  I+E   Y  T E + A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRGHWNWTSDDQWITSDCDAIQNIYEPHYYAPTRELTVA 306

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CGT    +  +A D+G   E  +DRAL+  ++  ++LG F  DP + + Y
Sbjct: 307 DALNAGADLDCGTYYPENLGAAYDEGLFAESTLDRALIRQYASLVKLGYF--DPAENQPY 364

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            ++G  +V T E ++LA  AA +GI L+KND   LPL+  ++ SLA+IGP  N  +QM G
Sbjct: 365 RQIGWANVSTPEAEELAYRAAVEGITLIKNDGT-LPLSP-SIKSLALIGPWANATTQMQG 422

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
            Y G P    S L   EA     +Y+ G   D P  S   F  A   A+ AD +I + G+
Sbjct: 423 NYYGQPPYLISPLMAAEALNYTVYYSPGPGVDDPTTSS--FPAAFAAAQAADAIIYIGGI 480

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T E E  DR +L  PG Q   +  +++  K P++++  GGG +D S    ++ +++++
Sbjct: 481 DTTVEAEAMDRYTLDWPGVQPDFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALI 539

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSY 645
           W GYPG++G  AL +II G+  P GRLP T YP  +  +V M DM++R  ++   PGR+Y
Sbjct: 540 WGGYPGQSGGTALMDIIVGNAAPAGRLPTTQYPLDYVYQVAMTDMSLRPSATN--PGRTY 597

Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
            +YTGT +  FG GL YTN+S + LS PS  +   +   G+ + +     + LD    + 
Sbjct: 598 MWYTGTPIVEFGFGLHYTNFSAE-LSQPSAPSYDIASLVGACEGV-----AHLDLCAFES 651

Query: 706 VTSCTSLRFHVQISVTNAGD-VDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
            T          ++VTN G  V   +V +LF          P K L  +DR+HT+A  S 
Sbjct: 652 YT----------VNVTNIGSKVTSDYVALLFVAGEHGPAPIPNKVLAAYDRLHTIAPLSS 701

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHS 802
           + +        LS  +++G R+L  G + L++  L  +
Sbjct: 702 QQATLNLTLGSLSRVDEYGNRVLYPGEYTLILDVLPQA 739


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  454 bits (1167), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 277/770 (35%), Positives = 401/770 (52%), Gaps = 59/770 (7%)

Query: 36  VLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           V  + +FP CK     +   C+ S S   RAKSL++L TL+EKI   S++A  + RLG+P
Sbjct: 17  VFAQGNFPDCKNGPLKNETICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVP 76

Query: 95  AYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
            Y+WW+E LHGIA  GP  +F   G  S  TSFPQ ++  A+F+  L + +   ++ EAR
Sbjct: 77  PYQWWNEGLHGIA--GPFTSFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEAR 134

Query: 153 AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
           A  N  + GL FW PNIN FRDPRWGRGQETPGED   +S+Y    +   QG    +D  
Sbjct: 135 AFNNANRTGLDFWTPNINPFRDPRWGRGQETPGEDSYHLSSYVKALIHGLQGN--ATDP- 191

Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                   +RV+                A CKH   YD+E W    RY  +  I++QD  
Sbjct: 192 -------YRRVV----------------ATCKHYAGYDIENWNGNLRYQNDVQISQQDLV 228

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCD 328
           + +  PF +C+ Q      MCSYN VNG P C    L Q   R  WG+     ++TSDCD
Sbjct: 229 EYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYLLQTVLREHWGWSSDDHWVTSDCD 287

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           A+  ++    ++ T E +AA  L AG D++CGT +  H   A+ +G   E  +D+AL+  
Sbjct: 288 AIQNVYLPHQWSSTREGAAADSLNAGTDLDCGTYLQTHLPGAVKQGLTDETTLDKALIRQ 347

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
           +S  ++LG F+  P    Y +LG D V TS  + LAL AA +GIVLLKND   LP+N  +
Sbjct: 348 YSSLIKLGYFDA-PENQPYRQLGFDAVATSASQALALKAAEEGIVLLKNDG-VLPINLGS 405

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH 508
              + I G   N  SQ+ G Y G+     S L  L+       YA        +   G  
Sbjct: 406 -KQVGIYGDWANATSQLQGNYFGVAKFLTSPLMALQNLGVDVKYAGNLPGGQGDPTTGAW 464

Query: 509 EAVR-IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
            ++  +   +D  I V G+D   E+EDRDR  L L G Q+ ++  +A T K PVI+V+ G
Sbjct: 465 SSLSGVITTSDVHIWVGGIDNGVESEDRDRSWLTLTGGQLDVIGQLADTGK-PVIVVIMG 523

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVP 626
           GG +D S    + +IS++LW GYPG+ G  A+  I+ G   P GRLP T YP  + ++VP
Sbjct: 524 GGQIDTSPLIRNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRLPQTQYPSKYVSEVP 583

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
           M DM MR   S + PGR+Y++YTG  ++ FG+GL YTN+S    + P +    + L  G 
Sbjct: 584 MTDMAMRP--SDKNPGRTYKWYTGEPIFEFGYGLHYTNFSASITNQPKQSYAISDLVKGC 641

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           +      TG  L+      +T          +SV N G +   +V + F       +  P
Sbjct: 642 NS-----TGGFLERCPFTGIT----------VSVQNTGKISSDYVTLGFLTGSFGPKPYP 686

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +K L+ +DR+  +A GS   +        L+  ++ G ++L  G++ L +
Sbjct: 687 KKSLVAYDRLFNIAAGSSSTATLNLTLASLARVDESGNKVLYPGDYELQI 736


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  454 bits (1167), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 275/752 (36%), Positives = 413/752 (54%), Gaps = 65/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RAKSL++ +TL+EKI     ++S  PRLG+PAY WW+E+LHG+A  G GV+
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFP  ++  A+F+  L   + + ++ EARA  N G AGL +W PNIN F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+ +S Y    V   Q                      G G E  
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 198

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
           +     + A CKH  AYDLE W    RY+F+A+++ QD  + + P F++C    K   +M
Sbjct: 199 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 253

Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NG+P C    L Q   R  WG++    ++T DC A+  I+   +Y      +AA
Sbjct: 254 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 313

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CG+    + +SA+ +G    + ++ AL+ L+S  ++LG F  DP   + Y
Sbjct: 314 AALNAGTDLDCGSVFPEYLRSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 371

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +G ++V T   ++LA  A  +GIV+LKND   LPL  N   ++AIIGP  N  +Q+ G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 428

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G P   ++L+        K  ++ G  D+  NS AGF EA+  AK+AD VI   G+D
Sbjct: 429 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 487

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E +DR +++ PG Q+ L+  ++   K P+I+V  GGG +D S   A++ + ++LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYP +AG  A+ +I+ G   P GRLP+T YP S+  +VPM DM +R  S+   PGR+YR
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 604

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V  FG GL YT ++  +  A      + S+ +G+       T + +D       
Sbjct: 605 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 649

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
              T L     I+VTN G+V   ++ +LF    +V  +  P K L+G+ R   +  G S+
Sbjct: 650 ---TELFDTFSITVTNTGNVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQ 706

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++   V     ++   ++G  +L  G++ L V
Sbjct: 707 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 737


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 271/759 (35%), Positives = 400/759 (52%), Gaps = 78/759 (10%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQE 76
           +L  IV +  + F+ S        FP C+         C+T+L   TRA++L++ +TL+E
Sbjct: 12  ILACIVSLTQLGFAQSP-------FPDCENGPLSKNAVCDTTLDPVTRAQALLAAMTLEE 64

Query: 77  KIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAAS 134
           KI     N+  +PRLG+PAY WWSE+LHG+A + PGV+F  +G  S  TSFP  +   A+
Sbjct: 65  KINNTQYNSPGVPRLGLPAYNWWSEALHGVAGS-PGVHFADSGNFSYATSFPSPITLGAA 123

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           F+  L   I + +  E RA  N G AGL +W PNIN +RDPRWGRGQETPGEDP   S Y
Sbjct: 124 FDDDLVKQIATVIGTEGRAFGNAGHAGLDYWTPNINPYRDPRWGRGQETPGEDPFHTSRY 183

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
               +   Q                      G G E  +     + A CKH   YD+E W
Sbjct: 184 VYHLIDGLQ---------------------DGIGPEKPK-----IVATCKHFAGYDIEDW 217

Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
               RY+F+A+I++QD  + + PPF++C    K   +MCSYN VNG+P C    L Q   
Sbjct: 218 EGNERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVL 277

Query: 314 RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA 370
           R  W ++G   ++TSDC A+  I++   Y      +AA  + AG D++CG+   +   SA
Sbjct: 278 REHWEWEGVGHWVTSDCGAIDNIYKDHKYVADGAHAAAVAVNAGTDLDCGSVYPQFLGSA 337

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAAR 429
           I +G +  + +DRAL  L+S  ++LG F  DP   + Y  +G  DV T + ++LA  AA 
Sbjct: 338 ISQGLLGNRTLDRALTRLYSSLVKLGYF--DPAADQPYRSIGWSDVATPDAEQLAHTAAV 395

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
           +G VLLKND   LPL KN   ++AI+GP  N  +Q+ G Y G      ++L        K
Sbjct: 396 EGTVLLKNDGT-LPLKKNG--TVAIVGPYANATTQLQGNYEGTAKYIHTMLSAAAQQGYK 452

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
             YA G   +  NS +GF +A+  AK +D VI   G+D   E E  DR S+  PG Q+ L
Sbjct: 453 VKYAPGT-GINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDL 511

Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           +  ++   K+P+++V  GGG +D S   +++ ++ +LW GYP +AG  A+ +I+ G   P
Sbjct: 512 IQQLSDL-KKPLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAP 570

Query: 610 GGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
            GRLP+T YPE +  +VPM DMN+R   S   PGR+YR+Y    V  FG+G+ YT +   
Sbjct: 571 AGRLPVTQYPEEYVDQVPMTDMNLRPGPSN--PGRTYRWYD-KAVIPFGYGMHYTTFDVS 627

Query: 669 FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
           +         +A++KA   +N + +T S                     + V N G V  
Sbjct: 628 WKRKNYGPYNTAAVKA---ENAVLETFS---------------------LQVKNTGKVTS 663

Query: 729 SHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKGSKEI 766
            +V ++F        +  P K L+G+ RV  +  G +++
Sbjct: 664 DYVALVFLTTTDAGPKPYPIKTLVGYQRVKAIRPGERKV 702


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 275/752 (36%), Positives = 411/752 (54%), Gaps = 65/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RAKSL++ +TL+EKI     ++S  PRLG+PAY WW+E+LHG+A  G GV+
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFP  ++  A+F+  L   + + ++ EARA  N G AGL +W PNIN F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+ +S Y    V   Q                      G G E  
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 219

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
           +     + A CKH  AYDLE W    RY+F+A+++ QD  + + P F++C    K   +M
Sbjct: 220 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 274

Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NG+P C    L Q   R  WG++    ++T DC A+  I+   +Y      +AA
Sbjct: 275 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 334

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CG+    +  SA+ +G    + ++ AL+ L+S  ++LG F  DP   + Y
Sbjct: 335 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 392

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +G ++V T   ++LA  A  +GIV+LKND   LPL  N   ++AIIGP  N  +Q+ G
Sbjct: 393 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 449

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G P   ++L+        K  ++ G  D+  NS AGF EA+  AK+AD VI   G+D
Sbjct: 450 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 508

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E +DR +++ PG Q+ L+  ++   K P+I+V  GGG +D S   A++ + ++LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYP +AG  A+ +I+ G   P GRLP+T YP S+  +VPM DM +R  S+   PGR+YR
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V  FG GL YT ++  +  A      + S+ +G+       T + +D       
Sbjct: 626 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
              T L     I+VTN G+V   ++ +LF     V  +  P K L+G+ R   +  G S+
Sbjct: 671 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 727

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++   V     ++   ++G  +L  G++ L V
Sbjct: 728 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 758


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 275/752 (36%), Positives = 411/752 (54%), Gaps = 65/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RAKSL++ +TL+EKI     ++S  PRLG+PAY WW+E+LHG+A  G GV+
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFP  ++  A+F+  L   + + ++ EARA  N G AGL +W PNIN F
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+ +S Y    V   Q                      G G E  
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 198

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
           +     + A CKH  AYDLE W    RY+F+A+++ QD  + + P F++C    K   +M
Sbjct: 199 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 253

Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NG+P C    L Q   R  WG++    ++T DC A+  I+   +Y      +AA
Sbjct: 254 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 313

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CG+    +  SA+ +G    + ++ AL+ L+S  ++LG F  DP   + Y
Sbjct: 314 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLNNALIRLYSSLVKLGYF--DPADDQPY 371

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +G ++V T   ++LA  A  +GIV+LKND   LPL  N   ++AIIGP  N  +Q+ G
Sbjct: 372 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 428

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G P   ++L+        K  ++ G  D+  NS AGF EA+  AK+AD VI   G+D
Sbjct: 429 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 487

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E +DR +++ PG Q+ L+  ++   K P+I+V  GGG +D S   A++ + ++LW
Sbjct: 488 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 546

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYP +AG  A+ +I+ G   P GRLP+T YP S+  +VPM DM +R  S+   PGR+YR
Sbjct: 547 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 604

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V  FG GL YT ++  +  A      + S+ +G+       T + +D       
Sbjct: 605 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 649

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
              T L     I+VTN G+V   ++ +LF     V  +  P K L+G+ R   +  G S+
Sbjct: 650 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 706

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++   V     ++   ++G  +L  G++ L V
Sbjct: 707 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 737


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  450 bits (1157), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 280/766 (36%), Positives = 409/766 (53%), Gaps = 62/766 (8%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           S P C+TSL   TRAK L+  +T +EK+Q   + +    RLG+PAY+WW+E+LHG+A + 
Sbjct: 161 SNPICDTSLDPLTRAKGLVDAMTFEEKVQNTQNGSPGAARLGLPAYQWWNEALHGVAGS- 219

Query: 111 PGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
           PGV F  +G  S  TSFPQ ++ +A+F+ +L   +G+ V++E RA  N G AGL FW PN
Sbjct: 220 PGVTFQPSGNFSYATSFPQPILMSAAFDDALIKEVGTVVSIEGRAFNNYGNAGLDFWTPN 279

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           IN FRDPRWGRGQETPGEDP  ++ Y    V   Q           G      RV+    
Sbjct: 280 INPFRDPRWGRGQETPGEDPYHIARYVYNLVDGLQN----------GIAPANPRVV---- 325

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
                       A CKH   YD+E W   SRY FNA+I+ QD  + + PPF+SC    + 
Sbjct: 326 ------------ATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYLPPFKSCARDAQV 373

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHE 344
             IMCSYN VNG+P C    L     R+ W +     ++TSDCDAV  I+    YT +  
Sbjct: 374 DAIMCSYNAVNGIPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDNIYSDHRYTSSLA 433

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  L AG +++CGT M  +  +A  +   +   ++ AL+ L+S  +RLG F  D   
Sbjct: 434 AAAADALNAGTNLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSSLVRLGWF--DSED 491

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND-KKFLPLNKNAVSSLAIIGPLVNNIS 463
            +Y  LG  DV T+  ++LA  AA +GIVLLKND KK LPL+++   ++A+IGP  N  +
Sbjct: 492 SQYSSLGWSDVGTTASQQLANRAAVEGIVLLKNDHKKVLPLSQHG-QTIALIGPYANATT 550

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           Q+ G Y G P   ++L+ G E       Y +G   +     +GF  AV  AK AD VI  
Sbjct: 551 QLQGNYYGTPAYIRTLVWGAEQMGYTVQYEAGT-GINSTDTSGFAAAVAAAKTADIVIYA 609

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G+D + E E  DR ++   G Q+ L+  +++  K P++++  GGG LD S    +  ++
Sbjct: 610 GGIDNSIEAEAMDRNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQLDDSALLQNENVN 668

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPG 642
           ++LW GYP + G +A+ +I+ G   P GRLP+T YP ++T  +PM DM++R + S   PG
Sbjct: 669 ALLWCGYPSQTGGQAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDMSLRPNGST--PG 726

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR+Y    V  FG GL YT +   +         +ASL A + K+  Q T +  D  H
Sbjct: 727 RTYRWYDDA-VIPFGFGLHYTTFDASWADKKFGPYNTASLVAKASKSKYQDT-APFDSFH 784

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAK 761
           ++               V N G V    V +LFA       +  P K LI + R  ++  
Sbjct: 785 VN---------------VKNTGKVTSDFVALLFASTDNAGPKPYPIKTLISYARASSIKP 829

Query: 762 G-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           G ++ +S  V     ++    +G  +L  G++ L +   +H  T E
Sbjct: 830 GETRTVSIDVT-IGSIARTATNGDLVLYPGSYTLQLDVGQHYPTAE 874


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 232/486 (47%), Positives = 317/486 (65%), Gaps = 18/486 (3%)

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
           DCDAV  +   Q Y KT ED+ A  LK+G+     + +  +T+SA++K KV   +IDRAL
Sbjct: 229 DCDAVNVLHVEQKYAKTPEDAVADALKSGI-----SYLRNYTKSAVEKKKVTVSEIDRAL 283

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            NLFS ++RLGLFNGDP K  Y  +GPD VC+ EH+ LAL+AA  GIVLLKN  + LPL+
Sbjct: 284 HNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPLS 343

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
           K+ +SSLA+IGP  +N + + G Y G  C   ++L GL  YVS   Y  GC++V C S A
Sbjct: 344 KSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTS-A 402

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
              + V +A+  D VI+V GLD +QE E  DR+ L+LPG+Q +L+T+VA+ +KRP++LVL
Sbjct: 403 AKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLVL 462

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP---GGRLPMTWYPESF 622
            GG P+DV+FA+ + +I SILW GYPG+AGA ALA+IIFG+ NP   GGRLPMTWYP+ F
Sbjct: 463 LGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQDF 522

Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISAS 681
           TKVPM DM MR   S   PGR+YRFY G +V+ FG+GLSY++YSY F S A ++L +  S
Sbjct: 523 TKVPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFASVAQNQLNVKDS 582

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
                  N   +      Y  + ++    C +++F V +SV N G + G H V+LFAR  
Sbjct: 583 ------SNQQPENSETPGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLFARHA 636

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
           K  +G P K+L+GF  V   A    EI + + PCE LS AN+ G  ++  G+ +L+VG+ 
Sbjct: 637 KPGKGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGDK 696

Query: 800 RHSLTI 805
            H +TI
Sbjct: 697 EHPVTI 702



 Score =  223 bits (569), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 115/224 (51%), Positives = 140/224 (62%), Gaps = 28/224 (12%)

Query: 38  NKPDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPA 95
           ++P + C    P   +Y FC T+L IS RA+ L+S LT +EK  QL D + AIPRLGIPA
Sbjct: 26  SQPPYSCDSSDPSTKTYDFCKTTLPISRRAEDLVSRLTFEEKATQLVDTSPAIPRLGIPA 85

Query: 96  YEWWSESLHGIA---SNGPGVNF-NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
           YEWWSE LHGI        G++F N T+   TSFPQV+++AASF+  +W  IG  V  EA
Sbjct: 86  YEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQVILTAASFDAHIWYRIGQ-VGKEA 144

Query: 152 RAMYNLGQ-AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
           RA+YN GQ  GL FWAPN+NIFRDPRWGRGQETPGEDP+VV  Y   FV+  QG++    
Sbjct: 145 RALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGEDPLVVGKYGASFVRGVQGDS---- 200

Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
                           F  ES  GD L  SACCKH  A+DL+ W
Sbjct: 201 ----------------FEGESTLGDHLQASACCKHYTAHDLDNW 228


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 275/752 (36%), Positives = 410/752 (54%), Gaps = 65/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL   +RAKSL++ +TL+EKI     ++S  PRLG+PAY WW+E+LHG+A  G GV+
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFP  ++  A+F+  L   + + ++ EARA  N G AGL +W PNIN F
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP+ +S Y    V   Q                      G G E  
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQ---------------------DGIGPERP 219

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
           +     + A CKH  AYDLE W    RY+F+A+++ QD  + + P F++C    K   +M
Sbjct: 220 K-----VVATCKHFAAYDLENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVM 274

Query: 293 CSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NG+P C    L Q   R  WG++    ++T DC A+  I+   +Y      +AA
Sbjct: 275 CSYNSLNGIPTCADRWLLQTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVADGAHAAA 334

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CG+    +  SA+ +G    + +  AL+ L+S  ++LG F  DP   + Y
Sbjct: 335 AALNAGTDLDCGSVFPEYLGSALQQGLYNNQTLYNALIRLYSSLVKLGYF--DPADDQPY 392

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +G ++V T   ++LA  A  +GIV+LKND   LPL  N   ++AIIGP  N  +Q+ G
Sbjct: 393 RSIGWNEVFTPAAEELAHKATVEGIVMLKNDGT-LPLKSNG--TVAIIGPFANATTQLQG 449

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G P   ++L+        K  ++ G  D+  NS AGF EA+  AK+AD VI   G+D
Sbjct: 450 NYEGPPKYIRTLIWAAVHNGYKVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGID 508

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E +DR +++ PG Q+ L+  ++   K P+I+V  GGG +D S   A++ + ++LW
Sbjct: 509 NTIEKESQDRTTIVWPGNQLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLW 567

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYP +AG  A+ +I+ G   P GRLP+T YP S+  +VPM DM +R  S+   PGR+YR
Sbjct: 568 AGYPSQAGGAAVFDILTGKSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSNN--PGRTYR 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y    V  FG GL YT ++  +  A      + S+ +G+       T + +D       
Sbjct: 626 WYD-KAVLPFGFGLHYTTFNVSWNHAEYGPYNTDSVASGT-------TNAPVD------- 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
              T L     I+VTN G+V   ++ +LF     V  +  P K L+G+ R   +  G S+
Sbjct: 671 ---TELFDTFSITVTNTGNVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQ 727

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++   V     ++   ++G  +L  G++ L V
Sbjct: 728 QVKLDVS-VGSVARTAENGDLVLYPGSYKLEV 758


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 282/774 (36%), Positives = 413/774 (53%), Gaps = 113/774 (14%)

Query: 48  HFDSYP-FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           H +  P + + SLS   RAK L+S +TL EKI Q+  +A AIPRL IPAY WW+E LHG+
Sbjct: 6   HEEKLPAYKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGV 65

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-------- 158
           A  G            T FPQ +  AA+FN  L   +  A++ EARA ++          
Sbjct: 66  ARAG----------IATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGI 115

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
             GLTFW+PNINIFRDPRWGRG ET GEDP + S   V FVK  QG++ K          
Sbjct: 116 YKGLTFWSPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGDDPKY--------- 166

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L + A  KH   Y +       R+SF+A ++++D  +T+ P 
Sbjct: 167 ------------------LKVVATPKH---YAVHSGPESQRHSFDARVSQKDLRETYLPA 205

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
           F  C+++GKA  IM +YN+ NG P C    L +   R+EWGF GY+ SDC A+  I  + 
Sbjct: 206 FEECVKEGKAVSIMGAYNRTNGEPCCASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHH 265

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
             TKT  +SAA  +  G ++NCG     +   A+++G + E+ ID+A++ LF+ ++RLG+
Sbjct: 266 KVTKTAAESAALAVNNGCELNCGKTY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGM 324

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+  P   +Y  +  D   + EH++LAL+ ARQ IVLLKND+  LPL+K  + ++A+IGP
Sbjct: 325 FD-PPEMVRYAHIPYDVNDSPEHRELALETARQSIVLLKNDENILPLSKK-LKTIAVIGP 382

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
             +++  +   Y G P    + L G++  VS   K  YA GC +V  NS  GF EAV IA
Sbjct: 383 NADDLDVLLANYFGTPSKYVTPLEGIKNKVSPDTKVLYAKGC-EVTGNSVDGFDEAVNIA 441

Query: 515 KKADFVIVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           + AD VI+  GL    E E+ D         R+ + LPG Q  L+ ++  T K P++LVL
Sbjct: 442 EMADIVIMCLGLSPRIEGEEGDVADSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVL 500

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  + +++A     + +I+   YPGE G  A+A+++FGD+NP GRLP+T+        
Sbjct: 501 LNGSAIAINWAH--EHVPAIIEAWYPGEEGGTAIADVLFGDYNPAGRLPITFVRSLDDLP 558

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKA 684
           P  D NM+        GR+YR++    +Y FG+GLSYT++ Y      S L +SA  L A
Sbjct: 559 PFTDYNMK--------GRTYRYFEKEPLYPFGYGLSYTSFKY------SNLRLSAMRLPA 604

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
           G++ +I                           + V N G + G  VV L+    + S  
Sbjct: 605 GNNLDI--------------------------NVDVENTGKLAGREVVQLYISDVEASVE 638

Query: 745 TPEKQLIGFDRVHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P +QL G   + T+  G K+ +SF V+P + +S+ +  G+RIL  G  ++ VG
Sbjct: 639 VPMRQLCGIQCI-TLEPGQKQTVSFTVEP-QHMSLFDYDGKRILEPGQFIIAVG 690


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 272/759 (35%), Positives = 409/759 (53%), Gaps = 74/759 (9%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS +   RA+ LI+L TL+E I    ++   +PRLG+P YE WSE LHG+      V 
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGL-DRAHFVK 122

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
                +  TSFP  ++S A+ NR+L + I S +A +ARA  N+G+ GL  +APNIN FR 
Sbjct: 123 SGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGFRS 182

Query: 175 PRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           P WGRGQETPGED   + S+YA E++   Q        GGI                   
Sbjct: 183 PLWGRGQETPGEDANFLTSSYAYEYITGLQ--------GGI------------------D 216

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
            D L ++A  KH   YDLE WG  SR  F+A IT+QD  + + P F +     KA   MC
Sbjct: 217 PDNLKIAATAKHFAGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMC 276

Query: 294 SYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           SYN VN +P+C    L Q   R +W F   GY++SDCDAV  +F    Y      +AA  
Sbjct: 277 SYNSVNAIPSCSSSFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGYASNQSSAAAES 336

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           L+AG DI+CG     H   +  +G V   +I+R++L L+S  ++LG F+GD  K +Y +L
Sbjct: 337 LRAGTDIDCGQTYSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGD--KNEYRQL 394

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           G +DV T++   ++ +AA +GIVLLKND   LPL+KN V S+A++GP  N   Q+ G Y 
Sbjct: 395 GWNDVVTTDAWNISYEAAVEGIVLLKND-GVLPLSKN-VKSVALVGPWANATKQLQGNYF 452

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           G      + L+G      K +YA G + +  N+  GF  A+  AKK+D ++ + G+D T 
Sbjct: 453 GTAPYLITPLQGASDAGYKVNYALGTN-ISGNTTDGFANALSAAKKSDVIVYLGGIDNTI 511

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
           E E  DR+++  P  Q+ L+  +++T K P++++  GGG +D S  +++S++++++W GY
Sbjct: 512 EAEGTDRMNVTWPRNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGY 570

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
           PG++G KA+ +I+ G   P GRL  T YP  + T+ P  DM++R D  +  PG++Y +Y 
Sbjct: 571 PGQSGGKAIFDILKGKRAPAGRLVSTQYPAEYATQFPATDMSLRPD-GKSNPGQTYMWYI 629

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS- 708
           G  VY FG+GL YT +                      K   ++ GS      I E+ S 
Sbjct: 630 GKPVYEFGYGLFYTTF----------------------KETAKKLGSSSSSFDISEIVSS 667

Query: 709 --CTSLRF-------HVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHT 758
               S  +       +V  ++ N G     +  MLFA          P K L+G+DR+ +
Sbjct: 668 PRSPSYEYSELVPFLNVTATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLPS 727

Query: 759 VAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +  G S ++   V P   ++  +K+G RI+  G++ L +
Sbjct: 728 IEPGKSADLVIPV-PIGAIARVDKNGNRIVYPGDYQLTL 765


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  447 bits (1150), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 273/764 (35%), Positives = 411/764 (53%), Gaps = 78/764 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS +   RA+ LI+L TL+E I    + A  +PRLG+P Y+ WSE+LHG+       N
Sbjct: 64  CDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDR----AN 119

Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F  +    +  TSFP  ++S A+ NR+L + I   +  +ARA  N G+ GL  +APNIN 
Sbjct: 120 FATSGDEWTWATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNING 179

Query: 172 FRDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED   + S+YA E++   Q        GG+                
Sbjct: 180 FRSPLWGRGQETPGEDANFLSSSYAYEYITGLQ--------GGV---------------- 215

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
               D L + A  KH   YDLE WG  SR  F+A IT+QD  + + P F +     KA  
Sbjct: 216 --DPDHLKVVATAKHFAGYDLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARS 273

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
            MCSYN VNGVP+C    L Q   R+ W F   GY++SDCDAV  +F    Y      +A
Sbjct: 274 FMCSYNSVNGVPSCSSSFLLQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGYASNQSAAA 333

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L+AG DI+CG     +   +  +G V   +I+R+++ L+S  ++LG F+GD  K +Y
Sbjct: 334 ADSLRAGTDIDCGQTYPWNLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGD--KSEY 391

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            +LG +DV T++   ++ +AA +GIVLLKND   LPL+K+ V S+A+IGP  N   Q+ G
Sbjct: 392 RQLGWNDVVTTDAWNISYEAAVEGIVLLKND-GILPLSKH-VKSIALIGPWANATEQLQG 449

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G      + L+G      K +YA G + +  N+  GF +A+  AKK+D ++ + G+D
Sbjct: 450 NYYGTAPYLITPLQGASDAGYKVNYALGTN-ILGNTTEGFADALSAAKKSDVIVYLGGID 508

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR+++  PG Q+ L+  +++T K P++++  GGG +D S  +A+S++++++W
Sbjct: 509 NTIEAEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVW 567

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  A+ +I+ G   P GRL  T YP  + T+ P  DMN+R D +   PG++Y 
Sbjct: 568 GGYPGQSGGTAIFDILSGKRVPAGRLVTTQYPAEYATQFPATDMNLRPDGASN-PGQTYM 626

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG-SRLDYVHIDE 705
           +YTGT VY FG+GL YT +                      K   Q+ G S  D   I  
Sbjct: 627 WYTGTPVYDFGYGLFYTTF----------------------KETAQKLGSSSFDISEIVA 664

Query: 706 VTSCTSLRF-------HVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVH 757
                S  +       ++  ++ N G     +  MLFA          P K L+G+DR+ 
Sbjct: 665 APRSPSYEYSELVPFVNITATIKNTGKTASPYTAMLFANTTNAGPAPYPNKWLVGYDRLA 724

Query: 758 TVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
           ++  G S ++   V P   ++  +++G RI+  G++ L +   R
Sbjct: 725 SIEPGKSADLVIPV-PIGAIARVDENGNRIVYPGDYQLALNVER 767


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  447 bits (1150), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 269/769 (34%), Positives = 414/769 (53%), Gaps = 78/769 (10%)

Query: 42  FP-CKP-PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           FP C P     +   C+TSL   TRA SL++  T QE I    + A  +PRLGIP Y+WW
Sbjct: 21  FPDCSPGTPLSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWW 80

Query: 100 SESLHGIASNGPGVNFNGTVS----SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           +E+LHG+A + PGVNF   V     S T+FPQ++   A+F+ +L+  + + +A E RA  
Sbjct: 81  TEALHGVAGS-PGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFN 139

Query: 156 NLGQAGLTFWAP-NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
           N G+AGL  ++P NIN FRDPRWGRGQET GEDP+ +S YAV+ V+  QG N        
Sbjct: 140 NAGKAGLNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGPN-------- 191

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                               DEL L+A CKH +AYDLEKW    RY F+A ++ Q+  + 
Sbjct: 192 -------------------QDELRLAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEF 232

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGV-PACLRGDLFQKARNEWGF---KGYITSDCDAV 330
           + P FR+C+  GKA  +M SYN VN V P+  R  L   AR EWG      Y+TSDCDAV
Sbjct: 233 YLPQFRACVRDGKAVTLMTSYNAVNNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAV 292

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
           A +F+  +Y  ++  +AA  + AG D+NCG     +   A+++     + I  A+  +++
Sbjct: 293 ANVFDGHHYADSYVQAAADSINAGTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYA 352

Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
            Q+RLGLF  DP++G+   +LG + V T   + LA  +A   + LLKN+   LP+  +  
Sbjct: 353 SQVRLGLF--DPKQGQPLRELGWEHVNTKAAQDLAYSSAAASVTLLKNNGT-LPV--DGA 407

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT--HYASGCHDVPCNSDAGF 507
           + +A+IGP  N    + G Y G      ++    +   S+     A+G       +    
Sbjct: 408 TKVAVIGPYSNATFALRGNYAGPGPFAITMTEAAQRVFSQATISSANGTTISGTYNHTDA 467

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
             A+++AK+AD VI   G+D T E+E+ DR ++  P  Q+ L+ ++   +K+ + +V  G
Sbjct: 468 EAAMQLAKEADLVIFAGGIDPTIESEELDRATIAWPPNQLQLIHALGGMAKK-MAVVQFG 526

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VP 626
           GG +D +  +AD  I ++LW GYPG++GA A+ ++I G+  P GRLP+T YP  +   + 
Sbjct: 527 GGQIDGASIKADGNIGALLWAGYPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLA 586

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
              M +R +++  YPGR+Y++Y+GT  Y + HGL YT +  + L+ P+  TI+ +     
Sbjct: 587 ETTMALRPNAT--YPGRTYKWYSGTPTYPYAHGLHYTEFKAE-LAQPAPYTIATA----- 638

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                        Y   + V +       VQ ++TNAG     +  ++FAR        P
Sbjct: 639 ------------GYAEFERVAT-------VQATITNAGQRTSDYAALVFARHTNGPAPHP 679

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
            K L+G+ +V  +A G S+ +   +     L+  ++ G  +L  G + L
Sbjct: 680 NKTLVGYKKVKAIAPGESRSVEVEITQAA-LARGDEEGNLVLYPGKYEL 727


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 276/777 (35%), Positives = 418/777 (53%), Gaps = 64/777 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      S   C+ + + + RAK+LI + T +E +Q   + +  +PRLG+P+Y+WWS
Sbjct: 24  FPDCVNGPLSSTAVCDPTKAPAARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWS 83

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A + PGV+F  +G  SS TSFPQ +V  ++F+  L   + + ++ EARA  N  
Sbjct: 84  EALHGVAGS-PGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFH 142

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           +AGL ++ PNIN F+DPRWGRGQETPGEDP  VS Y    +   QG            G 
Sbjct: 143 RAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG------------GI 190

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
             +   K             ++A CKH  AYDL+ W    R+ F+A ++ QD  + + P 
Sbjct: 191 DPRPYFK-------------VAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPS 237

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
           F+SC+   K + +MCSYN VNG+PAC    L Q   R+ WGF    ++TSDCDA+  IF 
Sbjct: 238 FQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFT 297

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
             N+T T  ++ A  LKAG D++CGT    H   A+++  +   D++RAL   ++  +RL
Sbjct: 298 THNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRL 357

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+  P      +L   DV   + + LA  AA +G+VLLKND  FLP++ +   ++AII
Sbjct: 358 GYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAII 414

Query: 456 GPLVNNISQMGGGYTGI-PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           GP  N    M G Y G  P         ++A  ++   A+G   +   S+A F  A+ +A
Sbjct: 415 GPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVA 473

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
             +D +I   G++ + E+E +DR+++   G Q+SLV  +A   K PV++V  GGG LD S
Sbjct: 474 NSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDS 532

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
               +  + +++W GYPG++G  A+ ++I G   P GRL +T YPE F  +V M DM +R
Sbjct: 533 DLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALR 592

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S+   PGR+Y++YTG  V  FGHGL +T + + +   P        L   +DK     
Sbjct: 593 PGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRPGRKYNIQHLLHTADKKF--- 647

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                D + +D         FHV I   N G++   +V +LF R        P+K L+ F
Sbjct: 648 ----PDLIPLDT--------FHVNIR--NTGNITSDYVALLFLRSNAGFAPHPKKSLVSF 693

Query: 754 DRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMV----GELRHSLTI 805
            R H +  GS   +  GV+    ++  ++HG   L  G++ L++    G L HS ++
Sbjct: 694 ARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 219/439 (49%), Positives = 290/439 (66%), Gaps = 29/439 (6%)

Query: 40  PDFPCKPPH--FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P F C   +    SY FCN S + + RA  L+S LTL EK+  L D  +A+PRLG+P YE
Sbjct: 38  PAFACDASNATLASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALPRLGVPLYE 97

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG++  GPG  F+  V   TSFPQ +++AASFN +L+  IG  V+ EARAM+N+
Sbjct: 98  WWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSNEARAMHNV 157

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
           G AGLTFW+PNINIFRDPRWGRGQETPGEDP++ S YAV +V   QG             
Sbjct: 158 GLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQG------------- 204

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                 + G G        L ++ACCKH  AYD++ W    RY+F+A++++QD +DTFQP
Sbjct: 205 -----AVSGAG-------ALKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQP 252

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF+SC+  G  + +MCSYNQVNG P C   DL     R +W   GYI+SDCD+V  ++  
Sbjct: 253 PFKSCVVDGNVASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNN 312

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           Q+YTKT ED+AA  +KAG+D+NCGT + +HT +A+  GK+ E D+DRA+ N     +RLG
Sbjct: 313 QHYTKTPEDAAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLG 372

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
            F+GDPR+  +G LGP DVCT  +++LA +AARQGIVLLKN  K LPL+  ++ S+A+IG
Sbjct: 373 FFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIG 431

Query: 457 PLVNNISQMGGGYTGIPCS 475
           P  N    M G Y G  C+
Sbjct: 432 PNANASFTMIGNYEGTSCN 450


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 241/486 (49%), Positives = 322/486 (66%), Gaps = 18/486 (3%)

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
           DCDAV+ I++ Q Y K+ ED+ A VLKAGMD+NCG+ + +HT+SA+ + KV E DIDRAL
Sbjct: 221 DCDAVSIIYDAQGYAKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRAL 280

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
           LNLFSV++RLGLFNGDP K  YG + P++VC+  H+ LALDAAR GIVLLKN+ K LP +
Sbjct: 281 LNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFS 340

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
           K +VSSLA+IGP  + +  + G Y G PC   + L  L +YV    Y  GC  V C S+A
Sbjct: 341 KRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNA 399

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
              +AV IAK AD V+++ GLD TQE ED DRV L LPG+Q  L+TSVA  +K+PV+LVL
Sbjct: 400 AIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVL 459

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             GGP+D+SFA  +++I SI+W GYPGEAG  A++EIIFGD NPGGRLP+TWYP+SF  +
Sbjct: 460 ICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVNI 519

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKA 684
            M DM MR  S+  YPGR+Y+FY G +VY FGHGLSY+ YSY+F + A + L ++ S KA
Sbjct: 520 QMTDMRMR--SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQS-KA 576

Query: 685 GSDKNILQQTGSRLDYVHIDEV--TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
                  Q     + Y  + E+    C   +  V + V N G++ G H V++FAR  +  
Sbjct: 577 -------QTNSDSVRYTLVSEMGKEGCDVAKTKVTVEVENQGEMAGKHPVLMFARHERGG 629

Query: 743 QG--TPEKQLIGFDRVHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
           +     EKQL+GF  +  ++ G K E+ F +  CE LS AN+ G  +L  G + L VG+ 
Sbjct: 630 EDGKRAEKQLVGFKSI-VLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS 688

Query: 800 RHSLTI 805
              L +
Sbjct: 689 ELPLIV 694



 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 115/218 (52%), Positives = 141/218 (64%), Gaps = 23/218 (10%)

Query: 40  PDFPCKP--PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYE 97
           P   C P  P    Y FC T L I  RA+ L+S LT+ EKI QL + A  IPRLG+PAYE
Sbjct: 22  PPHSCDPSNPTTKLYQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGIPRLGVPAYE 81

Query: 98  WWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           WWSE+LHG+A  GPG+ FNGTV + TSFPQV+++AASF+   W  I   +  EAR +YN 
Sbjct: 82  WWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIGKEARGVYNA 141

Query: 158 GQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
           GQA G+TFWAPNINIFRDPRWGRGQETPGEDPM+   YAV +V+  QG++          
Sbjct: 142 GQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDS---------- 191

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
            F  ++ L          + L  SACCKH  AYDL++W
Sbjct: 192 -FDGRKTLS---------NHLQASACCKHFTAYDLDRW 219


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 274/761 (36%), Positives = 403/761 (52%), Gaps = 70/761 (9%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL+ + RA +LI L T+ E I    + A  +PRLG+PAYEWWSE LHG+A N  GV 
Sbjct: 37  CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVA-NSAGVT 95

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           ++  G  S  TSFPQ ++ +A+F+ +L   +G  + +E RA  N G AGL FW PNIN F
Sbjct: 96  WSITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPNINPF 155

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  ++ Y    ++  QG                     G   E  
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG---------------------GLDPEP- 193

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 + A CKH   YDLE W    RY +NA+I+ QD  + + P F+SC     A   M
Sbjct: 194 ---YFQVVATCKHFAGYDLEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASM 250

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN +NG+P C    L Q   R  WGF    ++T DCD+V  I+++ +YT   + +AA 
Sbjct: 251 CSYNAINGIPTCADTYLLQDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQAAAD 310

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YG 408
            LKAG DI+CG         A  +  + E+D+  AL   ++  +RLG F  DP   + Y 
Sbjct: 311 ALKAGSDIDCGIFYTTWLPLAYTESLITEQDLRAALTRQYASLVRLGYF--DPASEQPYR 368

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           +    +V TS  ++LA  AA +GI LLKND   LP + +A+ ++A+IGP     +QM G 
Sbjct: 369 QYNWSNVDTSYAQELAYTAAVEGITLLKNDGT-LPFS-SAIKNIALIGPWTFATTQMQGN 426

Query: 469 YTGIPCSPKSLLRGLE------AYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
           Y G      S  +G +      +YV +T       +V  N+  G+  A   A+ AD ++ 
Sbjct: 427 YYGNAPYLISPYQGAQLAGYNISYVLET-------NVTSNTTDGYAAAFTAAQGADAIVF 479

Query: 523 VAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           V G+D T E E  DR  +  P  Q+ L+  + +  K P+++V  GGG +D +   A+  +
Sbjct: 480 VGGIDNTVEAEAMDRNDITWPAFQLWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDV 538

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSS-RQY 640
           +++LW GYPG++G +AL +II G   P GRL  T YP  +  ++PM +MN+R D++    
Sbjct: 539 NALLWGGYPGQSGGQALFDIISGKVAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTS 598

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE-LTISASLKAGSDKNILQQTGSRLD 699
           PGR+Y++YTGT VY FG+GL YTN++Y +  AP+   +I A + AG       Q  + +D
Sbjct: 599 PGRTYKWYTGTPVYEFGYGLHYTNFTYAWTKAPAATYSIEALVAAG-------QGSAHID 651

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
               D ++          + VTNAG V   +  +LF          P K L  + R+H V
Sbjct: 652 LAPFDTLS----------VEVTNAGAVTSDYSALLFVNGTYGPAPYPNKSLAAYTRLHNV 701

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
             G+ + +       Q++ A+  G   L  G + + +   R
Sbjct: 702 TAGASQTATFEVVLNQIARADVQGNFWLYPGAYEVALDTTR 742


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 279/778 (35%), Positives = 407/778 (52%), Gaps = 75/778 (9%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     +   C+ +    TRA +LI+  T++EKI      A  +PRLG+PAY WW 
Sbjct: 25  FPDCQNGPLKNNTVCDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQ 84

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A + PGVNF+  G     TSFPQ ++  A+F+  L  ++ + ++ EARA  N  
Sbjct: 85  EALHGVAQS-PGVNFSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDA 143

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           ++GL +W PNIN F+D RWGRGQETPGEDP  +S+Y    +   QG+             
Sbjct: 144 RSGLDYWTPNINPFKDSRWGRGQETPGEDPYHLSSYVKSLIAGLQGDG------------ 191

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW-GNFSRYSFNAMITEQDTEDTFQP 277
           + K+V+                A CKH +AYDLE W GNF RY F+  +  Q+  + + P
Sbjct: 192 KYKKVV----------------ATCKHFVAYDLETWNGNF-RYQFDPHVGSQELVEYYMP 234

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
           PF++C         MCSYN +NG+P C    L Q   R  W +   + ++TSDCD++  +
Sbjct: 235 PFQACARDANVGAFMCSYNSLNGIPTCADPYLLQTILREHWNWTSEEQWVTSDCDSIQNV 294

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
           +    YT T E++ A  LKAG D+NCGT        A+  G V EKDID AL+  +S  +
Sbjct: 295 YLPHEYTSTREEAVAVSLKAGTDVNCGTYYQEFLPGALSLGLVTEKDIDMALIRQYSSLV 354

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLG F+G     +Y  L   DV T   ++LAL AA +GI LLKND   LPL     + +A
Sbjct: 355 RLGYFDG--TAVEYRSLSWKDVSTPYAQQLALKAAVEGITLLKND-GILPLAITKDTKIA 411

Query: 454 IIGPLVNNISQMGGGYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
           +IG   N   QM G Y GIP    SP    +   A V+ +    G  D   N+      A
Sbjct: 412 VIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTGANVTYSGNPGGQGDPTTNNWLHIWTA 471

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V    +AD ++   G+D   E E  DRVS+   G Q+ ++  +A   K PVI+   G   
Sbjct: 472 V---DEADVILFAGGIDNGVEAEGMDRVSIAWTGAQLDVIGQLASRGK-PVIVAQMGTNG 527

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMND 629
           +D +    +  IS++LW GYPG+ G  AL +II G   P GRLP T YP S+ +KVPM D
Sbjct: 528 VDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKSAPAGRLPTTQYPASYISKVPMTD 587

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
           M++R +S+  +PGR+Y +Y    V+ FG+GL YTN+S     +P++ T S S+ A   K+
Sbjct: 588 MHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFSATI--SPTD-TTSFSI-ADLTKD 643

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
             +    R  +  +             +I+VTN G+V   +V + F          P K+
Sbjct: 644 CTEHYMDRCPFADM-------------KIAVTNTGNVTSDYVTLGFLAGEHGPAPCPNKR 690

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           L+ + R+H +  G+ + +        L++      R+  +GN VL  G   ++L I+T
Sbjct: 691 LVNYQRLHNITAGASQTT-------SLNLTLASLARVDDMGNTVLYPGS--YALLIDT 739


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 275/753 (36%), Positives = 404/753 (53%), Gaps = 67/753 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TSL    RA++L+  LT+ E I      A  +PRLG+P Y WW+E+LHG+A++ PGV 
Sbjct: 35  CDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVAAS-PGVV 93

Query: 115 FNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F       SS TSFP  +   ++F+ +L   +G+  + EARA  N G AGL +W PNIN 
Sbjct: 94  FTSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLDYWTPNINP 153

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           F+DPRWGRG ETPGEDP+  + Y    V+  QG                     G    S
Sbjct: 154 FKDPRWGRGAETPGEDPLHAARYVRTLVEGLQG---------------------GIDPPS 192

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L ++A CKH  AYDLE WG  +RY+F+A++T QD  + + PPF+SC+   +A+ +
Sbjct: 193 -----LKVAADCKHWAAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASV 247

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVPAC    L +   R+ WG     ++TSDCDAV  +++   YT+   + +A
Sbjct: 248 MCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFVNGSA 307

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             LKAG D++CGT   ++   A D+G + E D+  AL  L++  + LG F+  P    Y 
Sbjct: 308 VSLKAGSDLDCGTTYSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFDA-PEDQPYR 366

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN-NISQMGG 467
           ++   DV T   + LA  AA +  VLLKND   LPL  +++ S+A+IGP+ N +  Q+ G
Sbjct: 367 QISWADVNTPAAQALAYTAAIESFVLLKNDGT-LPLTDSSL-SIALIGPMANASAVQLQG 424

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y GIP    + L+G         Y  G + V  N       AV  A+ AD VI V G+D
Sbjct: 425 NYNGIPPFAIAPLQGFLDAGFNVTYVLGTN-VTGNDADDIDGAVAAAEAADVVIYVGGID 483

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E +DR  +  P  Q++L++++    K P+++V  GGG LD +  +    +++ILW
Sbjct: 484 STVEEEAKDRTEISWPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILW 542

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  A+A+ + G   P GRL +T YP S+   V M DM +R D+S   PGR+Y+
Sbjct: 543 AGYPGQSGGTAIADTVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYK 602

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTGT VY +G+GL YTN+S  + S   E   S        +++       +D   +D  
Sbjct: 603 WYTGTPVYPYGYGLHYTNFSVAWASDAPEACYSI-------QDLTSSADGFVDLAPLDT- 654

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
                     +++VTN GDV    V +LF          P K+L+ + R   V  G    
Sbjct: 655 ---------FRVTVTNDGDVASDFVALLFVSTQAGPAPAPMKELVAYARASDVQPGDSTD 705

Query: 765 ---EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
              E++ G      L+ +++ G   L  G++ L
Sbjct: 706 VDLEVTLGA-----LARSDESGDASLYPGDYEL 733


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  443 bits (1140), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 274/777 (35%), Positives = 417/777 (53%), Gaps = 64/777 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      S   C+ + + + RA +LI + T +E +Q   + +  +PRLG+P+Y+WWS
Sbjct: 24  FPDCVNGPLSSTAVCDPTKAPAARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWS 83

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A + PGV+F  +G  SS TSFPQ +V  ++F+  L   + + ++ EARA  N  
Sbjct: 84  EALHGVAGS-PGVSFAPSGEFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFH 142

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           +AGL ++ PNIN F+DPRWGRGQETPGEDP  VS Y    +   QG            G 
Sbjct: 143 RAGLDYFTPNINPFKDPRWGRGQETPGEDPFHVSQYVYSLIDGLQG------------GI 190

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
             +   K             ++A CKH  AYDL+ W    R+ F+A ++ QD  + + P 
Sbjct: 191 DPRPYFK-------------VAADCKHYAAYDLDSWEGIDRFHFDAKVSLQDLSEYYLPS 237

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFE 335
           F+SC+   K + +MCSYN VNG+PAC    L Q   R+ WGF    ++TSDCDA+  IF 
Sbjct: 238 FQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDILRDFWGFDDDRWVTSDCDAIGNIFT 297

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
             N+T T  ++ A  LKAG D++CGT    H   A+++  +   D++RAL   ++  +RL
Sbjct: 298 THNFTDTFAEAVADALKAGTDVDCGTSYSTHLPDALNQSLITRDDLERALTRQYTSLMRL 357

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+  P      +L   DV   + + LA  AA +G+VLLKND  FLP++ +   ++AII
Sbjct: 358 GYFD-PPESQPLRQLAWSDVNKPDAQALAHTAAVEGLVLLKND-GFLPVSASG-KTIAII 414

Query: 456 GPLVNNISQMGGGYTGI-PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           GP  N    M G Y G  P         ++A  ++   A+G   +   S+A F  A+ +A
Sbjct: 415 GPYANATKDMQGNYFGTAPFIVTPFQGAVDAGFNEVVSAAGT-SINGTSEADFAAAIAVA 473

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
             +D +I   G++ + E+E +DR+++   G Q+SLV  +A   K PV++V  GGG LD S
Sbjct: 474 NSSDIIIFAGGINNSIESEAKDRLTIAWTGNQLSLVKQLASLGK-PVVVVQFGGGQLDDS 532

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR 633
               +  + +++W GYPG++G  A+ ++I G   P GRL +T YPE F  +V M DM +R
Sbjct: 533 DLLDNDAVRAVIWAGYPGQSGGTAIFDVITGAVAPAGRLSVTQYPEDFVNQVGMTDMALR 592

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S+   PGR+Y++YTG  V  FGHGL +T + + +   P        L   +DK     
Sbjct: 593 PGSAN--PGRTYKWYTGRPVLEFGHGLHFTTFDFSWRGRPGRKYNIQHLLHTADKKF--- 647

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                D + +D         FHV I   N G++   +V +LF +        P+K L+ F
Sbjct: 648 ----PDLIPLDT--------FHVNIR--NTGNITSDYVALLFLKSNAGFAPHPKKSLVSF 693

Query: 754 DRVHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMV----GELRHSLTI 805
            R H +  GS   +  GV+    ++  ++HG   L  G++ L++    G L HS ++
Sbjct: 694 ARAHRIDAGSSATVDLGVN-LGSIARVDEHGDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 253/714 (35%), Positives = 384/714 (53%), Gaps = 52/714 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CN  L  + RAK+++ L T+ E I  + + +  + RLG+P Y+WWSE LHGIA +    N
Sbjct: 61  CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIARS----N 116

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++   +FN  L+  +G+ +  EARA  N+G+AGL +++PNIN F
Sbjct: 117 FTASGEYSHATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQE   E P++V  YA+ +V+  Q        GGI              + + 
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQ--------GGI--------------DSNP 214

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
             D L ++A CKH   YD+E W   SR  +NA+I++QD  D + P F+SC+   KA+  M
Sbjct: 215 NDDTLQVAATCKHFAGYDMESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAM 274

Query: 293 CSYNQVNGVPACLRGDLFQKARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           CSYN +NG+P C   + F       GF    G I SDCD++ +I+    Y +    +AA 
Sbjct: 275 CSYNAINGIPVCAS-EFFLGTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGAAAAD 333

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            +KAG+D+NCG     +   A+    + E  I  ++   +S  +RLG F+  P+  KY  
Sbjct: 334 GIKAGVDVNCGDTYQNNLGYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRT 392

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
               DV TS+  +LA  AA +GI LLKND   LP NK+ V ++A+IGP  N  + M G Y
Sbjct: 393 YNWSDVSTSQANQLAYQAAVEGITLLKNDGT-LPFNKDKVKNVAVIGPWANATTDMLGDY 451

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            G P    S L+G +    K  YA G   +       +  A+  AK AD ++   G+D +
Sbjct: 452 AGTPPYLISPLQGAQDSGFKVQYAYGTQ-INTTLTTNYTAALNAAKGADAIVYFGGIDNS 510

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E  DR SL  PG Q+ LV+ ++  +K P+++V  G G +D +  + ++ ++SI++ G
Sbjct: 511 IENEALDRESLAWPGNQLDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAG 569

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFY 648
           YPG++G  A+ +++ G + P GRL  T YP S+  +VPM DM +R      YPGR++ +Y
Sbjct: 570 YPGQSGGTAIWDVLNGIYAPAGRLSTTQYPASYADQVPMTDMTLRPRDG--YPGRTFMWY 627

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            G  VY FG+GL YT +S    +AP +    A      D+ I  ++   +D         
Sbjct: 628 NGEPVYEFGYGLHYTTFSVSLANAPPK---GAPQSFNIDQFIAAKSSQYVD--------- 675

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
            TSL     +++ N G V   +  +L++         P K L+ FD++H +  G
Sbjct: 676 -TSLITTFDVNIKNTGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPG 728


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 271/744 (36%), Positives = 395/744 (53%), Gaps = 63/744 (8%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVSS 121
           RA+SL++L TL+EKI   S  A  +PRLG+P Y+WWSE LHGIA  GP  NF+  G  S 
Sbjct: 10  RAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFSDSGEWSY 67

Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQ 181
            TSFPQ ++  A+F+  L +++   ++ EARA  N  + GL FW PNIN FRDPRWGRGQ
Sbjct: 68  STSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRDPRWGRGQ 127

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           ETPGED   +S+Y    +   QGE   S D         KRV+                A
Sbjct: 128 ETPGEDAYHLSSYVQALIHGLQGE---STD-------PYKRVV----------------A 161

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
            CKH   YD+E W    RY  +  IT+Q+  + +  PF++C+ Q      MCSYN VNG 
Sbjct: 162 TCKHFAGYDVEDWNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGA 220

Query: 302 PACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           P C    L Q   R  WG+   + ++T DCDAV  ++    ++ T   +AA  L AG D+
Sbjct: 221 PPCADPYLLQTILREHWGWTNEEQWVTGDCDAVQNVYLPHQWSPTRAGAAADSLVAGTDV 280

Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            CGT M  H  +A  +  + E  +D+AL+  +S  +RLG F+    +  Y +LG D V T
Sbjct: 281 TCGTYMQEHLPAAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQ-PYRQLGFDAVAT 339

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
           +  + LA  AA +GIVLLKND   LPL+ ++  ++ + G   N  SQ+ G Y G+     
Sbjct: 340 NASQALARRAAAEGIVLLKNDGT-LPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLH 398

Query: 478 SLLRGLEAYVSKTHYASGC----HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
           S L  LE    K +YA G      D   N  +  + A      +D +I V G+D + E E
Sbjct: 399 SPLYALEQTGVKINYAGGNPGGQGDPTTNRWSNLYGAY---STSDVLIYVGGIDNSVEEE 455

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
            RDR  L   G Q+ ++  +A T K PVI+V+TGGG +D S    +  IS+I+W GYPG+
Sbjct: 456 GRDRGYLTWTGAQLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQ 514

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
            G  A+ +II G   P GRLP T YP ++T  V M +MN+R   +   PGR+Y++Y G+ 
Sbjct: 515 DGGSAIIDIIGGKTAPAGRLPQTQYPANYTAAVSMMNMNLRPGENS--PGRTYKWYNGSA 572

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
            + FG+G+ YTN+S +  +   +    +SL +G +      TG  L+         C   
Sbjct: 573 TFEFGYGMHYTNFSAEITTQMQQSYAISSLASGCNS-----TGGFLE--------RCPFA 619

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
             +VQ  V N G+V   ++ + +          P K L+ + R+H++A G+   +     
Sbjct: 620 SVNVQ--VHNTGNVTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLT 677

Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
              L+  ++HG ++L  G++ L +
Sbjct: 678 LASLARVDEHGNKVLYPGDYSLQI 701


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 236/493 (47%), Positives = 313/493 (63%), Gaps = 8/493 (1%)

Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
           + Y+ SDCDAVATI +  +YT + ED+ A  +KAGMD+NCG     H  +A+ KG + EK
Sbjct: 14  RKYVASDCDAVATIRDAHHYTLSPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEK 73

Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
           DIDRAL+NLF+V++RLG F+GDPR    YG LG  DVC+  HK LAL+AA+ GIVLLKND
Sbjct: 74  DIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKND 133

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS-KTHYASGCH 497
              LPL  +AV+SLA+IGP  +N+  + G Y G PC   + L+G++ Y+  +  + +GC 
Sbjct: 134 AGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGC- 192

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           D P  + A  +EA  +A  +D V++  GL   QE E  DR SLLLPG+Q  L+T+VA  +
Sbjct: 193 DSPACAVAATNEAAALASSSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAA 252

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           +RPVILVL  GGP+DV+FA+ + +I +IL  GYPG+AG  A+A+++FGD NP GRLP+TW
Sbjct: 253 RRPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTW 312

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           YPE FTKVPM DM MRAD +  YPGRSYRFY G  VY FG+GLSY+ +S +  S+ S   
Sbjct: 313 YPEEFTKVPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSN 372

Query: 678 I-SASLKAGSDKNILQQTGSRLDYVHIDE--VTSCTSLRFHVQISVTNAGDVDGSHVVML 734
             + SL AG         G  +    + E  V  C+ L F   + V N G +DG H V++
Sbjct: 373 AGNLSLLAGVMARRAGDDGGGMSSYLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLM 432

Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
           + R P  S G P +QLIGF   H        +SF V PCE  S   + G R++  G H L
Sbjct: 433 YLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFL 492

Query: 795 MVG--ELRHSLTI 805
           MVG  EL  SL +
Sbjct: 493 MVGDEELETSLAL 505


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 269/735 (36%), Positives = 389/735 (52%), Gaps = 61/735 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      S P C+ SL    RA++L+   T+ E I    + A  +PRLG+P YEWW+
Sbjct: 22  FPDCVNGPLASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWN 81

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHG+  +   V F    +  TSFP  +   ++F+ +L   +G  ++ EARA  N G+A
Sbjct: 82  EALHGVGLSPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRA 141

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GL +W PNIN F+DPRWGRG ETPGEDP+  + Y    V+  QG                
Sbjct: 142 GLDYWTPNINPFKDPRWGRGAETPGEDPLHAARYVRSLVEGLQG---------------- 185

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                G    S     L ++A CKH  AYDLE WG  +RY+F+A++T QD  + + PPFR
Sbjct: 186 -----GIDPPS-----LKVAAACKHWAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFR 235

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG--YITSDCDAVATIFEYQ 337
           SC+   +A+  MCSYN VNGVPAC    L +   R+ WG     ++TSDC AV  +++  
Sbjct: 236 SCVRDARAASAMCSYNAVNGVPACASPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPH 295

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            YT+   +++   LKAG D+NCGT   ++   A D+G + E D+  AL  L++  + LG 
Sbjct: 296 GYTEDLVNASTVSLKAGTDLNCGTNYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGY 355

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+  P    Y ++   DV T E + LA  AA +  VLLKND   LPL  + + SLA+IGP
Sbjct: 356 FDA-PEDQPYRQITWADVNTPEAQALAYTAAIKSFVLLKNDGT-LPLTDSTL-SLALIGP 412

Query: 458 LVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKK 516
           + N  + QM G Y GIP    + L+G         Y  G + V  N    F  AV  A+ 
Sbjct: 413 MANASALQMLGNYFGIPPFVIAPLQGFLDAGFNVTYVLGTN-VTGNDAGSFDAAVAAAEA 471

Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           AD VI V G+D T E E++DR  +  P  Q++L++++    K P+++V  GGG LD +  
Sbjct: 472 ADVVIYVGGIDNTLEMEEKDRTEISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPL 530

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
           +    +++ILW GYPG++G  A+A+ + G   P GRL          +V M DM +R D+
Sbjct: 531 KESDAVNAILWAGYPGQSGGTAIADTVTGKVAPAGRL-------YVDEVAMTDMTLRPDN 583

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           +   PGR+Y++YTGT VY +G+GL YTN S  + S   E   S          I   TG 
Sbjct: 584 ATGNPGRTYKWYTGTPVYPYGYGLHYTNISVAWASDAPEACYS----------IQDLTGE 633

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
              +V +  + +        +++VTN GD+    V +LF          P K+++ + R 
Sbjct: 634 ASGFVDLAPLDT-------FRVTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARA 686

Query: 757 HTVAKG-SKEISFGV 770
             V  G S E+   V
Sbjct: 687 SDVQPGNSTEVELEV 701


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 282/775 (36%), Positives = 412/775 (53%), Gaps = 73/775 (9%)

Query: 35  PVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGI 93
           P L + ++P C        P C+TSLS   RAKSL+S LTL+EKI      A+   RLG+
Sbjct: 18  PALCQSNYPDCTTGPLSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGL 77

Query: 94  PAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
           PAY WW+E+LHG+A    GV+F  +G  S  TSFP  +V  A+FN +L   +   ++ EA
Sbjct: 78  PAYNWWNEALHGVAEKH-GVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEA 136

Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
           RA  N   AG+ +W PN+N F+DPRWGRGQETPGEDP+  S Y  EFV   QG       
Sbjct: 137 RAFSNSDHAGIDYWTPNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG------- 189

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                               D  ++  + A CKHL AYDLE+WG  SR+ F+A ++  D 
Sbjct: 190 --------------------DDPEKPKVVATCKHLAAYDLEEWGGVSRFEFDAKVSAVDL 229

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDC 327
            + + PPF++C         MCSYN +NGVPAC    L Q   R  WG++G   ++T DC
Sbjct: 230 LEYYLPPFKTCAVDASVGAFMCSYNALNGVPACADRYLLQTVLREHWGWEGPGHWVTGDC 289

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
            AV  I  Y +Y ++  ++AA  L AG+D++CGT +  +   A  +G +  + +D AL  
Sbjct: 290 GAVERIQTYHHYVESGPEAAAAALNAGVDLDCGTWLPSYLGEAERQGLISNETLDAALTR 349

Query: 388 LFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           L++  ++LG F  DP +G+    LG DDV TSE ++LA   A QG VLLKN    LPL  
Sbjct: 350 LYTSLVQLGYF--DPAEGQPLRSLGWDDVATSEAEELAKTVAIQGTVLLKNIDWTLPLKA 407

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
           N   +LA+IGP +N  +++   Y G      +++   E        A G  +V   S  G
Sbjct: 408 NG--TLALIGPFINFTTELQSNYAGPAKHIPTMIEAAERLGYNVLTAPGT-EVNSTSTDG 464

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           F +A+ IA +AD +I   G+D T E E  DR  +  PG Q  L+  +A    RP+ +V  
Sbjct: 465 FDDALAIAAEADALIFFGGIDNTVEEESLDRTRIDWPGNQEELILELAELG-RPLTVVQF 523

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
           GGG +D S   A + + +I+W GYP +AG   + +++ G   P GRLP+T YP+S+  +V
Sbjct: 524 GGGQVDDSALLASAGVGAIVWAGYPSQAGGAGVFDVLTGKAAPAGRLPITQYPKSYVDEV 583

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
           PM DMN++  +    PGR+YR+Y    V  FG GL YT ++  +         +A+L  G
Sbjct: 584 PMTDMNLQPGTDN--PGRTYRWYE-DAVLPFGFGLHYTTFNVSWAKKAFGPYDAATLARG 640

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQG 744
            + +                    +++     ++VTN GDV   +V ++FA  P++ +Q 
Sbjct: 641 KNPS--------------------SNIVDTFSLAVTNTGDVASDYVALVFASAPELGAQP 680

Query: 745 TPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            P K L+G+ R   +  G     ++   V P   L+ A + GR +L  G + L+V
Sbjct: 681 APIKTLVGYSRASLIKPGETRKVDVEVTVAP---LTRATEDGRVVLYPGEYTLLV 732


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 266/755 (35%), Positives = 405/755 (53%), Gaps = 61/755 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNT+     RA SLI    L + +    + +  + RLG+P Y+WW+E+LHG+ S+ PGVN
Sbjct: 39  CNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGSS-PGVN 97

Query: 115 F----NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           +    +   +S TSFP  ++  A+FN SL ++I   ++ EARA  N   AGLTF+ PNIN
Sbjct: 98  WGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPNIN 157

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
            FRDPRWGRGQETPGEDP  +S Y  ++V   QG            G       K     
Sbjct: 158 PFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG------------GLSPDPYYK----- 200

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   + A CKH++AYD+E W    R  FNA++T QD  + + P F+ C+   + + 
Sbjct: 201 --------VLANCKHVLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGAS 252

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDS 346
            MCSYN VNGVP+C    + +   R+ WG    +G+IT DC AV  I++   YT T  ++
Sbjct: 253 AMCSYNAVNGVPSCASSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGYTDTLVNA 312

Query: 347 AAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
            A  + AG D++CG     +  +A+ +G +    I  AL+ L+   +RLG F  DP + +
Sbjct: 313 TAVAMDAGTDLDCGDVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYF--DPAEQQ 370

Query: 407 -YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y      +V T   + LA +AA QGIVLL+ND   LPL+ N V ++A+IGP+ N    +
Sbjct: 371 PYRSFDWSNVNTPSSQDLAYNAAVQGIVLLEND-GLLPLSTN-VKNIALIGPMANATLSL 428

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
            G Y GI     S  +  E       +A G   +  + ++G+ EA+  A+ AD V+ V G
Sbjct: 429 QGNYAGIAPFVISPQQAFETAGYNVTFAFGT-GISNSDNSGYSEALEAAQGADVVVFVGG 487

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
           +D + E E +DR S+  PG Q+ L+  +    K P+++V  GGG  D S  +A++ ++++
Sbjct: 488 IDNSIEAEGQDRTSIEWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNAL 546

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
           LW GYPG++G  AL +II G  +P GRLP+T YP S+ +++ M DM +R +SS   PGR+
Sbjct: 547 LWAGYPGQSGGTALVDIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRPNSSGS-PGRT 605

Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
           Y++YTG  +Y FG+G+ YT +   +  + S       + + ++K     +G   D   +D
Sbjct: 606 YKWYTGAPIYPFGYGIHYTTFRLAWSDSSSTTYNIQDIVSSANK-----SGGFADTEILD 660

Query: 705 EVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV-HTVAKG 762
             +          + VTN G +    +V +LFA         P ++L+G+ RV H    G
Sbjct: 661 TFS----------LLVTNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGG 710

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           +      V     +S  +++G  IL  G + L VG
Sbjct: 711 TATAELNVT-LGSISRVDENGNWILYPGTYNLWVG 744


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 281/796 (35%), Positives = 398/796 (50%), Gaps = 110/796 (13%)

Query: 44  CKPPHFDS-------YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           C P  F +       +P+C+ SL  + R + L+  +TL+EK+  L D A   PR+G+P Y
Sbjct: 47  CDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPRY 106

Query: 97  EWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR---- 152
                         P       V    +    L +     R +  +   AVA   R    
Sbjct: 107 CGGGRRCTAC----PTSARRDVVWRRRARRHQLPARHQQRRVVQRD---AVARHRRRGVD 159

Query: 153 ---AMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
               MYNLG A LT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q      
Sbjct: 160 GDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQ------ 213

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           D  G             F         + +S+CCKH                        
Sbjct: 214 DIDGATTAASAAAATDAFSRP------IKVSSCCKHY----------------------- 244

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
                             A+C+MCSYN++NGVPAC    L  +  R +W   GYI SDCD
Sbjct: 245 ------------------AACVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCD 286

Query: 329 AVATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTC-------MLRHTQSAIDKGKVQE 378
           +V  +    ++  YT     +AA  +KAG+D++CG            +   A+ +GK++E
Sbjct: 287 SVRVMVRDAKWLGYTGVEATAAA--MKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKE 344

Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
             +D AL NL+   +RLG F+G P   +   LG  DVCT EHK+LA DAARQG+VLLKND
Sbjct: 345 SAVDNALTNLYLTLMRLGFFDGIP---ELESLGAADVCTEEHKELAADAARQGMVLLKND 401

Query: 439 KKFLPLNKNAVSSLAIIGPL--VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
              LPL+   V+S+A+ G L  +N    M G Y G PC   +   G+   VS T     C
Sbjct: 402 AALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTS-VHAC 460

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
               C++      A   AK  D  IVVAGL+++ E E  DR  LLLP  Q S + +VA  
Sbjct: 461 DKGSCDT------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEA 514

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           S  P++LV+   G +DVSFA+ + +I +++W GYPGE G  A+A+++FG +NPGGRLP+T
Sbjct: 515 SPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLT 574

Query: 617 WYP-ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPS 674
           WY  E  +K+PM  M +R D+   YPGR+Y+FY G  V Y FGHGLSYTN++Y   +A +
Sbjct: 575 WYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAA 634

Query: 675 ELTISASLKAGSDKNILQQT-----GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
            +T+    K G+ +   Q T      S      ++  +          ++V N G  DG+
Sbjct: 635 PVTV----KVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGT 690

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
           HVV ++   P    G P KQL+ F RV   A  + E++F ++ C+  +I  +    ++P 
Sbjct: 691 HVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPS 750

Query: 790 GNHVLMVGELRHSLTI 805
           G   ++VG+   SL+ 
Sbjct: 751 GVSRVLVGDDALSLSF 766


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 276/751 (36%), Positives = 403/751 (53%), Gaps = 62/751 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS +   RA +LISL TL+EKI    + A  +PRLG+PAY+WW E+LHG+A + PGV 
Sbjct: 34  CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAES-PGVI 92

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++  A+F+ +L +++ + V+ EARA  N  ++G+ FW PNIN F
Sbjct: 93  FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  + +Y    +   QG          G     KR++        
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH  AYDLE W    RY F+A+++ QD  + +   FR+C         M
Sbjct: 195 --------ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFM 246

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN VNGVP+C    L Q   R+ WG+     YITSDCDA+  I+E   YT T  ++ A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVA 306

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             L AG D++CG     +  +A D+G   E  ++RAL+  ++  ++LG F  DP   + Y
Sbjct: 307 DALNAGTDLDCGEYYPENLGAAYDQGLFTESTLNRALIRQYAALVKLGYF--DPADIQPY 364

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            ++G  +V T E ++LA  AA +GI LLKND   LPL+  ++ ++A+IGP  N  +QM G
Sbjct: 365 RQIGWANVSTPEAEELAYTAAVEGITLLKNDGT-LPLSP-SIKTIALIGPWANATTQMQG 422

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
            Y G+     S L   E      +Y++G   D P  S   F  A   A+ AD +I   G+
Sbjct: 423 NYYGVAPYLISPLMAAEELGFTVYYSAGPGVDDPTTSS--FPAAFAAAEAADAIIYAGGI 480

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D+T E E  DR +L  PG Q   +  ++   K P+I++  GGG +D S    +  +++++
Sbjct: 481 DITVEAEAMDRYTLDWPGVQPDFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALV 539

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSY 645
           W GYPG++G KA+ +II G+  P GRLP+T YP  +  +V M DM++R   +   PGR+Y
Sbjct: 540 WGGYPGQSGGKAIMDIIVGNAAPAGRLPITQYPLDYVYQVAMTDMSLRPSPTN--PGRTY 597

Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
            +YTGT +  FG GL YT ++   LS PS  +         D   L    S + +  +  
Sbjct: 598 MWYTGTPIVEFGFGLHYTTFTAS-LSQPSAPSY--------DIATLVSLCSGVAHPDLCP 648

Query: 706 VTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
             S T+       +VTN G  V    V +LF          P K L+ +DR+H +A  + 
Sbjct: 649 FASYTA-------NVTNTGSSVTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLAS 701

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
           + +        LS  + +G  IL  G + L+
Sbjct: 702 QTTTLNLTLGSLSRVDDYGNTILYPGEYTLI 732


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 217/450 (48%), Positives = 299/450 (66%), Gaps = 13/450 (2%)

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           MD+NCG+ +  H  SA+ +GK+ E+DI+RAL NLF+V++RLGLFNGDPR+  YG +GPD 
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 415 VCTSEHKKLALDAARQGIVLLKND--KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
           VCT EH+ LAL+AA+ GIVLLKND     LPL+K  V+SLA+IG   N+  ++ G Y G 
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 473 PCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           PC   + L+ L+ YV  T + +GC+   CN      EAV+ A  AD V++  GLD  QE 
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVTT-IPEAVQAASSADSVVLFMGLDQDQER 179

Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
           E+ DR+ L LPGQQ +L+ SVA  +K+PVILVL  GGP+DVSFA+ + +I +ILW GYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
           EAG  A+A+++FG+ NPGGRLP+TWYP+ FT+VPM DM MRAD +  YPGR+YRFY G  
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTRVPMTDMRMRADPATGYPGRTYRFYRGPT 299

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CT 710
           V+ FG+GLSY+ YS++F + P   +  A LKA      ++ T   +    ++ + S  C 
Sbjct: 300 VFNFGYGLSYSKYSHRFATKPPPTSNVAGLKA------VEATAGGMASYDVEAIGSETCD 353

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKGSKEISF 768
            L+F   + V N G +DG H V++F R P  + G+  P  QLIGF  +H  A  +  + F
Sbjct: 354 RLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHVEF 413

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            V PC+  S A + GR+++  G+H +MVGE
Sbjct: 414 EVSPCKHFSRATEDGRKVIDQGSHFVMVGE 443


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 260/726 (35%), Positives = 388/726 (53%), Gaps = 61/726 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+++L   TRA++L+ +LT+ EK     + +  +PRLG+P Y WWSE LHG+AS+ PGV 
Sbjct: 42  CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVASS-PGVT 100

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F       S  TSFP+ ++  A+F+ +L  +I + ++ EARA  N   +GL FW PNIN 
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RDPRWGR  ETPGEDP  +++Y  + V             G+ FG  + +  K      
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVT------------GLQFGGDDPKYQK------ 202

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  L A CKH   YDLE WG ++RY F+A+I+ QD  + F PPF++C      + +
Sbjct: 203 -------LVATCKHYAGYDLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSV 255

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGFK--------GYITSDCDAVATIFEYQNYTKT 342
           MCSYN VNG+P+C    L Q   R  WG++         Y+TSDCDAV+ I+   NYT T
Sbjct: 256 MCSYNAVNGIPSCANDYLLQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTIT 315

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            E + A  LKAG D++CGT       S+ ++G   + DIDRAL+  ++    LG F  DP
Sbjct: 316 PEQAVAVSLKAGTDLDCGTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYF--DP 373

Query: 403 RKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            +G+ Y +    ++ T   ++LA  AA +GI LLKN    LPL  + ++++A+IGP  N 
Sbjct: 374 AEGQIYRQYNWANINTDYAQQLAYTAAWEGITLLKNIDDMLPL-PSTMTNIALIGPWANA 432

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
            +QM G Y GI     S L  L+       Y  G  ++  NS AGF  A+  A+ AD  +
Sbjct: 433 TTQMQGNYQGIAPFLHSPLYALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTL 491

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
            + G+D+T E E  DRV++  PG Q+ L+  +A  S   +I+   GGG +D +    + +
Sbjct: 492 YIGGIDITVEAEAMDRVNITWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPK 550

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQY 640
           +  +LW GYPG+ G  A+ +I++G   P GRLP++ YP +F  +VPM D  MR   +   
Sbjct: 551 VHGLLWGGYPGQDGGTAMIDILYGSRAPAGRLPLSQYPANFINEVPMTD--MRLHPALGT 608

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
           PGR+Y++Y+G  V  FG+GL YT ++   L   S      S    +  N  +Q+ + LD 
Sbjct: 609 PGRTYKWYSGDLVLPFGYGLHYTTFAKAALKDHSP----RSSDIATLVNEAKQSSAWLDK 664

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
              D   +           VTN G +   +V + +          P+  L+ + R+  V 
Sbjct: 665 AFFDVFAA----------EVTNTGSLTSDYVALGYLTGEFGPAPYPKSSLVSYTRLSQVT 714

Query: 761 KGSKEI 766
            G  ++
Sbjct: 715 PGETQV 720


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 284/770 (36%), Positives = 407/770 (52%), Gaps = 73/770 (9%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C          C+TS  ++TRA+SL+  ++  EK+      A  +PRLG+PAY WWS
Sbjct: 29  FPDCTSGPLSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWS 88

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A   PGV+F  +G  S  TSF Q ++  ASF+  L   + + V  E RA  N G
Sbjct: 89  EALHGVAG-APGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAG 147

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           +AGL +W PNIN FRDPRWGRGQETPGEDP+ VS Y    V   Q        GGIG   
Sbjct: 148 RAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--------GGIGPAR 199

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
            +                  ++A CKH  AYD+E W   SR+ F+A ++ QD  + + P 
Sbjct: 200 PQ------------------IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPS 241

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIF 334
           F+SC+   +   +MCSYN +NGVP C    L Q   R  W +     ++ SDC A+  I+
Sbjct: 242 FKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIY 301

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              NYTKT  ++AA  L AG D++CGT   +H   A ++G    + +DRAL+ L+S  ++
Sbjct: 302 IGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVK 361

Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP  K  YG +G  DV T   ++LA  AA +GIVLLKND+  LPL   A  +LA
Sbjct: 362 LGYF--DPAEKQPYGSIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQT-LPL--KAKGTLA 416

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N   QM G Y G P   ++L      +  +  Y+ G   +  +S AGF  A+  
Sbjct: 417 LIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAA 475

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK AD V+   G+D T E+E  DR ++  PG Q+SL++ ++   K P+I++  GGG +D 
Sbjct: 476 AKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDD 534

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           +    +  ++++LW GYP + G  A+ +I+ G   P GRLP+T YP ++T +VPM +M +
Sbjct: 535 TPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGL 594

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           RA      PGR+YR+Y    V  FG GL YT++   +         +A+L   +      
Sbjct: 595 RAGGDN--PGRTYRWYD-KAVVPFGFGLHYTSFEVSWDRGRLGPYNTAALVNRAPG---- 647

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
             GS +D    D         F VQ  V N G V   +V +LF +      +  P K L+
Sbjct: 648 --GSHVDRALFDT--------FRVQ--VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLV 695

Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           G+ RV  V  G +     E++ G      ++    +G  +L  G + L V
Sbjct: 696 GYTRVQQVKPGERRSVEIEVTLGA-----MARTAANGDLVLYPGKYTLQV 740


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 275/753 (36%), Positives = 397/753 (52%), Gaps = 65/753 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS    TRA +LISL TL EK+    + +  +PR+G+P+YEWW+E+LHGIA + PG  
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIARS-PGTT 92

Query: 115 FNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F  T S+    TSFPQ ++  A+F+  L   + + V+ EARA  N+ + GL FW PNIN 
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           ++DPRWGRGQETPGEDP   S+Y    +   QG            G  +    KG     
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQG------------GLDDLPYKKGV---- 196

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    A CKH   YDLE      RY F+A+I  QD  D + PPF+ C        +
Sbjct: 197 ---------ATCKHFAGYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSV 247

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
           MCSYN +NGVP C    L Q   R  WG+     ++TSDCDAV  I++Y NYT T E SA
Sbjct: 248 MCSYNAMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSA 307

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L AG D++CGT    +  SA D+G      +DR+L   ++  +RLG F+  P    Y
Sbjct: 308 ADALNAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPY 366

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            +L  D+V T   ++LAL AA  GIVLLKND   LPL+ N ++++A+IGPL N   QM G
Sbjct: 367 RQLNWDNVSTPAAQQLALQAAEDGIVLLKND-GILPLSSN-ITNVALIGPLANATKQMQG 424

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G     +S L   +    K  Y  G  D+   +   F  A+  A+ AD VI V G+D
Sbjct: 425 NYYGTAPYLRSPLIAAQNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGID 483

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            + E E+ DR S+  P  Q+SL+  +A  S   +I  +  G  +D S   +++ ++++LW
Sbjct: 484 NSIEAEEIDRTSISWPSSQLSLINQLANLSTPLIISQM--GCMIDSSSLLSNTGVNALLW 541

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
            GYPG+ G  A+  I+ G   P GRLP+T YP ++  +V M DMN++   SR  PGR+Y+
Sbjct: 542 AGYPGQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYK 599

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y G  V+ +G+GL YT             T  A +   S  N  + +    +  +  ++
Sbjct: 600 WYNGEPVFEYGYGLQYT-------------TFDAKITPSSPNNTFEISELLANASNYKDL 646

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
           T        + I+V+N G     +V + F          P+K L+ + R+H +  G+   
Sbjct: 647 TPFV----KIPITVSNTGTTTSDYVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANAT 702

Query: 765 -EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            E+S  +     L+  N +G  IL  G++ ++V
Sbjct: 703 AEVSLNL---ASLARGNWNGDLILYPGDYKVVV 732


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 254/752 (33%), Positives = 389/752 (51%), Gaps = 56/752 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ S     RAK+L++  T+ EK+    +N+  +PRLG+P Y+WW E+LHG+AS+ PGV 
Sbjct: 44  CDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVASS-PGVT 102

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           FN  G   S TSFPQ ++  A+F+ +L  ++   V+ EARA  N G+AGL FW PNIN +
Sbjct: 103 FNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWTPNINPY 162

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQETPGEDP  +S+Y    +   QG                       GE+ +
Sbjct: 163 RDPRWGRGQETPGEDPYHLSSYVHSLIMGLQG-----------------------GEDPE 199

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 ++A CKH   YD+E W    RY  +  I ++D  + + P FRSC         M
Sbjct: 200 I---RKITATCKHFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFM 256

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           C+Y+ +NGVP C    L     R  WG+     ++TSDCD++  IF   N++ T + +AA
Sbjct: 257 CTYSALNGVPTCADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNFSDTRQGAAA 316

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L AG D++CGT    H   A  +G + +  +D+AL+ L++  +R G F+G      Y 
Sbjct: 317 AALNAGTDLDCGTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDG--PNAMYR 374

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L   DV T+  ++LAL AA +G+VLLKND   LPL+ +  + +A+IG   N  +QM G 
Sbjct: 375 NLTWSDVGTTHAQQLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGN 433

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+P    S L   +   ++  YA G       +   +      A+KAD +I + G+D+
Sbjct: 434 YYGVPTYLHSPLYAAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDI 493

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E  DR  +   G Q+ ++  +A   K P++L    G  LD +    ++ IS+++W 
Sbjct: 494 SVEAEGMDREDINWTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWG 551

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG+ G  AL  II G   P GRLP+T YP  +   +PM DM +R +++   PGR+Y++
Sbjct: 552 GYPGQDGGVALFNIITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKW 611

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
           Y GT V+ FG+G+ YT +S             +SL +G ++    +              
Sbjct: 612 YNGTAVFEFGYGMHYTKFSADISPMSKSSYDISSLLSGCNETYKDRCAFE---------- 661

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
                   + ++V N G+V   +  + F          P+K L+ + R+H +A GS + +
Sbjct: 662 -------SISVNVHNTGNVTSDYAALGFIAGQFGPSPYPKKSLVNYQRLHNIAGGSSQTA 714

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
                   LS  + HG   L  G++ LM+  +
Sbjct: 715 TLNLTLGSLSRVDDHGNTYLYPGDYALMIDTM 746


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 284/770 (36%), Positives = 407/770 (52%), Gaps = 73/770 (9%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C          C+TS  ++TRA+SL+  ++  EK+      A  +PRLG+PAY WWS
Sbjct: 61  FPDCTSGPLSKLAVCDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAYNWWS 120

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+A   PGV+F  +G  S  TSF Q ++  ASF+  L   + + V  E RA  N G
Sbjct: 121 EALHGVAG-APGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAFGNAG 179

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           +AGL +W PNIN FRDPRWGRGQETPGEDP+ VS Y    V   Q        GGIG   
Sbjct: 180 RAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQ--------GGIGPAR 231

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
            +                  ++A CKH  AYD+E W   SR+ F+A ++ QD  + + P 
Sbjct: 232 PQ------------------IAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPS 273

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK---GYITSDCDAVATIF 334
           F+SC+   +   +MCSYN +NGVP C    L Q   R  W +     ++ SDC A+  I+
Sbjct: 274 FKSCVRDAQVDAVMCSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIY 333

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              NYTKT  ++AA  L AG D++CGT   +H   A ++G    + +DRAL+ L+S  ++
Sbjct: 334 IGHNYTKTGAEAAAVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVK 393

Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP  K  YG +G  DV T   ++LA  AA +GIVLLKND+  LPL   A  +LA
Sbjct: 394 LGYF--DPAEKQPYGSIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQT-LPL--KAKGTLA 448

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           +IGP  N   QM G Y G P   ++L      +  +  Y+ G   +  +S AGF  A+  
Sbjct: 449 LIGPYANATKQMQGNYQGPPKYIRTLEWAATQHGYQVQYSPGTA-INNSSTAGFAAALAA 507

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           AK AD V+   G+D T E+E  DR ++  PG Q+SL++ ++   K P+I++  GGG +D 
Sbjct: 508 AKDADVVLYAGGIDNTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDD 566

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNM 632
           +    +  ++++LW GYP + G  A+ +I+ G   P GRLP+T YP ++T +VPM +M +
Sbjct: 567 TPLLTNPHVNALLWAGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGL 626

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           RA      PGR+YR+Y    V  FG GL YT++   +         +A+L   +      
Sbjct: 627 RAGGDN--PGRTYRWYD-KAVVPFGFGLHYTSFEVSWDRGRLGPYNTAALVNRAPG---- 679

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLI 751
             GS +D    D         F VQ  V N G V   +V +LF +      +  P K L+
Sbjct: 680 --GSHVDRALFDT--------FRVQ--VQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLV 727

Query: 752 GFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           G+ RV  V  G +     E++ G      ++    +G  +L  G + L V
Sbjct: 728 GYTRVQQVKPGERRSVEIEVTLGA-----MARTAANGDLVLYPGKYTLQV 772


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 220/487 (45%), Positives = 313/487 (64%), Gaps = 10/487 (2%)

Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
           YI SDCD++  ++  Q+YTKT E++AA  + AG+D+NCG+ +  HT++A+ KG + E  I
Sbjct: 45  YIVSDCDSLGILYGSQHYTKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAAI 104

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           ++A+ N F+  +RLG F+G+P+   YG LGP DVCT E+++LA++ ARQGIVLLKN    
Sbjct: 105 NKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGS 164

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC 501
           LPL+ +A+ +LA+IGP  N    M G Y G+ C   + L+GLE  V  T Y  GC +V C
Sbjct: 165 LPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC 224

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            ++A    A  +A  AD  ++V G D T E E  DR+ L LPG+Q  LVT VA+ ++ PV
Sbjct: 225 -TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPV 283

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           +LV+  GG  D++FA+ D +I+SI+W+GYPGEAG  A+A++IFG  NP G+LPMTWYP+S
Sbjct: 284 VLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQS 343

Query: 622 FT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           +  KVPM +MNMR D S  Y GR+YRFY G  VY FG GLSYTN+S++ + AP  +    
Sbjct: 344 YVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFV---- 399

Query: 681 SLKAGSDKNILQQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           SL     ++        LD +  H ++     S  F VQ+ V N GD +G+  V LF   
Sbjct: 400 SLNLDESQSCRSPECQSLDAIGPHCEKAVGERS-DFEVQLKVRNVGDREGTETVFLFTTP 458

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           P+V  G+P KQL+GF+++    K    + F VD C+ L + ++ G+R L LG+H+L VG 
Sbjct: 459 PEV-HGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGS 517

Query: 799 LRHSLTI 805
           L+HS  I
Sbjct: 518 LKHSFNI 524


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 275/738 (37%), Positives = 396/738 (53%), Gaps = 70/738 (9%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+     S+  C+TSLSI+ R  SL+  LTL+EKI  L D A+   RLG+P YEWW+E+ 
Sbjct: 152 CQTQPLCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEAT 211

Query: 104 HGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           HG+ S  PGV F       S  TSFP  ++ AASF+ +L   I   +  E RA  N G +
Sbjct: 212 HGVGS-APGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFS 270

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G  FWAPNIN FRDPRWGRGQETPGED  V   Y   F+   QG++ K+           
Sbjct: 271 GFDFWAPNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDDPKN----------- 319

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+V+                A CKH   YDLE      RY  N   T+QD  D F  PF+
Sbjct: 320 KQVI----------------ATCKHYAVYDLET----GRYGNNYNPTQQDLSDYFLAPFK 359

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEY 336
           +C+       IMCSYN V+G+PAC    L  +  R  WGF     Y+ SDC+AV  I++Y
Sbjct: 360 TCVRDTDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWGFNADYHYVVSDCNAVTDIWQY 419

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            N+T T E +AA  L AG+D+ CG+  L+  +S +   +   K +D++L  L+S    +G
Sbjct: 420 HNFTDTEEAAAAVALNAGVDLECGSSYLKLNES-LAANQTSVKAMDQSLARLYSALFTIG 478

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN-KNAVSSLAII 455
            F+G    GKY  L   DV     + LA +AA +G+ LLKND   LPL+ ++   S+A+I
Sbjct: 479 FFDG----GKYDHLDFSDVSIPAAQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVI 533

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----KTHYASGCHDVPCNSDAGFHEAV 511
           GP  N  +QM GGY+G   +   L+  L A+ S    K +YA G      N+  GF  ++
Sbjct: 534 GPFANATTQMQGGYSG---NAPYLISPLVAFESDHRWKVNYAVGTAINDQNT-TGFEASL 589

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
             AKK+D ++ + G+D + E+E  DR SL  PG Q+ L+ S++  SK P+++V  GGG +
Sbjct: 590 AAAKKSDLIVYLGGIDNSIESETIDRTSLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQV 648

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDM 630
           D S    +  I +++W GYP ++G  AL +I+ G  +P GRLP+T YP S+  ++ + D+
Sbjct: 649 DDSALLENKDIQALIWAGYPSQSGGTALLDILVGKRSPAGRLPVTQYPASYADQINIFDI 708

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           N+R +S   +PGR+Y++YTG  V  FGHGL YT + + +           +L        
Sbjct: 709 NLRPNSKDSHPGRTYKWYTGKPVIPFGHGLHYTKFKFGW---------EETLNREYSIQE 759

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQ 749
           L  +  R     I + T  T+++  V+    N G     +V +LF           P K 
Sbjct: 760 LVASCQRSSGGPIKDNTPFTTVKARVR----NVGHETSDYVSLLFLSSKNAGPAPRPNKS 815

Query: 750 LIGFDRVHTVAKGSKEIS 767
           L+ + R+H +A GS  ++
Sbjct: 816 LVSYKRLHNIAPGSDRVA 833


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 264/741 (35%), Positives = 387/741 (52%), Gaps = 58/741 (7%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF--NGTVSS 121
           RAKSL++L TL+EKI   S++A  + RLGIP Y+WW+E LHGIA  GP  +F   G  S 
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171

Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQ 181
            TSFPQ ++  A+F+ +L + + + ++ EARA  N+ + GL FW PNIN FRDPRWGRGQ
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           ETPGED   +S+Y    +   QG   ++D          +RV+                A
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQGN--ETDP--------YRRVV----------------A 265

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
            CKH   YD+E W    RY  +  I++QD  + +  PF +C+ Q      MCSYN VNG 
Sbjct: 266 TCKHYAGYDIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGA 324

Query: 302 PACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           P C    + Q   R  WG+     ++TSDCD++  ++    ++ T E +AA  L AG D+
Sbjct: 325 PPCADPYMLQTVLREHWGWSSDEHWVTSDCDSIQNVYLPHQWSSTREGAAADSLNAGTDL 384

Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
           +CGT +  H   A+ +G   E  +D AL+  +S  ++LG F+  P    Y +LG D V T
Sbjct: 385 DCGTYLQSHLPGAVKQGLTNETTLDNALIRQYSSLIKLGYFD-IPENQPYRQLGFDAVAT 443

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
           S  + LAL AA +GIVLLKND   LP+N  +  ++ I G   N  SQ+ G Y G+     
Sbjct: 444 SASQALALKAAEEGIVLLKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLT 501

Query: 478 SLLRGLEAYVSKTHYASGCHDVPCNSDAG-FHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
           S    LE       YA        +   G +     +   +D  I V G+D   E+EDRD
Sbjct: 502 SPYMALEKLGVNVRYAGNLPGGQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRD 561

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R  L L G Q+ ++  +A T K PVI+++ GGG +D S    + +IS++LW GYPG+ G 
Sbjct: 562 RSWLTLTGSQLDVIGQLADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGG 620

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            A+  I+ G   P GRLP T Y   + ++VPM DM MR   S + PGR+Y++YTG  ++ 
Sbjct: 621 TAIVNILTGKAAPAGRLPQTQYLYKYVSEVPMTDMAMRP--SNKNPGRTYKWYTGKPIFE 678

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GL YTN+S    + P +    + L  G +      TG  L+                
Sbjct: 679 FGYGLHYTNFSASITNQPKQSYAISDLVKGCNS-----TGGFLERCPFT----------G 723

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           + +SV N G     +V + F       +  P+K L+ +DR+  +A  S   +        
Sbjct: 724 INVSVQNTGKTSSDYVTLGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLAS 783

Query: 776 LSIANKHGRRILPLGNHVLMV 796
           L+  ++ G ++L  G++ L +
Sbjct: 784 LARVDESGNKVLYPGDYELQI 804


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 273/768 (35%), Positives = 394/768 (51%), Gaps = 65/768 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      +   C+ S S   RAKSL++L TL+EKI   S  A  +PRLG+P Y+WW+
Sbjct: 23  FPDCTNGPLKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWN 82

Query: 101 ESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           E LHGIA  GP  NF+ +    S  TSFPQ ++  A+F+  L + +   ++ EARA  N 
Sbjct: 83  EGLHGIA--GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNA 140

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
            + GL FW PNIN FRDPRWGRGQETPGED   +S+Y    +   QGE   +D       
Sbjct: 141 NRTGLDFWTPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQGE--ATDP------ 192

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
              KRV+                A CKH   YD+E W    RY  +  IT+QD  + +  
Sbjct: 193 --YKRVV----------------ATCKHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLA 234

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
           PF++C+ Q      MCSYN VNG P C    L Q   R  WG+   + ++T DCDAV  +
Sbjct: 235 PFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQTILREHWGWNKEEQWVTGDCDAVQNV 293

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
           +    ++ T   +AA  L AG DI CGT M  H  +A  +  + E  +D AL+  +S  +
Sbjct: 294 YFPHQWSSTRAGAAADSLVAGTDITCGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLV 353

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           RLG F+  P    Y +LG D V T+  + LA  AA +GIVLLKND   LPL+ ++  ++ 
Sbjct: 354 RLGYFDA-PENQPYRQLGFDAVATNASQALARRAAAEGIVLLKNDGT-LPLSLDSSMTVG 411

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG----CHDVPCNSDAGFHE 509
           + G   N  +Q+ G Y G+     S L  L+    K +YA G      D   N  +  + 
Sbjct: 412 LFGDWANATTQLLGNYAGVATYLHSPLYALKQTGVKINYAGGKPGGQGDPTTNRWSNLYG 471

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           A      +D +I V G+D   E E  DR  L   G Q+ ++  +A T K PVI+V+TGGG
Sbjct: 472 AY---STSDVLIYVGGIDNGVEEEGHDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGG 527

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMN 628
            +D S    +  IS+I+W GYPG+ G  A+ +II G   P GRLP T YP S+   V M 
Sbjct: 528 QIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMM 587

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           +MN+R   +   PGR+Y++Y G+ V+ FG+G+ YTN+S    +   +    +SL +G + 
Sbjct: 588 NMNLRPGENN--PGRTYKWYNGSAVFEFGYGMHYTNFSAAISTQMQQSYAISSLASGCNS 645

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
                TG  L+                V + V N G V   +V + +          P K
Sbjct: 646 -----TGGFLERCPFAS----------VDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRK 690

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            L+ + R+H +A G+   +        ++  +++G ++L  G++ L +
Sbjct: 691 TLVSYKRLHNIAGGATSTAKLNLTLASVARVDEYGNKVLYPGHYSLQI 738


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 274/772 (35%), Positives = 400/772 (51%), Gaps = 59/772 (7%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      + P CN SLS   RA +L+ L TL+E      + A  +PRLG+PAYE W+
Sbjct: 46  FPSCVNSTLSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWN 105

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHGI+      N  GT S  TSFP  ++S AS NR+L + IG  ++ + RA  N G+ 
Sbjct: 106 EALHGISHGHFATN--GTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRY 163

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFR 219
           GL  +APNIN FR P WGRGQETPGED   +S+ YA E++   QG              +
Sbjct: 164 GLDSYAPNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQGG-------------K 210

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
              V K             L A  KH   YD+E W N SR   +  IT+QD    + P F
Sbjct: 211 APAVPK-------------LVAVPKHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQF 257

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF-KGYITSDCDAVATIFEYQ 337
           RS I+  KA  +MCSYN VNGVP+C      Q  AR+ WGF  G+++SDCDAV  ++   
Sbjct: 258 RSAIQNAKALGLMCSYNAVNGVPSCSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPH 317

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            Y      + A  L+AG DI+CGT    +   A + G V   DI+ AL   +S  +  G 
Sbjct: 318 GYAANTTGAVADSLRAGTDIDCGTSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGY 377

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+G+     Y  LG +DV T++   ++ +AA +GI LLKND   LPL+K +  S+A+IGP
Sbjct: 378 FDGN--SSLYRNLGWNDVLTTDAWNISYEAAVEGITLLKNDGT-LPLSK-STRSVALIGP 433

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
             N   Q+ G Y        S L+   A     ++ +G   +   + +GF EA+ +A+++
Sbjct: 434 WANATLQLQGNYYAAAPYLISPLQAFRASGMTVNFVNGT-TISSTNTSGFAEAITLAQQS 492

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D +I   G+D + E E  DR ++  PG Q+ L+  +++  K P++++  GGG +D S  +
Sbjct: 493 DVIIYAGGIDNSIEAEGLDRQNITWPGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALK 551

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            +S++++++W GYPG++G +AL +II G+  P GRL  T YP S+     N +NM     
Sbjct: 552 NNSKVNALVWGGYPGQSGGQALFDIIMGNRAPAGRLVTTQYPASY-ATSFNQLNMNMAPV 610

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
               G++Y +YTGT VY FGHGL YTN++      P       S+ A             
Sbjct: 611 NGSLGQTYMWYTGTPVYPFGHGLFYTNFTTTSTMGPVTTYNLTSIFAAPHPG-------- 662

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
             Y  ++EV       F V  +   A D  G    MLFA         P K L+G DR  
Sbjct: 663 --YEFVEEV-PIMDFNFIVNNTGRTASDWSG----MLFASTTSGPTPRPIKWLVGIDREA 715

Query: 758 TVAKGS-KEISFGVDPCEQLSIANKHGRRILPLGNHVLMV---GELRHSLTI 805
            +  G    ++  V P   L+ A+ +G  ++  G++ LM+     +R++ T+
Sbjct: 716 IIVPGGLASVTIKV-PVGALARADANGNLVVYPGSYSLMLNNEASIRYNFTL 766


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 273/771 (35%), Positives = 401/771 (52%), Gaps = 81/771 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+++ S   RA +L+SL TL+E I    + +  +PRLG+P Y+ WSE+LHG+A      N
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLAR----AN 115

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  NG  S  TSFP  ++SAA+FNR+L + I S ++ + RA  N G+ GL  ++PNIN F
Sbjct: 116 FTDNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINTF 175

Query: 173 RDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   ++A YA E++   QG                     G   E 
Sbjct: 176 RHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG---------------------GVNPE- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L+A  KH   YD+E W N SR   +  IT+QD  + + P F            
Sbjct: 214 ----HLKLAATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSF 269

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 270 MCSYNAVNGVPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAAA 329

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-Y 407
             + AG DI+CGT    H   +I  G V   DI+R  + L++  + LG F+G+      Y
Sbjct: 330 DAILAGTDIDCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPY 389

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL---NKNAVSSLAIIGPLVNNISQ 464
             LG  DV  ++   ++ +AA +GIVLLKND   LPL   ++    S+A+IGP  N  +Q
Sbjct: 390 RSLGWPDVQKTDAWNISYEAAVEGIVLLKNDGT-LPLASPSEGKNKSIALIGPWANATTQ 448

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           + G Y G      S +    A     HYA G  ++  NS A F  A+  A+ AD ++ + 
Sbjct: 449 LQGNYYGDAPYLISPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFLG 507

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVA--RTSKRPVILVLTGGGPLDVSFAEADSQI 582
           G+D T E E +DR S+  PG Q+ L++ +A  ++  +P+++   GGG +D S  ++++++
Sbjct: 508 GIDNTIEAEAQDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKV 567

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP----ESFTKVPMNDMNMRADSSR 638
           +++LW GYPG++G  AL +I+ G   P GRL  T YP    ESF+ +   DMN+R + + 
Sbjct: 568 NALLWGGYPGQSGGLALRDILTGARAPAGRLTTTQYPAAYAESFSAL---DMNLRPNETT 624

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK---NILQQTG 695
           Q PG++Y +YTG  VY FGHGL YT ++            ++S +A   K   NI   T 
Sbjct: 625 QNPGQTYMWYTGEPVYAFGHGLFYTTFN------------ASSAQAAKTKYTFNITDLTS 672

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFD 754
           +     H D  T      F+   S+TN+G  D  +  +++A          P K L+GFD
Sbjct: 673 A----AHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFD 728

Query: 755 RVHTVAK--GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
           R+  VAK  G+ E++        + +A     R+   GN VL  G    +L
Sbjct: 729 RLAAVAKEGGTAELN--------VPVAVDRLARVDEAGNTVLFPGRYEVAL 771


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/748 (35%), Positives = 386/748 (51%), Gaps = 58/748 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+    TRA +L+SL TL EKI    + +  +PR+G+PAY+WW+E+LHGIA       
Sbjct: 34  CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAYGTHFAA 93

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
                S  TSFPQ ++  A+F+ +L  ++ S ++ EARA  N  + GL FW PNIN ++D
Sbjct: 94  AGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINPYKD 153

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRGQETPGEDP  VS+Y    V   QG            G  +    KG        
Sbjct: 154 PRWGRGQETPGEDPFHVSSYVNALVTGLQG------------GLDDLPYKKGV------- 194

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A CKH   YDLE  G   RY+F+A+I  QD  D + P F+ C        IMCS
Sbjct: 195 ------ATCKHYAGYDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCS 248

Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           YN VNGVP C    L Q   R  WG+     ++TSDCDAV  I++  NYT T E +AA  
Sbjct: 249 YNAVNGVPTCADDWLLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTSTPEQAAADA 308

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGK 409
           L AG D++CG     +  SA ++       +DR+L   ++  +RLG F  DP   + Y +
Sbjct: 309 LNAGTDLDCGGFWPTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYF--DPASIQPYRQ 366

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV T   ++LAL AA  GIVLLKND   LPL  N ++++A+IGP  N  +QM G Y
Sbjct: 367 LGWSDVSTPSAEQLALQAAEDGIVLLKND-GILPLPSN-ITNVALIGPWANATTQMQGNY 424

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
            G      S L   +       Y  G  D+   +   F  A+  AKKAD +I + G+D +
Sbjct: 425 YGQAPYLHSPLIAAQNAGFHVTYVQGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNS 483

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
            E E +DR ++  P  Q+SLV  +A  S   +I  +  G  +D S    +  ++ I+W G
Sbjct: 484 IEAEAKDRKTIAWPSSQISLVNQLANLSIPLIISQM--GTMIDSSSLLTNRGVNGIIWAG 541

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFY 648
           YPG+ G  A+  I+ G   P GRLP+T YP  +  +V MN+MN+   ++   PGR+Y+++
Sbjct: 542 YPGQDGGTAIFNILTGKTAPAGRLPITQYPSDYVNEVSMNNMNLHPGANN--PGRTYKWF 599

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            GT ++ FG GL YT ++ K ++ PS  T   S       ++   T +  D         
Sbjct: 600 NGTSIFDFGFGLHYTTFNAK-ITPPSSNTFEIS-------HLTSNTSTHKDLTPF----- 646

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
                  + IS++N G     +V +LF          P+K L+ + R+H +  G+   + 
Sbjct: 647 -----LTLPISISNTGTTTSDYVALLFLTGSFGPTPYPKKSLVAYTRLHDIKGGASSTAQ 701

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV 796
                  L+  N+ G  +L  G++ ++V
Sbjct: 702 LKLNLASLARGNEKGDLVLYPGDYKVVV 729


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 268/752 (35%), Positives = 395/752 (52%), Gaps = 63/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C++S     RA++LISL TL+E I    ++   +PRLG+P Y+ W+E+LHG+       N
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F    G     TSFP  +++ A+ NR+L   I   ++ +ARA  N G+ GL  +APN+N 
Sbjct: 119 FATKGGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178

Query: 172 FRDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED   +S AY  E++   QG                     G   E
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG---------------------GVDPE 217

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 L ++A  KH   YDLE W N SR  F+A+IT+QD  + + P F +     K+  
Sbjct: 218 -----HLKVAATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRS 272

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
           +MC+YN VNGVP+C      Q   R  WGF   GY++SDCDAV  +F   +Y      +A
Sbjct: 273 LMCAYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAA 332

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L+AG DI+CG     H   +   G+V   +I+R++  L++  +RLG F+   +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFD---KKNQY 389

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             LG  DV  ++   ++ +AA +GIVLLKND   LPL+K  V S+A+IGP  N  +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G      S L   +      ++  G  ++  NS  GF +A+  AKK+D +I + G+D
Sbjct: 448 NYYGPAPYLISPLEAAKKAGYHVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGID 506

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR  +  PG Q+ L+  ++   K P++++  GGG +D S  +++ +++S++W
Sbjct: 507 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  AL +I+ G   P GRL  T YP  +  + P NDMN+R D  +  PG++Y 
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLVTTQYPAEYVHQFPQNDMNLRPD-GKSNPGQTYI 624

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTG  VY FG GL YT +     S P  L  +              T S L   H    
Sbjct: 625 WYTGKPVYEFGSGLFYTTFKETLASHPKSLKFN--------------TSSILSAPHPGYT 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
            S     F  + ++ N+G  +  +  MLF R         P K L+GFDR+  +  G S 
Sbjct: 671 YSEQIPVFTFEANIKNSGKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSS 730

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++S  + P   L+  + HG RI+  G + L +
Sbjct: 731 KLSIPI-PVSALARVDSHGNRIVYPGKYELAL 761


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 275/801 (34%), Positives = 425/801 (53%), Gaps = 81/801 (10%)

Query: 25  VNVIAFSNSKPVLN-KPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
           +N+ + SN  P     P FP C      S   C+T+ SI+ RA +++  +TL EK+  + 
Sbjct: 43  INIYSISNPSPNPPLTPSFPDCTRDPLCSNDVCDTTKSIAERAAAIVKPMTLNEKVANVG 102

Query: 83  DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFNRS 138
            +AS   RLG+PAY+W +E+LHG+A +  GV F        S+ TSFP  ++ +A+F+ +
Sbjct: 103 SSASGSARLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFDDA 161

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
           L  ++ +A++ EARA  N G AGL FW PNIN FRDPRWGRG ETPGED   +  Y +  
Sbjct: 162 LVKSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLAL 221

Query: 199 VKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFS 258
           V   QG        GI   F   R L                + CKH  AYD+E      
Sbjct: 222 VDGLQG--------GIDPDFY--RTL----------------STCKHFAAYDIEN----G 251

Query: 259 RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEW 317
           R + N   T+QD  D + P F +C+   K + IMC+YN V+GVPAC    L Q   R+ +
Sbjct: 252 RTANNLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSYLLQDVLRDTY 311

Query: 318 GFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKG 374
           GF     Y+ SDCDAV  +F+  +Y      +AA  + AG D++CG+       +++  G
Sbjct: 312 GFTEDFNYVVSDCDAVENVFDPHHYAANLTQAAAMSINAGTDLDCGSSY-NVLNASVQAG 370

Query: 375 KVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
              E  +D++L+ L+S  +++G F+   +  +Y  LG  +V T++ + LA DAA +G+ L
Sbjct: 371 LTTEATLDKSLIRLYSALVKVGYFD---QPAEYNSLGWGNVNTTQSQALAHDAATEGMTL 427

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           LKND   LPL++  +S++A+IGP  N  +QM G Y G      + L   +       YA 
Sbjct: 428 LKNDGT-LPLSR-TLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQKWRNVKYAQ 485

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           G   +     +GF+ A+  A  +D ++ + G+D++ E E  DR S+  PG Q++L++ +A
Sbjct: 486 GTA-INSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQLNLISQLA 544

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
              K P+++V  GGG +D S   ++S+++SILW GYPG+ G  A+ +++ G   P GRLP
Sbjct: 545 NLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGANPPAGRLP 603

Query: 615 MTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           +T YP ++     + DMN+R   S   PGR+Y +YTGT V  FG+GL YTN+S  F S  
Sbjct: 604 VTQYPANYVNNNNIQDMNLR--PSNGIPGRTYAWYTGTPVLPFGYGLHYTNFSLSFQSTK 661

Query: 674 SELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG---DVDGS 729
           +         AGSD   ++   GS  D      +           ++V N G   ++   
Sbjct: 662 T---------AGSDIATLVNNAGSNKDLATFATIV----------VNVKNTGGKANLASD 702

Query: 730 HVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRIL 787
           +V +LF +         P KQL  + RV  V  G +++++  V+    L+ A+ +G R +
Sbjct: 703 YVGLLFLKSTNAGPAPHPNKQLAAYGRVRNVGVGATQQLTLTVN-LGSLARADTNGDRWI 761

Query: 788 PLGNHVLMV---GELRHSLTI 805
             G + L++   G L  + T+
Sbjct: 762 YPGAYTLILDVNGPLTFNFTL 782


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 272/758 (35%), Positives = 413/758 (54%), Gaps = 74/758 (9%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI-----ASN 109
           CNTS++   RA+ LISL TL+E I    ++A  +PRLG+P Y+ WSE LHG+     A +
Sbjct: 64  CNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWAKS 123

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNI 169
           G    +       TSFP  ++S A+ NR+L + I S +A +ARA  N+G+ GL  +APNI
Sbjct: 124 GEEWKW------ATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNI 177

Query: 170 NIFRDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           N FR P WGRGQETPGED   + S+YA E++   QG                     G  
Sbjct: 178 NGFRSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG---------------------GVD 216

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
            E      L + A  KH   YDLE W N SR  F+A IT+QD  + + P F +     KA
Sbjct: 217 PE-----HLKIVATAKHFAGYDLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKA 271

Query: 289 SCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHED 345
              MCSYN VNGVP+C    L Q   R  W F   GY++SDCDA   +F    Y      
Sbjct: 272 RSFMCSYNSVNGVPSCSSSFLLQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGYAINISA 331

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           +AA  L+AG DI+CG     +   +  +G V   +I+R+L+ L+S  ++LG F+G+  + 
Sbjct: 332 AAADSLRAGTDIDCGQTYPWYLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGN--QS 389

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           +Y +LG +DV  ++   ++ +AA +GIVLLKND   LPL++  + S+A+IGP  N   Q+
Sbjct: 390 EYRQLGWNDVVATDAWNISYEAAVEGIVLLKND-GVLPLSEK-LKSVAVIGPWANATQQL 447

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
            G Y G      + L+       K +YA G  ++  N+  GF  A+  AKK+D +I + G
Sbjct: 448 QGNYFGPAPYLITPLQAARDAGYKVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGG 506

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
           +D T E E  DR+++  PG Q+ L+  +++T K P++++  GGG +D S  ++++ ++++
Sbjct: 507 IDNTIEAEGTDRMNVTWPGNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNAL 565

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
           +W GYPG++G KA+ +I+ G   P GRL  T YP  + T+ P  DMN+R D  +  PG++
Sbjct: 566 VWGGYPGQSGGKAIFDILSGKRAPAGRLVTTQYPAEYATQFPATDMNLRPD-GKSNPGQT 624

Query: 645 YRFYTGTQVYGFGHGLSYTNY---SYKFLSAPSELT-ISASLKAGSDKNILQQTGSRLDY 700
           Y +YTG  VY FG+ L YT +   + K  S+  +++ I AS ++ S             Y
Sbjct: 625 YIWYTGKPVYEFGYALFYTTFKETAEKLASSSFDISDIIASPRSSS-------------Y 671

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTV 759
            + + V        +V  ++ N G     +  MLFA          P K L+G+DR+ ++
Sbjct: 672 AYSELVPFV-----NVTATIKNTGKTASPYTAMLFANTTNAGPTPYPNKWLVGYDRLPSI 726

Query: 760 AKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
             G S E+   V P   +S  +++G RI+  G++ L +
Sbjct: 727 EPGKSTELVIPV-PIGAISRVDENGNRIVYPGDYQLAL 763


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 252/772 (32%), Positives = 401/772 (51%), Gaps = 62/772 (8%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
            C+ ++   TRA SL++  T +E +    + A  +PRLGIP Y+WW+E+LHG+A + PGV
Sbjct: 36  ICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVAKS-PGV 94

Query: 114 NFN-----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAP- 167
           NF+        +S T FPQ +   A+F+  L+  I S +A E RA  N G+AGL  ++P 
Sbjct: 95  NFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPL 154

Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
           NIN FRDPRWGRGQET GEDP+ +S +AV  V   QG +                     
Sbjct: 155 NINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQGPH--------------------- 193

Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGK 287
            +    G++L ++A CKH +AYDLE++    RY F+A++++QD  D   P FR+C+  G 
Sbjct: 194 AQNEAEGNKLTVAATCKHFLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGG 253

Query: 288 ASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTH 343
           A+ +M SYN VN VP        Q  AR  WG      Y+TSDCDAVA +++   Y + +
Sbjct: 254 ATTLMTSYNAVNNVPPSASKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRYAQNY 313

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
            ++AA  + AG D++CG     +  +A+ +       I RA++ +++  +RLG F+ DP 
Sbjct: 314 VEAAAKSINAGTDLDCGATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFD-DPA 372

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
                +L   DV +   ++LA  +A   I LLKN    LP+ K   + +AIIGP  N  +
Sbjct: 373 SQPLRQLTWKDVNSPSSQRLAYTSALSSITLLKNLDSTLPI-KQKPTKIAIIGPYTNVST 431

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
              G Y G      +++         +K  + +G         +   +AV++   AD V+
Sbjct: 432 SFSGNYAGPAAFNMTMVHAASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVV 491

Query: 522 VVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR----PVILVLTGGGPLDVSFAE 577
              G+D + E E  DR  +  P  Q+ L+  ++++ K+     +++V  GGG LD +  +
Sbjct: 492 FAGGIDASIERESHDRKDIAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLK 551

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADS 636
           +D  + +++W GYPG++ + A+ +I+ G   P GRLP+T YP S+   +P + M++R  +
Sbjct: 552 SDDAVGALVWAGYPGQSASLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRPKA 611

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
              YPGR+Y++Y G   Y FGHGL YT +S   L+ P    I  +  A   +        
Sbjct: 612 G--YPGRTYKWYKGVPTYPFGHGLHYTTFSAS-LAKPQPYAIPTTPAAKGPEG------- 661

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
               VH + ++        VQ ++ N G V   +  +LFAR        P K L+G+ +V
Sbjct: 662 ----VHAEHISVA-----DVQANIKNTGKVASDYTALLFARHSNGPAPYPRKTLVGYTKV 712

Query: 757 HTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
             ++ G +E S  +   +  L+ A++ G + L  G++ L +    H L   T
Sbjct: 713 KNLSAG-EESSVTIKITQAALARADEEGNQFLYPGSYQLELDTEEHRLASTT 763


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 289/798 (36%), Positives = 410/798 (51%), Gaps = 81/798 (10%)

Query: 25  VNVIAFSNSKPV---LNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
           +N+   SN  P    LNK    C+     S P CNTSL I+ R + L+  LTLQEKI  L
Sbjct: 132 MNLYNISNPAPTITPLNKFTPVCQTSPLCSSPACNTSLPIADRVRWLVGQLTLQEKITNL 191

Query: 82  SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTV-SSVTSFPQVLVSAASFNR 137
            D AS   R+G+P YEWWSE+LHG+A++ PGV F   NGT  S  TSFP  +  +A+F+ 
Sbjct: 192 VDGASGSARVGLPPYEWWSEALHGVAAS-PGVTFAGPNGTAFSYATSFPMPITISAAFDD 250

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
            L S I + V  E RA  N G +G  FW PNIN FRDPRWGRG ETPGED   +  Y   
Sbjct: 251 DLVSQIAAVVGREGRAFANHGLSGFDFWTPNINPFRDPRWGRGPETPGEDAFRIQQYIRH 310

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
            +   QG                          SD  D+ ++ A CKH   YD+E     
Sbjct: 311 LIPGLQG--------------------------SDPLDKQII-ATCKHYAVYDVET---- 339

Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
            RY ++      D  + +  PF++C+       +MCSYN V+G+PAC    L Q   R+ 
Sbjct: 340 GRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACASEYLLQSVLRDH 399

Query: 317 WGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           WGF     Y+ SDCDAV  I+   N+T +   +AA  L AG D+ CG+  L   QS +  
Sbjct: 400 WGFTEPYQYVVSDCDAVRFIYSPHNFTDSPAAAAAVALNAGTDLECGSTYLNLNQS-LAS 458

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
               E  +DRAL  L++    +G F+G  R   YG LG D V T + + LA  AA  G V
Sbjct: 459 NMTTEAALDRALTRLYTALHTIGFFDGSAR---YGGLGWDAVGTGDAQVLAYQAAVDGAV 515

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS----- 488
           LLKN+K  LPL+   +  LA+IGP  N  +QM G Y G       L+  L A+ S     
Sbjct: 516 LLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFG---QAAYLVSPLAAFQSAWGAD 572

Query: 489 KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMS 548
              +A+G   +  NS AGF  A+  AK AD V+ + G+D + E+E  DR ++  PG Q+ 
Sbjct: 573 NVLFANGT-GIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISWPGNQLD 631

Query: 549 LVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFN 608
           L+  +A   K P+++V  GGG LD S   A+ ++ ++LW GYPG+AG  A+A+++ G   
Sbjct: 632 LIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADLLTGKQA 690

Query: 609 PGGRLPMTWYPESFT-KVPMNDMNMR------ADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           P GRLP+T Y  S+T +V + D ++R      + S   +PGR+Y++YTG  V  FG+GL 
Sbjct: 691 PAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLPFGYGLH 750

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT +   +   P       +    ++        S  D   +           +V ++VT
Sbjct: 751 YTTFRTAWADEPRGRAYDIAGLFPANTTTTSSAFSAADTYPV----------LNVSVTVT 800

Query: 722 NAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLS 777
           N G     +V +LF R         P K L+G+ R   +A GS    E++  +     L+
Sbjct: 801 NTGRGASDYVGLLFLRTRNAGPAPYPNKWLVGYARARGLAPGSSARLELAVALG---SLA 857

Query: 778 IANKHGRRILPLGNHVLM 795
            A++ GRR++  G++ L+
Sbjct: 858 RADEDGRRVVYPGDYELL 875


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 270/791 (34%), Positives = 415/791 (52%), Gaps = 61/791 (7%)

Query: 18  VLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
           ++++  ++N I   + K   N PD  C      S   C+ SLS   RA +L++ LT+ EK
Sbjct: 137 MIEVYEIINPII--DEKAPGNFPD--CTRDPLCSNDVCDESLSDKQRAAALVAELTIWEK 192

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASF 135
           +  L + A  IPRL +P YEWWSE LHG+A + PG  F   G  S  TSFPQ ++  ++F
Sbjct: 193 LDNLVNEAPGIPRLRVPPYEWWSEGLHGVARS-PGTKFTSKGNFSYATSFPQPILLGSAF 251

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           +  L   +G  V+ EARA  N G++GL  ++PNIN F+DPRWGRGQETPGED   +  Y 
Sbjct: 252 DDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAFKDPRWGRGQETPGEDTFHLQKYV 311

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
              +   +G++             +K+++                A CKH  A D E + 
Sbjct: 312 SAMLSGLEGDD------------PDKKLI----------------ATCKHYAANDFENYK 343

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R  FNA+I+ QD  + + PPF++C  +      MCSYN +NG P C    L +   R
Sbjct: 344 GVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLIEDILR 403

Query: 315 NEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSA 370
             WG+ G   Y+++DCD VA +  Y +Y      +AA  ++AG D+ C         QSA
Sbjct: 404 KHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGHAAAWSMQAGTDLECNAFPGSEALQSA 463

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
            ++  + EKD+D+AL  +++  + +GLF+ D RK     LG D+V T E + LA  AA +
Sbjct: 464 WNQSLISEKDVDKALTRMYTSLVSVGLFDLD-RKDPLRSLGWDEVNTKEAQDLAYRAAVE 522

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP---CSPKSLLRGLEAYV 487
           G VL+KND   LPL+ ++    A+IGP V+  +QM G Y G      SP+   + L   +
Sbjct: 523 GAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKDLG--L 579

Query: 488 SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQM 547
             T++     +    SD+ F +A++ A+ AD VI + G+D T E E  DR +L  P  Q+
Sbjct: 580 DFTYFLGSRTN---KSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEPQL 636

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
            L+ +++   K P++++  GGG +D +   A+  +++ILW GYPG++G KA+ +I+FG  
Sbjct: 637 QLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFGRA 695

Query: 608 NPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
            P GRL +T YP S+   VP  DMN+R        GR+YR+YTG     +G GL YT +S
Sbjct: 696 APAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYTKFS 755

Query: 667 YKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGD 725
                A +   I  A + A ++ +   +  S    +    VT        V +S  N G+
Sbjct: 756 VDMKPASNVHNIDIAQMAAEANDDAASEIPSWQRGLERRMVT--------VTVSAKNEGN 807

Query: 726 VDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
           V   +V ++F R     +  P+K L+G+ R+  +  G +     +   EQL   ++ G R
Sbjct: 808 VISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKMEQLVRVDEVGNR 867

Query: 786 ILPLGNHVLMV 796
           +L  G + L +
Sbjct: 868 VLYEGLYSLFL 878


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 275/803 (34%), Positives = 424/803 (52%), Gaps = 83/803 (10%)

Query: 24  VVNVIAFSNSKPVLN-KPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQL 81
           ++N+ + +N  P     P FP C      S   C+T+ SI+ RA +++  +TL EK+  +
Sbjct: 106 MINIYSITNPSPNPPLTPSFPDCTRDPLCSNDVCDTTKSIAARAAAIVKPMTLNEKVANV 165

Query: 82  SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV----SSVTSFPQVLVSAASFNR 137
             +AS   RLG+PAY+W +E+LHG+A +  GV F   +    S+ TSFP  ++ +A+F+ 
Sbjct: 166 GSSASGSGRLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFDD 224

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
           +L  ++ +A++ EARA  N G AGL FW PNIN FRDPRWGRG ETPGED   +  Y + 
Sbjct: 225 ALVQSVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVLS 284

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
            +   QG        GI   F                      + CKH  AYD+E     
Sbjct: 285 LINGLQG--------GIDPDF------------------FRTISTCKHFAAYDIEN---- 314

Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
            R + N   T+QD  D + P F +C+   K   IMC+YN VNGVPAC    L Q   R+ 
Sbjct: 315 GRTANNLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSYLLQSVLRDG 374

Query: 317 WGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           +GF     Y+ SDCDAV  +++  +Y      +AA  L AG D++CG+       +++  
Sbjct: 375 YGFTEDFNYVVSDCDAVENVYDPHHYAANLTQAAAMSLNAGTDLDCGSSY-NVLNASVQA 433

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
           G   E  +D++L+ L+S  +++G F+   +  KY  LG  +V T++ + LA DAA  G+ 
Sbjct: 434 GMTTEATLDKSLIRLYSALIKVGWFD---QPAKYSSLGWGNVNTTQTRALAHDAATGGMT 490

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYA 493
           LLKND   LPL+   + ++A+IGP VN  +Q+ G Y G      + L   +       YA
Sbjct: 491 LLKNDGT-LPLSP-TLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQKWRNVKYA 548

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            G   +     +GF+ A+  A  +D ++ + G+D++ E E  DR ++  PG Q+SL++ +
Sbjct: 549 QGTA-INSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQLSLISQL 607

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A   K P+++V  GGG +D S   ++S+++SILW GYPG+ G  AL +++ G   P GRL
Sbjct: 608 ANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGANPPAGRL 666

Query: 614 PMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           P+T YP ++     + DMN+R   S   PGR+Y +YTGT V  FG+GL YTN+S  F S 
Sbjct: 667 PITQYPANYVNNNNIQDMNLRPSGS--IPGRTYAWYTGTPVLPFGYGLHYTNFSVSFQST 724

Query: 673 PSELTISASL--KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG---DVD 727
            +  T  A++   AGS+K               D  T  T     + ++V N G   ++ 
Sbjct: 725 KTSGTDVATIVNNAGSNK---------------DRATFAT-----LVVNVKNTGGKANLA 764

Query: 728 GSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRR 785
             +V +LF +         P KQL  + RV  V  G +++++  V+    L+ A+ +G R
Sbjct: 765 SDYVGLLFLKSTNAGPAPHPNKQLAAYGRVKKVGVGATQQLTLTVN-LGSLARADTNGDR 823

Query: 786 ILPLGNHVLMV---GELRHSLTI 805
            +  G + L +   G L  + T+
Sbjct: 824 WVYPGAYTLTLDVNGPLTFNFTL 846


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 270/772 (34%), Positives = 399/772 (51%), Gaps = 82/772 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+++ S   RA +L+SL TL+E I    + +  +PRLG+P Y+ WSE+LHG+       N
Sbjct: 60  CDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGR----AN 115

Query: 115 F--NGTV-SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F  NG + +   SFP  ++SAA+FNR+L + I S ++ + RA  N G+ GL  ++PNIN 
Sbjct: 116 FTDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYSPNINT 175

Query: 172 FRDPRWGRGQETPGEDPMVVSA-YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED   ++A YA E++   QG                     G   E
Sbjct: 176 FRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG---------------------GVNPE 214

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                 L L+A  KH   YD+E W N SR   +  IT+QD  + + P F           
Sbjct: 215 -----HLKLAATAKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHS 269

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSA 347
            MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +A
Sbjct: 270 FMCSYNAVNGVPSCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAANEPSAA 329

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK- 406
           A  + AG DI+CGT    H   +I  G V   DI+R  + L++  + LG F+G+      
Sbjct: 330 ADAILAGTDIDCGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNP 389

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL---NKNAVSSLAIIGPLVNNIS 463
           Y  LG  DV  ++   ++ +AA +GIVLLKND   LPL   ++    S+A+IGP  N  +
Sbjct: 390 YRSLGWPDVQKTDAWNISYEAAVEGIVLLKNDGT-LPLASPSEGKNKSIALIGPWANATT 448

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           Q+ G Y G      S +    A     HYA G  ++  NS A F  A+  A+ AD ++ +
Sbjct: 449 QLQGNYYGDAPYLISPVDAFTAAGYTVHYAPGT-EISTNSTANFSAALSAARAADTIVFL 507

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVA--RTSKRPVILVLTGGGPLDVSFAEADSQ 581
            G+D T E E +DR S+  PG Q+ L++ +A  ++  +P+++   GGG +D S  + +++
Sbjct: 508 GGIDNTIEAEAQDRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAK 567

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP----ESFTKVPMNDMNMRADSS 637
           ++++LW GYPG++G  AL +I+ G   P GRL  T YP    ESF+ +   DMN+R + +
Sbjct: 568 VNALLWGGYPGQSGGLALRDILTGARAPAGRLTTTQYPAAYAESFSAL---DMNLRPNET 624

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK---NILQQT 694
            Q PG++Y +YTG  VY FGHGL YT ++            ++S +A   K   NI   T
Sbjct: 625 TQNPGQTYMWYTGEPVYAFGHGLFYTTFN------------ASSAQAAKTKYTFNITDLT 672

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGF 753
            +     H D  T      F+   S+TN+G  D  +  +++A          P K L+GF
Sbjct: 673 SA----AHPDTTTVGQRTLFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGF 728

Query: 754 DRVHTVAK--GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
           DR+  VAK  G+ E++        + +A     R+   GN VL  G    +L
Sbjct: 729 DRLAAVAKEGGTAELN--------VPVAVDRLARVDEAGNTVLFPGRYEVAL 772


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+S+ T +E +    + +  +PRLG+P Y+ WSE+LHG+       N
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F   G  S  TSFP  +++ ++ NR+L + I + +A + RA  N+G+ GL  +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R   WGRGQETPGED   + SAYA E++   QG                     G   E 
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L A  KH   YDLE W   SR   +  IT+Q+  + + P F       K   +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ +GF   GY++SDCD+   ++    +      +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CGT    +   A D+ +V   +I+R ++ L+S  +RLG F+G+     Y 
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L  +DV T++   ++ +AA +GIVLLKND   LPL K +V S+A+IGP +N  +Q+ G 
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S L   +      +YA G  ++  +S  GF EA+  AKK+D +I   G+D 
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR+++  PG Q+ L+  +++  K P+I++  GGG +D S  +++  ++S++W 
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G +AL +II G   P GRL +T YP  + T+ P  DM++R   +   PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
           YTGT VY FGHGL YT             T  ASL   G DK         T     + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           ++++        +  +++TN G V   +  MLFA         P K L+GFDR+ ++   
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
             +        + ++  ++ G R+L  G + L +   R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+S+ T +E +    + +  +PRLG+P Y+ WSE+LHG+       N
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F   G  S  TSFP  +++ ++ NR+L + I + +A + RA  N+G+ GL  +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R   WGRGQETPGED   + SAYA E++   QG                     G   E 
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L A  KH   YDLE W   SR   +  IT+Q+  + + P F       K   +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ +GF   GY++SDCD+   ++    +      +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CGT    +   A D+ +V   +I+R ++ L+S  +RLG F+G+     Y 
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L  +DV T++   ++ +AA +GIVLLKND   LPL K +V S+A+IGP +N  +Q+ G 
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S L   +      +YA G  ++  +S  GF EA+  AKK+D +I   G+D 
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR+++  PG Q+ L+  +++  K P+I++  GGG +D S  +++  ++S++W 
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G +AL +II G   P GRL +T YP  + T+ P  DM++R   +   PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
           YTGT VY FGHGL YT             T  ASL   G DK         T     + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           ++++        +  +++TN G V   +  MLFA         P K L+GFDR+ ++   
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
             +        + ++  ++ G R+L  G + L +   R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 260/758 (34%), Positives = 397/758 (52%), Gaps = 68/758 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+S+ T +E +    + +  +PRLG+P Y+ WSE+LHG+       N
Sbjct: 63  CDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDR----AN 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F   G  S  TSFP  +++ ++ NR+L + I + +A + RA  N+G+ GL  +APNIN F
Sbjct: 119 FTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNINAF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R   WGRGQETPGED   + SAYA E++   QG                     G   E 
Sbjct: 179 RSAMWGRGQETPGEDAYCLASAYAYEYITGIQG---------------------GVDPE- 216

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L A  KH   YDLE W   SR   +  IT+Q+  + + P F       K   +
Sbjct: 217 ----HLKLVATAKHYAGYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ +GF   GY++SDCD+   ++    +      +AA
Sbjct: 273 MCSYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANITGAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CGT    +   A D+ +V   +I+R ++ L+S  +RLG F+G+     Y 
Sbjct: 333 DSIRAGTDIDCGTTYQYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGN--GSVYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L  +DV T++   ++ +AA +GIVLLKND   LPL K +V S+A+IGP +N  +Q+ G 
Sbjct: 391 DLTWNDVVTTDAWNISYEAAVEGIVLLKNDGT-LPLAK-SVRSVALIGPWMNVTTQLQGN 448

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S L   +      +YA G  ++  +S  GF EA+  AKK+D +I   G+D 
Sbjct: 449 YFGPAPYLISPLNAFQNSDFDVNYAFGT-NISSHSTDGFSEALSAAKKSDVIIFAGGIDN 507

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR+++  PG Q+ L+  +++  K P+I++  GGG +D S  +++  ++S++W 
Sbjct: 508 TLEAEAMDRMNITWPGNQLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWG 566

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G +AL +II G   P GRL +T YP  + T+ P  DM++R   +   PG++Y +
Sbjct: 567 GYPGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMW 624

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA-GSDKNILQ----QTGSRLDYVH 702
           YTGT VY FGHGL YT             T  ASL   G DK         T     + +
Sbjct: 625 YTGTPVYEFGHGLFYT-------------TFHASLPGTGKDKTSFNIQDLLTQPHPGFAN 671

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           ++++        +  +++TN G V   +  MLFA         P K L+GFDR+ ++   
Sbjct: 672 VEQMP-----LLNFTVTITNTGKVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPH 726

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
             +        + ++  ++ G R+L  G + L +   R
Sbjct: 727 RSQTMTIPVTIDSVARTDEAGNRVLYPGKYELALNNER 764


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 268/774 (34%), Positives = 397/774 (51%), Gaps = 119/774 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + S S   RAK L+S +T++EK+ Q+  N+ AI RLGIPAY WW+E+LHG+A  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAG--- 63

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA+F+  L   +   ++ E RA Y+            GLTFW
Sbjct: 64  -------TATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FVK  QG + K                 
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNHPKY---------------- 160

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
                      L  +AC KH   +     G  S R+ FNA+++++D  +T+ P F++ ++
Sbjct: 161 -----------LKAAACAKHFAVHS----GPESLRHEFNAVVSKKDLYETYLPAFKALVQ 205

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           + K   +M +YN+ NG P C    L     R EWGFKG++ SDC A+     + + T T 
Sbjct: 206 EAKVESVMGAYNRTNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATA 265

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
            +SAA  ++ G D+NCG  M  +   A+ +G + E++IDRA+  L   +++LG+F  DP 
Sbjct: 266 PESAALAVRNGCDLNCGN-MFGNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPE 322

Query: 404 -KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
            +  Y  +  D V   EH++LALD A++ IVLLKND   LPL++  + S+A+IGP  ++ 
Sbjct: 323 DQVPYASISYDFVDCKEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSR 381

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRI 513
             + G Y G      ++L G+        + +Y+ GCH     V    + G    EAV  
Sbjct: 382 QALIGNYEGTASEYVTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTC 441

Query: 514 AKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           A+ AD VI+  GLD T E E+          D+  L LPGQQ  L+ +V  T K P++LV
Sbjct: 442 AEHADVVIMCLGLDSTIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLV 500

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L V++  AD  I +IL   YPG  G +A+A ++FG+ NP G+LP+T+Y  +   
Sbjct: 501 LLTGSALAVTW--ADEHIPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEEL 558

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLK 683
               D +M          R+YRF     +Y FG GLSYT + Y      S+L +S  +++
Sbjct: 559 PDFTDYSME--------NRTYRFMKNEALYPFGFGLSYTTFDY------SDLKLSKDTIR 604

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           AG                            F+V + VTN G + G  VV ++ +  + S 
Sbjct: 605 AGEG--------------------------FNVSVKVTNTGKMAGEEVVQVYIKDLEASW 638

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             P  QL G  RV   +  + EI+F + P EQL++    G+ ++  G   + VG
Sbjct: 639 RVPNWQLSGMKRVRLESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 257/751 (34%), Positives = 399/751 (53%), Gaps = 56/751 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T  S   RA +LISL TL+E +    + +  +PRLG+P Y+ W+E+LHG+  +     
Sbjct: 63  CDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGL--DRAYFT 120

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
             G  S  TSFP  +++ ++ NR+L + + S ++ + RA  N G+ GL  ++PNIN FR 
Sbjct: 121 DEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRH 180

Query: 175 PRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           P WGRGQETPGED   +S AYA E++   QG            G   K            
Sbjct: 181 PVWGRGQETPGEDAYCLSSAYAYEYITGIQG------------GVDPK------------ 216

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L L A  KH   YD+E W   SR   +  IT+QD  + + P F       K   +MC
Sbjct: 217 --SLKLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMC 274

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           SYN VNGVP+C      Q   R+ +GF   GYI+SDCD+   +F    Y      +AA  
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADS 334

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           ++AG DI+CGT    +   A+D+  +   DI+R ++ L+S  +RLG F+G+     Y  L
Sbjct: 335 IRAGTDIDCGTTYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGN--SSAYRNL 392

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
             +DV T+    ++ +   +G VLLKND   LPL++ ++ S+A++GP +N  +Q+ G Y 
Sbjct: 393 TWNDVVTTNSWNISYEV--EGTVLLKNDGT-LPLSE-SIRSIALVGPWMNVSTQLQGNYF 448

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           G      S L          +YA G  ++  NS  GF +A+  AKK+D +I   G+D + 
Sbjct: 449 GPAPYLISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSL 507

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
           E E  DR+++  PG+Q+ L+  +++  K P+I++  GGG +D S  +++  ++S++W GY
Sbjct: 508 EAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGY 566

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
           PG++G +AL +II G   P GRL +T YP  + T+ P  DM++R   +   PG++Y +YT
Sbjct: 567 PGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMWYT 624

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           GT VY FGHGL YT +    +S    + I  +      +++L Q      Y+H++++   
Sbjct: 625 GTPVYEFGHGLFYTTFR---VSHARAVKIKPTYNI---QDLLAQ--PHPGYIHVEQM--- 673

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
             L F V I  TN G     +  MLFA         P+K L+GFDR+ T+   + ++   
Sbjct: 674 PFLNFTVDI--TNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTI 731

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
                 ++  ++ G R+L  G + L +   R
Sbjct: 732 PVTINSMARTDELGNRVLYPGKYELALNNER 762


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/781 (34%), Positives = 403/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ + +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G  +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG + +S+        
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEQGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A  + K+P+I++  GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++++S++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTKVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL +T   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 676 QDILSRTHEEL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G    +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 255/766 (33%), Positives = 390/766 (50%), Gaps = 58/766 (7%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C        P C+ SLS   RA +L+SL T  E +    +    + RLG+P Y+ W 
Sbjct: 47  FPDCASGPLSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWG 106

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF  +G  S  TSFP  +   A+ N++L   IG+ V+ + RA  N G
Sbjct: 107 EALHGVGR----ANFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAG 162

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
             G+  ++PNIN FR P WGRGQETPGED  + S Y  E++ + QG              
Sbjct: 163 LGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-------------- 208

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                  G   E+     L + A  KH   YD+E W N SR   +  IT+Q+  + + PP
Sbjct: 209 -------GVDPET-----LKIIATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPP 256

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFE 335
           F       K   +MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  ++ 
Sbjct: 257 FIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWN 316

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y      ++A  + AG DI+CGT    H++ A +   V   DI+R ++ L+S  ++ 
Sbjct: 317 PHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQA 376

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G+     Y  +  DDV +++   +A +AA +GIVLLKND+  LPL+K+ + S+A+I
Sbjct: 377 GYFDGE--DAPYRDITWDDVLSTDAWNIAYEAAVEGIVLLKNDET-LPLSKD-IKSVAVI 432

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  N   ++ G Y G      S L G        HYA G  ++  +S +GF EA+  AK
Sbjct: 433 GPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAK 491

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD +I   G+D T E E  DR ++  PG Q+ L++ ++   K P++++  GGG +D S 
Sbjct: 492 QADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSS 550

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRA 634
            + +  +++++W GYPG++G  ALA+II G   P GRL  T YP  + +V P  DMN+R 
Sbjct: 551 LKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRP 610

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           + +   PG++Y +YTGT VY FGHGL YT          +    + +  AGS       T
Sbjct: 611 NETSGNPGQTYMWYTGTPVYEFGHGLFYT----------TFEESTETTDAGSFNIQTVLT 660

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
                Y H  + T       +   +V N G+ +  +  +++          P+K ++GFD
Sbjct: 661 TPHSGYEHAQQKT-----LLNFTATVKNTGERESDYTALVYVNTTAGPAPYPKKWVVGFD 715

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
           R+  +  G  +        E ++  ++ G R+L  G++ L +   R
Sbjct: 716 RLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLYPGSYELALNNER 761


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 261/774 (33%), Positives = 393/774 (50%), Gaps = 62/774 (8%)

Query: 35  PVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           P    PD  C      S   C+ +LS   RA +L++ +T +EK+Q L   +   PR+G+P
Sbjct: 24  PFQTYPD--CVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLP 81

Query: 95  AYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
           AY WWSE+LHG+A   PG  F   +G  +S TSFP  L+ AA+F+  L   +G  +  E 
Sbjct: 82  AYNWWSEALHGVAY-APGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEG 140

Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
           RA  N G +G  +W PN+N F+DPRWGRG ETPGED + +  YA   ++  QG       
Sbjct: 141 RAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQGP------ 194

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                   E+RV+                A CKH  A D E W   +R+ F+A +T QD 
Sbjct: 195 ------LPERRVV----------------ATCKHYAANDFEDWNGSTRHDFDAKVTLQDL 232

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDC 327
            + +  PF+ C    K   IMCSYN VNGVPAC    L Q   R  W +     YITSDC
Sbjct: 233 AEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREHWNWTAPGNYITSDC 292

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
           +AV  IF   +Y KT+ +  A   +AG D +C          A  +G +++  +DRAL  
Sbjct: 293 EAVLDIFANHHYAKTNAEGTALAFEAGTDSSCEYESSSDIPGAWTQGLLEQSTVDRALTR 352

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNK 446
           L+   +R+G F+G+    +Y  LG  DV + + +++AL  A +GIVLLKND+   L L  
Sbjct: 353 LYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQTAVEGIVLLKNDQTLPLGLKT 410

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDA 505
           +  S LA+IG   N+   + GGY+G P    S +   EA       A G       ++D 
Sbjct: 411 DPKSKLAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAEAMGFNVTTAGGPVLQNSTSNDT 470

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
               A+  A+ A++++   GLD +   E +DR ++  P  Q+ L+ ++ +  K  V++ +
Sbjct: 471 WTQAALEAAQDANYILYFGGLDTSAAGETKDRTTINWPEAQLQLIKTLTKLGKPLVVVQM 530

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-K 624
             G  LD +   A   ++SILW  +PG+ G  A+ +I+ G  +P GRLP+T YP ++T  
Sbjct: 531 --GDQLDNTPLLATKTVNSILWANWPGQDGGTAVMQILTGLKSPAGRLPVTQYPANYTAA 588

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
           VPM DMN+R   S + PGR+YR+Y  T V  FG GL YT +  K         I+A L  
Sbjct: 589 VPMTDMNLR--PSDRLPGRTYRWYP-TAVQPFGFGLHYTTFQAK---------IAAPLPR 636

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
            + +++L + G      + D     T     +++ VTN+G+    +VV+ F       + 
Sbjct: 637 LAIQDLLSRCGGDNANAYPD-----TCALPPLKVEVTNSGNRSSDYVVLAFLAGDAGPRP 691

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            P K L+ + R+  V+ G K  +        ++  ++ G  +L  G + + V E
Sbjct: 692 YPIKTLVSYTRLRDVSPGHKTTAHLEWTLGDIARYDEQGNTVLYPGTYTVTVDE 745


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 263/760 (34%), Positives = 385/760 (50%), Gaps = 65/760 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+SLLT +E +   ++     PR+G+PAY+ W+E+LHG+A      +
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ + + A+ NR+L   I + ++ + RA  N G+ GL  ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG        G+                 
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              + L L A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CG    RH Q +    +V  +D++R +  L++  +R G F+G  +   Y 
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL-NKNAVSSLAIIGPLVNNISQMGG 467
            +   DV ++  + L+ +AA Q IVLLKND   LPL   ++  ++A+IGP  N  +QM G
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIGPWANATTQMLG 449

Query: 468 GYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
            Y G      SP    +  E  ++ T   +   D    S +    A+  AK+AD +I   
Sbjct: 450 NYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFAG 506

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G+D T ETE +DR ++  P  Q+SL+T +A   K P+I++  GGG +D S  + +  +++
Sbjct: 507 GIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVNA 565

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGR 643
           ++W GYPG++G +ALA+II G   P  RL  T YP  + +V P  DMN+R + S   PG+
Sbjct: 566 LIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PGQ 623

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH- 702
           +Y +YTGT VY FGHGL YTN+           T SAS  +G+           L   H 
Sbjct: 624 TYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPHP 672

Query: 703 -IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
               V     L F V   V N GD    +  M F          P K L+GFDR+  V  
Sbjct: 673 GYKLVEQMPLLNFTVD--VKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEP 730

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           GS +        + L+  ++ G R+L  G + + +   R 
Sbjct: 731 GSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 770


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/781 (34%), Positives = 402/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ + +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G  +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG + +S+        
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A  + K+P+I++  GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ +S++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +K L+G+DR+  V  G ++E+     P E  S A     R+   G+ V+  G    +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVVFPGTFELALNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/781 (34%), Positives = 402/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ + +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDETATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G  +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG + +S+        
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN-------- 222

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                             L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 223 ------------------LKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVQSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLSWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A  + K+P+I++  GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ +S++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIR--NTGKLESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           +K L+G+DR+  V  G ++E+     P E  S A     R+   G+ V+  G    +L +
Sbjct: 729 KKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVVFPGTFELALNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 273/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ S +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G+ +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG              
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 216

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                        D    L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A ++  +P+I++  GG
Sbjct: 504 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ ++++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 676 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G    +L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 264/768 (34%), Positives = 391/768 (50%), Gaps = 80/768 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+SLLT +E +   ++     PR+G+PAY+ W+E+LHG+A      +
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ + + A+ NR+L   I + ++ + RA  N G+ GL  ++PNIN F
Sbjct: 119 FSDAGGFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG        G+                 
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              + L L A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CG    RH Q +    +V  +D++R +  L++  +R G F+G  +   Y 
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
            +   DV ++  + L+ +AA Q IVLLKND   LPL    ++  ++A+IGP  N  +QM 
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449

Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           G Y G      SP    +  E  ++ T   +   D    S +    A+  AK+AD +I  
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G+D T ETE +DR ++  P  Q+SL+T +A   K P+I++  GGG +D S  + +  ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
           +++W GYPG++G +ALA+II G   P  RL  T YP  + +V P  DMN+R + S   PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           ++Y +YTGT VY FGHGL YTN+           T SAS  +G        T +R  + +
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASASSG--------TKNRTSF-N 663

Query: 703 IDEVTSCTSLRFHV---------QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
           IDEV     L + +          + V N GD    +  M F          P K L+GF
Sbjct: 664 IDEVLGRPHLGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           DR+  V  GS +        + L+  ++ G R+L  G + + +   R 
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  417 bits (1072), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 267/763 (34%), Positives = 399/763 (52%), Gaps = 64/763 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNT+ S++ RAK+L++ LT +EK     + +  +PRLG+ +Y+WW E+LHG+AS+ PGVN
Sbjct: 35  CNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVASS-PGVN 93

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ ++ +A+F+ +L + + + V+ EARA  N+ ++GL FW PNIN +
Sbjct: 94  FSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWTPNINPY 153

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGED   + +Y    +   QG          G     K+V+        
Sbjct: 154 KDPRWGRGQETPGEDTFHLKSYVAALIDGLQG----------GLNPPIKKVI-------- 195

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH +AYDLE W    RY+F+A+++ QD  + +  PF++C    +   IM
Sbjct: 196 --------ATCKHFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIM 247

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN +NGVP C    + Q   R  W +     Y+TSDCDA+  I+    Y  T E + A
Sbjct: 248 CSYNAMNGVPTCADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPHYYEPTREQAVA 307

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L AG D+NCGT    H  +A  +G   +  ID+ +  L+S  ++LG F+  P    Y 
Sbjct: 308 DALTAGTDLNCGTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFD-PPSATPYR 366

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF---LPLNKNAVSSLAIIGPLVNNISQM 465
            L   DV T   + LAL AA +GIVLLKND       P +KN  +++AIIG   N  + M
Sbjct: 367 SLNWSDVSTPAAEALALKAAEEGIVLLKNDGLLPLSFPTDKN--TTVAIIGGWANATTTM 424

Query: 466 GGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
            G Y GI     S L  L+   +      G   VP     G+ E +  A +AD +I+  G
Sbjct: 425 QGNYFGIAPYLHSPLYALQQLPNINAVYGGGFGVPTTD--GWDELLGAAGEADLIIIADG 482

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
           L  + E+E  DR ++      + ++  ++   K  V L +  G  LD +    +  IS++
Sbjct: 483 LTTSDESESNDRYTIGWQPAAIDIINQLSGMGKPTVFLQM--GDQLDNTPLLNNPNISAL 540

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRS 644
           +W GYPG AG  AL  I+ G   P GRLP+T YP  +  +V M DM +R +++   PGR+
Sbjct: 541 IWGGYPGMAGGDALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRT 600

Query: 645 YRFYTGTQVYGFGHGLSYTNYSY--------KFLSAPSELTISASLKAGSDKNI--LQQT 694
           Y++Y    V  FG+GL YTN+S         +  S PS    ++S   G+  NI  L  +
Sbjct: 601 YKWYNNA-VLPFGYGLHYTNFSVAASAQGQAQTQSGPSS---NSSQGQGTSYNISSLVSS 656

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML-FARVPKVSQGTPEKQLIGF 753
             R  Y ++D    C    F+V  +VTN G    S  V L F       Q  P KQL+ +
Sbjct: 657 CDRSQYAYLDL---CPFESFNV--NVTNTGSKLASDFVALGFISGSYGPQPYPIKQLVAY 711

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            R+  ++ G+   +        L+  +++G  +L  G++ L++
Sbjct: 712 QRLFNISAGASATATLNLTLGSLARHDENGNAVLYPGDYGLLI 754


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 272/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ S +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 29  FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 88

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G+ +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 89  EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 144

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG              
Sbjct: 145 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 190

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                        D    L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 191 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 238

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN V+GVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 239 FHVAARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 298

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 299 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 358

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 359 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 418

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 419 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 477

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A ++  +P+I++  GG
Sbjct: 478 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 537

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ ++++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 538 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 597

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 598 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 649

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 650 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 702

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G    +L +
Sbjct: 703 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 754

Query: 806 E 806
           E
Sbjct: 755 E 755


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 274/781 (35%), Positives = 397/781 (50%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ S +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G+ +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG              
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQG-------------- 216

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                        D    L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E      ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYNVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A ++  +P+I++  GG
Sbjct: 504 ALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ +S++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTREIKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G V+  +  M+FA          P
Sbjct: 676 QDILSQTHEDL-----ASITQLPVLNFTANIK--NTGKVESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G     L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PIEVGSFA-----RVNEDGDWVLFPGTFELGLNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 250/710 (35%), Positives = 384/710 (54%), Gaps = 77/710 (10%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQV 128
           ++++EK++ L D ++ +  LG+P + WW+E LHG+  + PGV F   +      TSFP  
Sbjct: 1   MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVGFS-PGVLFAQDSEPFGYATSFPLP 59

Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDP 188
           +++AASF+  L++ IG  +  E RA  N G AG  FW PN+N FRDPRWGRGQETPGED 
Sbjct: 60  ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119

Query: 189 MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIA 248
           +VVS Y   +V   QG                          SD  D+++++A CKH  A
Sbjct: 120 LVVSNYVQSYVTGLQG--------------------------SDPTDKVIIAA-CKHFAA 152

Query: 249 YDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD 308
           YD+E     +R + N   T+QD +D + P FR C+       +MCSYN V+G+PAC    
Sbjct: 153 YDIET----ARRANNYNPTQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEY 208

Query: 309 LFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML 364
           L ++  R+ WGF     ++ SDC AV  ++   N+T T +D+A+  + AG D+ CG+  L
Sbjct: 209 LLKEVLRDTWGFTNDYQFVVSDCGAVTDVWLLHNFTNTEQDAASVSMAAGTDLECGSSYL 268

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
               S  DK   QE+ +D AL  L+     +G F+G      +  LG  DV T + +++A
Sbjct: 269 HLNGSLADKQVTQER-VDEALTRLYKALFTVGYFDG----SSHSSLGWSDVSTIDAQQIA 323

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLE 484
            +AAR G+ LLKND   LPL      S+A+IGP  N  +QM G Y G     +S L    
Sbjct: 324 CEAARAGMTLLKND-GVLPLADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFT 382

Query: 485 AYVS-KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
              S + +YA+G  D+   SD+GF +A+  AK +D VI   G+D T E E  DRVS+  P
Sbjct: 383 QQSSLQVNYAAGT-DINSTSDSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWP 441

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G Q+ L++ ++   K P+++   GGG +D +    ++ ++++ W G PG+AG  A+ +++
Sbjct: 442 GNQLDLISQLSMLGK-PLVVAQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLV 500

Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            G  +  GRLP T YP S+   V + ++N+R + +  +PGR+Y++Y G  V+ FG GL Y
Sbjct: 501 VGKASFAGRLPTTQYPASYADLVSIFNINLRPNGT--FPGRTYKWYIGEPVFPFGFGLHY 558

Query: 663 TNYSYKFLSAPSELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           T +++         T   +L+   D  NI+    S+ +  H+ + T  TS    V ++V 
Sbjct: 559 TKFNF---------TWKDTLEPTYDISNIISWARSQ-NNGHVTDTTPFTS----VNVTVK 604

Query: 722 NAGDVDGSHVVMLF------ARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           N G+V   +V +LF        VP+     P K L  + R H +  G+ +
Sbjct: 605 NVGNVRSDYVGLLFLSSKNAGPVPR-----PNKSLASYSRAHDIETGASD 649


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 259/759 (34%), Positives = 384/759 (50%), Gaps = 62/759 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+SLLT +E +   ++     PR+G+PAY+ W+E+LHG+A      +
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ + + A+ NR+L   I + ++ + RA  N G+ GL  ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG        G+                 
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              + L L A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CG    RH Q +    +V  +D++R ++ L++  +R G F+G  +   Y 
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDG--KTSPYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
            +   DV ++  + L+ +AA Q IVLLKND   LPL    ++  ++A+IGP  N  +QM 
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449

Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           G Y G      SP    +  E  ++ T   +   D    S +    A+  AK+AD +I  
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G+D T ETE +DR ++  P  Q+SL+T +A   K P+I++  GGG +D S  + +  ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
           +++W GYPG++G +ALA+II G   P  RL  T YP  + +V P  DMN+R + S   PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           ++Y +YTGT VY FGHGL YTN+           T SAS  +G+           L   H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
                       +  + V N GD    +  M F          P K L+GFDR+  V  G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           S +        + L+  ++ G R+L  G + + +   R 
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/456 (44%), Positives = 292/456 (64%), Gaps = 8/456 (1%)

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           +A +D++CG  + +HT+ A+ KG + E +I+ ALLN  +VQ+RLG+F+G+P    YG LG
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P DVCT  H++LAL+AARQGIVLLKN    LPL+     S+AIIGP  N    M G Y G
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
           + C   + L+G+  Y +KT Y  GC DV C SD  F  A+  A++AD  ++V GLD + E
Sbjct: 125 VACGYTTPLQGIGRY-AKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIE 183

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E RDR  LLLPG+Q  L++ VA  SK P ILVL  GGP+DVSFAE D +I  I+W GYP
Sbjct: 184 AESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYP 243

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           G+AG  A+++++FG  NPGG+LPMTWYP+ + T +PM +M MR   S  YPGR+YRFY G
Sbjct: 244 GQAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKG 303

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             VY FGHG+SYTN+ +   SAP+ +++       + +N    +G  +   H      C 
Sbjct: 304 KVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNA-TISGKAIRVTH----ARCN 358

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
            L F VQ+ V N G +DG+H ++++++ P      P KQL+ F++VH  A   + +   V
Sbjct: 359 RLSFGVQVDVKNTGSMDGTHTLLVYSK-PPAGHWAPLKQLVAFEKVHVAAGTQQRVGINV 417

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             C+ LS+ ++ G R +P+G H L +G+++HS++++
Sbjct: 418 HVCKFLSVVDRSGIRRIPMGAHSLHIGDVKHSVSLQ 453


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 271/794 (34%), Positives = 425/794 (53%), Gaps = 86/794 (10%)

Query: 24  VVNVIAFSN--SKPVLNKPDFP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
           ++N+ + +N  + P L  P FP C      S   C+T+LS++ RA +++  +TL EK+  
Sbjct: 111 MINIYSVTNPSANPPL-TPSFPDCTRDPLCSNAICDTTLSMAERAAAIVKPMTLDEKVAN 169

Query: 81  LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFN 136
           +  +AS   RLG+PAY+W +E+LHG+A +  GV F        S+ TSFP  ++ +A+F+
Sbjct: 170 VGSSASGSARLGLPAYQWQNEALHGVAGS-TGVQFQSPLGANFSAATSFPMPILLSAAFD 228

Query: 137 RSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
            +L  N+ +A++ EARA  N G AGL FW PNIN FRDPRWGRG ETPGED   +  Y +
Sbjct: 229 DALVQNVATAISTEARAFANYGFAGLDFWTPNINPFRDPRWGRGMETPGEDAFRIQGYVL 288

Query: 197 EFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
             +   Q        GGI                    D   + A CKH  AYD+E    
Sbjct: 289 ALISGLQ--------GGI------------------NPDFFRIIATCKHFAAYDIEN--- 319

Query: 257 FSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
             R   N   T+QD  D + P F +C+   K   +MC+YN V+G+PAC    L Q   R+
Sbjct: 320 -GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIPACASEYLLQDVLRD 378

Query: 316 EWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID 372
            +GF     Y+ SDCDAV  +F+  +Y     ++AA  L AG D++CG+       ++++
Sbjct: 379 GFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTEAAALSLNAGTDLDCGSSY-NVLNASVE 437

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
                E  ++++L+ L+S  +++G F+   +  +Y  L   +V T++++ LA DAA  G+
Sbjct: 438 AALTSEAALNQSLVRLYSALIKVGYFD---QPSEYKSLSWANVNTTQNQALAHDAATGGM 494

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK--- 489
            LLKND   LPL++  +S++AIIGP VN  +QM G Y G   +   L+  L+ +  K   
Sbjct: 495 TLLKNDGT-LPLSR-TLSNVAIIGPWVNATTQMQGNYAG---TAPFLVNPLDVFQQKWGN 549

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
             YA G   +     +GF  A+  A  +D ++ + G+D+T E E  DR S++ PG Q+ L
Sbjct: 550 VKYAQGTA-INSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSIVWPGNQLDL 608

Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           ++ +A   K P+++V  GGG +D S   ++  + SILW GYPG+ G  A+ +++ G   P
Sbjct: 609 ISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVFDVLTGANPP 667

Query: 610 GGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
            GRLP+T YP S+     + DMN+R   S   PGR+Y +YTGT V  FG+GL YTN+S  
Sbjct: 668 AGRLPITQYPASYINNNNIQDMNLR--PSNGIPGRTYAWYTGTPVLPFGYGLHYTNFSVS 725

Query: 669 FLSAPSELTISASLKAGSD-KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG--- 724
           F S  +         AG+D   I+   G+ +D          TS+   + +SV N G   
Sbjct: 726 FQSINT---------AGTDVATIVNNAGAVID----------TSVFATLVVSVHNTGGKA 766

Query: 725 DVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKH 782
           ++   +V ++F           P KQL  + R  +V  G +++++  ++    L+ A+ +
Sbjct: 767 NLASDYVGLVFLSSTNAGPSPYPNKQLAAYGRAKSVGVGATQQLTLKIN-LGSLARADTN 825

Query: 783 GRRILPLGNHVLMV 796
           G R +  G++ L +
Sbjct: 826 GDRWIYPGDYKLTL 839


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 272/781 (34%), Positives = 399/781 (51%), Gaps = 70/781 (8%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C+     S+  C+ S +   RA SLISL TL E I    +    + RLG+PAY+ WS
Sbjct: 55  FPDCQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWS 114

Query: 101 ESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF+  G+ +  TSFPQ +++ A+ NR+L   I S ++ + RA  N G
Sbjct: 115 EALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAG 170

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
           + GL  +APNIN FR P WGRGQETPGED  + + YA E++   QG              
Sbjct: 171 RYGLDVYAPNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQG-------------- 216

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                        D    L L+A  KH   YD+E W N SR   +  IT+QD  + + P 
Sbjct: 217 ------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQ 264

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFE 335
           F       K   +MC+YN V+GVPAC      Q   R+ +GF   GY++SDCDA   I+ 
Sbjct: 265 FHVAARDAKVHSVMCAYNAVDGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYN 324

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++  ++ 
Sbjct: 325 PHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQA 384

Query: 396 GLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV---- 449
           G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN    LPL + A     
Sbjct: 385 GYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSN 444

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           +++A+IGP  N  +Q+ G Y G      S     E    K ++A G   +   S +GF  
Sbjct: 445 TTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAA 503

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGG 568
           A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A ++  +P+I++  GG
Sbjct: 504 ALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGG 563

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           G +D S  + ++ ++++LW GYPG++G  AL +II G  NP GRL  T YP S+  + P 
Sbjct: 564 GQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPA 623

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K  + 
Sbjct: 624 TDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NI 675

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
           ++IL QT   L       +T    L F   I   N G ++  +  M+FA          P
Sbjct: 676 QDILSQTHEEL-----ASITQLPVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYP 728

Query: 747 EKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G    +L +
Sbjct: 729 VKWLVGWDRLGDVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFELALNL 780

Query: 806 E 806
           E
Sbjct: 781 E 781


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 259/759 (34%), Positives = 384/759 (50%), Gaps = 62/759 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+SLLT +E +   ++     PR+G+PAY+ W+E+LHG+A      +
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ + + A+ NR+L   I + ++ + RA  N G+ GL  ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG        G+                 
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              + L L A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CG    RH Q +    +V  +D++R ++ L++  +R G F+G  +   Y 
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRAGYFDG--KTSPYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
            +   DV ++  + L+ +AA Q IVLLKND   LPL    ++  ++A+IGP  N  +QM 
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449

Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           G Y G      SP    +  E  ++ T   +   D    S +    A+  AK+AD +I  
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G+D T ETE +DR ++  P  Q+SL+T +A   K P+I++  GGG +D S  + +  ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
           +++W GYPG++G +ALA+II G   P  RL  T YP  + +V P  DMN+R + S   PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           ++Y +YTGT VY FGHGL YTN+           T SAS  +G+           L   H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
                       +  + V N GD    +  M F          P K L+GFDR+  V  G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           S +        + L+  ++ G R+L  G + + +   R 
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 259/759 (34%), Positives = 383/759 (50%), Gaps = 62/759 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS     RA +L+SLLT +E +   ++     PR+G+PAY+ W+E+LHG+A      +
Sbjct: 63  CDTSAKPHDRAAALVSLLTFEELVNNTANTGHGAPRIGLPAYQVWNEALHGVAH----AD 118

Query: 115 FN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+  G  S  TSFPQ + + A+ NR+L   I + ++ + RA  N G+ GL  ++PNIN F
Sbjct: 119 FSDAGDFSWSTSFPQPISTMAALNRTLIHQIATIISTQGRAFMNAGRYGLDVYSPNINTF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG        G+                 
Sbjct: 179 RHPVWGRGQETPGEDAYCLASTYAYEYITGIQG--------GV----------------- 213

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
              + L L A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +
Sbjct: 214 -DANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQFLVASRDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  +F    Y      +AA
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNPHGYATNESSAAA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CG    RH Q +    +V  +D++R +  L++  +R G F+G  +   Y 
Sbjct: 333 DSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRAGYFDG--KTSPYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL--NKNAVSSLAIIGPLVNNISQMG 466
            +   DV ++  + L+ +AA Q IVLLKND   LPL    ++  ++A+IGP  N  +QM 
Sbjct: 391 NITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALIGPWANATTQML 449

Query: 467 GGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
           G Y G      SP    +  E  ++ T   +   D    S +    A+  AK+AD +I  
Sbjct: 450 GNYYGPAPYLISPLQAFQDSEYKITYTIGTNTTTDPDSTSQS---TALTTAKEADLIIFA 506

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G+D T ETE +DR ++  P  Q+SL+T +A   K P+I++  GGG +D S  + +  ++
Sbjct: 507 GGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSALKNNKNVN 565

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPG 642
           +++W GYPG++G +ALA+II G   P  RL  T YP  + +V P  DMN+R + S   PG
Sbjct: 566 ALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRPNGSN--PG 623

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           ++Y +YTGT VY FGHGL YTN+           T SAS  +G+           L   H
Sbjct: 624 QTYMWYTGTPVYEFGHGLFYTNF-----------TASASAGSGTKNRTSFNIDEVLGRPH 672

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
                       +  + V N GD    +  M F          P K L+GFDR+  V  G
Sbjct: 673 PGYKLVEQMPLLNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFDRLSAVEPG 732

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
           S +        + L+  ++ G R+L  G + + +   R 
Sbjct: 733 SAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNNERE 771


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 268/752 (35%), Positives = 397/752 (52%), Gaps = 63/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C++S     RA++LISL TL+E I    ++   +PRLG+P Y+ W+E+LHG+       N
Sbjct: 63  CDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F    G     TSFP  +++ A+ NR+L   I   ++ +ARA  N G+ GL  +APN+N 
Sbjct: 119 FATKGGQFEWATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNVNG 178

Query: 172 FRDPRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED   +S AY  E++   QG                     G   E
Sbjct: 179 FRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG---------------------GVDPE 217

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                +L ++A  KH   YDLE W N SR  F+A+IT+QD  + + P F +     K+  
Sbjct: 218 -----QLKVAATVKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRS 272

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
           +MCSYN VNGVP+C      Q   R  WGF   GY++SDCDAV  +F   +Y      +A
Sbjct: 273 LMCSYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSSAA 332

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L+AG DI+CG     H   +   G+V   +I+R++  L++  +RLG F+   +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFD---KKNQY 389

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             LG  DV  ++   ++ +AA +GIVLLKND   LPL+K  V S+A+IGP  N  +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G      S L   +      ++  G  ++  NS AGF +A+  AKK+D ++ + G+D
Sbjct: 448 NYFGPAPYLISPLEAAKKAGYHVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGID 506

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR  +  PG Q+ L+  ++   K P++++  GGG +D S  +++ +++S++W
Sbjct: 507 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  AL +I+ G   P GRL  T YP  +  + P NDMN+R D  +  PG++Y 
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLITTQYPAEYVHQFPQNDMNLRPD-GKSNPGQTYI 624

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTG  VY FG GL YT +     S P  L  +              T S L   H    
Sbjct: 625 WYTGKPVYEFGSGLFYTTFKETLASHPKCLKFN--------------TSSILSAPHPGYT 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
            S     F  + ++ N+G  +  +  MLF R         P K L+GFDR+  +  G S 
Sbjct: 671 YSEQIPVFTFEANIKNSGKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSS 730

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++S  + P   L+  + +G RI+  G + L +
Sbjct: 731 KLSIPI-PVSALARVDSYGNRIVYPGKYELAL 761


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/766 (32%), Positives = 389/766 (50%), Gaps = 58/766 (7%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C        P C+ SLS   RA +L+SL T  E +    +    + RLG+P Y+ W 
Sbjct: 46  FPDCATGPLSLTPVCDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWG 105

Query: 101 ESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
           E+LHG+       NF  +G  S  TSFP  +   A+ N++L   IG+ V+ + RA  N G
Sbjct: 106 EALHGVGR----ANFVESGNFSWATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAG 161

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
             G+  ++PNIN FR P WGRGQETPGED  + S Y  E++ + QG              
Sbjct: 162 LGGVDVYSPNINTFRHPVWGRGQETPGEDAFLTSVYGYEYITALQG-------------- 207

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
                      E+ +     + A  KH   YD+E W N SR   +  IT+Q+  + + PP
Sbjct: 208 -------AVDPETSK-----IIATAKHYAGYDIESWNNHSRLGNDMQITQQELSEYYTPP 255

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFE 335
           F       K   +MCSYN VNGVP+C      Q   R+ + F   GY++ DC AV  ++ 
Sbjct: 256 FIVASRDAKVRSVMCSYNAVNGVPSCANKFFLQTLLRDTFEFSEDGYVSGDCGAVYNVWN 315

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y      ++A  + AG DI+CGT    H++ A +   V   DI+R ++ L+S  ++ 
Sbjct: 316 PHGYASNEAAASADSILAGTDIDCGTSYQWHSEDAFEDSLVSRSDIERGVIRLYSNLVQA 375

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G+     Y  +  DDV +++   +A +AA +GIVLLKND+  LPL+K+ + S+A+I
Sbjct: 376 GYFDGE--DAPYRDITWDDVLSTDAWNIAYEAAVEGIVLLKNDET-LPLSKD-IKSVAVI 431

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
           GP  N   ++ G Y G      S L G        HYA G  ++  +S +GF EA+  AK
Sbjct: 432 GPWANVTEELQGNYFGPAPYLISPLTGFRDSGLDVHYALGT-NLTSHSTSGFEEALTAAK 490

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD +I   G+D T E E  DR ++  PG Q+ L++ ++   K P++++  GGG +D S 
Sbjct: 491 QADAIIFAGGIDNTIEAEAMDRENITWPGNQLDLISKLSELGK-PLVVLQMGGGQVDSSS 549

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRA 634
            + +  +++++W GYPG++G  ALA+II G   P GRL  T YP  + +V P  DMN+R 
Sbjct: 550 LKDNDNVNALIWGGYPGQSGGHALADIITGKRAPAGRLVTTQYPAEYAEVFPAIDMNLRP 609

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           + +   PG++Y +YTGT VY FGHGL YT          +    + +  AGS       T
Sbjct: 610 NETSGNPGQTYMWYTGTPVYEFGHGLFYT----------TFEESTETTDAGSFNIQTVLT 659

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
                Y H  + T       +   +V N G+ +  +  +++          P+K ++GFD
Sbjct: 660 TPHSGYEHAQQKT-----LLNFTATVKNTGERESDYTALVYVNTTAGPAPYPKKWVVGFD 714

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
           R+  +  G  +        E ++  ++ G R+L  G++ + +   R
Sbjct: 715 RLGGLEPGDSQTLTVPVTVESVARTDEQGNRVLYPGSYDVALNNER 760


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 272/768 (35%), Positives = 399/768 (51%), Gaps = 71/768 (9%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ S     RA +L+S+ TL+E +    +  + +PRLG+P Y+ WSESLHG+       N
Sbjct: 63  CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR----AN 118

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           +   G  S  TSFPQ +++ A+ NR+L   IG  ++ +ARA  N+G+ GL  +APNIN F
Sbjct: 119 WASEGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSF 178

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG                     G   E+
Sbjct: 179 RHPVWGRGQETPGEDAYYLASTYAYEYITGIQG---------------------GVDPET 217

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L A  KH   YD+E W   SR   +  IT+QD  + + P F       K   +
Sbjct: 218 -----LKLVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSV 272

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R  +GF   GY++ DC AV   F    Y      ++A
Sbjct: 273 MCSYNAVNGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASA 332

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CGT    H  +A D+G++  +DI+R ++ L++  +RLG F+G+    +Y 
Sbjct: 333 DSIRAGTDIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN--SSQYR 390

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L   DV T++   ++ +AA +G VLLKND   LPL  +++ S+A+IGP  N  +QM G 
Sbjct: 391 DLTWSDVQTTDAWNISHEAAVEGTVLLKNDGT-LPL-ADSIRSVALIGPWANATTQMQGN 448

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S L  LEA     HYA G  ++   + AGF +A+  A+KAD +I   G+D 
Sbjct: 449 YYGPAPYLTSPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDN 507

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR+++  PG Q+ L+  ++   K P++++  GGG +D S  + ++ +S++LW 
Sbjct: 508 TIEGEALDRMNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWG 566

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G  AL +II G   P GRL  T YP  + T+ P  DM +R + +   PG++Y +
Sbjct: 567 GYPGQSGGTALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTN--PGQTYMW 624

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP---SELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YTGT VY FGHGL YT +  K  S     S   I   L A        Q    L++    
Sbjct: 625 YTGTPVYEFGHGLFYTTFEAKRASTATNHSSFNIEDLLTAPHPGYAYPQLRPFLNFT--- 681

Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-S 763
                          +TN G     +  MLFA         P K L+GFDR+  +  G S
Sbjct: 682 -------------AHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGAS 728

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE-----LRHSLTIE 806
           + ++F +   + ++  ++ G R+L  G + L +       LR +LT E
Sbjct: 729 QTMTFPIT-IDNVARTDELGNRVLYPGRYELALNNERSVVLRFTLTGE 775


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 260/788 (32%), Positives = 398/788 (50%), Gaps = 62/788 (7%)

Query: 21  LIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQ 80
           L+   +V+  +   P    PD  C      S   C+ +LS   RA +L++ +T +EK+Q 
Sbjct: 10  LLSCSSVLVSAIDLPFQTYPD--CVNGPLASLKVCDVTLSPPQRAAALVAAMTTEEKLQN 67

Query: 81  LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNR 137
           L   +   PR+G+PAY WWSE+LHG+A   PG  F   +G  ++ TSFP  L+ AA+F+ 
Sbjct: 68  LVSKSKGAPRIGLPAYNWWSEALHGVAY-APGTQFWSGDGPFNASTSFPMPLLMAATFDD 126

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
            L   +G  +  E RA  N G +G  +W PN+N F+DPRWGRG ETPGED + +  YA  
Sbjct: 127 ELIEKVGEVIGTEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAAS 186

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
            ++  QG              RE+RV+                A CKH  A D E W   
Sbjct: 187 MIRGLQGPA------------RERRVV----------------ATCKHYAANDFEDWNGS 218

Query: 258 SRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNE 316
           +R+ FNA +T QD  + +  PF+ C    K   IMCSYN VNGVPAC    L Q   R  
Sbjct: 219 TRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILREH 278

Query: 317 WGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDK 373
           W +     YITSDC+AV  I    +Y +T+ +  A   +AG+D +C          A  +
Sbjct: 279 WNWTAPGNYITSDCEAVLDISANHHYAETNAEGTALAFEAGIDSSCEYESSSDIPGAWTQ 338

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIV 433
           G +++  +DRAL  ++   +R+G F+G+    +Y  LG  DV + + +++AL AA +GIV
Sbjct: 339 GLLEQSTVDRALKRIYEGLVRVGYFDGN--HSEYASLGWKDVNSPKSQEVALQAAVEGIV 396

Query: 434 LLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
           LLKNDK   L L  +  S LA+IG   N+   + GGY+G P    S +   +A       
Sbjct: 397 LLKNDKTLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHSPVYAAQAMGFSVTT 456

Query: 493 ASG-CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           A G       ++D     A+  AK A++++   G D +   E +DR ++  P  Q+ L+T
Sbjct: 457 AGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLIT 516

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
           ++++  K  V++ +  G  LD +   A   +++ILW  + G+ G  A+ +I+ G  NP G
Sbjct: 517 TLSKLGKPLVVVQM--GDQLDNTPLLAAKAVNAILWANWLGQDGGTAVMQILTGLKNPAG 574

Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           RLP+T YP ++T  VPM DMN+R   S + PGR+YR+Y  T V  FG GL YT +  K  
Sbjct: 575 RLPVTQYPANYTAAVPMTDMNLR--PSDKLPGRTYRWYP-TAVQPFGFGLHYTTFQTKIA 631

Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSH 730
                L I         +++L + G      + D     T     +++ VTN+G+    +
Sbjct: 632 VPLPRLAI---------QDLLSRCGGDNANAYPD-----TCALPPLKVEVTNSGNRSSDY 677

Query: 731 VVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
           VV+ F       +  P K L+ + R+  ++ G K  +        ++  ++ G  +L  G
Sbjct: 678 VVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKWTLGDIARYDEQGNTVLYPG 737

Query: 791 NHVLMVGE 798
            + + V E
Sbjct: 738 TYTVTVDE 745


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 268/753 (35%), Positives = 386/753 (51%), Gaps = 82/753 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS    TRA +LISL TL EK+    + +  +PR+G+P+YEWW+E+LHGIA + PG  
Sbjct: 34  CDTSSDPYTRAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIARS-PGTT 92

Query: 115 FNGTVSSV---TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F  T S+    TSFPQ ++  A+F+  L   + + V+ EARA  N+ + GL FW PNIN 
Sbjct: 93  FAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNINP 152

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           ++DPRWGRGQETPGEDP   S+Y    +   QG            G  +    KG     
Sbjct: 153 YKDPRWGRGQETPGEDPFHTSSYVNALITGLQG------------GLDDLPYKKGV---- 196

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    A CKH   YDLE      RY F+A+I  QD  D + PPF+ C        +
Sbjct: 197 ---------ATCKHFAGYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSV 247

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
           MCSYN +NGVP C    L Q   R  WG+     ++TSDCDAV  I++Y NYT T E SA
Sbjct: 248 MCSYNAMNGVPTCADDWLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTLTPEQSA 307

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L AG D++CGT    +  SA D+G      +DR+L   ++  +RLG F+  P    Y
Sbjct: 308 ADALNAGTDLDCGTFWPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFD-PPSVQPY 366

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            +L  D+V T   ++LAL AA  GIVLLKND   LPL+ N ++++A+IGPL N   QM G
Sbjct: 367 RQLNWDNVSTPAAQQLALQAAEDGIVLLKND-GILPLSSN-ITNVALIGPLANATKQMQG 424

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G     +S L   +    K  Y  G  D+   +   F  A+  A+ AD VI V G+D
Sbjct: 425 NYYGTAPYLRSPLIAAQNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGID 483

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            + E E+      +L      L+ S               G  +D S   +++ ++++LW
Sbjct: 484 NSIEAEE------ILANLSTPLIISQM-------------GCMIDSSSLLSNTGVNALLW 524

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
            GYPG+ G  A+  I+ G   P GRLP+T YP ++  +V M DMN++   SR  PGR+Y+
Sbjct: 525 AGYPGQDGGTAIFNILTGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYK 582

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +Y G  V+ +G+GL YT             T  A +   S  N  + +    +  +  ++
Sbjct: 583 WYNGEPVFEYGYGLQYT-------------TFDAKITPSSPNNTFEISELLANASNYKDL 629

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-- 764
           T        + I+V+N G     +V + F          P+K L+ + R+H +  G+   
Sbjct: 630 TPFV----KIPITVSNTGTTTSDYVALFFLSGTFGPAPHPKKSLVAYTRLHDITGGANAT 685

Query: 765 -EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            E+S  +     L+  N +G  IL  G++ ++V
Sbjct: 686 AEVSLNL---ASLARGNWNGDLILYPGDYKVVV 715


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 191/399 (47%), Positives = 258/399 (64%), Gaps = 28/399 (7%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLG+  YEWWSE+LHG+++ GPG  F G   + TSFPQV+ +AASFN SLW  IG AV+ 
Sbjct: 1   RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EARAMYN G AGLT+W+PN+NIFRDPRWGRGQETPGEDP + + YA  +V   QG     
Sbjct: 61  EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQG----- 115

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                 + G+ L ++ACCKH  AYDL+ W    R+ FNA +++Q
Sbjct: 116 ----------------------NYGNRLKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQ 153

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           D EDT+  PF++C+ +GK + +MCSYNQVNG P C   D+ +   R +W   GYI SDCD
Sbjct: 154 DLEDTYNVPFKACVLEGKVASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCD 213

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           +V  +++ Q+YT+T E++AA  + AG+D++CG  +  HT+ AI +G V E  +++AL N 
Sbjct: 214 SVGVLYDDQHYTRTPEEAAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANT 273

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            +VQ+RLG+F+G+P    +G LGP DVCT  H+ LAL AAR+GIVLLKN    LPL+   
Sbjct: 274 ITVQMRLGMFDGEPSAQPFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVR 333

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
             ++A+IGP     + M G Y GI C   S L+G+  Y 
Sbjct: 334 HRNIAVIGPNAQATTTMIGNYAGIACGYTSPLQGISRYA 372


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 258/773 (33%), Positives = 393/773 (50%), Gaps = 112/773 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + + +LS   RAK L+S +TL+EK+ Q+   + AIPRLG+PAY WWSE+LHG+A  G 
Sbjct: 30  FDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVARAG- 88

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLT 163
                      T FPQ +  AA+F+  L  ++   +++E RA ++  Q         GLT
Sbjct: 89  ---------VATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIYKGLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+PN+NIFRDPRWGRGQET GEDP +     V F+K  QG++              K+ 
Sbjct: 140 FWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD--------------KKY 185

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           L+              +AC KH   + +       R+SF+A+++ +D  +T+ P F+ C+
Sbjct: 186 LRA-------------AACAKH---FAVHSGPESERHSFDAVVSPKDLRETYLPAFKECV 229

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +M +YN+VNG P C    L ++  R EWGF G++ SDC A+    E    T +
Sbjct: 230 KEANVEAVMGAYNRVNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHRVTSS 289

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
             +S A  L  G D+NCG  M  +   A  +G V E+ I+ A+  L   +++LGLF+   
Sbjct: 290 APESVALALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD-TA 347

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
               Y  +G       EH++ AL+ +++ +VLLKN+   LPL++N +SS+A+IGP  N+ 
Sbjct: 348 ENVPYTNIGFHQNDCQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSR 407

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCNSDAG---FHEAVRI 513
             + G Y G   +  ++L G+   V K     YA GCH   D   N       F EAV  
Sbjct: 408 EALTGNYCGTASNYITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVST 467

Query: 514 AKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           A++AD V++  GLD + E E+          D++ L LPG Q  L+  + +T K P+ILV
Sbjct: 468 AERADIVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILV 526

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L V++A    ++ +I+   YPG  G KALA  IFG+++P G+LP+T+Y  +   
Sbjct: 527 LLAGSALAVTWAA--EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTTEEL 584

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
               D +M+         R+YR+ T   +Y FG+GL YT ++Y+ L   +   ISA    
Sbjct: 585 PEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQLQL-NRTQISAG--- 632

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
                               E   C+ L       V N G+      V L+ +  K S  
Sbjct: 633 --------------------ENVQCSVL-------VKNTGNFASDETVQLYIKDVKASVE 665

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P  +L G  +VH +    +E+ F + P  QL++ N+ G  IL  G   + VG
Sbjct: 666 VPILELQGIQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 258/777 (33%), Positives = 397/777 (51%), Gaps = 114/777 (14%)

Query: 49  FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
            + + + + +LS   RAK L+S +TL+EK+ Q+   + AIPRLG+PAY WWSE+LHG+A 
Sbjct: 1   MEIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVAR 60

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------A 160
            G            T FPQ +  AA+F+  L  N+   +++E RA ++  Q         
Sbjct: 61  AG----------VATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYK 110

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLTFW+PN+NIFRDPRWGRGQET GEDP +     V F+K  QG++              
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD-------------- 156

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+ L+              +AC KH   + +       R+SF+A+++ +D  +T+ P F+
Sbjct: 157 KKYLRA-------------AACAKH---FAVHSGPESERHSFDAVVSPKDLRETYLPAFK 200

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
            C+++     +M +YN+VNG P C    L ++  R EWGF G++ SDC A+    E    
Sbjct: 201 ECVKEANVEAVMGAYNRVNGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHRV 260

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           T +  +S A  L  G D+NCG  M  +   A  +G V E+ I+ A+  L   +++LGLF+
Sbjct: 261 TSSAPESVAMALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD 319

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
                  Y K+G       EH++ AL+ +++ +VLLKN+   LPL++N +SS+A+IGP  
Sbjct: 320 -TAENVPYTKIGFHQNDCQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNA 378

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCNSDAG---FHEA 510
           N+   + G Y G   +  ++L G+   V K     YA GCH   D   N       F EA
Sbjct: 379 NSREALTGNYCGTASNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEA 438

Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
           V  A++AD V++  GLD + E E+          D++ L LPG Q  L+  + +T K P+
Sbjct: 439 VSTAERADIVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PI 497

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILVL  G  L V++A    +I +I+   YPG  G KALA  IFG+++P G+LP+T+Y  +
Sbjct: 498 ILVLLAGSALAVTWAA--EKIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTT 555

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISA 680
                  D +M+         R+YR+ T   +Y FG+GL YT ++Y+ L    +++++  
Sbjct: 556 EELPEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQLQLNRTQISVGE 607

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           +++           GS L                     V N G+      V L+ +  K
Sbjct: 608 NVQ-----------GSVL---------------------VKNTGNFASDETVQLYIKDVK 635

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            S   P   L G  +VH +    +E+ F + P  QL++ N+ G  IL  G   + VG
Sbjct: 636 ASVEVPIWALQGIQKVHLLPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 264/773 (34%), Positives = 397/773 (51%), Gaps = 117/773 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + S S   RAK L+S +T++EK+ Q+  N+ AI RLGIPAY WW+E+LHG+A  G   
Sbjct: 7   YLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAG--- 63

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA+F+  L   +   ++ E RA Y+            GLTFW
Sbjct: 64  -------TATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTFW 116

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FVK  QG +               + LK
Sbjct: 117 SPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGNH--------------PKYLK 162

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
             G              CK+++ + +       R+ FNA+++++D  +T+ P F++ +++
Sbjct: 163 AGG-------------MCKNILPFTVVP--ESLRHEFNAVVSKKDLYETYLPAFKALVQE 207

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
            K   +M +YN+ NG P C    L     R EWGFKG++ SDC A+     + + T T  
Sbjct: 208 AKVESVMGAYNRTNGEPCCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAP 267

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR- 403
           +SAA  ++ G D+NCG  M  +   A+ +G + E++IDRA+  L   +++LG+F  DP  
Sbjct: 268 ESAALAVRNGCDLNCGN-MFGNLLIALKEGLITEEEIDRAVTRLMITRMKLGMF--DPED 324

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  Y  +     C  EH++LALD A++ IVLLKND   LPL++  + S+A+IGP  ++  
Sbjct: 325 QVPYASISSFVDC-KEHRELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQ 382

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRIA 514
            + G Y G      ++L G+        + +Y+ GCH     V    + G    EAV  A
Sbjct: 383 ALIGNYEGTASEYVTVLDGIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCA 442

Query: 515 KKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           + AD VI+  GLD T E E+          D+  L LPGQQ  L+ +V  T K P++LVL
Sbjct: 443 EHADVVIMCLGLDSTIEGEEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVL 501

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  L V++  AD  I +IL   YPG  G +A+A ++FG+ NP G+LP+T+Y  +    
Sbjct: 502 LTGSALAVTW--ADEHIPAILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYRTTEELP 559

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA-SLKA 684
              D +M          R+YRF     +Y FG GLSYT + Y      S+L +S  +++A
Sbjct: 560 DFTDYSME--------NRTYRFMKNEALYPFGFGLSYTTFDY------SDLKLSKDTIRA 605

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
           G                            F+V + VTN G + G  VV ++ +  + S  
Sbjct: 606 GEG--------------------------FNVSVKVTNTGKMAGEEVVQVYIKDLEASWR 639

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P  QL G  RV   +  + EI+F + P EQL++    G+ ++  G   + VG
Sbjct: 640 VPNWQLSGMKRVRLESGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 267/751 (35%), Positives = 391/751 (52%), Gaps = 66/751 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ S     RA +L+S+ TL+E +    +  + +PRLG+P Y+ WSESLHG+       N
Sbjct: 81  CDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR----AN 136

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           +   G  S  TSFPQ +++ A+ NR+L   IG  ++ +ARA  N+G+ GL  +APNIN F
Sbjct: 137 WASEGDYSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPNINSF 196

Query: 173 RDPRWGRGQETPGEDPM-VVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + S YA E++   QG                     G   E+
Sbjct: 197 RHPVWGRGQETPGEDAYYLASTYAYEYITGIQG---------------------GVDPET 235

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                L L A  KH   YD+E W   SR   +  IT+QD  + + P F       K   +
Sbjct: 236 -----LKLVATAKHYAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSV 290

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           MCSYN VNGVP+C      Q   R  +GF   GY++ DC AV   F    Y      ++A
Sbjct: 291 MCSYNAVNGVPSCSNSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAANESSASA 350

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             ++AG DI+CGT    H  +A D+G++  +DI+R ++ L++  +RLG F+G+    +Y 
Sbjct: 351 DSIRAGTDIDCGTSYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGN--SSQYR 408

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L   DV T++   ++ +AA +G VLLKND   LPL  +++ S+A+IGP  N  +QM G 
Sbjct: 409 DLTWSDVQTTDAWNISHEAAVEGTVLLKNDGT-LPL-ADSIRSVALIGPWANATTQMQGN 466

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G      S L  LEA     HYA G  ++   + AGF +A+  A+KAD +I   G+D 
Sbjct: 467 YYGPAPYLTSPLAALEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDN 525

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E  DR+++  PG Q+ L+  ++   K P++++  GGG +D S  + ++ +S++LW 
Sbjct: 526 TIEGEALDRMNITWPGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWG 584

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG++G  AL +II G   P GRL  T YP  + T+ P  DM +R + +   PG++Y +
Sbjct: 585 GYPGQSGGTALLDIIRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGTN--PGQTYMW 642

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAP---SELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YTGT VY FGHGL YT +  K  S     S   I   L A        Q    L++    
Sbjct: 643 YTGTPVYEFGHGLFYTTFEAKRASTATNHSSFNIEDLLTAPHPGYAYPQLRPFLNFT--- 699

Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-S 763
                          +TN G     +  MLFA         P K L+GFDR+  +  G S
Sbjct: 700 -------------AHITNTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGAS 746

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVL 794
           + ++F +   + ++  ++ G R+L  G   L
Sbjct: 747 QTMTFPIT-IDNVARTDELGNRVLYPGRRCL 776


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 266/759 (35%), Positives = 388/759 (51%), Gaps = 69/759 (9%)

Query: 63  TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN--GTVS 120
           T   SLISL TL E I    +    + RLG+PAY+ WSE+LHG+       NF+  G+ +
Sbjct: 61  TEQHSLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDR----ANFSDSGSYN 116

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
             TSFPQ +++ A+ NR+L   I S ++ + RA  N G+ GL  +APNIN FR P WGRG
Sbjct: 117 WATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRG 176

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           QETPGED  + + YA E++   QG                           D    L L+
Sbjct: 177 QETPGEDVSLAAVYAYEYITGIQG--------------------------PDPDSNLKLA 210

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH   YD+E W N SR   +  IT+QD  + + P F       K   +MC+YN VNG
Sbjct: 211 ATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNG 270

Query: 301 VPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           VPAC      Q   R+ +GF   GY++SDCDA   I+    Y  +   +AA  + AG DI
Sbjct: 271 VPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAAAAAEAILAGTDI 330

Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK--YGKLGPDDV 415
           +CGT    H   +I  G +   DI++ ++ L++  ++ G F+ +  K    Y  L   DV
Sbjct: 331 DCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDV 390

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV----SSLAIIGPLVNNISQMGGGYTG 471
             ++   ++  AA QGIVLLKN    LPL + A     +++A+IGP  N  +Q+ G Y G
Sbjct: 391 LETDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYG 450

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
                 S     E    K ++A G   +   S +GF  A+  A+ AD +I   G+D T E
Sbjct: 451 NAPYMISPRAAFEEAGYKVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLE 509

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
            E  DR S+  PG Q+ L+  +A ++  +P+I++  GGG +D S  + ++ ++++LW GY
Sbjct: 510 AEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGY 569

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYT 649
           PG++G  AL +II G  NP GRL  T YP S+  + P  DMN+R +     PG++Y++YT
Sbjct: 570 PGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGDN--PGQTYKWYT 627

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           G  VY FGHGL YT +     +  S  T +  +K  + ++IL QT   L       +T  
Sbjct: 628 GEAVYEFGHGLFYTTF-----AESSSNTTTKEVKL-NIQDILSQTHEEL-----ASITQL 676

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEIS 767
             L F   I   N G ++  +  M+FA          P K L+G+DR+  V  G ++E+ 
Sbjct: 677 PVLNFTANIK--NTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 734

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
               P E  S A     R+   G+ VL  G    +L +E
Sbjct: 735 V---PVEVGSFA-----RVNEDGDWVLFPGTFELALNLE 765


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 267/752 (35%), Positives = 398/752 (52%), Gaps = 64/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+++     RA++LISL TL+E I    ++   +PRLG+P Y+ W+E+LHG+       N
Sbjct: 64  CDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 119

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F    G     TSFP  ++S A+ NR+L   I   ++ +ARA  N G+ GL  +APNIN 
Sbjct: 120 FATKGGEFEWGTSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNING 179

Query: 172 FRDPRWGRGQETPGEDPMVV-SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED  V+ SAY  E++   QG                     G   E
Sbjct: 180 FRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG---------------------GVDPE 218

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           +     L ++A  KH   YDLE + N SR  F+A+IT+QD  + + P F +     K+  
Sbjct: 219 N-----LKIAATAKHFAGYDLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHS 273

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
            MC+YN VNGVP+C      Q   R  WGF   GY++SDCDA+  ++   NY  +   +A
Sbjct: 274 FMCAYNSVNGVPSCSNSFFLQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNYANSQSSAA 333

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  LKAG DI+CG     H   +   G V   +I+R++  L++  +RLG F+   +K +Y
Sbjct: 334 ADSLKAGTDIDCGQTYPWHLNESFVAGTVSRGEIERSVTRLYANLVRLGYFD---KKNEY 390

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             LG  DV  ++   ++ +AA +GIVLLKND   LPL+K  V S+A+IGP VN   Q+ G
Sbjct: 391 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWVNATEQLQG 448

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G      S L+  +    + +Y  G   +   + AGF +A+  AKK+D +I + G+D
Sbjct: 449 NYFGTAPYLISPLQAAKKAGYEVNYELGT-GINNQTTAGFAKAIAAAKKSDAIIFIGGID 507

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR  +  PG Q+ L+  ++   K P++++  GGG +D S  +++ +++S++W
Sbjct: 508 NTIEQEGADRTDIAWPGNQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVW 566

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  AL +I+ G   P GRL  T YP  +  +   NDMN+R D  +  PG++Y 
Sbjct: 567 GGYPGQSGGYALFDILSGKRAPAGRLVSTQYPAEYVHQFAQNDMNLRPDGKKN-PGQTYI 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTG  VY FG GL YT +  + L   S L  +AS   G+               H    
Sbjct: 626 WYTGKPVYQFGDGLFYTTFK-ETLGKQSTLKFNASQILGAG--------------HPGYT 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SK 764
            S  +  F    ++ N+G     +  M F R      +  P K L+GFDR+ T+  G S 
Sbjct: 671 YSEQTPVFTFTANIQNSGKTASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSS 730

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            +S  + P   LS  + +G +I+  G + L++
Sbjct: 731 TLSIPI-PLNALSRVDSNGNKIVYPGKYELVL 761


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/756 (33%), Positives = 376/756 (49%), Gaps = 119/756 (15%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
           PC    F++ PFC+ +L+I  RA   +S +T+ EKI  L      I  LG+PAY WWSE+
Sbjct: 28  PCLNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEA 87

Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGL 162
             G+  + P           T F   + +A SFNR+LW   G+A+  EARA+ N G A  
Sbjct: 88  SSGVMGSRP----------TTKFAYPVTTAMSFNRTLWRATGAAIGREARALMNAGAAYS 137

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+WAP +N+ R+PRWGR  E PGEDP +   YA EFV  FQ                   
Sbjct: 138 TYWAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQAA----------------- 180

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKW-----GNFSRYSFNAMITEQDTEDTFQP 277
                    +    L  SACCKH +A +LE         + R   ++ +T++D  D++  
Sbjct: 181 --------PEDPYHLQASACCKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMV 232

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEY 336
           PF++C+E+GK S +MCSYN VNGVP+C    L +  AR+ W F GYITSDCDA + +++ 
Sbjct: 233 PFQACVEKGKVSSLMCSYNAVNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDA 292

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            +Y  T E++ A VLKAG D++C + + +H +SA+DKG + E D+D  L+NLF V+LRLG
Sbjct: 293 HHYAATPEEAVADVLKAGTDVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLG 352

Query: 397 LFN-----GDPRKGKYGKLGPDD-VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
            F+       PR G   ++  D  VC+  H   +++   Q   LLKND   LPL  +  +
Sbjct: 353 HFDLSFDAAKPR-GPLDEIDADAVVCSDAHLDASMEGLAQSATLLKNDGA-LPLKPSGTA 410

Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
             A++GP                           A +SK             +DAG++  
Sbjct: 411 --AVVGP--------------------------NALLSK-------------ADAGYYGP 429

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
                 AD V++  G DLT   E +D  S++    Q+ L+ +VA  S  PV++V+    P
Sbjct: 430 T---DAADAVVLAVGTDLTWAAEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATP 486

Query: 571 LDVS--FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
           LD++   A +D ++ +++ +G P     K L ++++G  +  GR   T YP ++  ++ +
Sbjct: 487 LDLTPLLARSDGKVGAVVHVGQP-SVTVKGLGDLLYGRRSFAGRAVQTVYPAAYADQISI 545

Query: 628 NDMNMRADSSR----------------QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
            D NMR   S                   PGR+YRFY    V  FG GLSYT ++Y   S
Sbjct: 546 FDFNMRPGPSAFARPDCATNESACPRGTNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRS 605

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           AP+ + ++    A +     +  G        D+  + T       + VTN GD+D   V
Sbjct: 606 APTTVDLAPLRAAYAGVAAARGDGGPAFLSLHDDAAAAT-----YAVDVTNTGDIDADDV 660

Query: 732 VMLFARVPKVS-QGTPEKQLIGFDRVHTVAKGSKEI 766
           V+ F   P     G P K+L GF+RVH  A  +K +
Sbjct: 661 VLGFVTPPGAGVDGVPLKELFGFERVHVKAGETKTV 696


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 244/659 (37%), Positives = 353/659 (53%), Gaps = 51/659 (7%)

Query: 42  FP-CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           FP C      + P C+TSL   TRA SL++  T  E I    ++A  +PRLGIP Y+WW+
Sbjct: 21  FPNCSTLPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWT 80

Query: 101 ESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN 156
           E+LHG+A + PGVNFN    G     TSFPQV+   A+F+ +L+  + + +A E RA  N
Sbjct: 81  EALHGVARS-PGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSN 139

Query: 157 LGQAGLTFWAP-NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
            G+AGL  ++P NIN FRDPRWGRGQET GEDP+ +S YAV  V+  QG           
Sbjct: 140 AGRAGLNMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGP---------- 189

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                         + +    L L+A CKH +AYDLE      RY F+A+++ QD  D  
Sbjct: 190 ------------AAQDEANPRLTLAATCKHYLAYDLEASAGVERYQFDALVSNQDLADLH 237

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF---KGYITSDCDAVA 331
            P FR+C+  G A+ +M SYN VNGVP        +  AR+ WG      Y+TSDCDAVA
Sbjct: 238 LPQFRACVRDGGATTLMTSYNAVNGVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVA 297

Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
            +++  +Y   +  +AA  L AG D++CG        +A+ +       I RA+  ++  
Sbjct: 298 NVYDAHHYAADYVHAAAASLNAGTDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGS 357

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
            +RLG F+    +    +LG  DV     +KLA +AA   I LLKN +  LPL + A  +
Sbjct: 358 LVRLGYFDAAEAQ-PLRQLGWKDVNAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKT 416

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN--SDAGFHE 509
           +A+IGP  N    + G Y G    P  L+  +  + +     S  H V  N  S AG ++
Sbjct: 417 IALIGPYTNATFALRGNYAG----PSPLV--ITPFDAARRTFSDAHIVSANGTSIAGPYD 470

Query: 510 ------AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
                 A+  AK AD ++   G+D T E E  DR  +  P  Q+ L+  +A   K  +++
Sbjct: 471 TATASAALATAKSADIIVYAGGIDPTVEGESLDRRDIAWPANQLRLIQELAALGKV-LVV 529

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           V  GGG +D +  + D  + +++W GYPG++GA AL +I+ G   P GRLP+T YP ++T
Sbjct: 530 VQFGGGQVDGALLKGDDGVGALVWAGYPGQSGALALMDILAGKRAPAGRLPITQYPANYT 589

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
              + +  M    +  YPGR+Y++YTGT  + FG GL YT +    ++ P+  TI   L
Sbjct: 590 HA-LRETTMALRPTATYPGRTYKWYTGTPTFPFGFGLHYTTFRAS-IAPPATYTIPPPL 646


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 268/752 (35%), Positives = 395/752 (52%), Gaps = 63/752 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C++S   + RA++LISL TL+E I    ++   +PRLG+P Y+ W+E+LHG+       N
Sbjct: 63  CDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDR----AN 118

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F    G     TSFP  ++S A+ NR+L   I   ++ +ARA  N G+ GL  +APNIN 
Sbjct: 119 FATKGGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNING 178

Query: 172 FRDPRWGRGQETPGEDPMVV-SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           FR P WGRGQETPGED  V+ SAY  E++   QG                     G   E
Sbjct: 179 FRSPLWGRGQETPGEDANVLTSAYTYEYITGMQG---------------------GVDPE 217

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           +     L ++A  KH   YDLE W N SR  F+A+IT+QD  + + P F +     K+  
Sbjct: 218 N-----LKIAATAKHFAGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHS 272

Query: 291 IMCSYNQVNGVPACLRGDLFQK-ARNEWGFK--GYITSDCDAVATIFEYQNYTKTHEDSA 347
            MC+YN VNGVP+C      Q   R  WGF   GY++SDCDAV  ++   +Y      +A
Sbjct: 273 FMCAYNSVNGVPSCANSFFLQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSSAA 332

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  L+AG DI+CG     H   +   G+V   +I+R++  L++  +RLG F+   +K +Y
Sbjct: 333 ASSLRAGTDIDCGQTYPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFD---KKNEY 389

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             LG  DV  ++   ++ +AA +GIVLLKND   LPL+K  V S+A+IGP  N  +QM G
Sbjct: 390 RSLGWKDVVKTDAWNISYEAAVEGIVLLKNDGT-LPLSKK-VRSIALIGPWANATTQMQG 447

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G      S L   +    + ++  G  +    S AGF +A+  AKK+D +I   G+D
Sbjct: 448 NYFGAAPYLISPLEAAKKAGYQVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGID 506

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T E E  DR  +  PG Q+ L+  ++   K P++++  GGG +D S  +++ +++S++W
Sbjct: 507 NTVEQEGADRTDIAWPGNQLDLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVW 565

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYR 646
            GYPG++G  AL +I+ G   P GRL  T YP  +  + P NDMN+R D  +  PG++Y 
Sbjct: 566 GGYPGQSGGVALFDILSGKRAPAGRLVSTQYPADYVHQFPQNDMNLRPD-GKSNPGQTYI 624

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           +YTG  VY FG G+ YT +                  +GS K +     S L   H    
Sbjct: 625 WYTGKPVYQFGDGIFYTTFKETL--------------SGSSKGLKFNVSSVLAAPHPGYT 670

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SK 764
            S  +       ++ N+G  D  +  MLF R         P K L+GFDR+ T+  G S 
Sbjct: 671 YSEQTPVLTFTANIENSGKTDSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSS 730

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           ++S  + P   L+  +  G RI+  G + L +
Sbjct: 731 KLSIPI-PVSALARVDSLGNRIVYPGKYELAL 761


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 213/457 (46%), Positives = 292/457 (63%), Gaps = 12/457 (2%)

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
           V  AG+D+NCGT + +HT +A+  GK+ E D+DRA+ N     +RLG F+GDPR+  +G 
Sbjct: 28  VAAAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 87

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LGP DVCT  +++LA +AARQGIVLLKN  K LPL+  ++ S+A+IGP  N    M G Y
Sbjct: 88  LGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSMAVIGPNANASFTMIGNY 146

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDL 528
            G PC   + L+GL A V+ T Y  GC +V C+ ++     A + A  AD  ++V G D 
Sbjct: 147 EGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQ 205

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           + E E  DR SLLLPGQQ  LV++VA  S  P ILV+  GGP D+SFA++  +I++ILW+
Sbjct: 206 SIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWV 265

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPGEAG  A+A+++FG  NP GRLP+TWYPESFTKVPM DM MR D S  YPGR+YRFY
Sbjct: 266 GYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMRMRPDPSTGYPGRTYRFY 325

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
           TG  VY FG GLSYT++++  +SAP +L    +L+       L +    ++     E   
Sbjct: 326 TGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACLTEQCPSVEA----EGAH 377

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           C  L F V + V NAG+  G H V LF+  P V    P K L+GF++V      +  ++F
Sbjct: 378 CEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLLGFEKVSLEPGQAGVVAF 436

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 437 KVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 473


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 272/785 (34%), Positives = 398/785 (50%), Gaps = 71/785 (9%)

Query: 37  LNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY 96
           L+ PD  C+     S+  C+ S +   RA SLISL TL E I    +    + RLG+P Y
Sbjct: 53  LSSPD--CQNGPLRSHLICDESATPYDRAASLISLFTLDELIANTGNTGLGVSRLGLPVY 110

Query: 97  EWWSESLHGIASNGPGVNFN--GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
           + WSE+LHG+       NF+  G+ +  TSFPQ +++ A+ NR+L   I S ++ + RA 
Sbjct: 111 QVWSEALHGLDR----ANFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAF 166

Query: 155 YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
            N G+ GL  +APNIN FR P  GRGQETPGED  + + YA E++   QG          
Sbjct: 167 NNAGRYGLDVYAPNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQG---------- 216

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDT 274
                            D    L L+A  KH   YD+E W N SR   +  IT+QD  + 
Sbjct: 217 ----------------PDPDSNLKLAATAKHYAGYDIENWHNHSRLGNDMNITQQDLSEY 260

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVA 331
           + P F       K   +MC+YN VNGVPAC      Q   R+ +GF   GY++SDCDA  
Sbjct: 261 YTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDHGYVSSDCDAAY 320

Query: 332 TIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
            I+    Y  +   +AA  + AG DI+CGT    H   +I  G +   DI++ ++ L++ 
Sbjct: 321 NIYNPHGYASSQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRDDIEKGVIRLYTT 380

Query: 392 QLRLGLFNGDPRKGK--YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
            ++ G F+ +  K    Y  L   DV  ++   ++  AA QGIVLLKN  K LPL + A 
Sbjct: 381 LVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKNSNKVLPLTEKAY 440

Query: 450 ----SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA 505
               +++A+IGP  N  +Q+ G Y G      S     E      ++A     +   + +
Sbjct: 441 PPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRVAFEEAGYNVNFAERT-GISSTNTS 499

Query: 506 GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-KRPVILV 564
           GF  A+  A+ AD +I   G+D T E E  DR S+  PG Q+ L+  +A ++  +P+I++
Sbjct: 500 GFAAALSAAQSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQKLASSAGSKPLIVL 559

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT- 623
             GGG +D S  + ++ +S++LW GYPG++G  AL +II G  NP GRL  T YP S+  
Sbjct: 560 QMGGGQVDSSSLKNNTNVSALLWGGYPGQSGGFALRDIITGKKNPAGRLVTTQYPASYAE 619

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           + P  DMN+R +     PG++Y++YTG  VY FGHGL YT +     +  S  T +  +K
Sbjct: 620 EFPATDMNLRPEGDN--PGQTYKWYTGEAVYEFGHGLFYTTF-----AESSSNTTTREIK 672

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
             + ++IL QT   L       +T    L F   I   N G V+  +  M+FA       
Sbjct: 673 L-NIQDILSQTHEDL-----ASITQLPVLNFTANIK--NTGKVESDYTAMVFANTSDAGP 724

Query: 744 GT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
              P K L+G+DR+  V  G ++E+     P E  S A     R+   G+ VL  G    
Sbjct: 725 APYPVKWLVGWDRLGEVKVGETRELRV---PVEVGSFA-----RVNEDGDWVLFPGTFEL 776

Query: 802 SLTIE 806
           +L +E
Sbjct: 777 ALNLE 781


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 253/776 (32%), Positives = 389/776 (50%), Gaps = 112/776 (14%)

Query: 49  FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
            + + + + +LS   RAK L+S +T++EK+ Q+  ++ AI RLGIPAY WWSE+LHG+A 
Sbjct: 1   MEIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVAR 60

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------A 160
            G            T FPQ +  AA+F+  L  ++   +++EARA ++  Q         
Sbjct: 61  AG----------VATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYK 110

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLTFW+PN+NIFRDPRWGRGQET GEDP +     V F+K  QG++              
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQD-------------- 156

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K+ L+              +AC KH   + +       R+ F+A+++ +D  +T+ P F+
Sbjct: 157 KKYLRA-------------AACAKH---FAVHSGPESERHRFDAVVSPKDLRETYLPAFK 200

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
            C+++     +M +YN+VNG P C    L ++  R EWGF G++ SDC A+    E    
Sbjct: 201 ECVKEANVEAVMGAYNRVNGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHRV 260

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           T +  +S A  L  G D+NCG  M  +   A  +G V E+ I+ A+  L   +++LGLF+
Sbjct: 261 TSSAPESVALALNNGCDLNCGN-MYLNLLIAYQEGLVTEEAINTAVTRLMLTRMKLGLFD 319

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
                  Y  +G       EH++ AL+ +++ +VLLKN+   LPL++N +SS+A+IGP  
Sbjct: 320 A-AENVPYTNIGFHQNDCQEHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNA 378

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCH---DVPCN---SDAGFHEA 510
           N+   + G Y G   +  ++L G+   V K     YA GCH   D   N       F EA
Sbjct: 379 NSREALTGNYFGTASNYITVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEA 438

Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
           V  A++AD V++  GLD + E E+          D++ L LPG Q  L+  + +T K P+
Sbjct: 439 VSTAERADLVVMCMGLDASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PI 497

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILVL  G  L V++A    ++ +I+   YPG  G KALA  IFG+++P G+LP+T+Y  +
Sbjct: 498 ILVLLAGSALAVTWAA--EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYRTT 555

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
                  D +M+         R+YR+ T   +Y FG+GL YT ++Y+ L           
Sbjct: 556 EELPEFTDYSMK--------NRTYRYMTKEALYPFGYGLGYTTFAYRQL----------- 596

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                                ++    C        I V N G+      V L+ +  K 
Sbjct: 597 --------------------QLNRTKICAGENVQCSILVKNTGNFASDETVQLYIKDVKA 636

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           S   P   L G  ++H +    +EISF +    QL++ N+ G  IL  G   + VG
Sbjct: 637 SVEVPIWALQGIQKIHLLPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 251/754 (33%), Positives = 388/754 (51%), Gaps = 53/754 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+     RA +LI++ T +E +    +   AIPRLG+P Y+ W+E+LHG+  +   + 
Sbjct: 61  CDTTAKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGL--DRANLT 118

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
             G  S  TSFP  +++ A+ NR+L + IG  V+ + RA  N G+ GL  ++PNIN FR 
Sbjct: 119 EFGDYSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRH 178

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           P WGRGQETPGED  + S Y +E++   QG            G   K             
Sbjct: 179 PVWGRGQETPGEDIQLCSVYGLEYITGLQG------------GLDPK------------- 213

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
            EL L+A  KH   YD+E WGN SR   +  I+  D    + P F + +   +   +M S
Sbjct: 214 -ELKLAATAKHFAGYDIENWGNHSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMAS 272

Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           YN VNGVPA     L Q   R+ W F   GY++SDCD+V  +F    Y  +   +AA  +
Sbjct: 273 YNAVNGVPASANSFLLQTLLRDTWNFVEDGYVSSDCDSVYNVFNPHGYASSASLAAAKSI 332

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           +AG DI+CG     +   +  +G++   +I+RA    +S  + LG F+GD    KY  L 
Sbjct: 333 QAGTDIDCGATYQLYLNQSFTQGEISRSEIERAATRFYSNLVSLGYFDGD--NSKYRDLD 390

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
             DV  ++   ++ +AA +GIVLLKND   LPL+K+   S+A+IGP  N  + M G Y G
Sbjct: 391 WSDVVATDAWNISYEAAVEGIVLLKNDGT-LPLSKD-THSVALIGPWANVTTTMQGNYYG 448

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
                   L  L+A     +YA G  ++   + +GF  A+  A+K+D VI   G+D + E
Sbjct: 449 AAPYLTGPLAALQASDLDVNYAFGT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVE 507

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E  DR ++  PG Q+ L+  ++   K P++++  GGG +D S  +A+  ++S++W GYP
Sbjct: 508 AEGVDRETITWPGNQLQLIEQLSELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYP 566

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTG 650
           G++G  A+ +I+ G   P GRL +T YP  +  + P  DM++R   S   PG++Y +YTG
Sbjct: 567 GQSGGPAILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGSN--PGQTYMWYTG 624

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDYVHIDEVTSC 709
             VY FGHGL YT +        + L  S     G+  +I++  + S   Y  I++V   
Sbjct: 625 KPVYEFGHGLFYTTFE-------TSLANSHGANNGASFDIVKLLSRSNAGYNVIEQVP-- 675

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
                +  I V N G V   +  M F          P K L+GFDR+  +   + +    
Sbjct: 676 ---FMNYTIEVENTGTVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRLGGIEPHATQTMTI 732

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
               + ++  ++ G RI+  G + L +   R ++
Sbjct: 733 PVSLDNVARTDEDGNRIVYPGKYELALNNERSAV 766


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 250/773 (32%), Positives = 382/773 (49%), Gaps = 119/773 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA  L+S +TL+EK  QL  +A  + RLGIP Y WW+E+LHG+A  G   
Sbjct: 6   YLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAG--- 62

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA F+      I   +A E RA YN            G+TFW
Sbjct: 63  -------VATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYKGITFW 115

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN+NIFRDPRWGRG ET GEDP + S   V FVK  QG+                    
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------------------- 155

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                   G  L  +AC KH   + +       R+ F+A+++++D  +T+ P F + +++
Sbjct: 156 --------GKYLKTAACAKH---FAVHSGPEDDRHHFDAVVSQKDLYETYLPAFEALVKE 204

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
            K   +M +YN+ NG P      L +   R+ WGF G++ SDC A+    E    TKT  
Sbjct: 205 AKVESVMGAYNRTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGVTKTPT 264

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           +S A  LK+G D+NCG   L     A+ +G++ E+DIDRA + L + ++RLG+F+ D   
Sbjct: 265 ESVALALKSGCDLNCGNMYLL-ILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDD--- 320

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++ K+  +   + EH KL+L+AA++ +VLLKND   LPL+   + ++A+IGP  ++   
Sbjct: 321 CEFDKIPYELNDSVEHNKLSLEAAKKSMVLLKNDG-LLPLDSKKIKNIAVIGPNADSSLA 379

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAK 515
           +   Y+G P    ++L G+   VS   +  Y+ G H      +     D    EAV +A+
Sbjct: 380 LRANYSGTPSQNITILDGIRKRVSEDTRVWYSVGSHLFMNREEDLAQPDDRLKEAVSVAE 439

Query: 516 KADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKRPVILV 564
           ++D V++  GLD + E E  D           +  L LP  Q +L+ +V  T K  ++ +
Sbjct: 440 RSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKPTIVAL 499

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L+G     +S  +A  + ++I+   YPG  G  A AE+IFGD++P GRLP+T+Y  +   
Sbjct: 500 LSGSA---LSIGDAADKAAAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYKSTEEL 556

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
            P  D +M          R+Y+F  G  +Y FG GLSYTN+ Y  +  P  +    +L  
Sbjct: 557 PPFADYSME--------NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQNVNNGENLS- 607

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
                                          V + V NAG VD   VV ++ +    S  
Sbjct: 608 -------------------------------VSVDVQNAGSVDSDEVVQVYIKDMDASVR 636

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P+  L GF R+H  +   K ++F +D    ++I ++ G+R +  G   L VG
Sbjct: 637 VPKYSLCGFKRIHLKSGEKKTVTFEID-SNAMTIVDEAGKRYIENGEFTLYVG 688


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 275/802 (34%), Positives = 399/802 (49%), Gaps = 85/802 (10%)

Query: 31  SNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR 90
           +N+  V + PD  C      S   C+ +LS   RA +L++ LT+ EK+  L + A  +PR
Sbjct: 43  ANNNAVNSFPD--CSRDPLCSNDVCDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPR 100

Query: 91  LGIPAYEWWSESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVA 148
           L IP YEWWSE LHG+AS+ PG  F  +G  S  TSFPQ +V  ++F+  L   IG  V+
Sbjct: 101 LAIPPYEWWSEGLHGVASS-PGTKFAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVS 159

Query: 149 VEARAMYNLGQAGLTFW---------------------APNINIFRDPRWGRGQETPGED 187
            EARA  N G++GL  +                     +PNIN F+DPRWGRGQETPGED
Sbjct: 160 KEARAFSNRGRSGLDLYVSSISRHIEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGED 219

Query: 188 PMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLI 247
           P  +  Y    +   +G                       G+ S +     L A CKH  
Sbjct: 220 PFHLQNYVAAMLTGLEG-----------------------GDPSKK-----LIATCKHYA 251

Query: 248 AYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG 307
           A D E +    R  F+A IT QD  + + PPF++C    K    MCSYN +NG P C   
Sbjct: 252 ANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVGSFMCSYNAINGEPLCANP 311

Query: 308 DLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
            L +   R  WG+ G   Y+++DCD VA +  + +Y      +AA  +KAG D+ C    
Sbjct: 312 YLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGHAAAWAMKAGTDLECNAFP 371

Query: 364 LRHT-QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
                Q A ++  + EK++D++L  +++  + +G F+   R      L  DDV T E +K
Sbjct: 372 GSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDS-ARGQPLRSLSWDDVNTKEAQK 430

Query: 423 LALDAARQGIVLLKNDKKFLPLNKN-AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
           LA  A  +G VLLKND   LPL+        A+IGP +N  +QM G Y G      SL +
Sbjct: 431 LAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINATTQMQGNYFGPAPYLISLYQ 489

Query: 482 GLEAYVSKTHYASGCHDVPCNS-DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
             + +     Y+ G      NS D  F +A+  A  A  ++   G+D T E E RDR +L
Sbjct: 490 AAKEFGLDFTYSLGSR---INSTDDSFKQALDSAHAAALIVFAGGVDNTLEAETRDRKTL 546

Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
             P  Q+ L+ +V+   K PVI++  GGG +D +   A+  I+++LW GYPG++G KA+ 
Sbjct: 547 AWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSINALLWGGYPGQSGGKAVI 605

Query: 601 EIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           +++FG   P GRL +T YP S+ + VP  DMN+R        GR+Y +Y G  V  +G G
Sbjct: 606 DLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLGRTYMWYNGDAVVPYGFG 665

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS--CTSLRFHVQ 717
           L YT +  K  +  +    SA +K     ++L       DYV    V     T     V 
Sbjct: 666 LHYTTFDAKLKARQA----SALIKTEEVSSLLSN-----DYVSGTLVWQQILTKPVVSVL 716

Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG---SKEISFGVDPCE 774
           I+V+N G+V   +V +LF R        P K L G+ R   +  G    +E+S  +   E
Sbjct: 717 ITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQPGDRSEREVSITI---E 773

Query: 775 QLSIANKHGRRILPLGNHVLMV 796
           +L   ++ G R+L  G++ L V
Sbjct: 774 RLVRVDELGNRVLHPGSYELFV 795


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 248/754 (32%), Positives = 387/754 (51%), Gaps = 53/754 (7%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+     RA +L S+ TL+E +    +   A+PRLG+P Y+ WSE+LHG+  +   + 
Sbjct: 62  CDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGL--DRANLT 119

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
            +G  S  TSFP  ++  A+ NR+L + IG  ++ + RA  N G+ GL  +APNIN FR 
Sbjct: 120 ESGDYSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPNINSFRH 179

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           P WGRGQETPGED  + S Y VE++   QG          G   R               
Sbjct: 180 PVWGRGQETPGEDVQLCSIYGVEYITGIQG----------GLNPR--------------- 214

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
            +L L+A  KH   YDLE WGN SR   N  I+  D    + P F + +   +   +M S
Sbjct: 215 -DLKLAATAKHFAGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSS 273

Query: 295 YNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           YN VNGVP+     L Q   R  W F   GY++SDCDAV  +F    Y  +   +AA  +
Sbjct: 274 YNAVNGVPSSANSFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYASSASLAAAKSI 333

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           +AG DI+CG     +   ++   ++   +I+RA+   +S  + LG F+GD    KY  L 
Sbjct: 334 QAGTDIDCGATYQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGD--NSKYRHLH 391

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
             DV  ++   ++ +AA +GIVLLKND   LPL+ N   S+A+IGP  N  + + G Y G
Sbjct: 392 WPDVVATDAWNISYEAAVEGIVLLKNDGT-LPLSNN-TRSVALIGPWANVTTTLQGNYYG 449

Query: 472 IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
                   L  L+A     +YA G  ++  +S +GF  A+  A K++ +I   G+D T E
Sbjct: 450 AAPYLTGPLAALQASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVE 508

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E  DR S+  PG Q+ L+  +++  K P++++  GGG +D S  +A+  ++S++W GYP
Sbjct: 509 AEGVDRESITWPGNQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYP 567

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTG 650
           G++G  A+ +I+ G   P GRL +T YP  +  + P  DM++R   +   PG++Y +YTG
Sbjct: 568 GQSGGPAILDILTGKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGNN--PGQTYMWYTG 625

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ-QTGSRLDYVHIDEVTSC 709
             VY FGHGL YT +          L      + G+  +I+Q  +     Y  ++++   
Sbjct: 626 KPVYEFGHGLFYTTFKVS-------LAHFHGAENGTSFDIVQLLSRPNAGYSVVEQIP-- 676

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
                +  + V N G+V   +  M F          P K L+GFDR+  ++  + +    
Sbjct: 677 ---FINYTVEVMNTGNVTSDYTAMAFVNTKAGPSPHPNKWLVGFDRLGGISPRTTQTMTI 733

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSL 803
               + ++  ++ G RI+  G + L +   R ++
Sbjct: 734 PITLDNVARTDERGNRIVYPGKYELTLNNERSAV 767


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 259/737 (35%), Positives = 376/737 (51%), Gaps = 89/737 (12%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           CNT+LS   RA +L++ LT +EK+Q +   +   PR+G+PAY WWSE+LHG+A   PG  
Sbjct: 40  CNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVAY-APGTQ 98

Query: 115 F---NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F   +G  +S TSFP  L+ AA+F+  L   I   + +E RA  N G +GL +W PN+N 
Sbjct: 99  FWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFSGLDYWTPNVNP 158

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           F+DPRWGRG ETPGED ++V  YA   +K  +G              +E+RV+       
Sbjct: 159 FKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEGP----------VPEKERRVV------- 201

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    A CKH  A D E W   +R++FNA I+ QD  + +  PF+ C+   +   I
Sbjct: 202 ---------ATCKHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSI 252

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
           MC+YN VNGVP+C    L Q   R  W +     YITSDC+AV  +     Y  T+ +  
Sbjct: 253 MCAYNAVNGVPSCASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAEGT 312

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A   +AGMD +C          A  +G ++E  +DRALL L+   +R G F+G  ++  Y
Sbjct: 313 AISFEAGMDTSCEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDG--KQSLY 370

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP----LNKNAVSSLAIIGPLVNNIS 463
             LG  DV     +KL+L AA  G VLLKND   LP    L+K+    +A+IG   +   
Sbjct: 371 SSLGWADVNKPSAQKLSLQAAVDGTVLLKNDGT-LPLSDLLDKSRPKKVAMIGFWSDAKD 429

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG--FH-----------EA 510
           ++ GGY+G             AY+    YA+    +P ++ +G   H            A
Sbjct: 430 KLRGGYSGT-----------AAYLHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDNA 478

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           +  AK AD+++   G+D +   E +DR  L  PG Q+SL+  +   SK   ++VL  G  
Sbjct: 479 MAAAKDADYILYFGGIDTSAAGETKDRYDLDWPGAQLSLINLLTTLSKP--LIVLQMGDQ 536

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMND 629
           LD +   ++ +I++ILW  +PG+ G  A+ E++ G  +P GRLP+T YP +FT+ VPM D
Sbjct: 537 LDNTPLLSNPKINAILWANWPGQDGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTD 596

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDK 688
           M +R  +     GR+YR+Y  T V  FG GL YT +S KF    P+ + +   L+   DK
Sbjct: 597 MALRPSAGNSQLGRTYRWYK-TPVQAFGFGLHYTTFSPKFGKKFPAVIDVDEVLEGCDDK 655

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PE 747
                    LD   + ++           + V N G+    +V + F   P V  G  P 
Sbjct: 656 --------YLDTCPLPDLP----------VVVENRGNRTSDYVALAFVSAPGVGPGPWPI 697

Query: 748 KQLIGFDRVHTVAKGSK 764
           K L  F R+  V  G K
Sbjct: 698 KTLGAFTRLRGVKGGEK 714


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 254/774 (32%), Positives = 381/774 (49%), Gaps = 121/774 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA  L+S +TL+EK  QL  +A  + RLGIP Y WW+E+LHG+A  G   
Sbjct: 6   YLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAG--- 62

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA F+      I   +A E RA YN            G+TFW
Sbjct: 63  -------VATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYKGITFW 115

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN+NIFRDPRWGRG ET GEDP + S   V FVK  QG+                    
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD-------------------- 155

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                   G  L  +AC KH   + +       R+ FNA+ +++D  +T+ P F + +++
Sbjct: 156 --------GKYLKSAACAKH---FAVHSGPEDDRHHFNAVASQKDMYETYLPAFEALVKE 204

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
            K   +M +YN+ NG P      L +   R++WGF G++ SDC A+    E    TKT  
Sbjct: 205 AKVESVMGAYNRTNGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHGVTKTPT 264

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           +S A  LK G D+NCG   L     A+ +GK+ E+DIDRA + L + +++LG+F+ D   
Sbjct: 265 ESVALALKNGCDLNCGNMYLL-ILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEF 323

Query: 405 GKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
            K     P +V  S EH KL+L+AAR+ +VLLKN+   LPL+   + ++A+IGP  ++  
Sbjct: 324 DKI----PYEVNDSIEHNKLSLEAARKSMVLLKNNG-LLPLDSKKIKNIAVIGPNADSSL 378

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIA 514
            +   Y+G P    ++L G+ + VS   +  Y+ G H      +     D    EAV +A
Sbjct: 379 ALRANYSGTPSHNITILDGVRSRVSEDTRVWYSLGSHLFMNREEDLAQPDDRLKEAVSMA 438

Query: 515 KKADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKRPVIL 563
           +++D V++  GLD + E E  D           +  L LP  Q +L+ +V  T K  ++ 
Sbjct: 439 ERSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKPTIVA 498

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           +L+G     +S  +A  + ++I+   YPG  G  A AE+IFGD++P GRLP+T+Y  +  
Sbjct: 499 LLSGSA---LSIGDAADKAAAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYKSTEE 555

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
             P  D +M          R+Y+F  G  +Y FG GLSYTN+ Y  +  P  +    SL 
Sbjct: 556 LPPFEDYSME--------NRTYKFMKGEALYPFGFGLSYTNFEYSNIVCPQAVNNGESLS 607

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                                           V + V NAG VD   VV ++ +  + S 
Sbjct: 608 --------------------------------VSVDVQNAGSVDSDEVVQVYIKDMEASV 635

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             P   L GF R+   +   K ++F +D    ++I ++ G+R +  G+  L VG
Sbjct: 636 RVPNHSLCGFKRIFLKSGEKKTVTFEID-SRAMTIVDEEGKRYIENGDFTLYVG 688


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 262/780 (33%), Positives = 379/780 (48%), Gaps = 128/780 (16%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           +YPF NT+LSI  R   L+S LTL+EK+ Q+ +N  AI RL IPAY WW+E LHGI    
Sbjct: 24  NYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNECLHGIGR-- 81

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GL 162
                  T   VT FPQ +  AA++N+ L   + SA++ E RA+YN   +        GL
Sbjct: 82  -------TDYKVTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSKGNREIYYGL 134

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+W PNINIFRDPRWGRGQET GEDP +       FV   QG++ K              
Sbjct: 135 TYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDDTKY------------- 181

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         L  +AC KH   Y +      +R++FN  +T+ D  DT+ P FR+ 
Sbjct: 182 --------------LKAAACAKH---YAVHSGPENTRHTFNTFVTDYDLWDTYLPAFRNL 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           + + K + +MC+YN  NG P C    L Q+  R +W F GY+TSDC A+   +++    K
Sbjct: 225 VVEAKVAGVMCAYNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHH---K 281

Query: 342 THEDS---AAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+   AA  +  G DI+CG    +    A+  G + EK ID +L  LF+++ RLG+F
Sbjct: 282 THPDAKYAAADAVYNGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMF 341

Query: 399 NGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
             DP +  KY ++    + + +HK LAL   R+ IVLLKN+   LPL+K  +  +A++GP
Sbjct: 342 --DPAENVKYSQISTSVLESQKHKDLALKITRESIVLLKNENNTLPLSKK-LKKVAVVGP 398

Query: 458 LVNNISQMGGGYTGIPC-------SPKSLLRGLEAYVSKTHYASGCHDVP--CNSDAGFH 508
             NN   + G Y G P        + K  L+G E       Y  G   V    NS     
Sbjct: 399 NANNEVSVLGNYNGFPTEIVTPYEAVKQKLKGAEVI-----YEKGIDFVTPSTNSKEEVS 453

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
             V+  K  D VI V G+    E E+           DR S+ LP  Q   + ++    K
Sbjct: 454 ALVKRLKDVDVVIFVGGISPELEGEEMPVKIEGFTGGDRTSIKLPKIQTDFMKALV-AEK 512

Query: 559 RPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
            P + V+  G  +     E +SQ I +I+   Y G+    A+A+++FGD+NP G+LP+T+
Sbjct: 513 IPTVFVMMTGSAIA---TEWESQNIPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTF 569

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           Y +        D ++ A +S +   R+YR++ G  +Y FG+GLSYT + Y  +  PS + 
Sbjct: 570 YAK--------DSDLPAFNSYEMKNRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTID 621

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
              + K                                V +S+ N G V+G  VV L+  
Sbjct: 622 TGNNAK--------------------------------VSVSIKNTGKVEGEEVVQLYIS 649

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P      P   L GF+RV   A  SK + F + P E L + +  G   +  G   + +G
Sbjct: 650 YPDTKGQKPLYALKGFNRVSLKAGESKTVEFNLSPRE-LGLVDDAGILKVSAGKRKIFIG 708


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 225/539 (41%), Positives = 310/539 (57%), Gaps = 53/539 (9%)

Query: 42  FPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P    +   PFC  +L I TR   LI  +TLQEK+  L +NA+A+PRLGI  YEWW
Sbjct: 27  FACDPKDGTNRDLPFCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G     TSFPQV+ +AASFN +LW  IG  V+ EARAM+N G 
Sbjct: 87  SEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGV 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+     PRWGRGQETPGEDP+VV  YA  +V+  QG +             
Sbjct: 147 AGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSD------------- 193

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G  L ++ACCKH  AYDL+ W    R+ FNA +++QD  DTF  PF
Sbjct: 194 --------------GIRLKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQN 338
           R C+++GK + +MCSYNQVNG+P C   +L +K  R +W   GYI SDCD+    +  Q+
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQH 299

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           +T     S+ G  KAG+D++CG  ++ H  +   K   +E +I+ A L   + Q+ LG+F
Sbjct: 300 FTSPRR-SSLGCYKAGLDLDCGPFLVTHRDAV--KKAAEEAEINNAWLKTLTFQISLGIF 356

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP- 457
           +G P +   G + P  +    ++ LA++A ++ + + KN + FL       S   I GP 
Sbjct: 357 DGSPLQA-VGDVVP-TMGPPTNQDLAVNAPKR-LFIFKN-RAFL-----LYSPRHIFGPV 407

Query: 458 -LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAK 515
            L  ++  M G Y G+PC     L+GL  +VS   Y  GC +V C  +D G   AV +A 
Sbjct: 408 ALFKSLPFMLGNYEGLPCKYLFPLQGLAGFVSLL-YLPGCSNVICAVADVG--SAVDLAA 464

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV-----LTGGG 569
            AD V++V G D + E E  DRV   LPG+Q  LVT VA  +K PV+LV     ++GGG
Sbjct: 465 SADAVVLVVGADQSIEREGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIMDLAISGGG 523



 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 56/91 (61%), Gaps = 1/91 (1%)

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           +Q+ V N G +DG+H ++++ R P      P KQL+ F++VH  A   + +   +  C+ 
Sbjct: 634 MQVDVKNTGSMDGTHTLLVYFR-PPARHWAPHKQLVAFEKVHVAAGTQQRVGINIHVCKS 692

Query: 776 LSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           LS+ +  G R +P+G H L +G+++HS++++
Sbjct: 693 LSVVDGSGIRRIPMGEHSLHIGDVKHSVSLQ 723


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 269/783 (34%), Positives = 391/783 (49%), Gaps = 128/783 (16%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           H     + N  LS  TRA++L++ +T+ EKI QLS +  AIPRL +P Y WW+E+LHGIA
Sbjct: 24  HVKQQIWFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIA 83

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ-------- 159
            NG            T FPQ +   A+F+  L   + +A++ EARA Y + Q        
Sbjct: 84  RNG----------KATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQY 133

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLTFW PN+NIFRDPRWGRGQET GEDP++ S     FVK  QG++ K          +
Sbjct: 134 AGLTFWTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDDPKY--------LK 185

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
              V K F   S  G E +                    R+ F+   +++D  +T+ P F
Sbjct: 186 SAGVAKHFAVHS--GPESL--------------------RHQFDVEPSKKDLYETYLPAF 223

Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
            + + Q K + +MC+YN V G P+C    L G++ +K   +W F GY+ SDC A+     
Sbjct: 224 EALVTQAKVAGVMCAYNGVYGQPSCASEFLLGEMLKK---KWQFNGYVVSDCGALHDFHS 280

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
               T    +SAA  L+AG+D+NCG    +  ++A ++G + +  ID+ L NL  ++ RL
Sbjct: 281 GHKVTHNRVESAALALRAGVDLNCGFTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRL 340

Query: 396 GLFNGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           GLF  DP +   +  +G + + + EH +LA   A + IVLLKN+K+ LPL+K+ +    +
Sbjct: 341 GLF--DPSELNPHNAIGQEVIHSLEHIELARKVAAKSIVLLKNEKQVLPLSKD-IKVPYV 397

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAG-FHEA 510
            GP   +   + G Y GI  S  ++L G+   V   S  +Y +G   +P +S+    + A
Sbjct: 398 TGPFAASSDMLMGNYYGISDSLVTVLEGIAGKVSLGSSLNYRAGA--LPFHSNINPLNWA 455

Query: 511 VRIAKKADFVIVVAGLDLTQETE--------DR-DRVSLLLPGQQMSLVTSVARTSKRPV 561
             +AK AD VI V G+    E E        DR DRV++ LP  Q+  V  +A   K P+
Sbjct: 456 PEVAKTADAVIAVVGISADMEGEEVDAIASADRGDRVAITLPQNQVDYVKQLAENKKGPL 515

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILV+  G P+D+S  E D    +ILWI YPGE G  A+A++IFGD NP G LP+T+    
Sbjct: 516 ILVVAAGSPVDIS--ELDPLADAILWIWYPGEQGGNAVADVIFGDTNPSGHLPLTFVKTI 573

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS----APSELT 677
               P +D  M         GR+Y+F     +Y FG GLSYT + +  LS    AP E  
Sbjct: 574 DDLPPFDDYTMT--------GRTYKFLKKLPLYPFGFGLSYTQFKFGKLSLSKRAPQE-- 623

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-- 735
                  G + NI                           + V N+  +DG  VV ++  
Sbjct: 624 -------GENINI--------------------------SVEVENSTALDGETVVQVYLS 650

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
            +VP  ++      L  F RVH  A   + I F ++  + L   N  G  + P G + L 
Sbjct: 651 PQVPLKNEAI--TNLKAFKRVHIGAYEKRLIEFTIE-GKNLYRVNDAGENVWPSGAYTLA 707

Query: 796 VGE 798
           VG+
Sbjct: 708 VGD 710


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 271/790 (34%), Positives = 399/790 (50%), Gaps = 72/790 (9%)

Query: 24  VVNVIAFSNSKP------VLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEK 77
           V+N+ + SN  P      V   P   C    F +   C+TSLS   R  +LIS +T++EK
Sbjct: 111 VMNLYSISNPNPHPPPKGVKRSP--VCLTDPFCANKACDTSLSQDDRIAALISQMTVEEK 168

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF---NGT-VSSVTSFPQVLVSAA 133
              L D A  +PR+G+P YEWW+E+LHG+A +  GV+F   NG+  S  TSFP  ++  A
Sbjct: 169 ATNLVDGALGLPRIGLPPYEWWNEALHGVAGSR-GVSFDSPNGSDFSYATSFPLPILMGA 227

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           +F+  L  ++ S +  EARA  N   +G  FW PN+N F DPRWGRG E P ED      
Sbjct: 228 AFDDPLIYDVASIIGKEARAFANYAHSGYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQR 287

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           Y    V   QG   K+D          K+++                A CKH   YD+E 
Sbjct: 288 YVASLVPGLQGGKEKTD---------HKQII----------------ATCKHFAVYDVET 322

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
               +R++ N   T QD  + + P F++C+       IMCSYN V GVPAC      Q  
Sbjct: 323 ----NRHAQNYEPTPQDLGEYYLPAFKTCVRDVNVGSIMCSYNAVYGVPACASEYFLQDV 378

Query: 314 -RNEWGFK---GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            R++W F     Y+TSDC+AV  I+   N+T T   +AA  L AG D NCGT  L+   +
Sbjct: 379 LRDQWNFNEPYHYVTSDCEAVKDIWTPHNFTDTEPAAAAVALNAGTDTNCGTSYLQ-LNT 437

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
           ++      E  +D +L  L++    +G F+G P   +Y  L   DV T   +  A  AA 
Sbjct: 438 SVANNWTTEAQMDISLTRLYNALFTVGYFDGQP---EYDGLSFADVSTPFAQATAYRAAS 494

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK 489
           +GI LLKND   LPL K + +S+A+IGP  N  +QM G Y GI     S L   +A    
Sbjct: 495 EGITLLKND-GLLPL-KKSYNSVALIGPWANATTQMQGIYQGIAPYLVSPLAAAQAQWGH 552

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSL 549
             + +G      N+  GF  A+  A+ AD +I   G+D + E E RDR S+  PG Q+ L
Sbjct: 553 ISFTNGTAINSTNT-TGFASALSAARDADVIIYAGGIDSSIEKESRDRTSISWPGNQLDL 611

Query: 550 VTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           V  ++   K P+++V  GGG +D S    +  ++S++W GYPG+ G  AL +++ G  +P
Sbjct: 612 VQQLSELGK-PLVVVQFGGGQVDDSALLRNKNVNSLVWAGYPGQDGGSALIDVLVGKQSP 670

Query: 610 GGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYK 668
            GRL +T YP  +  ++ + D N+R   S   PGR+Y++Y    V  FG+GL YT + + 
Sbjct: 671 AGRLTITQYPADYINQISLFDPNLRPSDSS--PGRTYKWYNKEPVLPFGYGLHYTTFEFD 728

Query: 669 FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDG 728
           +  AP      ASL        +  T S       ++ +  T L     I V N+G +  
Sbjct: 729 WAKAPQASYDIASL--------VDSTASYTTSPKKNDASPWTEL----SIKVHNSGSLGS 776

Query: 729 SHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKG-SKEISFGVDPCEQLSIANKHGRRI 786
            +V ++F R P       P K L  + R+H ++ G S E+SF +     L+  ++HG  I
Sbjct: 777 DYVGLVFLRTPNAGPAPYPNKWLASYARLHGLSAGASAELSFSLS-LGALARGDEHGDLI 835

Query: 787 LPLGNHVLMV 796
           +  G++ + +
Sbjct: 836 IYPGDYEVQI 845


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 199/473 (42%), Positives = 279/473 (58%), Gaps = 28/473 (5%)

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYNLG AGLTFW+PNIN+ RD RWGR QET  EDP +V  +AV +V+  Q          
Sbjct: 1   MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQD--------- 51

Query: 214 IGFGFREKRVLKGFGEESDRGDE-LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                     ++G    +D     L +S+CCKH  AYD++ W N  R++F+A ++EQD +
Sbjct: 52  ----------VEGTENVTDLNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMK 101

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA 331
           +TF  PF  C+ +G  S +MCS+N++NG+P C    L +   R+EW   GYI SDC  + 
Sbjct: 102 ETFVSPFERCVREGDVSSVMCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLE 161

Query: 332 TIFEYQNYTKTHE-DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
            I + QNY    + D+ A  L+AG+D+ CG          +  GKV + ++DRAL N++ 
Sbjct: 162 VIVDNQNYLNDSKVDAVAKTLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYV 221

Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
           + +R+G F+G P    Y  LG  D+C ++H +LA +AARQGIVLLKND +  PL      
Sbjct: 222 LLMRVGYFDGIP---AYESLGLKDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--K 276

Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
            LA++GP  N    M G Y G+P    S L    A +    Y +GC D  C++D  F EA
Sbjct: 277 KLALVGPHANATEVMIGNYAGLPRKYVSPLEAFSA-IGNVTYTTGCLDASCSNDTYFSEA 335

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
              AK A+  I+  G DL+ E E  DRV  LLPG Q  L+  VA  S  PVILV+  G  
Sbjct: 336 KEAAKSAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSN 395

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           +D++FA+ + +IS+ILW+G+PGE G  A+A+++FG +NPGGRLP+TWY   + 
Sbjct: 396 IDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYV 448


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 258/756 (34%), Positives = 382/756 (50%), Gaps = 65/756 (8%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ + S   RA +L S++TL+E +    +   AIPRLG+P Y+ W+E+LHG+       N
Sbjct: 62  CDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLYL----AN 117

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFP  +++ A+ NR+L   I   +A + RA  N G+ GL  ++PNIN F
Sbjct: 118 FTESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNINAF 177

Query: 173 RDPRWGRGQETPGEDP-MVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           R P WGRGQETPGED   + SAYA E++   QG                           
Sbjct: 178 RHPVWGRGQETPGEDANCLCSAYAYEYITGLQGN-------------------------- 211

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  + A  KH   YD+E W   SR+  +  IT+QD  + F P F   +   +   +
Sbjct: 212 --ATNPKIIATAKHYAGYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSV 269

Query: 292 MCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           M SYN VNGVP+     L Q   R+ WGF   GY+ SDCDAV  +F    Y      ++A
Sbjct: 270 MPSYNAVNGVPSSANTFLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAANLSSASA 329

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             L+AG DI+CG   L     ++ +G++   +I+RA+   +S  +  G F+G      Y 
Sbjct: 330 MSLRAGTDIDCGISYLTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGP--DAPYR 387

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            L   DV  +    +A +AA  G+VLLKND   LPL+K +V  +A+IGP  N   QM G 
Sbjct: 388 DLSWSDVVRTNRWNVAYEAAVAGVVLLKNDG-VLPLSK-SVQRVALIGPWANATEQMQGN 445

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G+     S L  ++A   + +YA G  ++  N    F  A+  A+K+D +I   G+D 
Sbjct: 446 YHGVAPYLTSPLAAVQASGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDN 504

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
           T E E+ DR ++  PG Q+ L+  +    K P++++  GGG +D S  +A  ++ ++LW 
Sbjct: 505 TLEAEELDRANITWPGNQLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWG 563

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRF 647
           GYPG+AG +AL +I+ G   P GRL  T YP  +  + P  DM++R       PG++Y +
Sbjct: 564 GYPGQAGGQALWDILTGQRAPAGRLTTTQYPAEYALQFPATDMSLRPRGDN--PGQTYMW 621

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH--IDE 705
           YTG  VY FGHGL YT ++   L+ P           G +       G+ L   H   + 
Sbjct: 622 YTGEPVYAFGHGLFYTTFATA-LAGP-----------GQEPERSFDIGALLARPHAGYNL 669

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV-AKGSK 764
           V     L F V+  VTN G+V   +  M FA      +  P K L+GFDR+  +  + S 
Sbjct: 670 VEQLPFLNFTVK--VTNTGEVISDYTAMAFANTTAGPRPHPNKWLVGFDRIGPLDPRVSA 727

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
            +S  V   + L+  +  G R++  G + L +   R
Sbjct: 728 RMSVPVS-LDSLARTDAQGNRVIYPGPYELALNNER 762


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 265/770 (34%), Positives = 380/770 (49%), Gaps = 69/770 (8%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           PD  C      S   C+TS S   RA +L+S++   EK+  L +N+  + RLG+ AY+WW
Sbjct: 26  PD--CANGPLKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWW 83

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           +E+LHG+A N  G+ + G  S+ T FPQ + ++A+F+ +L   IG+ ++ EARA  N G+
Sbjct: 84  NEALHGVAHNR-GITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGR 142

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           A L FW PN+N FRDPRWGRG ETPGED      +A  FVK  QG               
Sbjct: 143 AHLDFWTPNVNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGP-------------- 188

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         G    + A CKH  AYDLE  G+ +R++F+A ++ QD  + + PPF
Sbjct: 189 --------------GPTHRVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPF 234

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFE 335
           + C    K   IMCSYN VN +PAC    L     R  W +     YI SDCDAV  +  
Sbjct: 235 QQCARDSKVGSIMCSYNAVNEIPACANPYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGN 294

Query: 336 Y---QNYTKTHEDSAAGVLKAGMDINCGTC--MLRHTQSAIDKGKVQEKDIDRALLNLFS 390
                 Y  ++  +    L+AG D  C           SA + G+  +  +D A+L    
Sbjct: 295 ANGGHRYKPSYAAAIGASLEAGCDNMCWATGGTAPDPASAFNSGQFSQTTLDTAILRQMQ 354

Query: 391 VQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVS 450
             +  G F+G    G Y  L   DV T   +  AL AA  GIVLLKND   LPL+ N  +
Sbjct: 355 GLVLAGYFDGP--GGMYRNLSVADVNTQTAQDTALKAAEGGIVLLKND-GILPLSVNGSN 411

Query: 451 -SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
             +A+IG   N   +M GGY+G P      +    +     +Y +G    P N D     
Sbjct: 412 FQVAMIGFWANAADKMLGGYSGSPPFNHDPVTAARSMGITVNYVNGPLTQP-NGDT--SA 468

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           A+  A+K++ V+   G+D T E E +DR S+  P  Q++L+  +A T K PVI+V  G  
Sbjct: 469 ALNAAQKSNAVVFFGGIDNTVEKESQDRTSIEWPSGQLALIRRLAETGK-PVIVVRLGTH 527

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
             D         + +ILW GYPG+ G  A+ +II G  +P GRLP T YP S+T + P  
Sbjct: 528 VDDTPLLSI-PNVRAILWAGYPGQDGGTAVVKIITGLASPAGRLPATVYPSSYTSQAPFT 586

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           +M +R  SS  YPGR+YR+Y+   V+ FGHGL YTN+S      P+   I+         
Sbjct: 587 NMALRPSSS--YPGRTYRWYS-NAVFPFGHGLHYTNFSVSVRDFPASFAIA--------- 634

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
           ++L   G  + Y+ +    S       V ++VTN G     +V + F          P K
Sbjct: 635 DLLASCGDSVAYLDLCPFPS-------VSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIK 687

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            L  + RV  +  G  +++      E L   ++ G R+L  G + L+V +
Sbjct: 688 TLATYKRVFNIEPGETQVAELDWKLESLVRVDEKGNRVLYPGTYTLLVDQ 737


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 242/738 (32%), Positives = 363/738 (49%), Gaps = 109/738 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F N    +  R   L+  LTL EKI  L   +  + RLGIPAY WW+E+LHG+A  G   
Sbjct: 17  FRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNEALHGVARAG--- 73

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA+FN  L     + ++ EARA YNL  A        GLTFW
Sbjct: 74  -------VATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQGRHLQYMGLTFW 126

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +     FVK  QG               + R LK
Sbjct: 127 SPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQG--------------NDPRYLK 172

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                         SAC KH   +   + G   R++FNA++ E+D  +T+   F + ++ 
Sbjct: 173 A-------------SACAKHFAVHSGPENG---RHTFNAIVDEKDLRETYLYAFHALVDA 216

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G  S +MC+YN+VN  P C    L     RNEW FKG++ +DC A+  IF       +  
Sbjct: 217 GVES-VMCAYNRVNDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGV 275

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           + AA  +KAG++++C   + +  + A+++  + EKDID +L +L   Q++LG ++ DP  
Sbjct: 276 EVAAAAIKAGVNLDCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYD-DPTA 334

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + K G D V  + H  LA   A+Q +VLLKN  + LPL+K    ++ ++G    ++  
Sbjct: 335 NPFYKYGADSVANTAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDA 394

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           + G Y G+     S + G+   V   ++  Y  G       +D      +  A  AD  +
Sbjct: 395 LLGNYHGVSNRAVSFVEGITNAVDAGTRVEYDQGSD----YNDTTHFGGIWAAGNADITV 450

Query: 522 VVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            V GL    E E+          D+  + LP   ++ + ++ + +K+P+I V+T G  +D
Sbjct: 451 AVIGLTPVYEGEEGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVD 510

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +S  E  +    + W  YPGE G  ALA+I+FG  +P GRLP+T+Y +SF  VP  D   
Sbjct: 511 ISAIEPYADAILLAW--YPGEQGGNALADILFGKVSPAGRLPVTFY-QSFADVPAYD--- 564

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
               +    GR+YR++ G   Y FG+GLSYT+++Y++   P+ +                
Sbjct: 565 ----NYAMKGRTYRYFNGKVQYPFGYGLSYTSFAYEWQQMPANIR--------------- 605

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                         T+  S+ F   I V N G +DG  VV ++   P V +  P K+L  
Sbjct: 606 --------------TAKDSVSF--SIKVKNTGSMDGDEVVQVYVEYPAVER-MPLKELKA 648

Query: 753 FDRVHTVAKGSKEISFGV 770
           F RVH  A G + +   +
Sbjct: 649 FKRVHVKAGGEETVQLTI 666


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 248/778 (31%), Positives = 378/778 (48%), Gaps = 119/778 (15%)

Query: 49  FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
            D   + + SLS   RA  L+S +TL+EK  QL  +A  + RLGIP Y WW+E+LHG+A 
Sbjct: 1   MDKPKYLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-------- 160
            G            T FPQ +  AA F+      I   +A E RA YN            
Sbjct: 61  AG----------VATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYK 110

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G+TFW+PN+NIFRDPRWGRG ET GEDP + S   V FVK  QG+               
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD--------------- 155

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                        G  L  +AC KH   Y +       R+ F+A+++++D  +T+ P F 
Sbjct: 156 -------------GKYLKTAACAKH---YAVHSGPEDDRHFFDAIVSQKDLYETYLPAFE 199

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           + +++ K   IM +YN+ NG P      L +   R+ WGF G++ SDC A+    E    
Sbjct: 200 ALVKEAKVESIMGAYNRTNGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHGV 259

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           TKT  +S A  LK+G D+NCG   L     A+ +G + E+DIDRA + L + +++LG+F+
Sbjct: 260 TKTPTESVALALKSGCDLNCGNMYLL-ILLALKEGLITEEDIDRAAIRLMTTRMKLGMFD 318

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D    ++  +  +   ++EH K++L+AA++ +VLLKND   LPL+   + ++A+IGP  
Sbjct: 319 DD---CEFDNIPYELNDSAEHNKISLEAAKKSMVLLKNDG-LLPLDSKKIKNVAVIGPNA 374

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEA 510
           ++   +   Y+G P    +++ G+   VS   +  YA G H      +     D    EA
Sbjct: 375 DSSLALRANYSGTPSQNVTIIEGIRKRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEA 434

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRD-----------RVSLLLPGQQMSLVTSVARTSKR 559
           V  A+++D V++  GLD + E E  D           +  L LP  Q +L+ +V  T K 
Sbjct: 435 VSAAERSDVVVLCLGLDASVEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGKP 494

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            ++ +L+G     +S  +A  + ++I+   YPG  G  A AE+IFGD++P GRLP+T+Y 
Sbjct: 495 TIVALLSGSA---LSIGDAADKAAAIVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYK 551

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
            +    P  D +M          R+Y+F  G  +Y FG GLSYT++ Y  +  P  +   
Sbjct: 552 STEELPPFADYSME--------NRTYKFMKGDALYPFGFGLSYTSFEYSNMVCPQTVNNG 603

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +L                                 V + V N G VD   VV ++ +  
Sbjct: 604 ENLS--------------------------------VSVDVQNTGSVDSDEVVQVYIKDM 631

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             S   P+  L GF R+H  +   K ++F V     +SI ++ G+R +  G   L  G
Sbjct: 632 DASVRVPKYSLCGFKRIHLKSGEKKTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 255/769 (33%), Positives = 386/769 (50%), Gaps = 124/769 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA  L+  +TL+EK+ Q    A A+ RL I AY WW+E+LHG+A  G          + T
Sbjct: 16  RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAG----------TAT 65

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTFWAPNINIFRDP 175
            FPQ +  AA+F+  L   +G AV+ EARA +N+ Q G        LTFWAPN+NIFRDP
Sbjct: 66  VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQG--ENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           RWGRG ET GEDP + S   V +++  QG  EN+                          
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENY-------------------------- 159

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L  +AC KH   +   +     R+ F+A +TEQD  +T+ P F +C+++GK   +M 
Sbjct: 160 ---LKAAACAKHFAVHSGPE---AVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMG 213

Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN+ NGVP C    L     R EWGF G++TSDC A+    E  + T T  +S A  + 
Sbjct: 214 AYNRTNGVPCCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMN 273

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
            G D+NCGT +      A+ +G V+E+ +D A+ NLF  +++LG+F+    +  Y K+  
Sbjct: 274 NGCDLNCGT-LFGFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPY 331

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
               + E KKL    AR+ +VLLKN +  LPL+KN + ++ +IGP  ++   + G Y G 
Sbjct: 332 LAADSREMKKLNEAVARRTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGT 391

Query: 473 PCSPKSLLRGLEAYVS---KTHYASGCH---DVPCN---SDAGFHEAVRIAKKADFVIVV 523
                ++L G+E YV    +  Y+ GCH   D   N    +    E + + K++D V+ V
Sbjct: 392 ASRYITVLEGIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAV 451

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            GLD   E E+          D+  L LPG Q  ++ + A +  +PVILVL  G  L V+
Sbjct: 452 LGLDAGIEGEEGDAGNEYGSGDKPDLNLPGLQEEILEA-AVSCGKPVILVLLSGSALAVN 510

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           +  AD  + +I+   YPG  G  A+A+I+FG+ NP G+LP+T+Y  +       D +M+ 
Sbjct: 511 W--ADEHVDAIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ- 567

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSELTISASLKAGSDKNIL 691
                  GR+YR+     +Y FG+GLSYT Y+Y   +FL    E  +S  +  G      
Sbjct: 568 -------GRTYRYMEQEALYPFGYGLSYTEYAYQNVRFLE--QEPVVSEGVTIG------ 612

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
                                     +SV N G +DG+  V ++ +        P  QL 
Sbjct: 613 --------------------------LSVKNTGKMDGTETVQVYVKAEHSKM--PHGQLK 644

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
              ++   A   KEI+  ++  E   + +++G +ILP G+  + VG ++
Sbjct: 645 KIVKLPLCAGEEKEINIRLE-SEAFMLYDENGEKILPSGHFEIFVGGMQ 692


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/399 (50%), Positives = 263/399 (65%), Gaps = 27/399 (6%)

Query: 154 MYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
           MYN+G AGLTFW+PN+NIFRDPRWGRGQETPGEDP++ S YA  +V+  Q    +SDDG 
Sbjct: 1   MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ----QSDDGS 56

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
                                D L ++ACCKH  AYDL+ W    R+ FNA++T+QD +D
Sbjct: 57  --------------------PDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDD 96

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVAT 332
           TFQPPF+SC+  G  + +MCSYNQVNG PAC   DL     R EW   GYI SDCD+V  
Sbjct: 97  TFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDV 156

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
            +  Q+YTKT E++AA  + AG+D+NCG+ + +HT++A+  G V E  +D+A+ N F+  
Sbjct: 157 FYNSQHYTKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATL 216

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG F+G+P K  YGKLGP DVCT EH++LA +AARQGI+LLKN K  LPL+  A+ +L
Sbjct: 217 MRLGFFDGNPSKAIYGKLGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTL 276

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           AIIGP  N    M G Y G PC   + L+GL A V+ T Y SGC +V C S A   EA +
Sbjct: 277 AIIGPNANVTKTMIGNYEGTPCKYTTPLQGLMALVATT-YLSGCSNVAC-STAQIDEAKK 334

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           IA  AD  +++ G+D + E E RDRV++ LPGQQ  L+T
Sbjct: 335 IAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLIT 373


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 255/769 (33%), Positives = 386/769 (50%), Gaps = 124/769 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA  L+  +TL+EK+ Q    A A+ RL I AY WW+E+LHG+A  G          + T
Sbjct: 16  RAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAG----------TAT 65

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTFWAPNINIFRDP 175
            FPQ +  AA+F+  L   +G AV+ EARA +N+ Q G        LTFWAPN+NIFRDP
Sbjct: 66  VFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIFRDP 125

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQG--ENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           RWGRG ET GEDP + S   V +++  QG  EN+                          
Sbjct: 126 RWGRGHETFGEDPYLTSRLGVRYIEGLQGHDENY-------------------------- 159

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L  +AC KH   +   +     R+ F+A +TEQD  +T+ P F +C+++GK   +M 
Sbjct: 160 ---LKAAACAKHFAVHSGPE---AVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMG 213

Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN+ NGVP C    L     R EWGF G++TSDC A+    E  + T T  +S A  + 
Sbjct: 214 AYNRTNGVPCCGNKRLLIDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIESVAMAMN 273

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
            G D+NCGT +      A+ +G V+E+ +D A+ NLF  +++LG+F+    +  Y K+  
Sbjct: 274 NGCDLNCGT-LFGFLVQAVRQGLVKEERLDEAVTNLFMARMKLGVFD-KKEENPYDKIPY 331

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
               + E KKL    AR+ +VLLKN +  LPL+KN + ++ +IGP  ++   + G Y G 
Sbjct: 332 LAADSREMKKLNEAVARRTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGT 391

Query: 473 PCSPKSLLRGLEAYVS---KTHYASGCH---DVPCN---SDAGFHEAVRIAKKADFVIVV 523
                ++L G+E YV    +  Y+ GCH   D   N    +    E + + K++D V+ V
Sbjct: 392 ASRYITVLEGIEDYVGDDVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAV 451

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            GLD   E E+          D+  L LPG Q  ++ + A +  +PVILVL  G  L V+
Sbjct: 452 LGLDAGIEGEEGDAGNEYGSGDKPDLNLPGLQEEILEA-AVSCGKPVILVLLSGSALAVN 510

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           +  AD  + +I+   YPG  G  A+A+I+FG+ NP G+LP+T+Y  +       D +M+ 
Sbjct: 511 W--ADEHVDAIVQGWYPGARGGAAIADILFGEANPEGKLPVTFYRTTEELPDFEDYSMQ- 567

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSELTISASLKAGSDKNIL 691
                  GR+YR+     +Y FG+GLSYT Y+Y   +FL    E  +S  +  G      
Sbjct: 568 -------GRTYRYMEQEALYPFGYGLSYTEYAYQNVRFLE--QEPVVSEGVTIG------ 612

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
                                     +SV N G +DG+  V ++ +        P  QL 
Sbjct: 613 --------------------------LSVKNTGKMDGTETVQVYVKAEHSKM--PHGQLK 644

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
              ++   A   KEI+  ++  E   + +++G +ILP G+  + VG ++
Sbjct: 645 KIVKLPLCAGEEKEINIRLE-SEAFMLYDENGEKILPSGHFEIFVGGMQ 692


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 260/761 (34%), Positives = 377/761 (49%), Gaps = 109/761 (14%)

Query: 52   YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
            YPF N  LSI  R   L+S LTL+EK++Q+ +NA AI RLGIPAY WW+E LHG+     
Sbjct: 712  YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR--- 768

Query: 112  GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                  T   VT FPQ +  AAS+N  L   + S++A E RA+YN  Q          LT
Sbjct: 769  ------TKYHVTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHALT 822

Query: 164  FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            +W PNINIFRDPRWGRGQET GEDP + S     FV   QG+              + R 
Sbjct: 823  YWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGD--------------DPRY 868

Query: 224  LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
            LK              SAC KH   Y +      +R+SFN+ ++  D  DT+ P FR+ +
Sbjct: 869  LKA-------------SACAKH---YAVHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLV 912

Query: 284  EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
                 S +MC+YN   G P C    L Q   R++W FKGY+TSDC A+  IF +      
Sbjct: 913  VDANVSGVMCAYNAFKGQPCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPD 972

Query: 343  HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
               +AA  +  G D++CG         A+  G + EK +D ++  LF+++ RLGLF  DP
Sbjct: 973  AATAAADAVFHGTDLDCGQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLF--DP 1030

Query: 403  -RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
              +  Y  +    +   +H+ LA   AR+ +VLLKND + LPL KN +  + ++GP  + 
Sbjct: 1031 AEQVDYAHIPISVLECKKHQDLAKQLARESMVLLKND-RLLPLQKNKLKKVVVMGPNADC 1089

Query: 462  ISQMGGGYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKAD 518
               + G Y G P    +P   +R     V++  Y SG   +   S+      V  AK AD
Sbjct: 1090 KDALLGNYNGHPSRMLTPLQAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGAD 1149

Query: 519  FVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
             VI + G+    E E+           DR S+ LP  Q  L+ ++    + P + V+  G
Sbjct: 1150 AVIFIGGISPRLEGEEMSVNKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTG 1208

Query: 569  GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
              L + + EA   + +IL   Y G+ G +A+A+++FGD+NP G+LP+T+Y +        
Sbjct: 1209 SALAIPW-EA-KHVPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAK-------- 1258

Query: 629  DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
            D ++    S    GR+YR++ G  +Y FG+GLSYT++ Y  L  P+           +DK
Sbjct: 1259 DSDLPDFESYDMQGRTYRYFKGKALYPFGYGLSYTDFRYSSLKMPTACNT-------TDK 1311

Query: 689  NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
             I                         V ++V N G +DG  VV L+   P      P  
Sbjct: 1312 EI------------------------PVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVT 1347

Query: 749  QLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG-RRILP 788
             L GF R++  A  +K+I+F +   E LS  +++G R++LP
Sbjct: 1348 ALKGFKRIYLKAGEAKQITFSLS-SEDLSCVDENGIRKVLP 1387


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/463 (43%), Positives = 296/463 (63%), Gaps = 12/463 (2%)

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  +KAG+D++CG  +  HT++A+ +G V + +I+ AL N  +VQ+RLG+F+G+P   +Y
Sbjct: 1   ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
           G LGP DVCT  H++LAL+AARQGIVLL+N  + LPL+     ++A+IGP  +    M G
Sbjct: 61  GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
            Y G+ C   + L+G+  Y    H A GC DV CN +  F  A   A++AD  ++V GLD
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 179

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            + E E  DR  LLLPG Q  LV+ VAR S+ P ILVL  GGP+DV+FA+ D +IS+I+W
Sbjct: 180 QSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 239

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYR 646
           +GYPG+AG  A+A ++FG  NPGG+LPMTWYP+++ T +PM DM MRAD +R YPGR+YR
Sbjct: 240 VGYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 299

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDE 705
           FY G  V+ FG GLSYT +++     P+ +++   SLKA ++  +L +T   +   H D 
Sbjct: 300 FYIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKT---VRVSHPD- 355

Query: 706 VTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
              C +L    V + V N G +DG+H +++F   P   +    KQL+GF ++H      K
Sbjct: 356 ---CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWASSKQLMGFHKIHIATGSEK 411

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +   V  C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 412 RVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 454


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 260/779 (33%), Positives = 385/779 (49%), Gaps = 126/779 (16%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF +T LS   R   L+S LTL+EK+ Q+ +N  AI RL IPAY WW+E LHGI     
Sbjct: 24  YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                 T   VT FPQ +  AA+++  L  ++ +A++ E RA+YN   A        GLT
Sbjct: 81  ------TEYKVTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           +W PN+NIFRDPRWGRGQET GEDP +  A    FV   QG                   
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQG------------------- 175

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L  +AC KH   Y +      +R++FN  +T  D  DT+ P FR  +
Sbjct: 176 --------DDSQYLKAAACAKH---YAVHSGPENTRHTFNTFVTTFDLWDTYLPAFRDLV 224

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
              K + +MC+YN  +G P C    L Q+  R++WGF GY+TSDC A+   + +    KT
Sbjct: 225 VDAKVAGVMCAYNAFSGEPCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHH---KT 281

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   +AA  + +G DI+CG    +    A+  G + E+ ID +L  LF ++ RLG+F 
Sbjct: 282 HPDAKYAAADAVYSGTDIDCGNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMF- 340

Query: 400 GDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
            DP +  K+ K+    + +  HK LAL   R+ IVLLKN+  FLPL+K  +  +A+IGP 
Sbjct: 341 -DPAEDVKFSKIPLSVLESQPHKDLALKITRESIVLLKNENNFLPLSKK-LKKVAVIGPN 398

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKK 516
            +N   + G Y G P    +  + ++  +  T   Y  G  D    S+    E   +AK+
Sbjct: 399 ADNEVSVLGNYNGFPTQIITPYKAIKNKLKNTEVIYEKGI-DFVKPSENSKEEIAALAKR 457

Query: 517 ---ADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
               D VI   G+    E E+           DR S+ LP  Q  L+ ++ +  + P + 
Sbjct: 458 LKGMDVVIFAGGISPELEGEEMPVKIEGFTGGDRTSIKLPKIQTELMQAL-KAERIPTVF 516

Query: 564 VLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           V+  G  +    AE +SQ + +IL   Y G+    A+A+++FGD+NP G+LP+T+Y +  
Sbjct: 517 VMMTGSAIA---AEWESQNVPAILNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTK-- 571

Query: 623 TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
                 D ++ A +S +   R+YR++ G  +Y FG+GLSYT + Y  +  P      AS+
Sbjct: 572 ------DSDLPAFNSYEMKNRTYRYFDGQVLYPFGYGLSYTKFEYSPIQMP------ASI 619

Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
           KAG +                            V I+V N G  DG  VV L+  +   +
Sbjct: 620 KAGEN--------------------------MEVSITVKNTGKTDGEEVVQLY--ISHDN 651

Query: 743 QGT----PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            GT    P   L  F+R+   A  SK ++F + P E +++A++ G   +  G   L +G
Sbjct: 652 NGTNRQLPLYALKSFERISLKAGESKSVTFKLSPRE-MALADEDGVLKMTKGKSKLYIG 709


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 249/767 (32%), Positives = 380/767 (49%), Gaps = 111/767 (14%)

Query: 58  SLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNG 117
           SL     AK L+  +TL+EKI Q+   + AI RL IPAY WW+E+LHG+A +G       
Sbjct: 2   SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSG------- 54

Query: 118 TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNI 169
                T FPQ +  AA+F+  L   IG  V+ E RA +             GLTFWAPNI
Sbjct: 55  ---VATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111

Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
           NIFRDPRWGRG ET GEDP + +     +++  QG++                       
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGKD----------------------- 148

Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
                D L  +AC KH   + +       R+ F+A ++  D  DT+   F+ C++     
Sbjct: 149 ----PDHLKAAACAKH---FAVHSGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVE 201

Query: 290 CIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
            +M +YN+VNG PAC    L Q   R ++GF+G++ SDC A+    E+ + TKT E+SAA
Sbjct: 202 AVMGAYNRVNGEPACGSKTLLQDILREQFGFEGHVVSDCWAILDFHEHHHVTKTVEESAA 261

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +  G D+NCG   L +   A ++G V+EK I  A+  L  V++RLG+    P    Y 
Sbjct: 262 MAVNHGCDLNCGKAFL-YLSRACEQGLVEEKTITEAVERLMDVRIRLGMMEDYP--SPYA 318

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            +  D V   EH  L+L+A+++ +VLLKND  FLPL +  V ++A+IGP  N+ + + G 
Sbjct: 319 NIPYDVVECPEHIALSLEASKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGN 378

Query: 469 YTGIPCSPKSLLRGLEAYV---SKTHYASGCH------DVPCNSDAGFHEAVRIAKKADF 519
           Y G      + L G++ Y    ++  YA GCH      +        F EA+  A++AD 
Sbjct: 379 YEGTSSRYITPLEGIQEYTGEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADV 438

Query: 520 VIVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           +++  GLD   E E+ D         ++ L LPG Q  L+ +VA   K P++L +  G  
Sbjct: 439 IVMCLGLDAGIEGEEGDAGNEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSA 497

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
           LD+S+A+  +QI +IL   YPG  G KA+AE +FG+F+P G+LP+T+Y  +       D 
Sbjct: 498 LDLSWAQEHAQIRAILDCWYPGARGGKAIAEALFGEFSPCGKLPVTFYEGTEFLPDFTDY 557

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           +M         GR+YR+     +Y FG+GL+Y+   Y    A +++T             
Sbjct: 558 SM--------AGRTYRYTDRHVLYPFGYGLTYSQIRYS--DAHADVT------------- 594

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                   D+  ++ VT        V ++V N G       V ++ R  +     P  QL
Sbjct: 595 --------DFGILEPVT--------VHVTVENTGTYPVQEAVQVYVRFSEREAYDPGYQL 638

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            G   V       KE+   + P    ++ ++ G+ ++  G++ + VG
Sbjct: 639 KGIRSVALECGEKKEVCITLSP-RDFALISEEGKCLVHPGSYEIAVG 684


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 259/779 (33%), Positives = 388/779 (49%), Gaps = 118/779 (15%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           H   YPF N +LS+  R   L+  +TL+EK+ Q+ ++A AI RLG+PAY WW+E LHG+A
Sbjct: 27  HTYKYPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVA 86

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------Q 159
                     T   VT +PQ +  AA+++++    +G   A E RA+YN           
Sbjct: 87  R---------TPFKVTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHDIY 137

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
            GLT+W PNINIFRDPRWGRGQET GEDP +       FVK  QG+              
Sbjct: 138 LGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGD-------------- 183

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
           + R LK  G             C KH   Y +       R+ FN  I++ D  DT+ P F
Sbjct: 184 DPRYLKAAG-------------CAKH---YAVHSGPEDLRHKFNTDISDYDLWDTYLPAF 227

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQ 337
           R  +   K + +MC+YN   G P C   DL   +   ++W F GY+TSDC  +   +  +
Sbjct: 228 RKLVVDAKVTGVMCAYNAFKGQPCC-GSDLLMNSILHDKWKFTGYVTSDCGGIDDFYR-E 285

Query: 338 NYTKTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           N  +T  D   +AA  +  G D+ CG    +    A+  GK+ EK ID++L  LFSV+ +
Sbjct: 286 NTHQTQPDAESAAADAVLHGTDVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFK 345

Query: 395 LGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG+F  DP    KY ++G D +    H   AL  A Q IVLLKN+   LPL+KN +  +A
Sbjct: 346 LGMF--DPADAVKYNQIGKDALEAPAHGAQALKMAHQSIVLLKNEGNLLPLSKN-LKKIA 402

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEA 510
           ++GP  +N   + G Y G P    + L+G++  +   ++  Y     D   +S A ++ A
Sbjct: 403 VLGPNADNAVSVLGNYNGTPSRIVTALQGIKNKLPAGTEVIYDKAV-DYVADSAARYNYA 461

Query: 511 VRIA--KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
              A  K AD +I + G+    E E+           DR ++LLPG Q  L+ ++  T K
Sbjct: 462 AMAAKVKDADAIIYIGGISPELEGEEMPVSKPGFHGGDRSTILLPGVQTELLKALKATGK 521

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            PV+ V+  G  +   + EA++ + +I+   Y G+A   A+A+++FGD+NP GRLP+T+Y
Sbjct: 522 -PVVFVMMTGSAIATPW-EAEN-LPAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFY 578

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
                    +D ++ + +      R+YR++ G  +Y FG+GLSY+ + Y  L AP     
Sbjct: 579 G--------SDKDLPSFTDYSMDNRTYRYFKGKPLYAFGYGLSYSKFEYAPLDAP----- 625

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
             +LKAG                         +L  HV+  VTN   +DG  V  L+   
Sbjct: 626 -LTLKAGE------------------------ALTVHVK--VTNKSKMDGEEVTELYLSH 658

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             + Q T  + L GF+R    A  +K+I+F +   + LSI + +G  +   G   + VG
Sbjct: 659 IGIKQKTAIRALKGFERTLIKAGETKDITFKLSSAD-LSITDLNGNLVKASGKIAISVG 716


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 257/780 (32%), Positives = 387/780 (49%), Gaps = 114/780 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF +TS +I  R K L+S LTL EK  Q+  NA AI RLGI  Y WW+E+LHG+A  G 
Sbjct: 38  YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTG- 96

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLT 163
                      T FP+ +  AA+F+  L   IG A++ EA A YN+ Q        +G+T
Sbjct: 97  ---------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           F+APN+NIFRDPRWGRGQET GEDP + S   V +VK  QG + K               
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGNDPKY-------------- 193

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                        L  +AC KH   Y +       R+S++A    +D  +T+ P F + +
Sbjct: 194 -------------LKTAACAKH---YVVHSGPEALRHSYDAEPPMKDFMETYVPAFETLV 237

Query: 284 EQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
           ++GK   +MC+YN+  G P C    L  DL    R +WGF GY+T+DC A+   + +   
Sbjct: 238 KEGKVESVMCAYNRTFGKPCCGSSFLLHDLL---REKWGFTGYVTTDCWAIQNFYLHHGA 294

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            K   ++ A  +K+G+++NCG     +  +A+ KG V EK++D AL  L   + RLGLF+
Sbjct: 295 AKDSLEACALAIKSGVNLNCGN-EFNYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD 353

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P +  Y K+  + + + ++  LA +AA + +VLL+N    LPL K+ + SL ++GP  
Sbjct: 354 -SPNENPYAKIKEEVIGSQQNIDLAYEAAAKSLVLLQNKNNTLPLKKD-MKSLYVVGPYA 411

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHE--AVRIA 514
            N   + G Y G+     ++++ +   VS     +Y  G      N ++  +       A
Sbjct: 412 ANQDILLGNYNGVNSRLTTIMQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADA 471

Query: 515 KKADFVIVVAGL------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
                V  ++G+      + T  T   DR+ L LP  Q+  +  + +  K+P+ILVLTGG
Sbjct: 472 DAVVAVFGISGVFEGEEGESTASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVLTGG 531

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-M 627
            P  +   E    + +IL++ YPG+ G  A+A++IFGD NP GRL +T +P+S +++P  
Sbjct: 532 SP--ICTPELADMVDAILFVWYPGQEGGHAVADVIFGDVNPSGRLCIT-FPKSVSQLPAF 588

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            D +M+        GR+YR+ T   +Y FG GLSYTNYSY  +    +            
Sbjct: 589 EDYSMK--------GRTYRYMTEEPLYPFGFGLSYTNYSYSNIKTDKDKIKKGQ------ 634

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                                      HV  +V+N G   G  V  L+    K S  TP 
Sbjct: 635 -------------------------SVHVTATVSNTGKTAGEEVAQLYITDVKASAPTPL 669

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN-HVLMVGELRHSLTIE 806
             L G  RV   A  SKE+SF V P + + +    G +++  G+  V + G    +L+I+
Sbjct: 670 YALKGTKRVKLAAGESKEVSFEVTP-QMMELVTVTGEKVIEPGDFKVYIAGSTPSALSIK 728


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 268/765 (35%), Positives = 388/765 (50%), Gaps = 79/765 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ +LS   RA +L+  LT++EK+Q L   A   PR+G+PAY WWSE+LHG+A   PG  
Sbjct: 43  CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVAY-APGTY 101

Query: 115 F-NGTV--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F  G V  +S TS+P  L+ AA+F+  L   IG+A+ +EARA  N G AGL +W PN+N 
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           F+DPRWGRG ETPGED + V  YA    +   G               ++RV+       
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGP----------VPGEQRRVI------- 204

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    + CKH    D E W   SR+ F+A IT QD  + +  PF+ C    K   I
Sbjct: 205 ---------STCKHYAGNDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSI 255

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSA 347
           MC+YN VNGVP+C    L Q   R  W +     Y+TSDC+AV  +     Y  T+    
Sbjct: 256 MCAYNAVNGVPSCANEYLLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKYAPTNAAGT 315

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A   +AGMD +C          A  +G ++E+ +DRALL L+   +R G F+G   +  Y
Sbjct: 316 AICFEAGMDTSCEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDG--HEAIY 373

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            KLG  DV ++E + LAL AA +GIVLLKN+   LPL+      +A+IG   +   ++ G
Sbjct: 374 AKLGWKDVNSAEAQSLALQAAVEGIVLLKNNGT-LPLDLKPSHKVAMIGFWADAPDKLQG 432

Query: 468 GYTGIPC---SPKSLLRGLEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVIVV 523
           GY+G      +P    R L   ++    ASG      N SD     A+  A+ AD+++  
Sbjct: 433 GYSGRAAHLHTPAYAARQLGLDIT---LASGPVLQRNNASDNWTAAALEAAEGADYILYF 489

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            GLD +   E  DR  L  P  Q+ L+  ++   K P+++ L G    D    + D ++S
Sbjct: 490 GGLDTSAAGETLDRTDLEWPEAQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVS 547

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPG 642
           SILW  +PG+ G  A+ ++I G+ +P GRLP+T YP ++T  +PM  M++R  S  QYPG
Sbjct: 548 SILWANWPGQDGGVAIMKLITGEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTS--QYPG 605

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLDYV 701
           R+YR+Y    +  FG GL YT +  +   A P  L I+  +  G++              
Sbjct: 606 RTYRWYD-KPIKRFGFGLHYTTFKAEVGGAFPKTLRIADLVGCGNE-------------- 650

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
           H D   +C +    V  S+TN G+    +V + +       +  P K L  + R+  VA 
Sbjct: 651 HPD---TCPAPPLPV--SITNTGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAP 705

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           G       VD    L    +H  +    GN VL  GE  +++TI+
Sbjct: 706 GETAT---VDLAWTLGDIARHDEQ----GNTVLYPGE--YTITID 741


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 249/765 (32%), Positives = 376/765 (49%), Gaps = 126/765 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A+ L++ +TL+EK  QL+ N+ AI RLGIPAY WW+E+LHG+A  G          + T
Sbjct: 10  KAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAG----------TAT 59

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+      I +A+A+EARA YN            GLT W+PNINIFRDP
Sbjct: 60  VFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIFRDP 119

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP +     V F+K  QG+                             D
Sbjct: 120 RWGRGHETYGEDPFLSGKLGVAFIKGLQGDK----------------------------D 151

Query: 236 ELMLSACCKHLIAY----DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            +M +AC KH  AY    DL       R+ FNA +T++D  +T+ P F +C++  K   +
Sbjct: 152 VMMTAACVKHFAAYSGPEDL-------RHGFNAEVTKKDLWETYLPAFETCVKDAKVEAV 204

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           M  YN+ NG P C    L +   R +WGF+G++ SDC A+         TKT E+S A  
Sbjct: 205 MGGYNRTNGEPCCGSYTLLRDILREKWGFEGHVVSDCWAIKDFHTDHMVTKTPEESVALA 264

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           + AG D+NCG  M      A+ +G + E+ I RA + +F+ + +LGLF G     ++  +
Sbjct: 265 IDAGCDLNCGN-MYLMLLIALQEGLITEEHITRAAVRIFTTRFKLGLFEG----SEFDNI 319

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
             + V  SEHK++A++AAR+  VLLKND   LP+NK A+ ++ +IGP  N+   + G Y 
Sbjct: 320 PYEVVECSEHKEMAIEAARKSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYH 378

Query: 471 GIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVI 521
           G      +LL G++  V    +  Y++GC       +V   ++    EAV +A+ +D V+
Sbjct: 379 GTSSRYITLLEGIQDEVGDEVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVV 438

Query: 522 VVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +  GLD T E E           D+  L LP  Q SL+  +  T K P +L L  G  ++
Sbjct: 439 LCLGLDETIEGEQSDEGNNGGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAIN 497

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +S+A      + IL   YPG  G KA+A+I+FG+ +P G+LP+T+Y       P+ D +M
Sbjct: 498 LSYAH--EHCNGILLTWYPGARGGKAVADILFGNASPSGKLPVTFYRSLDNLPPITDYSM 555

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           +         R+YR+     +Y FG+GL+Y +   K +     + I        +K+I  
Sbjct: 556 K--------NRTYRYIEEAPLYPFGYGLTYGDVELKHVEIKGTVEI--------EKDI-- 597

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                                 ++ +++ N G V    VV  + +  +         L  
Sbjct: 598 ----------------------YITVTLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCA 635

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           F RV   A   K++S  + P + L + N  G ++L      L  G
Sbjct: 636 FMRVGLGANEEKQVSMRI-PFDSLKVVNLDGEKVLDSKKFTLFAG 679


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 255/768 (33%), Positives = 377/768 (49%), Gaps = 110/768 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +  +S + RA  L++ +T+ EKI QLS    AI RL +P Y WW+E+LHGIA NG   
Sbjct: 29  WFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNG--- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                    T FPQ +  AA+F+  L   + SA++ EARA Y + Q        AGLTFW
Sbjct: 86  -------KATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTFW 138

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PN+NIFRDPRWGRGQET GEDP + +     FVK  QG++ K          +   V K
Sbjct: 139 TPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDDPKY--------LKSAGVAK 190

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
            F   S  G E +                    R+ F+   +++D  +T+ P F + + Q
Sbjct: 191 HFAVHS--GPESL--------------------RHHFDVEPSQKDLYETYLPAFEALVTQ 228

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
            K + +MC+YN VNG PAC    L     + +WGF GYI SDC A+         TK+  
Sbjct: 229 AKVAGVMCAYNAVNGEPACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGP 288

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           +SAA  L++G+++NCG+      ++A+++  V  + ID+ L  L  ++ +LG F  DP  
Sbjct: 289 ESAALALQSGVNLNCGSTYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFF--DPAG 346

Query: 405 -GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              Y ++ PD + + EH  L+ D AR+ IVLLKND   LPL+K+ +    + GP   +  
Sbjct: 347 LNPYNEVTPDVIHSPEHINLSRDVARKSIVLLKNDNHVLPLSKD-IKVPYVTGPFAASSD 405

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAG-FHEAVRIAKKADF 519
            + G Y GI  S  S+L G+   V   S  +Y SG   +P +++    + A ++AK AD 
Sbjct: 406 MLIGNYYGISDSLVSVLEGIAGKVSLGSSLNYRSGS--LPFHNNINPLNWAPQVAKTADA 463

Query: 520 VIVVAGLDLTQETE--------DR-DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           VI V G+    E E        DR DRV++ LP  Q+  V  +A   K P+ILV+  G P
Sbjct: 464 VIAVVGVSADMEGEEVDAIASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSP 523

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
           +D+S  + +    +ILWI YPGE G  A+A+++FGD NP G LP+T+        P +D 
Sbjct: 524 VDIS--DLEPLADAILWIWYPGEQGGNAVADVLFGDTNPSGHLPLTFVKSIDDLPPFDDY 581

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
            M         GR+Y+F     +Y FG G SYT +S+      ++LT+S           
Sbjct: 582 AMT--------GRTYKFLEKAPLYPFGFGRSYTEFSF------NDLTVS----------- 616

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                         +  +       + + V N GD+ G  VV  +              L
Sbjct: 617 --------------QGKAIEGEALTLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSL 662

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
             F R+H   K ++ +   +   + L   N  G  + P G + L VG+
Sbjct: 663 KSFKRIHLAPKETRWVELTIQ-GKDLYQVNNAGETVWPQGRYSLAVGD 709


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/782 (33%), Positives = 389/782 (49%), Gaps = 98/782 (12%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ + + + RA  L+ ++ L EK++ L + +   PR+G+PAYEWWSE+LHG+A + PGV 
Sbjct: 96  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKS-PGVT 154

Query: 115 FN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           FN       SS TSF   +V +A+F+  L   + + ++ EARA  N G AGL +W PNIN
Sbjct: 155 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 214

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
            ++DPRWGRG ETPGED + +S Y    ++  +G                          
Sbjct: 215 PYKDPRWGRGMETPGEDALRISKYVKALLRGLEG-------------------------- 248

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           SD     M+ A CKH  A DLE+W   +RY+F+A +T QD  + + P F+ C        
Sbjct: 249 SDPTTRKMV-ANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGS 307

Query: 291 IMCSYNQV---------NGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQ 337
            MC+YN +         NG P C    L     R  WG+K    +ITSDC+AV  ++   
Sbjct: 308 FMCAYNAMSIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQH 367

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHT--QSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           +++ T E++A     AG D  C       T  + A D+G + E  +DRAL  L+   +R 
Sbjct: 368 HWSDTREEAAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRA 427

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK----KFLPLNKNAVSS 451
           G F+G      Y  +   DV T E +KLA  +A +G+VL KN+     K   L K    +
Sbjct: 428 GYFDGP--DAPYRNITWADVNTPEARKLAHRSAVEGMVLTKNNGVLPIKLEELQKKG-KT 484

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE-- 509
           +A+IG  V+N  QM G Y+GI     + L   +A   K   A G    P N   G  +  
Sbjct: 485 VALIGNWVDNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGG----PVNQSTGSRDSW 540

Query: 510 ---AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
              A+  A +AD V+   G+DL+ E EDRDR SL  P  Q  L++ ++   K  V++ L 
Sbjct: 541 TRPALNAAIQADVVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL- 599

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
            G  LD +    +  IS+I+W GYPG+ G  A  +II G   P GRLP+T YP  +  +V
Sbjct: 600 -GTMLDDTALLDNKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQV 658

Query: 626 PMNDMNMRADSSRQ-----YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           PM DM +R     +      PGR+YR+Y    V+ FG GL +TN++     + S    ++
Sbjct: 659 PMTDMEVRPSKDTKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTS 717

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVP 739
            L++G                HID+ +  +SL    ++SVTN G     S+  + F R  
Sbjct: 718 DLESGCKSE-----------KHIDKCSFPSSL----EVSVTNDGKSTTSSYAALAFVRGE 762

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
              +  P K L+ + ++H +A G       E++ G      L+   ++G  +L  G + +
Sbjct: 763 YGPKPYPLKTLVAYGKLHDIAPGQTKKVKLELTLG-----DLARTAENGDLVLYPGKYEV 817

Query: 795 MV 796
           +V
Sbjct: 818 LV 819


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 262/782 (33%), Positives = 392/782 (50%), Gaps = 98/782 (12%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ + + + RA  L+ ++ L EK++ L + +   PR+G+PAYEWWSE+LHG+A + PGV 
Sbjct: 41  CDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAKS-PGVT 99

Query: 115 FN----GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
           FN       SS TSF   +V +A+F+  L   + + ++ EARA  N G AGL +W PNIN
Sbjct: 100 FNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDWWTPNIN 159

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
            ++DPRWGRG ETPGED + +S Y                          K +L+G  E 
Sbjct: 160 PYKDPRWGRGMETPGEDALRISKYV-------------------------KALLRGL-EG 193

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           SD     M+ A CKH  A DLE+W   +RY+F+A +T QD  + + P F+ C        
Sbjct: 194 SDPTTRKMV-ANCKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGS 252

Query: 291 IMCSYNQV---------NGVPACLRGDLFQKA-RNEWGFK---GYITSDCDAVATIFEYQ 337
            MC+YN +         NG P C    L     R  WG+K    +ITSDC+AV  ++   
Sbjct: 253 FMCAYNAMSIKGKDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQH 312

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHT--QSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           +++ T E++A     AG D  C       T  + A D+G + E  +DRAL  L+   +R 
Sbjct: 313 HWSDTREEAAGSAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRA 372

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV----SS 451
           G F+G      Y  +   DV T E +KLA  +A +G+VL KN+   LP+    +     +
Sbjct: 373 GYFDGP--DAPYRNITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKT 429

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE-- 509
           +A+IG  V+N  QM G Y+GI     + L   +A   K   A G    P N   G  +  
Sbjct: 430 VALIGNWVDNGEQMLGTYSGIAPFRNTPLAAAKALNLKMVTAGG----PVNQSTGSRDSW 485

Query: 510 ---AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
              A+  A +AD V+   G+DL+ E EDRDR SL  P  Q  L++ ++   K  V++ L 
Sbjct: 486 TRPALNAAIQADVVLYFGGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGKPTVVVQL- 544

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KV 625
            G  LD +    +  IS+I+W GYPG+ G  A  +II G   P GRLP+T YP  +  +V
Sbjct: 545 -GTMLDDTALLDNKNISAIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQV 603

Query: 626 PMNDMNMRADSSRQ-----YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           PM DM +R     +      PGR+YR+Y    V+ FG GL +TN++     + S    ++
Sbjct: 604 PMTDMEVRPSKDTKGGAASNPGRTYRWYD-EAVHPFGFGLHFTNFTTSVAVSSSSAISTS 662

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG-DVDGSHVVMLFARVP 739
            L++G                HID+ +  +SL    ++SVTN G     S+  + F R  
Sbjct: 663 DLESGCKSE-----------KHIDKCSFPSSL----EVSVTNDGKSTTSSYAALAFVRGE 707

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSK-----EISFGVDPCEQLSIANKHGRRILPLGNHVL 794
              +  P K L+ + ++H +A G       E++ G      L+   ++G  +L  G + +
Sbjct: 708 YGPKPYPLKTLVAYGKLHDIAPGQTKKVKLELTLG-----DLARTAENGDLVLYPGKYEV 762

Query: 795 MV 796
           +V
Sbjct: 763 LV 764


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 249/725 (34%), Positives = 373/725 (51%), Gaps = 95/725 (13%)

Query: 96  YEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY 155
           Y WWSE+L          NF    SS TSFP  +   A+F+  L   I + ++ EARA  
Sbjct: 1   YNWWSEAL----------NF----SSATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
           N+ + GL F+ PNIN F+DPRWGRGQETPGEDP  +S Y  + V   Q        GG+G
Sbjct: 47  NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQ--------GGVG 98

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L ++A CKH  AYDLE  G  SR+ F+A +T QD  + +
Sbjct: 99  ------------------PTNLKIAADCKHWAAYDLENLG-VSRFEFDAKVTMQDLAEFY 139

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGF--KGYITSDCDAVAT 332
            P F+SCI   K + IMCSYN VNG+P+C    L Q  AR+ WG   + +IT DC AV  
Sbjct: 140 SPSFQSCIRDAKVASIMCSYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGN 199

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS---AIDKGKVQEKDIDRALLNLF 389
           IF   +YT    +  A  L AG DI+C +    ++Q+   A+++  V E  +  A+   +
Sbjct: 200 IFARHHYTDDPANGTAVALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQY 259

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           +  +RL                 DDV T   ++LA  AA +GIVLLKND   LPL  ++V
Sbjct: 260 NSLVRLSW---------------DDVNTEPAQQLAYQAAVEGIVLLKND-GILPL-ASSV 302

Query: 450 SSLAIIGPLVNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG 506
             +A++GP+ N  +QM   Y GI     SP+   R     V+   +A+G   +  +  +G
Sbjct: 303 KKVAVVGPMANATTQMQSNYNGIAPFLVSPQQAFRNAGFNVT---FANGT-GLNSSDTSG 358

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           F  A+  A  AD V  V G+D T E EDRDR  +   G Q++LV  +A   K P+I++  
Sbjct: 359 FSAAIAAADDADVVFYVGGIDTTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQM 417

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-V 625
           GGG +D S    ++ +++++W GYPG++G  AL ++I G   P GRLP+T YP S+    
Sbjct: 418 GGGQVDSSSLRDNTSVNALIWGGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGF 477

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
           PM DM +R  SS   PGR+Y++YTG  ++ FG GL YT +  ++ S     ++   + + 
Sbjct: 478 PMTDMTLRPSSSN--PGRTYKWYTGAPIFEFGFGLHYTTFDAEWASGGDSFSVQDLVSSA 535

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
            +  +           H+D     T       ++VTN+G V   +V +LF+R        
Sbjct: 536 KNSGV----------AHVDLGVLDT-----FNVTVTNSGTVASDYVALLFSRTTAGPSPA 580

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV-----GELR 800
           P K+L+ + RV  +  G+   +        ++  ++ G R+L  G +VL++     G+++
Sbjct: 581 PNKELVSYTRVKGIEPGASSAASLKVTLGAVARTDEQGNRVLYPGEYVLLLDTGAEGKIQ 640

Query: 801 HSLTI 805
             +T+
Sbjct: 641 KKITL 645


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 252/780 (32%), Positives = 376/780 (48%), Gaps = 129/780 (16%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF N  LS   R   L+S LTL+EK++Q+ ++A AIPRLGIPAY+WWSE LHG+A     
Sbjct: 27  PFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPAYDWWSEVLHGVAR---- 82

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
                T    T +PQ +  AA+++      +    A E RA++N             GLT
Sbjct: 83  -----TPYHTTVYPQAIAMAATWDTVALYTMADQSAREGRAIHNKATEEGKNGDRYVGLT 137

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           +W PNINIFRDPRWGRGQET GEDP + +     FV+  QGE+ K               
Sbjct: 138 YWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGEDPKY-------------- 183

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                        L  +AC KH   Y +       R+SF+  +++ D  +T+ P F+  +
Sbjct: 184 -------------LKAAACAKH---YAIHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELV 227

Query: 284 EQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY 339
              K + +MC+YN     P C    L  D+    R +WGF GY+TSDC A+   F   NY
Sbjct: 228 THAKVAGVMCAYNAFRKKPCCGSDLLMTDIL---RRQWGFTGYVTSDCGAIDDFF---NY 281

Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            KTH ++ A  + A   G D+ CG         A+  G++ EK+IDR++  LF +++RLG
Sbjct: 282 HKTHPNAEAAAIDAVTNGTDVECGNRAYLTLTDAVKTGRIAEKEIDRSVKRLFMIRMRLG 341

Query: 397 LFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           +F  DP     Y +  P  + ++ HK  AL  A++ IVLLKN+   LPL+K ++  +A++
Sbjct: 342 MF--DPVSMVSYAQTSPAVLESAPHKAQALKMAQESIVLLKNENHLLPLSK-SIKKIAVV 398

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN--------SDAGF 507
           GP  +N   + G Y G P    + L G++A +      S  ++   N            F
Sbjct: 399 GPNADNSIAVLGNYNGTPSKIVTALDGIKAKLGTN--GSVVYEKAVNFTNAMLPEGKTDF 456

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
                  K AD +I V G+    E E+           DR ++LLP  Q   + ++  T 
Sbjct: 457 AALTSRVKDADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAMKALKATG 516

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K PV+ V+  G  L + + + +  I +I+   Y G+A   A+A+++FGD+NP GRLP+T+
Sbjct: 517 K-PVVFVMMTGSALAIPWEQEN--IPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTF 573

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           Y         +D ++ A    +   R+YR+++G  +Y FG+GLSYT + Y+ L  P+   
Sbjct: 574 Y--------KSDADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPT--- 622

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
                                        T    +R  V I +TN G   G  VV L+  
Sbjct: 623 -----------------------------TVKNKVRIPVSIQLTNTGAKGGEEVVQLYIS 653

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                   P K L GF RV      +K I F + P + L+IA ++G+ + P G   + VG
Sbjct: 654 YQGQPIKKPLKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVG 712


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 266/772 (34%), Positives = 384/772 (49%), Gaps = 92/772 (11%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSD------NASAIPRLGIPAYEWWSESLHGIAS 108
           C+T+ S   RA +L+  + + EK+  L +      ++   PRLG+P YEWWSE+LHG+A+
Sbjct: 11  CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVAA 70

Query: 109 NGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW 165
           + PGV+FN   G  S  TSF   +  +A+F+  L   +   ++ EARA  N G AGL FW
Sbjct: 71  S-PGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PNIN +RDPRWGRG ETPGEDP+ +  Y    ++  +GE               K+V+ 
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGLEGEE------------SIKKVI- 176

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                          A CKH  AYDLE+W N +RY F+A+++ QD  + + PPF+ C   
Sbjct: 177 ---------------ATCKHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARD 221

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIF-EYQNYT 340
            K   IMCSYN +NG PAC    L     R  W +     YITSDC+A+     +  N+T
Sbjct: 222 SKVGSIMCSYNSLNGTPACANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFT 281

Query: 341 KTHEDSAAGVLKAGMDINC---GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +T  ++AA    AG D  C   G+        A D+  + E+ IDRAL  L+   +R G 
Sbjct: 282 QTAAEAAAAAYTAGTDTVCEVAGSPPYTDVVGAYDQKLLSEEVIDRALRRLYEGLVRAGY 341

Query: 398 FNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           F  DP     Y  +G  DV T+E + LAL +A  G+VLLKND   LP+ K    ++A+IG
Sbjct: 342 F--DPASASPYRDIGWSDVNTAEAQALALQSASDGLVLLKNDGT-LPI-KLEGKTVALIG 397

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDAGFHEAVRIAK 515
              +    M GGY+GIP    S +           YASG         D    +A+  A 
Sbjct: 398 HWASGTRSMLGGYSGIPPYYHSPVYAAGQLNLTYKYASGPVAPASAARDTWTADALSAAN 457

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           K+D ++   GLD +  +ED+DR S+  P  Q++L+ ++A   K  V++ L  G  +D + 
Sbjct: 458 KSDVILYFGGLDQSVASEDKDRDSIAWPPAQLTLIQTLAGLGKPLVVIQL--GDQVDDTP 515

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRA 634
              +  +S+ILW GYPG++G  A+   I G   P GRLP+T YP S+T ++P+ DM++R 
Sbjct: 516 LLTNPNVSAILWAGYPGQSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRP 575

Query: 635 DSSRQYPGRSYRFYT-GTQVYGFGHGLSYTNYSYK------FLSAPSELTISASLKAGSD 687
           D +   PGR+YR+      V  FG+GL YTN++ +      F   PS L     L     
Sbjct: 576 DPASGRPGRTYRWLPRNATVLPFGYGLHYTNFTARPNPAQNFTLTPSALLAPCKL----- 630

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-P 746
                         H D       L + V + VTN G     +V ++FA          P
Sbjct: 631 -------------AHRD----LCPLPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHP 673

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            K L+ + R+  +A G    +       Q+ +A     R+   GN VL  G 
Sbjct: 674 LKTLVAYARLRGIAPGRTARA-------QVQVALGDLARVDAAGNRVLYPGR 718


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 259/767 (33%), Positives = 382/767 (49%), Gaps = 111/767 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F + +L I TRAK+L+S LTL+EKI  L  N   + RL IPAY WW+E+LHG+A  G   
Sbjct: 29  FRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGVARAG--- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA+F+ +L   I  A++ EAR+ YN+ ++        G+TFW
Sbjct: 86  -------EATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQYLGITFW 138

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PNINIFRDPRWGRGQET GEDP + ++    FVK  Q                      
Sbjct: 139 TPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQ---------------------- 176

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
             G E +R   L  +A  KH   +   +     R+ FNA++ E+D  +T+ P F++ +E 
Sbjct: 177 --GSEPER--RLKTAAGAKHFAVHSGPE---ADRHHFNAVVDEKDLRETYLPAFKALVEN 229

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G  + IMC+YN+VNG P C    L Q   R+EWGFKG + +DC A+  I+       T  
Sbjct: 230 G-VTTIMCAYNRVNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIPTRV 288

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           + AA  +KAG++++C   +    Q AI+K  +  + +D ALL     QL+LG ++ DP  
Sbjct: 289 EVAAAAVKAGVNLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYD-DPSH 347

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y   G D V  S H  LA +AA + +VLLKND   LPL K+ +SS+ ++G    +IS 
Sbjct: 348 SPYRHYGIDSVNNSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISA 406

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           + G Y G+  +  + + GL           Y  GC      +D      +  A   D  I
Sbjct: 407 LTGNYHGLSGNMVTFVEGLVKAGGPGMSVQYDYGC----SFADTSHFGGIWAAGFTDVTI 462

Query: 522 VVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
            V GL    E E           D+  L +P      +  +  +   PVI V+TGG  LD
Sbjct: 463 AVIGLSPLLEGEHGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALD 522

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +S  E  +   +I++  YPGE G  ALA++IFG+ +P GRLP+T+Y +     P +D NM
Sbjct: 523 ISAIEPYA--DAIIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKDIKDLPPYHDYNM 580

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
                     R+YR++ G  +Y FG+GLSYT++ Y++LS PS                  
Sbjct: 581 T--------NRTYRYFQGDVLYPFGYGLSYTSFHYEWLSKPST----------------- 615

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                   V  D++ S       V I+VTN G +D   V+ ++   P + +  P ++L G
Sbjct: 616 -------KVSEDDIIS-------VNIAVTNTGTMDADEVIQVYIVYPDIER-MPLRELKG 660

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGE 798
           F R+H  A  ++     + P + L   + K+ R  L  G + + V +
Sbjct: 661 FSRIHIKAGQTQNTDIQI-PVKNLKKWDSKNNRWKLYKGKYKIQVSQ 706


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/792 (32%), Positives = 391/792 (49%), Gaps = 135/792 (17%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           ++S  +  RA  LI  ++L EK+ QL + A  IPRLG+PAY++W+E+ HGIA+NG     
Sbjct: 207 DSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNG----- 261

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----------LGQAGLTFW 165
                  T FPQ + +AA++N +L    G+ + +E RA +N              GLT+W
Sbjct: 262 -----IATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGLTYW 316

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNIN+FRDPRWGRGQET GEDP + +   +EFVK  QG++ +                 
Sbjct: 317 APNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDDPRY---------------- 360

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                      ++  AC KH   Y +      +R+SFNA I E+D  DT+ P F   + +
Sbjct: 361 -----------MLAMACAKH---YAVHSGPERTRHSFNAEIPERDLFDTYLPHFERVVRE 406

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF--EYQNYTKT 342
           GK + +M +YN VNGVPA     L  +  R  WGF+GY+ SDCDA+  I+  +  +Y KT
Sbjct: 407 GKVAGVMSAYNAVNGVPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKT 466

Query: 343 HEDSAAGVLKAGMDINCG---TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            E++AA  +KAG ++ CG     ++R    A+ +G V EKD+D AL +    + RLGLF 
Sbjct: 467 AEEAAALAVKAGCNLCCGGDYNALVR----AVQQGLVTEKDLDGALYHTLWTRFRLGLF- 521

Query: 400 GDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
            DP  +  +      D     H ++AL+ ARQ IVLLKND   LPL++  +  +A+IGP 
Sbjct: 522 -DPAEQVPFSGYTLKDNDLPAHSQVALELARQAIVLLKNDGT-LPLDRTKLKQIAVIGPN 579

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAG------------ 506
             + S + G Y G      S+L  +   V      +     P  +  G            
Sbjct: 580 AASKSMLEGNYHGSASRSISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTD 639

Query: 507 ------FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
                   EA+++A +AD +I V G+   QE E  DR S+ LP +Q  L+ ++  T K P
Sbjct: 640 RPVAELKAEALKLAAEADAIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-P 698

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V++V   G  + +++   D  + +I+   YPG+ G +A+AE++FG+ NP G LP+T+Y  
Sbjct: 699 VVMVNCSGSAMALTWQ--DENLPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYRS 756

Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
           +      +D +M+         R+YR++TG  +Y FGHGLSY+ + Y  L          
Sbjct: 757 TADLPDFSDYSMK--------NRTYRYFTGRPLYAFGHGLSYSTFEYANL---------- 798

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
                                    V    +    V + +TN+G  DG  VV L+A  P 
Sbjct: 799 ------------------------RVAPAANGALTVTLDLTNSGKRDGDDVVQLYATPPA 834

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC--EQLSIANKHGRRILPLGNHVLMVG- 797
            SQ    + L GF R H  A  ++ ++  V      +  IA K     +P G+  +  G 
Sbjct: 835 SSQPQELRALCGFRRTHVKAGETRTVTVTVPAVALRRWDIAKKD--YAIPSGDWTIAAGA 892

Query: 798 ---ELRHSLTIE 806
              +LR   TI+
Sbjct: 893 SSADLRQKATIK 904


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 244/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           AK L+S +T+ EKI Q+   + AI RLGIP Y WW+E+LHG+A  G            T 
Sbjct: 9   AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
           FPQ +  AA+F+  L   IG  V+ E R  +N            GLTFWAPN+NIFRDPR
Sbjct: 59  FPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRG ET GEDP +       +++  QG                           D  D 
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L  +AC KH   + +       R+ F+A  ++ D  DT+   F+ C++  K   +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           +VNG PAC    L +   R+E+GF+G++ SDC A+    E+ + T T E+SAA  +  G 
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG+  L H + A DKG V ++ I  A+  L  V++RLG+    P    Y  +  + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGMVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
              EH +L+++AAR+ +VLLKN   FLPL++  V ++A+IGP  N+   + G Y G    
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385

Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
             + L GL+ Y+   ++  YA GCH               F EA+ +A+++D V++  GL
Sbjct: 386 YITPLEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445

Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E+          D++ L+LPG Q  L+ +VA   K PVILVL+ G  +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
               + +I+   YPG  G KA+AE IFG+++P G+LP+T+Y  +       D +M     
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPNGKLPVTFYQGTENLPEFTDYSM----- 557

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
                R+YR+     +Y FG+GL Y   +Y  LS                          
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGLS-------------------------- 588

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                +D+  S  +    V ++VTN      + +V L+ R    ++  P  QL G + V 
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643

Query: 758 TVAKGSKEISFGVDP 772
                +K++   + P
Sbjct: 644 LEPHETKKVKLTLSP 658


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 244/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           AK L+S +T+ EKI Q+   + AI RLGIP Y WW+E+LHG+A  G            T 
Sbjct: 9   AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
           FPQ +  AA+F+  L   IG  V+ E R  +N            GLTFWAPN+NIFRDPR
Sbjct: 59  FPQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRG ET GEDP +       +++  QG                           D  D 
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L  +AC KH   + +       R+ F+A  ++ D  DT+   F+ C++  K   +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           +VNG PAC    L +   R+E+GF+G++ SDC A+    E+ + T T E+SAA  +  G 
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG+  L H + A DKG V ++ I  A+  L  V++RLG+    P    Y  +  + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
              EH +L+++AAR+ +VLLKN   FLPL++  V ++A+IGP  N+   + G Y G    
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385

Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
             + L GL+ Y+   ++  YA GCH               F EA+ +A+++D V++  GL
Sbjct: 386 YITPLEGLQQYLGDDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445

Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E+          D++ L+LPG Q  L+ +VA   K PVILVL+ G  +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
               + +I+   YPG  G KA+AE IFG+++P G+LP+T+Y  +       D +M     
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM----- 557

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
                R+YR+     +Y FG+GL Y   +Y  LS                          
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGLS-------------------------- 588

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                +D+  S  +    V ++VTN      + +V L+ R    ++  P  QL G + V 
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643

Query: 758 TVAKGSKEISFGVDP 772
                +K++   + P
Sbjct: 644 LEPHETKKVKLTLSP 658


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 249/771 (32%), Positives = 377/771 (48%), Gaps = 116/771 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + S +   R + L+S +TL+EK+ QL  +A A+ RLGIP+Y WW+E+LHG+A  G   
Sbjct: 5   YLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG--- 61

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                  + T FPQ +  AA F+ +L   IG   A+E RA Y+            G+TFW
Sbjct: 62  -------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITFW 114

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRG ET GEDP +       ++K  QG                     
Sbjct: 115 SPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQG--------------------- 153

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                   G  L  +AC KH  A+   + G   R+SFN++++++D  +T+ P F  C+++
Sbjct: 154 -------NGKRLKAAACVKHFAAHSGPEKG---RHSFNSVVSKKDLTETYFPAFERCVKE 203

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
                +M  YN++NG  AC    L  +  R +WGF GY  SDC A+     +   T T +
Sbjct: 204 AGVEGVMGGYNRLNGEAACGSHHLITEILREKWGFDGYYVSDCGAIKDFHMHHGLTDTPQ 263

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           +SAA  LK+G D+NCG   L H  SA ++G V  +DIDRA+ +L   ++RLG+F+   + 
Sbjct: 264 ESAALALKSGCDLNCGAVYL-HVMSAYNQGLVSAEDIDRAVTHLMMTRMRLGMFD---QH 319

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++ ++  +    +EH  LAL AA + +VLLKND   LPL+K A+ ++A+IGP  ++   
Sbjct: 320 TEFDEIPYEINDCAEHHGLALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEI 378

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHY---ASGCHDVPCN------SDAGFHEAVRIAK 515
           + G Y G      ++L G+ A + K      + G H    N      +D    EAV +A 
Sbjct: 379 LKGNYNGTATEKYTILEGIRAVLGKETRIFCSEGSHLYRDNVENLAEADDRLKEAVSMAV 438

Query: 516 KADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           ++D V +  GL+ T E E+          D+  L LP  QM L+ +V  T   PVIL+L 
Sbjct: 439 RSDVVFLCLGLNGTLEGEEGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLA 497

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  + +++A      S+IL I YPG+ G  A A ++ G+  P GRLP+T+Y  +     
Sbjct: 498 AGSAMAINYAA--EHCSAILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQTTEELPE 555

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
             D +M+        GR+YR+     +Y FG+GLSY ++ Y    A          +AG 
Sbjct: 556 FTDYSMK--------GRTYRYMEREALYPFGYGLSYGDFEYSNFKAE-------QTEAGP 600

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           D  +                      RF V+I  TN    +   +  ++ R+       P
Sbjct: 601 DGQV----------------------RFSVKI--TNRSKAECDEIAEVYVRIADSELAAP 636

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              L  F R+H  A  S  + F + P +   + N+ G  IL     V+  G
Sbjct: 637 GGSLADFRRIHMKAGESVTVPFTL-PVKAFMVVNEEGEYILDGSTAVVTCG 686


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 243/735 (33%), Positives = 365/735 (49%), Gaps = 112/735 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           AK L+S +T+ EKI Q+   + AI RLGIP Y WW+E+LHG+A  G            T 
Sbjct: 9   AKKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAG----------VATV 58

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
           FPQ +  AA+F+  L   IG  V+ E R  +N            GLTFWAPN+NIFRDPR
Sbjct: 59  FPQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPR 118

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRG ET GEDP +       +++  QG                           D  D 
Sbjct: 119 WGRGHETYGEDPYLTGKLGCAYIRGLQG---------------------------DDPDH 151

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L  +AC KH   + +       R+ F+A  ++ D  DT+   F+ C++  K   +M +YN
Sbjct: 152 LKSAACAKH---FAVHSGPEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYN 208

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           +VNG PAC    L +   R+E+GF+G++ SDC A+    E+ + T T E+SAA  +  G 
Sbjct: 209 RVNGEPACGSRTLLKDILRDEFGFEGHVVSDCWAILDFHEHHHVTDTVEESAAMAVNNGC 268

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG+  L H + A DKG V ++ I  A+  L  V++RLG+    P    Y  +  + V
Sbjct: 269 DLNCGSAFL-HLKDAYDKGLVSDEAITAAVERLMEVRIRLGMMKDYP--SPYEDISYEVV 325

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
              EH +L+++AAR+ +VLLKN   FLPL++  V ++A+IGP  N+   + G Y G    
Sbjct: 326 ECKEHVELSVEAARRSLVLLKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSR 385

Query: 476 PKSLLRGLEAYV---SKTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAGL 526
             + L GL+ Y+   ++  YA GCH               F EA+ +A+++D V++  GL
Sbjct: 386 YITPLEGLQQYLGEDTRVLYAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGL 445

Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E+          D++ L+LPG Q  L+ +VA   K PVILVL+ G  +D+S+AE
Sbjct: 446 DATIEGEEGDAGNEYASGDKLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE 504

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
               + +I+   YPG  G KA+AE IFG+++P G+LP+T+Y  +       D +M     
Sbjct: 505 --EHVDAIIDSWYPGARGGKAVAEAIFGEYSPSGKLPVTFYQGTENLPEFTDYSM----- 557

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
                R+YR+     +Y FG+GL Y   +Y  +S                          
Sbjct: 558 ---AHRTYRYTNENVLYPFGYGLHYGETNYDGMS-------------------------- 588

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                +D+  S  +    V ++VTN      + +V L+ R    ++  P  QL G + V 
Sbjct: 589 -----VDKAESDVNEPVEVFVNVTNDSRYTVNEIVQLYIRHVDAAEYEPGYQLKGIEVVK 643

Query: 758 TVAKGSKEISFGVDP 772
                +K++   + P
Sbjct: 644 LEPYETKKVKLTLSP 658


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 257/764 (33%), Positives = 375/764 (49%), Gaps = 114/764 (14%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A+ L+  +TL EK  QL  +A AIPRL IPAY WW+ESLHG+A  G          + T
Sbjct: 13  QAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGG----------TAT 62

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDP 175
            FPQ +  AASF+R +   IG A+A+E RA YN            GLTFWAPN+NIFRDP
Sbjct: 63  VFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRDP 122

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V +++  QG+                            G+
Sbjct: 123 RWGRGHETYGEDPYLSSRLGVSYIRGLQGD----------------------------GE 154

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +  +AC KH   + +       R+ F+A ++E+D  +T+ P F++C+++G    +M +Y
Sbjct: 155 TMKAAACAKH---FAVHSGPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAY 211

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N VNG P C    L +K  R EWGF G++ SDC A+    E    T T   SAA  ++AG
Sbjct: 212 NCVNGEPCCGSETLLKKILREEWGFDGHVVSDCWAIKDFHENHLVTGTPVQSAALAMEAG 271

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L H   A  +G V E  I  A + LF+ +  LG+F+G     +Y  +    
Sbjct: 272 CDLNCGVTYL-HLVHACQEGLVTEAQITEAAIRLFTTRFLLGMFDG----SEYDSVPYTV 326

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V   EH+ L+  AAR+ IVLLKN+   LPL++  + ++ IIGP  ++   + G Y G   
Sbjct: 327 VECKEHRDLSERAARESIVLLKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSS 385

Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
              ++L G+   V    +  Y+ GCH      +          EA  +A+++D VI+  G
Sbjct: 386 EYITVLEGVRRLVGDEVRILYSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLG 445

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E+          D+V L LP  Q  L+ +VA   K+P +L L  G  +D+SFA
Sbjct: 446 LDETLEGEEGDTGNSYASGDKVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFA 504

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRAD 635
           E        LW  YPG  G  A A+I+FG  +P G+LP+T+Y ES   +P   D +MR  
Sbjct: 505 EKHFDAIVDLW--YPGAYGGAAAADILFGKCSPSGKLPITFY-ESLEVLPSFEDYSMR-- 559

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                 GR+YR+      Y FG+GL+YT                        +N+  +  
Sbjct: 560 ------GRTYRYLEQKAQYPFGYGLTYTKMKI--------------------RNVWLENA 593

Query: 696 SRLDYVHIDEVTSCTSLRFHVQI--SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            +     + EVT   +    V +   V N G +D   V+ ++ R  +    TP   L GF
Sbjct: 594 EK----DMKEVTDGENAEAAVIVCAEVENCGGMDSQEVLQIYIRDTESEHETPHPHLAGF 649

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           +R+  V KG K++          ++ ++ GRR    G + +  G
Sbjct: 650 ERIF-VEKGVKKLVKIPVNRSAFTVVDESGRRFTDSGKYEIFAG 692


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 255/770 (33%), Positives = 377/770 (48%), Gaps = 113/770 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F +T  SI  RA+ L+S +TL+EKI QL + A AI RL +P Y+WW+E+LHG+A NG   
Sbjct: 21  FLDTDKSIEERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG--- 77

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                    T FPQ +   A+F+  L   + SA++ EARA Y + Q        AGLTFW
Sbjct: 78  -------KATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGLTFW 130

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PN+NIFRDPRWGRGQET GEDP +++   V FVK  QG                     
Sbjct: 131 TPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQG--------------------- 169

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                 D  + L  +AC KH   Y +       R  FNA+ T+QD  +T+ P F + ++ 
Sbjct: 170 ------DDPNYLKSAACAKH---YAVHSGPESLRLEFNAVPTQQDLYETYLPAFEALVKD 220

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
                +M ++N V G P      L     R+ WGF GY+ +DC A+  I     Y  +  
Sbjct: 221 ANVEGVMPAHNAVFGAPMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYVDSEV 280

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  LKAG ++NCG    +  + AID+G V E+ +      LF  + RLG+F+ D  K
Sbjct: 281 AAAAVALKAGTNLNCGATY-KELKKAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSK 339

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y K+GP+ + + EH +LA +AA++ IV+LKN    LPL  + +    + GP  N+   
Sbjct: 340 NPYSKIGPELIHSKEHIELAREAAQKSIVMLKNKNNLLPLPTD-IKVPYVTGPFANSSDM 398

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           + G Y G+     ++L G+   VS     +Y SG      N +   + A  +A  +D  I
Sbjct: 399 LMGSYYGVSPGVVTILAGITDAVSLGTSLNYRSGALPFQKNINPK-NWAPNVAGMSDVTI 457

Query: 522 VVAGLDLTQETE---------DRDRVSLLLPGQQMSLVTSVARTSK-RPVILVLTGGGPL 571
            V GL   +E E           DR+ L LP  Q++ V  +A   K +P++LV+  G P 
Sbjct: 458 CVVGLTADREGEGVDAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSP- 516

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDM 630
            VS    +    +IL I YPGE G  A+A+++FG  +P G LPMT +P+S  ++P   D 
Sbjct: 517 -VSLEGIEEHCDAILQIWYPGEQGGNAVADVLFGKVSPTGHLPMT-FPKSVAQLPDYKDY 574

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKN 689
           +M+        GR+Y++ T   ++ FG GL+Y+   +K L    ++L    SLK      
Sbjct: 575 SMK--------GRTYKYMTEEPMFPFGFGLTYSKTEFKNLVVEDAKLRKKESLK------ 620

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF--ARVPKVSQGTPE 747
                                     V + VTN GD D   +V L+   +  K  +G P 
Sbjct: 621 --------------------------VSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPF 654

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             L  F RV      ++++ F + P E L + N  G+++   G + + VG
Sbjct: 655 TTLKAFKRVALKKGETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVG 703


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 255/763 (33%), Positives = 378/763 (49%), Gaps = 120/763 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A  L+S +TL+EK  QL  +A AIPRLG+P Y WW+E+LHG+A  G            T
Sbjct: 10  KAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAG----------VAT 59

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
           SFPQ +  AA+F+  L   +G AVA E RA YN            GLTFW+PN+NIFRDP
Sbjct: 60  SFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRDP 119

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V +V+  QG                          S   D
Sbjct: 120 RWGRGHETYGEDPYLTSRLGVAYVEGLQG--------------------------SQDDD 153

Query: 236 ELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
            +  +AC KH   +     G  S R+ F+A  +++D  +T+ P F +C+++     +M +
Sbjct: 154 FMKTAACAKHFAVHS----GPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGA 209

Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN+ NG P C    L Q   R EW F+G+  SDC A+A    +   TKT E+SAA  LK+
Sbjct: 210 YNRTNGEPCCGSPTLIQNILREEWDFQGHYVSDCWAIADFHMHHMVTKTPEESAALALKS 269

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCG   L H   A  +G V E++I +A   LF+ +  LG F+    K +Y  +  +
Sbjct: 270 GCDVNCGVTYL-HLLKAYQQGLVTEEEITQAAERLFTTRFLLGCFD----KNEYDDIPYE 324

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            V   EH +LA   A++ +VLLKND   LPLNK+ + ++ +IGP  ++ + + G Y G  
Sbjct: 325 VVECKEHLELAQKMAKESMVLLKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTS 383

Query: 474 CSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDA------GFHEAVRIAKKADFVIVVA 524
               +LL G++ +V    + +Y+ GCH      +          EA+ +A+ +D V++  
Sbjct: 384 SRYITLLEGIQDFVGEDVRVYYSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCL 443

Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           GLD   E E+          D+  L LP  Q  L+ +VA   K PV+L +  G  +D+ F
Sbjct: 444 GLDENLEGEEGDTGNSYASGDKKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQF 502

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A     +++IL + YPG  G KA AEI+FG  +P G+LP+T+Y +        D +M+  
Sbjct: 503 AA--EHVNAILQVWYPGARGGKAAAEILFGACSPSGKLPVTFYKDLEGFPAFEDYSMK-- 558

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                 GR+YR+     +Y FG+GL+Y     K     +ELT   +++ G +  I     
Sbjct: 559 ------GRTYRYLEKEPLYPFGYGLTYGQVCVK----AAELT--GAVEEGKELTI----- 601

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
                                +  V N+G  D   V+ ++ +        P   L  F R
Sbjct: 602 ---------------------KAMVENSGKYDTDDVIQVYIKDLDSKNAVPNHSLCAFKR 640

Query: 756 VHTVAKGSK-EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           V ++ KG K EI   V P E L   ++ G++ +   + VL VG
Sbjct: 641 V-SLKKGEKAEILLKV-PYEALMAVDEEGKKYVDSSHFVLSVG 681


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 249/777 (32%), Positives = 370/777 (47%), Gaps = 119/777 (15%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R + +IS LTL+EK+Q +   A A+PRLGIPAY WW+E+LHG+A    
Sbjct: 26  YPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALHGVAR--- 82

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG-----L 162
                 T   VT FPQ +  AA+F+      +G   + E RA++N     G+ G     L
Sbjct: 83  ------TKEKVTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKTGEIYRGL 136

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+W PNINIFRDPRWGRGQET GEDP + +      V   +G N                
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGLEGNN---------------- 180

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                       + L   AC KH   Y +      +R+S++A ++  D  DT+ P FR  
Sbjct: 181 -----------PEYLKSVACAKH---YAVHSGPEHNRHSYDARVSMYDLWDTYLPAFREL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           + + K   +MC+YN+  G P C   +L Q   RN+W F GY+TSDC AV+   ++  Y K
Sbjct: 227 VTKAKVHGVMCAYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVS---DFAKYHK 283

Query: 342 THEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH +    V  A   G D+ CG  + +  Q  ++KG + EKDI+ +L  LF +Q +LG++
Sbjct: 284 THSNDTEAVADAVLNGTDLECGN-LYQKLQQGVEKGLISEKDINVSLARLFEIQFKLGMY 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +   R   Y  +G + +    HKK A + A++ +VLLKN+K  LPLN + +  +A+IGP 
Sbjct: 343 DPADRV-PYASIGREVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPN 401

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDV----PCNSDAGFHEAVRIA 514
           ++N S +   Y G P    +  + L+     +        V           F +    A
Sbjct: 402 MDNGSTLLANYFGTPSEIITPYKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQA 461

Query: 515 KKADFVIVVAGLDLTQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
           KKAD +I V G+    E                 DR ++ LP  Q  L+  + +T  RP+
Sbjct: 462 KKADIIIFVGGISADYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTG-RPL 520

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ILV   G  +  SF        +IL   Y G+A   A+ +++FGD+NP GR+P+T Y   
Sbjct: 521 ILVNMSGSVM--SFDWESRNADAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTY--- 575

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
                MND ++          R+YR++ G   Y FG+GLSYT + Y  L   S +    S
Sbjct: 576 -----MNDEDLPDFEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTVKTGES 630

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
           ++                                V  +VTN G   G  VV L+   P+ 
Sbjct: 631 IQ--------------------------------VTTTVTNTGKRAGDEVVQLYISHPQN 658

Query: 742 SQG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                P + L GF R+H     S++++F + P E+LS+ ++ G ++   G   L +G
Sbjct: 659 GNTRVPLRALKGFKRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIG 714


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 265/806 (32%), Positives = 397/806 (49%), Gaps = 117/806 (14%)

Query: 55  CNTSLSISTRAKSLISLLTLQEK------------------IQQLSDNASAI--PRLGIP 94
           C+T+ S   RA +L+  L + EK                  IQ ++ ++ ++   R+G+P
Sbjct: 36  CDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGAERIGLP 95

Query: 95  AYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA 151
           AY WW+E+LHG+A++ PGV+FN      S  TSF   +  AA+F+  L   +   ++ EA
Sbjct: 96  AYAWWNEALHGVAAS-PGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVADTISTEA 154

Query: 152 RAMYNLGQAGLTFWAPNINIFRDPRWGRG------------------QETPGEDPMVVSA 193
           RA  N   AGL +W PNIN ++DPRWGRG                  Q+TPGEDP+ +  
Sbjct: 155 RAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGEDPVHIKG 214

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
           Y    ++  +G +            + ++V+                A CKH  AYDLE+
Sbjct: 215 YVQALLEGLEGRD------------KIRKVI----------------ATCKHFAAYDLER 246

Query: 254 WGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDL 309
           W    RY FNA++T QD  + +  PF+ C    K    MCSYN +NG PAC    L  D+
Sbjct: 247 WQGALRYRFNAVVTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDI 306

Query: 310 FQKARNEWGFKGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINC---GTCMLR 365
            +K  N      YITSDC+A+      + N+++T   +AA    AG D  C   G   L 
Sbjct: 307 LRKHWNWTEHNNYITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLT 366

Query: 366 HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL 425
               A ++  + E+ IDRAL  L+   +R G  +       Y K+    V T + + LAL
Sbjct: 367 DVIGAYNQSLLSEEIIDRALRRLYEGLIRAGYLDSA-SPHPYTKISWSQVNTPKAQALAL 425

Query: 426 DAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEA 485
            +A  GIVLLKN+   LPL+     ++A+IG   N   QM GGY+GIP    + +     
Sbjct: 426 QSATDGIVLLKNN-GLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQ 483

Query: 486 YVSKTHYASGC--HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
                H+A G      P  +D     A+  A K+D ++ + G DL+   EDRDR S+  P
Sbjct: 484 LNVTFHHAPGPVNQSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWP 543

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
             Q+SL+TS+A+  K  ++  L  G  +D +   ++  ISSILW+GYPG++G  AL  II
Sbjct: 544 SAQLSLLTSLAQMGKPTIVARL--GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNII 601

Query: 604 FGDFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            G  +P  RLP+T YPE++T  +P+  M++R  S+R  PGR+YR+Y  + V  FGHGL Y
Sbjct: 602 TGVSSPAARLPVTVYPETYTSLIPLTAMSLRPTSAR--PGRTYRWYP-SPVLPFGHGLHY 658

Query: 663 TNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF-HVQISV 720
           T ++ KF      LTI+ A L +  ++  L                     RF  V + V
Sbjct: 659 TTFTAKF-GVFESLTINIAELVSNCNERYLDLC------------------RFPQVSVWV 699

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           +N G++   +V ++F R     +  P K L+G+ R+  +  G+     G  P   + +  
Sbjct: 700 SNTGELKSDYVALVFVRGEYGPEPYPIKTLVGYKRIRDIEPGTT----GAAP---VGVVV 752

Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
               R+   GN VL  G+    L +E
Sbjct: 753 GDLARVDLGGNRVLFPGKYEFLLDVE 778


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 257/783 (32%), Positives = 389/783 (49%), Gaps = 82/783 (10%)

Query: 27  VIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS 86
            + ++   P    PD  C      S   C+ +LS   RA +L++ +T +EK+Q L   + 
Sbjct: 23  ALVYAIDLPFQTYPD--CVNGPLASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSK 80

Query: 87  AIPRLGIPAYEWWSESLHGIASNGPGVNF---NGTVSSVTSFPQVLVSAASFNRSLWSNI 143
             PR+G+PAY WWSE+LHG+A   PG  F   NGT +S TSFP  L+ AA+F+  L   +
Sbjct: 81  GAPRIGLPAYNWWSEALHGVAY-APGTQFRSGNGTFNSSTSFPMPLLMAATFDDELIERV 139

Query: 144 GSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ 203
           G  + +E RA  N G +G  +W PN+N F+DPRWGRG ETPGED + +  YA   ++  +
Sbjct: 140 GEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLE 199

Query: 204 GENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN 263
           G              RE+R++                A CKH  A D E W   +R+ FN
Sbjct: 200 GP----------VRERERRIV----------------ATCKHYAANDFEDWNGSTRHDFN 233

Query: 264 AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG- 321
           A +T QD  + +  PF+ C    K   IMCSYN VNGVPAC    L Q   R+ W +   
Sbjct: 234 AKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAP 293

Query: 322 --YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEK 379
             YITSDC+AV  I    +Y KT+ +  A   +AG+D +C          A  +G +++ 
Sbjct: 294 GNYITSDCEAVLDISANHHYAKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQS 353

Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
            +DRAL  L+   +++G F+G+  + +Y  LG + V   + +++AL AA +GIVLLKNDK
Sbjct: 354 TVDRALRRLYEGLVQVGYFDGN--RSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK 411

Query: 440 KF-LPLNKNAVS-SLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG-C 496
              L + KN     LA+IG   N+   + GGY+G P    S +   +A   K   A G  
Sbjct: 412 TLPLGVKKNGPKLKLAMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPV 471

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
                + D     A+  AK A++++   G D +   E +DR ++  P  Q+ L+T +++ 
Sbjct: 472 LQNSTSKDTWTQAALAAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKL 531

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K P+++V  G   LD +   A   I+SILW  +P                 P GRLP+T
Sbjct: 532 GK-PLVVVQMGDQ-LDNTPLLASKAINSILWANWP----------------VPAGRLPVT 573

Query: 617 WYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
            Y  ++T  VPM DM +R   S + PGR+YR+Y  T V  FG GL YT +  K +  P  
Sbjct: 574 QYHANYTAAVPMTDMTLR--PSDKLPGRTYRWYP-TPVQPFGFGLHYTTFKTKIVRLP-R 629

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
             I         K++L + G+     + D     T     +++ VTN G     +VV+ F
Sbjct: 630 FAI---------KDLLSRCGN----AYPD-----TCGLPPLKVEVTNTGKRSSDYVVLAF 671

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
            +     +  P K L+ + R+  ++ G K  +        ++  ++ G  +L  G + ++
Sbjct: 672 LKGDVGPKPYPIKTLVSYTRLRDLSPGRKTTAHLDWTLGDIARYDEQGNTVLYPGTYTVI 731

Query: 796 VGE 798
           V E
Sbjct: 732 VDE 734


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 248/771 (32%), Positives = 383/771 (49%), Gaps = 118/771 (15%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           +    AK L+  + L EK  QL  +A AI RLGIP Y WW+E+LHG+A  G         
Sbjct: 3   NFKNEAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGVA------- 55

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LGQA-------GLTFWAPNINI 171
              T FPQ +  AA F+      I   +A+E RA YN   Q        G+TFWAPNINI
Sbjct: 56  ---TVFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINI 112

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           FRDPRWGRG ET GEDP + +   V F+K  QG+                          
Sbjct: 113 FRDPRWGRGHETYGEDPYLTARLGVAFIKGLQGD-------------------------- 146

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
           +  D L  +AC KH   +   +     R+ F+A+++++D  +T+ P F + +++     +
Sbjct: 147 ENEDYLKAAACAKHFAVHSGPEE---DRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGV 203

Query: 292 MCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
           M +YN+VNG PAC    L  D+ +K   +WGF GYI SDC A+         T T  +SA
Sbjct: 204 MGAYNRVNGEPACGSKTLLVDILKK---DWGFDGYIVSDCWAIRDFHTEHMVTHTAAESA 260

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  +  G ++NCG   L H   A  +G V+E+ I  A   L  ++++LGLF+   +  KY
Sbjct: 261 ALAINNGCELNCGNTYL-HMLEAHQEGLVKEEIITEAAEKLMRIRMQLGLFD---KNCKY 316

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            ++         H+++AL+A+R+ +V+LKND   LPLNK+ + S+ IIGP  NN + + G
Sbjct: 317 NEIPYAVNDCKVHREVALEASRRSMVMLKND-GILPLNKDKLKSIGIIGPTANNRTVLEG 375

Query: 468 GYTGIPCSPKSLLRGLEAYVS---KTHYASGCH-------DVPCNSDAGFHEAVRIAKKA 517
            Y G      + + G++ YV    + +Y+ GCH       ++   +D    EA+ +A+++
Sbjct: 376 NYNGTASRYTTFVEGIQDYVGDDVRVYYSEGCHLFANGMSNLAWENDRE-AEALIVAEQS 434

Query: 518 DFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
           D V++  GLD T E E           D++SL L G+Q  L+  V    K PVILVL+ G
Sbjct: 435 DVVVLCLGLDSTIEGEQGDTGNAFAGGDKLSLNLIGRQQQLLEKVVAVGK-PVILVLSTG 493

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
             + +++  AD   ++I    YPG  G KALA+++FG+++P G+LP+T+Y  +       
Sbjct: 494 SAMAINY--ADEHCNAIFQTWYPGAQGGKALAQLLFGEYSPSGKLPVTFYKTTEELPAFE 551

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           D +M+         R+YR+     +Y FG+GLSY                          
Sbjct: 552 DYSMK--------DRTYRYMPNEALYPFGYGLSYA------------------------- 578

Query: 689 NILQQTGSRLDYVHIDEVT--SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           +I  Q+   LD    +E+T  S    ++ V++ + N  +VD   VV ++ +  +     P
Sbjct: 579 DIKVQSVKVLDGAKGEEITNFSAGQTKYKVKVELENKSNVDSYDVVQIYIKDMESQYAVP 638

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              L  F  V   A  SKE++  V   +  ++ N+ G+RI+      L +G
Sbjct: 639 NFSLCSFKSVFLKAGESKEVTLNVGE-KAFTVINEEGKRIVDSKKFKLFIG 688


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/773 (31%), Positives = 391/773 (50%), Gaps = 128/773 (16%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF  T +    R + LIS LTL+EK+ Q+ D + ++PRL +P Y WW+E+LHG+A +G  
Sbjct: 27  PFNPTDV----RVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSG-- 80

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
                     T FPQ +   A+F++ L     +A++ EARAMYN            GLTF
Sbjct: 81  --------VATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTF 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP + S   V F++  QG                    
Sbjct: 133 WTPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQG-------------------- 172

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                  D  + L ++AC KH   + +       R+SFNA+ + +D  +T+ P F++ + 
Sbjct: 173 -------DDPEHLKVAACAKH---FAVHSGPERLRHSFNAIASPKDLRETYLPAFKALV- 221

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
             +   +MC+YN+ N    C    L  Q  R+EW F G++ SDC A+   +         
Sbjct: 222 NARVEAVMCAYNRTNSEVCCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQ 281

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
            ++ A  +K G+D+NCG       + A+ +G + EK+ID+AL  L   + +LGLF  DP+
Sbjct: 282 PEAVALAVKHGVDLNCGDEYPALIE-AVKRGLITEKEIDKALATLLKTRFKLGLF--DPK 338

Query: 404 KGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           +   Y  +    + +++H+ LA + A + IVLLKN+K  LPL KN +S   I GP   ++
Sbjct: 339 QNSPYNNIPVSVINSTDHRALAKEVALKSIVLLKNEK-CLPL-KNNLSKYYITGPNAASV 396

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC-----HDVPCNSDAGFHEAVRIA 514
             + G Y G+     ++L G+   +   S+  Y  G      ++ P +   G       A
Sbjct: 397 DALMGNYYGVNPHMSTILEGIAGAIQPGSQMQYKPGILLDRDNNNPIDWTTG------DA 450

Query: 515 KKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           K +D   VV G+    E E+          DR+   LP  Q+  +  + + +K  V+ ++
Sbjct: 451 KASDVTFVVMGITGLLEGEEGEAIASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII 510

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
           TGG P+++S  E      ++L   YPGE G  A+A+I+FG  +P GRLP+T +P+SF ++
Sbjct: 511 TGGSPMNLS--EVHELADAVLLAWYPGEEGGNAVADILFGKVSPSGRLPVT-FPKSFAQL 567

Query: 626 P-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
           P   D +M+        GR+YR+ T   +Y FG+GLSY+ Y+Y      S LT+S     
Sbjct: 568 PPYEDYSMK--------GRTYRYMTAEPMYTFGYGLSYSTYTY------SSLTLS----- 608

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
             +K I +                  ++    +  VTN G ++G  VV L+  VP+ ++ 
Sbjct: 609 --EKQIKK------------------NMTIIAETMVTNTGKMEGEEVVQLYITVPQ-TEK 647

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            P+  L GF RV+  A  S+++ F + P + +   + +G  +L  G++V+ +G
Sbjct: 648 NPQYSLKGFKRVNLKAGESRKVQFQITP-DLMKSVDANGSEVLLSGSYVVRIG 699


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 219/626 (34%), Positives = 335/626 (53%), Gaps = 82/626 (13%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           A+ ++S +TL EKI Q+   ASAIPRL IP Y WW+E+LHG+A  G            T 
Sbjct: 14  ARRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGIA----------TV 63

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDPR 176
           FPQ +  AA F+  +   IG+ ++ E RA YN            GLTFW+PN+NIFRDPR
Sbjct: 64  FPQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPR 123

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRGQET GEDP + +  AV F++  QG+                            G  
Sbjct: 124 WGRGQETYGEDPYLTARLAVAFIRGIQGD----------------------------GKY 155

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L  +AC KH   + +       R+ F+A ++++D  +T+   F++ +++ +   +M +YN
Sbjct: 156 LKAAACAKH---FAVHSGPEALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYN 212

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           +VNGVPAC   +L     R+EWGF+G++ SD +A+  IF++ +Y      + A  LKAG 
Sbjct: 213 RVNGVPACASHELLSDILRSEWGFEGHVVSDYEALEDIFKHHHYVADEAHTMAVALKAGC 272

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           ++ C   + RH +S++D+G + E +I  A+  LF+ ++ +G+   D     Y  +G ++ 
Sbjct: 273 NL-CAGKIARHLRSSVDEGLISEDEITEAVERLFTTRIMMGMMADDC---PYDSIGYEEN 328

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
            T EH +LA++AA +  VLLKND   LPL    +SS+A+IGP  N+   + G Y G    
Sbjct: 329 DTPEHHQLAVEAASRSFVLLKNDG-LLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASR 387

Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
             ++L G++  V    +  Y+ GCH            +    EAV  A+ AD V++  GL
Sbjct: 388 YVTVLEGIQDLVGDSVRVWYSEGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGL 447

Query: 527 DLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E+ D         + +L LPG+Q  L+ ++  T  +PVIL+L  G  L +   E
Sbjct: 448 DATLEGEEGDVEVGFGSGDKPNLSLPGRQQLLLDTML-TVGKPVILLLASGSALTLGGRE 506

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            D  + +IL I YPG  G KA+A+++FG   P G+LP+T+Y  +       D +M     
Sbjct: 507 NDENLKAILQIWYPGAMGGKAVADVLFGRRAPAGKLPVTFYASADELPAFEDYSM----- 561

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYT 663
               GR+YR+  G  +Y FG+GL+Y+
Sbjct: 562 ---AGRTYRYMKGNALYPFGYGLTYS 584


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/768 (33%), Positives = 372/768 (48%), Gaps = 112/768 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +TSL+   R   L+  +TL+EKI QL   +  + RL IP Y WW+E+LHG+A NG   
Sbjct: 26  WLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVARNGKS- 84

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                    T FPQ +  AA+F+  L   + SA++ EARA +N+ Q        AGLTFW
Sbjct: 85  ---------TVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQYAGLTFW 135

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PN+NIFRDPRWGRGQET GEDP + S   V FVK  QG + K                 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQGNHPKY---------------- 179

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                      L  +AC KH   +   +     R+ FNA  +++D  +T+ P F + ++Q
Sbjct: 180 -----------LKSAACAKHFAVHSGPEE---LRHHFNANPSKKDLYETYLPAFEALVKQ 225

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
                +M +YN V GVPA     L ++  R  WGF GYI SDC A+  IF+     KT  
Sbjct: 226 ANVEGVMSAYNAVYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMP 285

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           ++AA  LKAG+++NCG       + A+ +G V E+ ID  L  L   + +LG F  DP++
Sbjct: 286 EAAAVALKAGVNLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFF--DPKE 343

Query: 405 GK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              Y  +    + + +H  LA   A++ IVLLKN    LPL+KN +    + GP  ++  
Sbjct: 344 ANPYNAIPTSVIHSDDHIALARKTAQKSIVLLKNKNHTLPLDKN-IKVPYVTGPFASSSD 402

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHE-AVRIAKKADF 519
            +   Y G+  +  S+L G+   VS     +Y  G   +P N +      A  +AK AD 
Sbjct: 403 VLLANYYGMTTNLVSVLEGIADKVSLGTSLNYRMGA--LPFNKNLNPKNWAPNVAKTADA 460

Query: 520 VIVVAGLDLTQETEDRDRVS---------LLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           VI V GL    E E+ D ++         L LP  Q+  V  +A   K P+ILV+  G  
Sbjct: 461 VIAVVGLSADFEGEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVVASGSA 520

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMND 629
             V+  E      +I+ + YPGE G  A+A+++FGD +P G LP+T +P+S  ++ P  D
Sbjct: 521 --VALGELYDLADAIVLMWYPGEQGGNAVADVLFGDVSPSGHLPVT-FPKSVAQLPPFED 577

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
            +M+        GR+Y++     ++ FG GLSYT++                        
Sbjct: 578 YSMQ--------GRTYKYMEEEPLFPFGFGLSYTDF------------------------ 605

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                  +   V I E        F V  SV N G VDG  VV L+      ++  P+ Q
Sbjct: 606 -------KFSNVQISEEKIKKKDSFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQ 658

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           L+ F R+      SK +SF ++  + L   NK G++    G + L+V 
Sbjct: 659 LLKFKRIEIQKNTSKTVSFNLE-AKDLFQVNKEGKKTWIKGKYKLVVA 705


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 240/762 (31%), Positives = 382/762 (50%), Gaps = 117/762 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA++L++ +TL+EKI Q+   A AIPRLGIPAY WW+E +HG+          G   + T
Sbjct: 14  RARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGV----------GRAGTAT 63

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AASF+  L   +G AV VEAR  YN+ ++        GLT WAPN+NIFRDP
Sbjct: 64  VFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIFRDP 123

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V FV+  QG                           D  D
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVRFVEGMQG---------------------------DDPD 156

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   +   +     R+ F+A +++QD  +T+ P FR+ +++     +M +Y
Sbjct: 157 YLRAAACAKHFAVHSGPE---DQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAY 213

Query: 296 NQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ NG P C  +  L    R +W F+G++TSDC A+    E    T    DS A  +  G
Sbjct: 214 NRTNGEPCCGSKTLLVDILRGKWNFQGHVTSDCWAIKDFHEGHMVTSGPVDSVALAVNNG 273

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG  +  + + A+ +GKV+E+ IDR+L+ LF+ +++LG+F+ +  K  Y K+G D 
Sbjct: 274 CDLNCGD-LYAYLEEAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAE-EKVPYNKIGYDA 331

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V + E + L L+ A + +VLLKN+   LPL+K+ +  +A++GP  +N   + G Y G   
Sbjct: 332 VDSREMQALNLEVAEKILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTAS 391

Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVAG 525
              ++L G++ Y+    +  Y+ GCH           S+    E   +  + D VI   G
Sbjct: 392 RYVTVLDGIQEYLGEDVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLG 451

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD   E E+          D+ SL LPG Q S++ +   + K  V++VL+G     ++  
Sbjct: 452 LDAGLEGEEGDQGNQFASGDKQSLSLPGNQESVLKACIESGKPVVVVVLSGSA---LALG 508

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
            A    +++L   YPG  G +A+A  +FG+ NP G+LP+T+Y         +D ++ A +
Sbjct: 509 TAQEGAAAVLQAWYPGAQGGRAVARALFGECNPQGKLPVTFY--------HSDEDLPAFT 560

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
                GR+YR+     +Y FG+GLSY++++++   A +        + G D         
Sbjct: 561 DYAMKGRTYRYMEKEPLYPFGYGLSYSHFTFRDAKADAA-------QIGPDG-------- 605

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
                              V+++V N G   G   V ++ +  +   GTP  QL    +V
Sbjct: 606 -----------------VDVRVTVVNDGQYRGRETVEVYVKAER--PGTPNAQLKALAKV 646

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHG-RRILPLGNHVLMVG 797
             +    K ++  +  C   ++ N+ G   +LP G + + +G
Sbjct: 647 DLMPGEEKCVTLHLPQC-AFALCNEEGISEVLP-GEYTVWLG 686


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 244/774 (31%), Positives = 389/774 (50%), Gaps = 123/774 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + + +   RA++L+  +TL+EK+ Q   NA AI RLG+PAY +W+E+LHG+A  G   
Sbjct: 6   WLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVARAG--- 62

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF+  L   +   ++ EARA +N+ Q         GLTFW
Sbjct: 63  -------VATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYKGLTFW 115

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN+NIFRDPRWGRG ET GEDP +     V F++  QG++              +R +K
Sbjct: 116 SPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGDD--------------ERYMK 161

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        ++AC KH   + +       R+SFNA+++EQD  +T+ P F +C+ +
Sbjct: 162 -------------VAACAKH---FAVHSGPEDQRHSFNAVVSEQDLRETYLPAFHACVTE 205

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
                +M +YN+ NG   C    L     R EWGF+G++TSDC A+    E+   TK  E
Sbjct: 206 AGVEAVMGAYNRTNGEACCGSKKLLVDILRGEWGFRGHVTSDCWALKDFHEFHMVTKNQE 265

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           ++ A  + +G D+NCG   + H   A+  G V+E  IDRA+  LF+ +++LGLF+    +
Sbjct: 266 ETVALAMNSGCDLNCGNLYV-HLLQAVRDGLVEESVIDRAVTRLFTTRMKLGLFD-RSEE 323

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y  +G D V T  ++KL  +A+R+ + LLKN    LPL+ + + ++ ++GP  +N   
Sbjct: 324 VPYNGIGYDRVDTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKA 383

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH---------DVPCNSDAGFHEAVR 512
           + G Y G      ++L G+        +  Y+ GCH           P +  A   EA  
Sbjct: 384 LVGNYEGTASEYVTVLDGIRELAGDDVRVVYSEGCHLFRDRVQGLGQPNDRIA---EARA 440

Query: 513 IAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
           +A+ +D VI V GLD   E E+          D+ +L LPG Q  ++ ++  + K PV+L
Sbjct: 441 VAELSDVVIAVMGLDPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVL 499

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           VL GG  L + +AE    + +IL   YPG  G +A+A+++FG   P G+LP+T+Y  S  
Sbjct: 500 VLLGGSALAIPWAE--EHVPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYRTSEE 557

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
                D +M+         R+YR+     +Y FG+GLSYT++         ELT + +  
Sbjct: 558 LPAFTDYSMK--------NRTYRYMKQPALYPFGYGLSYTSW---------ELTNTTA-- 598

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                      GS      +D+   C ++       + N G + G+  V ++ + P  + 
Sbjct: 599 ----------EGS------VDDGVVCRAV-------LRNTGAMAGAQTVQVYVKAPLAT- 634

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             P  QL G  ++      S E++  +D  E   + N+ G R+L  G + + +G
Sbjct: 635 -GPNAQLKGLRKIRLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/768 (32%), Positives = 380/768 (49%), Gaps = 114/768 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F N  +S+  RA+ ++  LT++EKI QL + A A+ RL IP Y+WW+E LHG+A  G   
Sbjct: 17  FRNPDISLDERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNECLHGVARAG--- 73

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------GQ-AGLTFW 165
                    T FPQ +  AA+++ +L   +G A++ EARA YN+       GQ  GLTFW
Sbjct: 74  -------RATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKHGYRGQYKGLTFW 126

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PN+NIFRDPRWGRGQET GEDP + S   V FVK  QG + K                 
Sbjct: 127 TPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQGNHPKY---------------- 170

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                      L ++A  KH   Y +       R+ F+A ++ +D  +T+ P F + +++
Sbjct: 171 -----------LKVAALAKH---YAVHNGPEALRHEFDAKVSMKDLWETYLPAFEALVKE 216

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
                +M +YN+ NG P C    L Q+  R +WGF GY  SDC A+   +       T E
Sbjct: 217 AGVEGVMGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHKIVDTPE 276

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           ++AA  L AG ++NCG       +S ++KG   E++IDR++  LF  +LRLGLF   P  
Sbjct: 277 EAAAMALNAGCNLNCGDTYASLLKS-LEKGLTTEEEIDRSVKQLFKTRLRLGLFA--PEG 333

Query: 405 G-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              Y  +  D + + EH+KLAL+AAR+ +VLLKN+   LP+ ++ V  + + GP   ++ 
Sbjct: 334 AVPYDTISTDVIRSKEHQKLALEAARKSVVLLKNEANTLPVARD-VKKVYVTGPTATHVQ 392

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC--HDVPCNSDAGFHEAVRIAKKAD 518
            +   Y G+     ++L G+   VS      Y  G   ++   N+   F  A   A  AD
Sbjct: 393 ALLANYYGVSEDMTTILEGIVGKVSPQTSVQYRQGALLYEANRNTMDWFSGA---AASAD 449

Query: 519 FVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
             +   G+    E E+          DR    LP  Q+  +  +  ++K+ V+++ +G  
Sbjct: 450 VTVACLGISQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKKLVVVITSGSA 509

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
              +S  E      ++L++ YPGE G KA+A+++FGD  P GRLP+T         P  +
Sbjct: 510 ---ISLPEIYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKSVDDLPPYEN 566

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
            +M+        GR+YR+   +  + FG GLSYT+++Y      S LT+ +         
Sbjct: 567 YDMK--------GRTYRYMEVSPQFPFGFGLSYTDFTY------SNLTLES--------- 603

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                         ++V S  S+R  +   +TN G+ D   VV  +    + S   P++ 
Sbjct: 604 --------------NKVKSGESVR--LSFDLTNEGEYDADEVVQFYITDVEASVNVPKQS 647

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           LIGF RV   A  S +I F V P + + I + +G +IL  G   + +G
Sbjct: 648 LIGFKRVGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIG 694


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 242/764 (31%), Positives = 377/764 (49%), Gaps = 61/764 (7%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           PD  CK     S   C+ + S + RA +L++ +  QEK++ L   +  + RLG+PAY WW
Sbjct: 22  PD--CKNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWW 79

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
            E+LHG+A   PG+NF G+  + TSFP  L+ +A+F+  L   I   +  EARA  N G 
Sbjct: 80  GEALHGVAG-APGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGI 138

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           A + FW P+IN FRDPRWGRG ETPGED + +  Y    +   +G+              
Sbjct: 139 APVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDK------------A 186

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
           +++++                A CKH + YD+E W    R+ F+A IT QD  + F PPF
Sbjct: 187 QRKII----------------ATCKHYVGYDVENWNGTDRHHFDAKITTQDLAEYFMPPF 230

Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
           + C    K    MCSYN VNGVP C    +  D+ +K  N      YITSDC+AV  I  
Sbjct: 231 QQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISL 290

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  T +++ A     GMD++C          A  +G +    IDRAL   +   +  
Sbjct: 291 RHKYVATLQEATAIAFNNGMDLSCEYSGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHA 350

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G      Y  LG  D+ T E +KL L  A +G+ LLKND   LPL+  + S +A++
Sbjct: 351 GYFDG--AAATYAHLGVQDINTPEAQKLVLQVAAEGLTLLKNDDT-LPLSLKSGSKVAMV 407

Query: 456 GPLVNNISQMGGGYTG-IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           G   N  S++ G Y+G  P     +  G +  +               +D     A+  A
Sbjct: 408 GFWANTTSKLSGIYSGPAPYLHTPVYAGNKLGLDMAVATGPILQTSGAADNWTTTALNAA 467

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           KK+DF++   GLD +   E  DR  +  P  Q+ L+T +A   K  V++ L  G  +D +
Sbjct: 468 KKSDFILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHT 525

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
                  ++S++W  +PG+ G  A+ ++I G+    GRLP+T YP  +T++ M DMNMR 
Sbjct: 526 PILKMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAEYTQLSMLDMNMRP 585

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
             +   PGR+YR+Y    V  FG GL YT ++ KF S+ S LT++        ++I++  
Sbjct: 586 GGNN--PGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSS-SGLTVNI-------QDIMKSC 634

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
               D+  + +V         ++++VTN G+     + + F +     +  P K L+ + 
Sbjct: 635 TK--DHPDLCDVPP-------IEVAVTNEGNRTSDFIALAFIKGEVGPKPYPLKTLVSYA 685

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           R+  ++    +++        LS  ++ G  +   G + L++ E
Sbjct: 686 RLRDISGSQTKMASLALTLGALSRVDQSGNLVAYPGEYTLLLDE 729


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 253/753 (33%), Positives = 374/753 (49%), Gaps = 73/753 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
            C+ + + + RA +L+  +   EK+  L      + RLG+P Y WW E+LHG+A   PG+
Sbjct: 33  ICDVTAAPAERAAALVEAMQTNEKLDNL---MRGVTRLGLPKYNWWGEALHGVAG-APGI 88

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
           NF G   + TSFP  L+ +A+F+  L   I + +  EARA  N G A + FW P+IN FR
Sbjct: 89  NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDFWTPDINPFR 148

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGRG ETPGED + +  Y    +   +G+              +++++         
Sbjct: 149 DPRWGRGSETPGEDIVRIKGYTKHLLAGLEGDK------------PQRKII--------- 187

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                  A CKH + YD+E WG   R+SFNA I  QD  + + PPF+ C    K    MC
Sbjct: 188 -------ATCKHYVGYDMEAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMC 240

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           SYN VNGVP C    + Q   R+ W +     YITSDC+AV  I     Y KT+ +    
Sbjct: 241 SYNAVNGVPTCADTYVLQTILRDHWNWTESNNYITSDCEAVKDISLKHKYAKTNAEGTGL 300

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
              AGMD +C          A ++  +    IDRAL   +   +R G F+G      Y  
Sbjct: 301 AFTAGMDNSCEYTGSSDIPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDG--AAATYAN 358

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  D+ T E ++L+L  A +G+VLLKND   LPL+    S +A++G   N+ S++ G Y
Sbjct: 359 LGVKDINTPEAQQLSLQVASEGLVLLKNDDT-LPLSLTNGSKVAMLGFWANDTSKLSGIY 417

Query: 470 TGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS---DAGFHEAVRIAKKADFVIVVAGL 526
           +G     +S +   +        ASG      NS   D     A+  A+K+D+++   GL
Sbjct: 418 SGPAPYLRSPVWAGQKLGLDMAIASGPILQQSNSSTRDNWTTNALAAAEKSDYILYFGGL 477

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D +   E  DR S+  P  Q+ L+  +A   K  V+LVL  G  +D S       ++S++
Sbjct: 478 DPSAAAEGFDRNSIAWPTAQVDLIKKLAAIGKPLVVLVL--GDLMDNSPLLELDGVNSVI 535

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
           W  +PG+ G  A+ +++ G     GRLP+T YP ++T++ M DMNMR  SS   PGR+YR
Sbjct: 536 WANWPGQDGGSAVMQVVTGAVAVAGRLPITQYPANYTELSMLDMNMRPSSSS--PGRTYR 593

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPS-ELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
           ++ G  V  FG GL YT +  KF +  + E  IS         NI ++  ++  Y     
Sbjct: 594 WFNGA-VQPFGTGLHYTTFDAKFAANSTIEYDIS---------NITKECTNQ--YPDTCS 641

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           V S       + ++VTN+G+     + + F +        P K LI + RV  V KG + 
Sbjct: 642 VPS-------IPVAVTNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDV-KGGQT 693

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            S  +    QL++ N    R+  +GN VL  GE
Sbjct: 694 KSAEM----QLTLGNL--ARVDQMGNTVLYPGE 720


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 249/774 (32%), Positives = 376/774 (48%), Gaps = 114/774 (14%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + +PF +  L +  R + L+S LTL+EK++Q+ ++   + RLGIPAY WW+E LHGI   
Sbjct: 23  EKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGIGR- 81

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
                   T   VT FPQ +  AA++N +L   + S++A E RA+YN  Q          
Sbjct: 82  --------TKYHVTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQYHA 133

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LT+W PNINIFRDPRWGRGQET GEDP + +     FV+  QG+N               
Sbjct: 134 LTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGDN--------------P 179

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
           R LK              SAC KH   Y +      +R+SFN+ ++  D  DT+ P FR+
Sbjct: 180 RYLKA-------------SACAKH---YAVHSGPEKNRHSFNSDVSTYDLWDTYLPAFRT 223

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
            +   K S +MC+YN   G P C    L Q   R++W F GY+TSDC A+  IF   N+ 
Sbjct: 224 LVVDAKVSGVMCAYNAFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIF---NHH 280

Query: 341 KTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           KTH D+A     A   G D++CG         A+  G + EK +D ++  LF+++ RLGL
Sbjct: 281 KTHPDAATAAADAVFHGTDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGL 340

Query: 398 FNGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           F  DP +   Y ++    +   +H+ LA   AR+ +VLLKND + LPL KN +  + ++G
Sbjct: 341 F--DPVELVDYARIPISILECRKHQDLAKQLARESMVLLKND-QLLPLQKNKLKKVVVMG 397

Query: 457 PLVNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRI 513
           P  ++   + G Y G P    +P   +R      ++  Y  G   V   S     + V  
Sbjct: 398 PNADSRESLLGNYNGNPSRMLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNR 457

Query: 514 AKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
           AK AD VI + G+    E E+           DR ++ LP  Q  ++ +       P + 
Sbjct: 458 AKGADAVIFIGGISPRLEGEEMPVSKDGFDGGDRTTIALPAVQTQMMKAWV-AEHIPTVF 516

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           V+  G  L + +   +  + +IL   Y G+ G +A+A+++FGD+NP G+LP+T+Y +   
Sbjct: 517 VMMTGSALAIPWEAQN--VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAK--- 571

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
                D ++    S    GR+YR++ G  +Y FG+GLSYT+++Y  L  P          
Sbjct: 572 -----DSDLPDFESYDMQGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLP---------- 616

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                                +V   T     V ++V N G  +G  VV L+   P    
Sbjct: 617 ---------------------KVCRTTDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKI 655

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             P   L GF R+   A  ++ ++F +   E LS  +++G R +  G   + VG
Sbjct: 656 LVPLTALKGFKRIQLKAGEAQRVTFSLS-SEDLSCVDENGIRKVWAGTVKIQVG 708


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 261/804 (32%), Positives = 385/804 (47%), Gaps = 80/804 (9%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           PD    P    S   C+T+ +   RA ++I+ + + EK+  L D +    RLG+P YEWW
Sbjct: 28  PDCVNGPALLTSNLVCDTNATPYQRASAIINAMNITEKLANLLDVSYGSARLGLPPYEWW 87

Query: 100 SESLHGIASNGPGVNF--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL 157
           SE+LHG+A + PGVNF  +G  S  TSFP  +  +++F+     NI S ++ EARA  N 
Sbjct: 88  SEALHGVAGS-PGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNA 146

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
            + GL ++ PNIN F+DPRWGRG ETPGEDP+ +  Y    +   +G    +DD      
Sbjct: 147 ARGGLDYFTPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEG----TDD------ 196

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                   G+   S  G + M+ A CKH   YDLE W  + RY ++A IT QD  + + P
Sbjct: 197 --------GYFNTSHSGYKKMI-ATCKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLP 247

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATI 333
           PF++C      + IMCSYN VN VPAC    L +   R  WG+     YITSDC+A++ I
Sbjct: 248 PFQTCARDQNVASIMCSYNSVNSVPACANSYLQETILREHWGWTIDNNYITSDCNAISDI 307

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINC---GTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
           +   NY+  +  +A   L  GMD  C    T ++     +   G V E  I  AL+  + 
Sbjct: 308 YYNHNYSVNNAAAAGLSLSNGMDTACIVANTGVMTDVNGSYYGGYVTEATITTALIRQYE 367

Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
             +  G F  DP     Y  +G   V T   + LA  AA +G  LLKN    LP    + 
Sbjct: 368 ALVIAGYF--DPASSNPYRSIGWSSVNTPAAQTLARQAATEGTTLLKN-TGLLPYKFTSQ 424

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHE 509
           + +A+IG   N  SQM GGY+G      S L          +YA+G  +    +      
Sbjct: 425 TKVAMIGMWANGTSQMQGGYSGPAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQN 484

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           A   A+ AD ++   G+D + E E  DR  +  PG Q +L+  +A   K   ++VL  G 
Sbjct: 485 ATAAAQNADVILFFGGIDWSVEAEAMDRYQIAWPGAQQALIAQLAALGKP--MIVLQMGS 542

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
            LD +   +++ IS+++W+GYPG+ G  A  +I+ G   P GRLP+T YP  +  +VPM 
Sbjct: 543 MLDATPILSNNNISALVWVGYPGQDGGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMT 602

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF--------------LSAPS 674
           +M++R       PGR+Y++Y    V  F +GL YT +   F               +AP 
Sbjct: 603 NMSLRPGPGN--PGRTYKWYNNA-VLPFAYGLHYTTFKATFNGGPPGPGSPWSPPWNAPW 659

Query: 675 ELTISASLKAGS----------DKNILQQTGSRLDYVHIDE-VTSCTSLR------FHVQ 717
              +      G+             +    G      +I   ++SCT+          V 
Sbjct: 660 SAKVRRGWGWGNWGPPNWGWTQPSQVAPGNGGLSSSYNIQSLLSSCTAAHPDLCAFPSVA 719

Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG-----SKEISFGVDP 772
           ISV NAG      V ++F+         P K L  + R+H+VA G     S  ++ GV  
Sbjct: 720 ISVQNAGQTTSDFVALVFSNTTAGPAPYPYKSLASYTRLHSVAAGQTVTASLNMTLGV-- 777

Query: 773 CEQLSIANKHGRRILPLGNHVLMV 796
              L+  +  G +IL  G + L++
Sbjct: 778 ---LARRDDQGNQILYPGTYNLLL 798


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/727 (33%), Positives = 361/727 (49%), Gaps = 83/727 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLHG 105
           PFC+ SL    RA  L   LTL E  QQL+         A  +PRLG+  Y + +E LHG
Sbjct: 43  PFCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLHG 102

Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------ 159
           I  +   VN+  T+     +PQV   AA+ N SL   + + +  E RA+ N  Q      
Sbjct: 103 I-RDANVVNYPATL-----YPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEIF 156

Query: 160 ---AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
                L+ + P +NI RD RWGR QE+  EDP +   YAV FV                 
Sbjct: 157 GRGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGL-------------- 202

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN-FSRYSFNAMITEQDTEDTF 275
                       E+ +    L  +  CKHL AY  E + N  +R+SFNA+I E D  DT+
Sbjct: 203 ------------EQRNSSKYLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTY 250

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIF 334
            P FR+C+E G    IMCSYN VNG+PAC RGD+   + R  WGF+G I SDCDAVA I+
Sbjct: 251 LPAFRACVELGHVQQIMCSYNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIY 310

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
              NYT+T ED+    L+ G D++CG    +H  SA+ +       + +++  +  ++  
Sbjct: 311 NTHNYTRTPEDAVTVALQGGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFL 370

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG F+ D     Y +LG + + T   +  +L A+R+ +VLL+N  K LP+  +A   +A+
Sbjct: 371 LGEFDPD-TSVPYRQLGREAIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIKVAL 429

Query: 455 IGPLVNNIS-QMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS--GCHDVPCNSDAGFHEAV 511
           IGP VN  +  MGG     P    +  +G +A +  TH  S  GC ++         +AV
Sbjct: 430 IGPYVNLTTIMMGGKLDYTPSFITTYFQGFQA-IGITHLTSSPGC-NITAPLPGALDKAV 487

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV-ARTSKRPVILVLTGGGP 570
           +IA +AD V++  GL    E E  DR +L LP  Q  L  ++ A      +++VL  GGP
Sbjct: 488 QIATQADLVVLTLGLSSDIEHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGP 547

Query: 571 LDVS-FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMN 628
           + V       ++  +I+   Y G++   ALAE IFG  NP G LP T +  + T  VP  
Sbjct: 548 VSVDRIKYGIARTPTIIEAFYGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFT 607

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           DM++R D++  +PGR++RF+    ++ FGHGLSY+ +S                 A  D+
Sbjct: 608 DMHLRPDAATGFPGRTHRFFDAPVMWPFGHGLSYSTFSL----------------AWQDE 651

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
            +   T          + T  T +   + ++VTN G + G   + L+  VP  +   P +
Sbjct: 652 TVPSIT--------TGDFTQPTLMHQLLSVNVTNHGPLPGRRALHLYVTVPVTNVSVPLR 703

Query: 749 QLIGFDR 755
            L+G  +
Sbjct: 704 NLVGLQK 710


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 244/761 (32%), Positives = 369/761 (48%), Gaps = 117/761 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AK L+  +TL+EK +QL+  +SA+ RL +P Y WW+E LHG+A  G          + T
Sbjct: 15  KAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAG----------TAT 64

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+  L + I   ++ E RA YN            G+TFW+PN+NIFRDP
Sbjct: 65  VFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRDP 124

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V FVK  QGE                            G 
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGE----------------------------GK 156

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   +   +     R+ F+A+++++D  +T+ P F +C+++G    +M +Y
Sbjct: 157 YLKAAACAKHFAVHSGPEG---LRHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAY 213

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ NG P C    L +   R +W FKG++ SDC A+A    +   T T  +SAA  +K G
Sbjct: 214 NRTNGEPCCGSKTLLRDILRGKWNFKGHVVSDCWAIADFHLHHRVTSTATESAALAMKNG 273

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L+    A  +G V E+DI  A   L + ++RLG+F+    + +Y K+  + 
Sbjct: 274 CDLNCGNVYLQ-LLLAYKEGLVTEEDITTAAERLMATRIRLGMFD---EECEYNKIPYEL 329

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
               EH +L+L AAR  +VLLKN+   LPLNKN + S+A+IGP  ++   + G Y+G   
Sbjct: 330 NDCKEHNELSLKAARNSMVLLKNN-GILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTAS 388

Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
              ++L G+   V    + +Y+ GCH      +     +    EA+ IA+++D  I+  G
Sbjct: 389 RYITVLEGIHEAVGEDVRVYYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLG 448

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E           D+ SL LPG+Q  L+  +  T   PVILV+  G  L  +F 
Sbjct: 449 LDSTIEGEQGDAGNSEGAGDKASLNLPGRQQELLEKIIETGT-PVILVIGAGSAL--TFN 505

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
            A+ + S+IL   YPG  G +A+A++IFG  +P G+LP+T+Y  +       D +M+   
Sbjct: 506 NAEDKCSAILDAWYPGSRGGRAVADLIFGKCSPSGKLPITFYRNTKDLPEFIDYSMK--- 562

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
                 R+YR+ +   +Y FG+GL+Y+                                 
Sbjct: 563 -----DRTYRYMSCESLYPFGYGLTYSTV------------------------------- 586

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
           +L  +H+ +V S       V + +TN G+ D   V+  + +  +         L GF RV
Sbjct: 587 KLSELHVPDVKSDFE-DVEVSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRV 645

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                 SK     +       + N  G RIL      L VG
Sbjct: 646 RLKIGESKIAKMKIKK-SSFEVVNDDGERILDSKRFKLFVG 685


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 256/807 (31%), Positives = 397/807 (49%), Gaps = 122/807 (15%)

Query: 13  RILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLL 72
           R LF ++ ++ +  ++    +K  ++K +F           F +T LS+  R   LIS L
Sbjct: 5   RQLFAIMLMLFLTAIMHAQQTK--IDKSEFD----------FYDTDLSMDERIDDLISRL 52

Query: 73  TLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           TL+EK QQ+ + + AI RLGIPAY+WW+E+LHG+  +G            T FPQ +   
Sbjct: 53  TLEEKAQQMLNASPAIERLGIPAYDWWNEALHGLGRSG----------VATVFPQAIGMG 102

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDPRWGRGQETP 184
           A+F+  L   + +A++ EARA +N            GLTFW PN+NIFRDPRWGRGQET 
Sbjct: 103 ATFDDDLILKVSTAISDEARANFNNAVKHGYHRKYGGLTFWTPNVNIFRDPRWGRGQETY 162

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP + S     FVK  QG+N K                            L  +A  K
Sbjct: 163 GEDPYLTSKLGEAFVKGLQGDNDKY---------------------------LKTAAAAK 195

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H   Y +       R+ FNA ++E+D  +T+ P F++ ++      IMC+YN  NG P C
Sbjct: 196 H---YAVHSGPEKLRHEFNADVSEKDLWETYLPAFKTLVD-ANVETIMCAYNSTNGEPCC 251

Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
               L     R++WGF G++ SDC A+       +  ++ E +AA  ++ G+++NCG   
Sbjct: 252 ANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHDIVESPEAAAALAVEVGIELNCGDTY 311

Query: 364 LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKK 422
                 A++ G V E+ +D+ L  L   + +LGLF  DP +   Y K+G + + + EH+ 
Sbjct: 312 -NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLF--DPEESNPYNKIGVEVMNSDEHRA 368

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
           LA + AR+ IVLLKND   LPL KN +S   I GP   NI  + G Y G+     ++L G
Sbjct: 369 LARETARKSIVLLKNDG-VLPL-KNNLSKYFITGPNATNIEVLLGNYHGVNPDMVTVLEG 426

Query: 483 LEAYV---SKTHYASGCH-DVPCNSDAGFHEAVRIAKKADFVIV-VAGLDLTQETED--- 534
           +   +   S+  Y  G   ++P  +   +         A FV++ ++GL   +E E    
Sbjct: 427 IAKAIKPESQLQYRMGTRLNLPNENPQDWASPNAGNSDATFVVMGISGLLEGEEGESIAS 486

Query: 535 ---RDRVSLLLPGQQMSLVTSVARTSK-RPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
               DR+   LP  Q+  +  V+  ++ RPV+ ++TGG P++++  E      ++L + Y
Sbjct: 487 PTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIVTGGSPMNLT--EVHKLADAVLLVWY 544

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           PGE G  A+A+IIFG  +P GRLP+T+        PM   ++ A       GR+Y++   
Sbjct: 545 PGEEGGNAVADIIFGKNSPSGRLPITF--------PMTIEDLPAYEDYTMEGRTYKYMDV 596

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             +Y FG+GLSYT++ Y      SE+ +S       +                       
Sbjct: 597 VPMYPFGYGLSYTDFEY------SEIKLSKDKIKKKES---------------------- 628

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
                 +ISVTN GD +   VV ++ +  K S   P  +L+ F  +H     SKE++F +
Sbjct: 629 ---VEARISVTNTGDFEADEVVQVYLKDVKASSRVPNFELVAFKNIHLKRGESKELTFEI 685

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
            P E LS  + +G+  L  G   + +G
Sbjct: 686 TP-EMLSFIDDNGKEKLEKGAFEIYIG 711


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 254/808 (31%), Positives = 384/808 (47%), Gaps = 133/808 (16%)

Query: 14  ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           I+F VL  + ++   AF    P+   PD                   +STR + LIS LT
Sbjct: 8   IIFAVLTSLGLIKT-AFCQQIPIYRNPD-----------------KKLSTRVQDLISRLT 49

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAA 133
           L EK+  L   + A+PRL IPAY WW+E LHG+A  G            T FPQ +  AA
Sbjct: 50  LAEKVSLLGYRSQAVPRLNIPAYNWWNEGLHGVARAG----------EATIFPQAIAMAA 99

Query: 134 SFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPG 185
           +F+ +L   + + V+ EARA YNL  A        GLTFW+PNINIFRDPRWGRGQET G
Sbjct: 100 TFDDNLVKQVANVVSTEARAKYNLSTAMGRHLQYMGLTFWSPNINIFRDPRWGRGQETYG 159

Query: 186 EDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML--SACC 243
           EDP + S     +V   QG                              D L L  SA  
Sbjct: 160 EDPFLTSKMGNAYVHGLQGT-----------------------------DPLHLKTSATA 190

Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
           KH +A+   +     R  F+A++ E+D  DT+   F+S ++ G  S IM +YN+VNGVP 
Sbjct: 191 KHFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLVDGGVES-IMTAYNRVNGVPN 246

Query: 304 CLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
            +   L       EWGFKG++ +DC A+  +++         + AA  +KAG+D++C + 
Sbjct: 247 SINKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRMEVAAAAIKAGVDLDCSSI 306

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
                 +AI+   + EK +D AL  + S Q +LG F+  P    +   G D +    H  
Sbjct: 307 FQTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDA-PSSSPFYSFGADSIHNDSHVM 365

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
           LA   A++ +VLLKNDK+ LPL     SS+ ++GP   ++  +   Y G+     + + G
Sbjct: 366 LARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDALVASYHGVSSKAVNFVEG 425

Query: 483 LEAYVSKTHYASGCHDVPCN-SDAGFHEAVRIAKKADFVIVVAGLDLTQETE-------- 533
           + A V K       +D+  +  D      +  A  AD  + V GL    E E        
Sbjct: 426 ITAAVDKGTRVE--YDLGADYRDTTHFGGIWGAGNADVTVAVIGLTPVLEGEAGDAFLSQ 483

Query: 534 -DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
              D+  L LP   ++ + ++ ++ K+P+I V+T G  +D++     +    + W  YPG
Sbjct: 484 TGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIAPYADAVILAW--YPG 541

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGT 651
           E G  ALA+I+FG  +P G LP+T+Y  S   +P  N+ +M+        GR+YR++ G 
Sbjct: 542 EQGGNALADILFGKISPSGHLPLTFY-NSVNDLPAYNNYSMK--------GRTYRYFAGA 592

Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
             Y FG GLSYT ++Y++   P          + S K+ +Q                   
Sbjct: 593 VQYPFGFGLSYTTFNYQWQQQPKT--------SYSAKDTIQ------------------- 625

Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
               + + V N G++    VV  +   P +++  P K+L GF R+ T+ KGS  ++    
Sbjct: 626 ----LSVVVKNTGNISADEVVQAYIGYPTLNR-MPLKELKGFKRI-TLNKGSTSLASISI 679

Query: 772 PCEQLSIAN--KHGRRILPLGNHVLMVG 797
           P  +L   N  KH   + P GN+ + +G
Sbjct: 680 PVTELQKWNSSKHQFELYP-GNYTVYLG 706


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 249/735 (33%), Positives = 358/735 (48%), Gaps = 82/735 (11%)

Query: 82  SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWS 141
           + N+  + RLG+ AY+WW+E+LHG+A N  G+ + G  S+ T FPQ + S+A+F+  L  
Sbjct: 42  ASNSPGVSRLGLSAYQWWNEALHGVAHNR-GITWGGQFSAATQFPQAITSSAAFDDHLIE 100

Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
            IG  ++ EARA  N G+A L FW PN+N FRDPRWGRG ETPGED      +A  FV+ 
Sbjct: 101 RIGVIISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQG 160

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
            QG                        E + R     + A CKH  AYDLE  G+ +R++
Sbjct: 161 MQGT-----------------------ESTHR-----VIATCKHYAAYDLENSGSTTRFN 192

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFK 320
           F+A ++ QD  + + PPF+ C    K   IMCSYN VNGVPAC    L     R  W + 
Sbjct: 193 FDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWT 252

Query: 321 G---YITSDCDAVATIFEY---QNYTKTHEDSAAGVLKAGMDINCGTC--MLRHTQSAID 372
               Y+ SDCDAV  +        Y  ++  +    L+AG D  C           SA +
Sbjct: 253 DQNQYVVSDCDAVYYLGNANGGHRYKSSYAAAIGASLEAGCDNMCWATGGTTPDPASAFN 312

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
             +  +  +D+A+L      ++ G F+G      Y  L   DV T   +  AL AA +GI
Sbjct: 313 SRQFTQATLDKAMLRQMQGLVKAGYFDGP--NSLYRNLTAADVNTQVARDTALKAAEEGI 370

Query: 433 VLLKNDKKFLPLNKNAVSS-LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
           VLLKND   LPL     ++ +A+IG   N   +M GGY+G P      +    +     +
Sbjct: 371 VLLKND-NILPLTLGGSNTQVAMIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVN 429

Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           Y +G       ++A    AV  A+K+  VI   G+D T E E +DR S+  P  Q++++ 
Sbjct: 430 YVNGPL---TQTNADTSAAVNAAQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQ 486

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
            +A+T K PVI+V  G    D         + +ILW GYPG+ G  A+  +I G  +P G
Sbjct: 487 RLAQTGK-PVIVVRMGTHVDDTPLLSI-PNVKAILWAGYPGQDGGTAVMNLITGLASPAG 544

Query: 612 RLPMTWYPESFT-KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFL 670
           RLP+T YP S+T + P  +M +R  SS  YPGR+YR+Y    V+ FGHGL YTN+S   L
Sbjct: 545 RLPVTVYPSSYTNQAPYTNMALRPSSS--YPGRTYRWYK-DPVFPFGHGLHYTNFSVAPL 601

Query: 671 SAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF-------HVQISVTNA 723
             P+  +I+                        D + SC  + +        V +SVTN 
Sbjct: 602 DFPATFSIA------------------------DLLASCKGVTYLELCPFPSVSVSVTNT 637

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G     +VV+ F          P K L  + RV  V  G  + +      E L+  +  G
Sbjct: 638 GSRASDYVVLGFLAGDFGPTPRPIKSLATYKRVFDVQPGKTQSAELDWKLESLARVDGKG 697

Query: 784 RRILPLGNHVLMVGE 798
            R+L  G + L++ +
Sbjct: 698 NRVLYPGTYTLLLDQ 712


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 252/748 (33%), Positives = 358/748 (47%), Gaps = 109/748 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
           +PF NTSL  S R   L+  LTL+E + Q+S         A  I RLGI  Y W +E LH
Sbjct: 25  FPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIGPYSWNTECLH 84

Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
           G  + GP           TSFPQ    AA+F+  L   I +A A E RA YN        
Sbjct: 85  GDVAAGPA----------TSFPQAFGLAATFDAVLIEQIANATAYEVRAKYNNYAKHKEY 134

Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GL+ ++P INI R P WGR QET GEDP +    A  +V   QG + +        
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQGNHPRY------- 187

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               +  +A CKH  AY   +    SR +F+A ++++D   TF 
Sbjct: 188 --------------------VTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFL 227

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F  CI+ G  S +MCSYN +NGVPAC    L     R EW F GY+ SD  AV  +++
Sbjct: 228 PAFHECIQAGTHS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYD 286

Query: 336 YQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
             +YTK   D+A   + +G+++    N    ++  T  A+ +G V  K +   +  LF  
Sbjct: 287 AHHYTKDMLDTAIACVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYT 346

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           ++RLG F+  P    Y KL    + + EH++L+L AA +  VLLKN+ +FLPL K  +  
Sbjct: 347 RMRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPL-KEKIDK 404

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLR-GLEAYVSKTHYASGCHDVPCNS-DAGFHE 509
           LA++GPL +N+  + G Y+  P +     R GL      T YASGC +  C   D+G  +
Sbjct: 405 LAVVGPLADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSG--Q 462

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
                  AD V+V  G     E+E  DR  L LPG+Q+SL+    +   +PVIL+L   G
Sbjct: 463 VKSAVSGADMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAG 522

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG---DFNPGGRLPMTWYPESFTKV- 625
           PLDVS+A  +  + +I+   +P +A   AL  +      + NP GRLPMTW P S  +V 
Sbjct: 523 PLDVSWAVENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTW-PRSMEQVP 581

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
           PM D  M+        GR+YR+     ++ FG GLSYT + Y   SA   +         
Sbjct: 582 PMTDYTMK--------GRTYRYSDADPLFPFGFGLSYTLFKYYNTSASPTV--------- 624

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                               + SC ++   + ++VTN GD  G  V+ ++      S   
Sbjct: 625 --------------------IKSCDTVT--IPLTVTNVGDFPGDEVMQVYISWSNASVTV 662

Query: 746 PEKQLIGFDRVHTV-AKGSKEISFGVDP 772
           P+ QL+GF RV  +    S  + F V P
Sbjct: 663 PKLQLVGFRRVREIEPSASATVHFAVLP 690


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 190/416 (45%), Positives = 263/416 (63%), Gaps = 10/416 (2%)

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG F+G+P+   YG LGP DVCT E+++LA++ ARQGIVLLKN    LPL+ +A+ +L
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  N    M G Y G+ C   + L+GLE  V  T Y  GC +V C ++A    A  
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC-TEADLDSAKT 119

Query: 513 IAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +A  AD  ++V G D T E E  DR+ L LPG+Q  LVT VA+ ++ PV+LV+  GG  D
Sbjct: 120 LAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMN 631
           ++FA+ D +I+SI+W+GYPGEAG  A+A++IFG  NP G+LPMTWYP+S+  KVPM +MN
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MR D S  Y GR+YRFY G  VY FG GLSYTN+S++ + AP  +    SL     ++  
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFV----SLNLDESQSCR 295

Query: 692 QQTGSRLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                 LD +  H ++     S  F VQ+ V N GD +G+  V LF   P+V  G+P KQ
Sbjct: 296 SPECQSLDAIGPHCEKAVGERS-DFEVQLKVRNVGDREGTETVFLFTTPPEV-HGSPRKQ 353

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           L+GF+++    K    + F VD C+ L + ++ G+R L LG+H+L VG L+HS  I
Sbjct: 354 LLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNI 409


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 245/764 (32%), Positives = 380/764 (49%), Gaps = 118/764 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RAK L++ +T++EK  QL  +A AI RLGIPAY WW+E+LHG+A  G          + T
Sbjct: 9   RAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAG----------TAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L S +G  +A EARA YN            GLTFWAPN+NIFRDP
Sbjct: 59  MFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S  AV FVK+ QG+                            G+
Sbjct: 119 RWGRGHETYGEDPFLTSRLAVPFVKAMQGD----------------------------GE 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +  +AC KH   +   +     R+ F+A  +++D E+T+ P F + +++ +   +M +Y
Sbjct: 151 YMKAAACAKHFAVHSGPEG---ERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAY 207

Query: 296 NQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ NG P C    L     R +WGF+G+  SDC A+    E    T + E+SA   L+ G
Sbjct: 208 NRTNGEPCCANKPLMVDTLRGKWGFQGHFVSDCWAIKDFHENHKVTSSPEESAKLALEMG 267

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG C  +   + +  G + EK I  +   LF+ +  LG+F+    K ++ ++  + 
Sbjct: 268 CDLNCG-CTYQSIMNGVRAGLIDEKLITESCERLFTTRFLLGMFD----KTEFDEIPYEK 322

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V   EH  +A  AAR+ +VLLKND   LPLNK+++ ++ ++GP  N+   + G Y G   
Sbjct: 323 VECKEHLAVAKRAARESVVLLKNDG-LLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSS 381

Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGF---------HEAVRIAKKADFVIV 522
              ++L G++  V    +  Y+ GC D+  N+ +            EA  +A  +D V+V
Sbjct: 382 RYITVLEGIQDKVGDDVRVLYSEGC-DIFQNNISNLADPNLPDRLSEAQAVADHSDVVVV 440

Query: 523 VAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V GLD   E E+ D         +++L LP  Q  L+ +V    K P I++   G  +D+
Sbjct: 441 VVGLDENLEGEEGDAGNQFASGDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDL 499

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           S  +A  + +++L   YPG  G   +A+I+FGD +P G+LP+T+Y  +       D +M+
Sbjct: 500 S--KAQDEANAVLQAFYPGARGGADVADILFGDVSPSGKLPVTFYKSADDLPDFKDYSMK 557

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                    R+Y+++TGT +Y FG+GL+Y +                 +K   D N+   
Sbjct: 558 --------NRTYKYFTGTPLYPFGYGLTYGD---------------CYVKPDYDFNV--- 591

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                 Y   D+V+        + ++V N G +D   VV L+ +       T    L+GF
Sbjct: 592 -----KYADADKVSGA-----EITVTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGF 641

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RVH  A G   ++  V   +  +  N+ G R +   N  L  G
Sbjct: 642 KRVHVPAGGETRVTLTVSE-KAFTSVNEEGERAVFGKNFRLYAG 684


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 232/671 (34%), Positives = 356/671 (53%), Gaps = 90/671 (13%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
            + RA++L++ +TL+EK+ Q+  +A AI RLGIPAY WW+E LHG+  +G          
Sbjct: 4   FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSG---------- 53

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQA----GLTFWAPNINIF 172
           + T FPQ +  AASF+ SL  ++  A++ EARA YN     G+     GLTFW+PNIN+F
Sbjct: 54  TATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRG ET GEDP++       F++  Q                          E +
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQ--------------------------EGE 147

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                 L A  KH  A+   + G   R+SFNA ++ +D  D++   FR CIE  K + +M
Sbjct: 148 DSQYRKLDATVKHFAAHSGPEAG---RHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVM 204

Query: 293 CSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
            +YN++NG PAC     L+G L++    EW F GY+ SDC A+  I E  + TK  ++SA
Sbjct: 205 GAYNRINGEPACASSTYLKGVLYE----EWKFDGYVVSDCGAIQDINENHHVTKNEKESA 260

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  +  G  +NCG       ++A++ G + E  +  A+  LF  + RLG+F+ D     Y
Sbjct: 261 ALAVNNGCQLNCGKAY-HWVKAAVEDGLISEDTVTCAVERLFEARFRLGMFDSD---CVY 316

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +  + +   +H++L    A++ IVLLKN+   LPLN     ++A+IGP  ++ + + G
Sbjct: 317 DSIPMNVIECRKHRELNRKMAQESIVLLKNN-GILPLNPE--KTIAVIGPNADDKTVLLG 373

Query: 468 GYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCN----SDAGFHEAVRIAKKADFVIV 522
            Y G P    +LLRG++     + +YA G   V       ++   HEA+  AK AD V++
Sbjct: 374 NYNGTPSHWTTLLRGIQDQARGEVYYARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVL 433

Query: 523 VAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
             GL    E E+          DR  + LP  Q  L+ ++  T K PV+LV   GG +D+
Sbjct: 434 CLGLSPLLEGEEGDAYNGADSGDRKDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDL 492

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +AD + ++IL   YPG  G  ALA+I+FG  +P GRLP+T+Y       P  D +M+
Sbjct: 493 R--QADERCAAILQCFYPGAEGGNALADILFGRVSPSGRLPVTFYRTVEDLPPFTDYSMK 550

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ- 692
                   GR+YRF+ G  +Y FGHGL+Y +   ++ + P  + +  + +  +  +IL+ 
Sbjct: 551 --------GRTYRFFDGKPLYPFGHGLTYADIKEQW-TDPYTVRVKNASETATGYSILRY 601

Query: 693 QTGSRLDYVHI 703
           + G+  D+ HI
Sbjct: 602 ENGALADFKHI 612


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 255/773 (32%), Positives = 386/773 (49%), Gaps = 124/773 (16%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           ++ + +T+ S   RAK+L++ LTL+EKI  + D ++ I RL IP Y WW+E LHG+A NG
Sbjct: 38  NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGL 162
                       T FPQ +  AA+F++ L   +  A++ EARA +N    +G     AGL
Sbjct: 98  ----------RATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGL 147

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           TFW PNINIFRDPRWGRGQET GEDP + S   V FVK  QG + K              
Sbjct: 148 TFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQGNHPKY------------- 194

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         L  +AC KH   Y +       R+ F+A+ +++D  +T+ P F + 
Sbjct: 195 --------------LKSAACAKH---YAVHSGPEELRHEFDAIASKKDMAETYLPAFEAL 237

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++ K   +M +YN+VNG  AC    L +K  ++ WGFKGYI SDC A++ + ++   T+
Sbjct: 238 VKEAKVEGVMGAYNRVNGEGACASPYLLEKLLKDTWGFKGYIVSDCWALSDLHKFHKVTQ 297

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
           T E+SAA  L  G+++NCG  +      AI +G   EK +D  L +    + +LG F  D
Sbjct: 298 TAEESAAAALNVGLNVNCGN-VYPALDGAIKQGLTSEKQLDNVLQHQLLTRFKLGFF--D 354

Query: 402 PRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           P     Y K+  D V +  H+ +AL+AA++ IVLLKN+   LPL K+ + S+ + GP   
Sbjct: 355 PSNNNPYNKITTDVVDSEAHRAIALEAAQKSIVLLKNNNNLLPLKKD-LKSVYVAGPNAA 413

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
               + G Y G+    +++L G+ + VS     +Y  G      N +        I+ +A
Sbjct: 414 REDVLLGNYYGVTSKTQTILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWSTGEIS-RA 472

Query: 518 DFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSV-ARTSKRPVILVLTG 567
           D  I+V GL    E E+          DRV + LP  Q+  +  + A+ +  P++LVLTG
Sbjct: 473 DVGIIVMGLSGNYEGEEGEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVLTG 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-P 626
           G P  ++  E    + +I++  YPGE G +A+A+I+FGD  P G+LP+T +P+S   + P
Sbjct: 533 GSP--IAMPEVYDLVDAIVFAWYPGEEGGQAVADILFGDVVPSGKLPIT-FPKSVDDLPP 589

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            ND  M+        GR+Y++ T T  + FG GLSYT++ Y  L    E           
Sbjct: 590 YNDYAMK--------GRTYKYMTKTPQFPFGFGLSYTSFKYDNLKVYKE----------- 630

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                                           S+TN G+VD   V  ++   P   +G P
Sbjct: 631 ----------------------------KASFSITNNGNVDAEEVAQVYVSSPNAGKGDP 662

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA--NKHGRRILPLGNHVLMVG 797
              L+GF RV   A  +K++S    P  + +    +  G+ I   G + + VG
Sbjct: 663 LNTLVGFTRVSLKAGATKQVSI---PFSKKAFVQFDSDGKEITRKGTYTIHVG 712


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 238/761 (31%), Positives = 369/761 (48%), Gaps = 114/761 (14%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RAK L+S +T++EK  Q+  +A AI RLGIP Y WW+E+LHG+A  G            T
Sbjct: 8   RAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAG----------DAT 57

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +   A+F+  L   +    + E RA YN            GLT+WAPN+NIFRDP
Sbjct: 58  VFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRDP 117

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP +     + +V+  QG++                            D
Sbjct: 118 RWGRGHETYGEDPYLTGQLGMAYVRGLQGDDL---------------------------D 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
               +AC KH   +   +     R+ F+A + +QD  DT+   F+  ++  K   +M +Y
Sbjct: 151 NPKSAACAKHFAVHSGPE---AERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAY 207

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+VNG PAC    L +   R +WGF+G++ SDC A+    E    T    +SAA  +  G
Sbjct: 208 NRVNGEPACGSKRLLKDILRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESAALAVNNG 267

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG C+      A     V E+ I  ++  L  ++LRLG      R+ KY  +  + 
Sbjct: 268 CDLNCG-CVYEKLLYAYKANLVTEETITESVERLIELRLRLGTL--PERRSKYDDIPYEV 324

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V   EHK+LA++AA++ +VLLKND   LPL K+ + ++ +IGP  N+   + G Y GI  
Sbjct: 325 VECKEHKELAIEAAKRSMVLLKNDG-LLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISS 383

Query: 475 SPKSLLRGLEAYVS---KTHYASGC------HDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
              ++L G++ YV    +  ++ G         V   +   F EA+ +A+ +D V++  G
Sbjct: 384 EYITVLEGIQQYVGDDVRVFHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMG 443

Query: 526 LDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E+ D         +  L LPG Q  L+  +    K PV+L++  G  +D+S+A
Sbjct: 444 LDSTIEGEEGDAGNEFGSGDKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWA 502

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
             +  +++I+   YPG  G KA+A+++FG+ +P G+LP+T+Y       P  D +M    
Sbjct: 503 --NENVNAIMHCWYPGARGGKAIAQVLFGEDSPSGKLPLTFYKSDADLPPFEDYSME--- 557

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
                GR+YR++ GT +Y FG+GLSY++  Y                AG           
Sbjct: 558 -----GRTYRYFKGTPLYPFGYGLSYSDIQYS--------------NAG----------- 587

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
                 ID+       +F V+++V NAGD      V ++ +  + S       L    +V
Sbjct: 588 ------IDKTEGAIGDKFTVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKIAKV 641

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             +   SKE+S  +      +I ++ G  I+  G   + VG
Sbjct: 642 ELLPGESKEVSLEL-SARDFAIIDEKGHCIVEPGKFKVFVG 681


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 242/760 (31%), Positives = 370/760 (48%), Gaps = 117/760 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           AK L+S +TLQEK +QL+  + AI  L +P Y WW+E LHG+A  G          + T 
Sbjct: 16  AKELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAG----------TATV 65

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
           FPQ +  AA F+      + + +A E RA YN            GLT+W+PNINIFRDPR
Sbjct: 66  FPQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPR 125

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRG ET GEDP + S   V F+K  QGE                            G  
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----------------------------GKY 157

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L L+AC KH   +   +     R+ FNA++ ++D  +T+ P F +C+++     +M +YN
Sbjct: 158 LKLAACAKHFAVHSGPEG---LRHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYN 214

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           + NG P C    L +   R +WGFKG++ SDC A+A    +   T T  +S A  ++ G 
Sbjct: 215 RTNGEPCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMVTSTATESVALAIENGC 274

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG  M  +   A  +G V E+ I  A   L + + +LG+F+    + +Y K+  +  
Sbjct: 275 DLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFD---EECEYNKIPYEVN 330

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
            + EH ++AL A+R+ +VLLKN+   LPL+K+ + S+A+IGP  N+   + G Y+G    
Sbjct: 331 DSREHNEVALIASRKSMVLLKNNGT-LPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASK 389

Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
             ++L G+   V    + +Y+ GCH      +     D    EA+ +A+++D V++  GL
Sbjct: 390 YTTILEGIHDAVGNDVRVYYSEGCHLFKDKVEDLARPDDRLSEAISVAERSDVVVLCLGL 449

Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E           D+ +L LPG+Q +L+  V    K PVI+VL  G  L ++ AE
Sbjct: 450 DSTIEGEQGDAGNSYGAGDKENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAE 508

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
              + ++IL   YPG  G  A+A+I+FG  +P G+LP+T+Y ++       D +M+    
Sbjct: 509 --EKCAAILNAWYPGSHGGTAVADILFGKCSPSGKLPVTFYKDTAKLPDFTDYSMK---- 562

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
               GR+YR+     +Y FG+GL+Y+      L  PS               + Q  GS 
Sbjct: 563 ----GRTYRYLGHESLYPFGYGLTYSTVELSNLQVPS---------------VKQGFGS- 602

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                           F + I + N G+ D   VV  + +  +         L GF RV 
Sbjct: 603 ----------------FDISIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVS 646

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                SK ++  ++  +   + N  G R+L      L VG
Sbjct: 647 LKKGESKIVTIKLNK-KSFEVVNDDGERLLDSKKFKLFVG 685


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  370 bits (950), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 257/769 (33%), Positives = 384/769 (49%), Gaps = 123/769 (15%)

Query: 63  TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
           +R K +IS +TL+EK+ QLS +A A+   GIP Y WW+E LHG+A  G            
Sbjct: 5   SRIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAG----------LA 54

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRD 174
           T FPQ +  AA+F+ +   ++  A++ E RA YN            GLTFW PN+NIFRD
Sbjct: 55  TVFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRD 114

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRGQET GEDP +     + F+K  QG                           D  
Sbjct: 115 PRWGRGQETYGEDPYLTGRIGLAFMKGLQG---------------------------DDT 147

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
           + L ++AC KH   Y +       R++F+A+++++D  +T+ P F+  +E G    +M +
Sbjct: 148 EHLKVAACAKH---YAVHSGPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGA 203

Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN+  G P      L ++  R  WGFKG++TSDC A+    E    TK+ E+SAA  L A
Sbjct: 204 YNRTLGEPCGGSTYLLKEILRGRWGFKGHVTSDCWAIRDFHENHKVTKSPEESAAMALNA 263

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCG C   +   +  KG V ++ ID AL  L   + +LGLF+  P +  Y  LG D
Sbjct: 264 GCDLNCG-CTYPYLTVSHKKGLVTDETIDTALTRLLRTRFKLGLFD-PPEQDPYRNLGND 321

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            V   +H+ LAL+AA++ IVLLKND   LPL+ +A   L ++GP   NI  +   Y G+ 
Sbjct: 322 IVGCEKHRNLALEAAQKSIVLLKNDSNILPLDDSARKIL-LMGPGAANILTLLANYYGMS 380

Query: 474 CSPKSLLRGL-EAYVSKT----HYASGC--------HDVPCNSDAGFHEA-VRIAKKADF 519
               ++L GL E   +KT     Y  G          +VP  S     EA +    + D 
Sbjct: 381 SRLVTILEGLAEKIKTKTAISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDL 440

Query: 520 VIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           VI V GLD + E E+          DR ++ LP  Q++ +  + +  K+ V+L+LTGG P
Sbjct: 441 VIAVYGLDGSMEGEEGDSIASDANGDRDTIELPSWQLNFLRRIRKAGKK-VVLILTGGSP 499

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMND 629
           +      AD    ++L+  YPGE G  A+A+I+FGD +P G+LP+T +P+S  ++ P +D
Sbjct: 500 IAFPEDLAD----AVLFAWYPGEQGGNAVADILFGDVSPSGKLPIT-FPQSTAQLPPYDD 554

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
             ++        GR+YR+   T +Y FG GLSYT  S++F S   EL+ S          
Sbjct: 555 YALK--------GRTYRYMKETPLYPFGFGLSYT--SFRFDSV--ELSSS---------- 592

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                          ++++  S++  VQ+S  N G  D   VV L+      S+  P   
Sbjct: 593 ---------------KISAGNSVKAKVQVS--NTGKRDAEEVVQLYIAKDNRSEDEPASS 635

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           L GF R+  +A  S  +   + P       N  G  +L  G++ ++  +
Sbjct: 636 LRGFRRLKILAGKSASVEIEL-PASAFETINAEGASVLIPGSYTVIAAD 683


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  369 bits (947), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 246/766 (32%), Positives = 379/766 (49%), Gaps = 108/766 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N    +  R   LIS +TL+EKI Q+ ++ +AIPRLGIPAY++WSE+LHG+A NG   
Sbjct: 8   YLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNG--- 64

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-----LGQA----GLTF 164
                    T FPQ +  AA+++  L   +  A+A EARA ++      G+     GLT 
Sbjct: 65  -------KATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLTM 117

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP +       FV+  QG              ++   L
Sbjct: 118 WSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQG--------------KDPHYL 163

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
           K              +AC KH   Y +       R++FNA++T ++  DT+ P F+  + 
Sbjct: 164 K-------------TAACAKH---YTVHSGPEKERHTFNAIVTRRELFDTYLPAFKKLVT 207

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           + K   +M +YN+  G P C    L ++  RN+WGFKG++ SDC A+     +   TK  
Sbjct: 208 EAKVEAVMGAYNRTLGEPCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDG 267

Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
            +SAA  +K G D+ C  T    +   A+++G + E+DID AL N    + +LGLF  DP
Sbjct: 268 AESAALGIKNGCDMACICTYSYENLTEALNRGLITEEDIDHALRNTLRTRFKLGLF--DP 325

Query: 403 R-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           + K  Y  +    V    H+KLA + A +  VLLKN    LP+ K  V S+ I+GP   N
Sbjct: 326 QEKVPYAHISMSVVGCEAHRKLAYETAVKSAVLLKNHNHILPV-KPDVKSILIVGPNAGN 384

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           +  + G Y G+  S  + + GL   + +                       +A  A F +
Sbjct: 385 VHVLLGNYYGLSDSMTTFMEGLVGRLPEGVRMEFMPGSLLTDSKKIKNDWSVASAASFDL 444

Query: 522 VVAGLDLTQETE----------DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           V+A + L+   E          + DR  + LP  Q   +  +A T  + ++LVLTGG  +
Sbjct: 445 VIAFMGLSPLLEGEEGEAILSDNGDREDIALPKAQQEYIRDLAATGAK-IVLVLTGGSAI 503

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
            ++  E    + +ILW+GYPG+ G +A+A++IFGD +P G+LP+T +P S  ++P     
Sbjct: 504 ALNGIE--DLVEAILWVGYPGQEGGRAIADLIFGDHSPSGKLPIT-FPVSTDQLP----P 556

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
            R  S ++   R+YR+ T + ++ FG GLSYT + YK L     +     L AG      
Sbjct: 557 FREYSMKE---RTYRYMTSSPLFPFGFGLSYTQFEYKNLQLEHPV-----LSAGE----- 603

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
                              +LR   +++  N G+ +G  VV ++    + S   P ++LI
Sbjct: 604 -------------------ALRGTFELA--NVGEYEGEEVVQVYLSDLEASTIVPLQKLI 642

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            F RV      + ++SF + P E + + +  G ++L  G   L +G
Sbjct: 643 SFQRVRLKPGETVQLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIG 687


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  369 bits (946), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 242/776 (31%), Positives = 379/776 (48%), Gaps = 103/776 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
             P   +  P+ +T L    R   LI  +TL+EK  QL +   AI RLG+P Y++W+E+L
Sbjct: 16  ATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
           HG+A NG            T FPQ +  AA+F++ L     S ++ EARA +N+      
Sbjct: 76  HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
               +GLTFW PNINIFRDPRWGRGQET GEDP + +      V   QG++ K       
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  +A  KH   + +       R+ F+A+ + +D  +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            P F + + +     +M +YN+VNG PA     L     R++WGF G++ SDC  +A   
Sbjct: 216 FPAFEALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +Y   T    +SAA  +  G D+NCG  +      A++ G V EK ID+ L  + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP+    Y  +  D V +  H ++A + A + IVLL+N    LPL++N + +L 
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
           + GP  ++   + G Y G+     ++L G+ A VS     +Y  G      N +      
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
               +  D +I V GL    E E+          DR+SL LP  Q++ +  + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+VLT G P  V+  E      +I++  YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             ++P  D     D S Q  GR+YR+ T   +Y FG GLSY    +  ++         +
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
            +A + KN LQ+  +                   V ++VTN G+ +   VV L+ + P  
Sbjct: 615 TQALASKNELQENMT-------------------VTVNVTNTGEREFEEVVQLYLKTPDA 655

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               P   L GF R+   A  ++++ F + P + L   N+ G+ +L  G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKKHLYSINEQGKPVLLKGQYSVIVG 710


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  368 bits (944), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 240/760 (31%), Positives = 365/760 (48%), Gaps = 117/760 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           AK L+S +TLQE+ +QL+  + AI  L +P Y WW+E LHG+A  G          + T 
Sbjct: 16  AKELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAG----------TATV 65

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPR 176
           FPQ +  AA F+      I   ++ E RA YN            GLT+W+PN+NIFRDPR
Sbjct: 66  FPQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPR 125

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGRG ET GEDP + S   V F+K  QGE                            G  
Sbjct: 126 WGRGHETYGEDPYLTSRLGVAFIKGLQGE----------------------------GKY 157

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L L+AC KH   +   +     R+ FNA++ ++D  +T+ P F +C+++     +M +YN
Sbjct: 158 LKLAACAKHFAVHSGPEG---LRHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYN 214

Query: 297 QVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           + NG P C    L +   R +WGFKG++ SDC A+A    +   T T  +S A  ++ G 
Sbjct: 215 RTNGEPCCGSKTLLKDILRGKWGFKGHVVSDCWALADFHLHHMITSTATESVALAIENGC 274

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG  M  +   A  +G V E+ I  A   L + + +LG+F+ D    +Y ++  +  
Sbjct: 275 DLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDC---EYNRIPYEVN 330

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
              EH ++AL A+R+ +VLLKND   LPL+K+++ S+A+IGP  N+   + G Y+G    
Sbjct: 331 DCKEHNEIALIASRKSMVLLKNDGT-LPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASK 389

Query: 476 PKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
             ++L G+   V    + +Y+ GCH      +     D    EA+ +A+++D VI+  GL
Sbjct: 390 YTTILEGIHNAVGDNIRVYYSEGCHLFKDKVEDLAGPDDRLSEAISVAERSDVVILCLGL 449

Query: 527 DLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E  D         + SL LPG+Q +L+  V    K PVI+VL  G  L  +F  
Sbjct: 450 DSTIEGEQGDAGNSYGAGDKESLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSAL--TFNG 506

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
           A+ + ++IL   YPG  G  A+A+I+FG  +P G+LP+T+Y ++       D +M+    
Sbjct: 507 AEEKCAAILNAWYPGSHGGTAVADILFGKCSPSGKLPVTFYKDTANLPEFTDYSMK---- 562

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
               GR+YR+     +Y FG+GL+Y+      L  P    + A  ++             
Sbjct: 563 ----GRTYRYLEHESLYPFGYGLTYSKVELSNLQVP---FVKADFES------------- 602

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                           F + I + N G+     VV  + +  K         L GF RV 
Sbjct: 603 ----------------FDISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVS 646

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                SK ++  +         N  G R+L   +  L VG
Sbjct: 647 LKKGESKTVTIELSK-RSFEAVNNDGERLLDSKSFKLFVG 685


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  368 bits (944), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 243/776 (31%), Positives = 380/776 (48%), Gaps = 103/776 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
             P   + +P+ +T L  + R +SLI  +TL+EK  QL +   AI RLG+P Y++W+E+L
Sbjct: 16  ASPTLANDHPWFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEAL 75

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
           HG+A NG            T FPQ +  AA+F++ L     + ++ EARA +N+      
Sbjct: 76  HGVARNG----------RATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGN 125

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
               +GLTFW PNINIFRDPRWGRGQET GEDP + +      V   QG++ K       
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  +A  KH   + +       R+ F+A+ +E+D  +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASEKDMYETY 215

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            P F + + +     +M +YN+VNG PA     L     R++WGF G+I SDC  +A   
Sbjct: 216 FPAFEALVTEADVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHIVSDCWGLADFH 275

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           EY   T    +SAA  +  G D+NCG+ +      A++ G V EK ID  L  + + + +
Sbjct: 276 EYHKVTANAVESAALAINTGTDLNCGS-VYTALPDAVEAGLVDEKTIDTRLHKVLATKFK 334

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP+    Y  +  D V +  H  +A + A + IVLL+N+ + LPL+KN + ++ 
Sbjct: 335 LGFF--DPKDDNPYNSISADVVNSDAHADVAYEMAVKSIVLLQNENQVLPLDKN-IRNVY 391

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
           + GP  ++   + G Y G+     ++L G+ A VS     +Y  G      N +      
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451

Query: 511 VRIAKKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPV 561
               +  D +I V GL    E E+          DR+SL LP  Q+  +  + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIEFLRKLRKDNDKPV 511

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+VLT G P++V+  E      +I++  YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTPVNVT--EIAQLADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             ++P  D     D S Q  GR+YR+ T   +Y FG GLSY    +  ++  +   +S+ 
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTEEPMYPFGFGLSYATVKFDNITLGNAEALSS- 620

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                                    T        V ++VTN G  +   VV L+ + P  
Sbjct: 621 -------------------------TDGQKGTLDVSVNVTNTGTRELEEVVQLYLKTPNA 655

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               P + L GF R+      + ++SF V   +  SI N  G+ +L  G++ ++VG
Sbjct: 656 GIDQPIQSLKGFQRIKLAPGQTGQVSFTVSKKQLYSI-NAKGKPVLLEGDYHVIVG 710


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 248/756 (32%), Positives = 362/756 (47%), Gaps = 113/756 (14%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R  SL+S ++++EK   +   A  IPRLGIP Y WW+E+LHG+A++G            T
Sbjct: 6   RMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSG----------EAT 55

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LG-------QAGLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L   +  A++ EARA +N +G       + GLTFW+PNINI+RDP
Sbjct: 56  VFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRDP 115

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + S   V FVK  QG                           D   
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVSFVKGLQG---------------------------DHPY 148

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            + ++AC KH   Y +       R+ F+A ++E+D  +T+ P F + ++ G    +M +Y
Sbjct: 149 YMRVAACAKH---YAVHSGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAY 204

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+VNG PAC    L  +  R  WGFKG++ SDC A+A    +   TK   +S A  L+AG
Sbjct: 205 NRVNGEPACGSKRLLDEILRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAG 264

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG     H   A+  G V E+ +DR++  L S   RLGLF  D     Y +L   D
Sbjct: 265 CDLNCGNTY-EHLLDAVKAGVVSEELVDRSVARLLSTLDRLGLFTDD---HPYARLSLSD 320

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           +    H+ LA +AA + +VLLKN+   LP ++  +  + + GP   N   + G Y G+  
Sbjct: 321 IDWEAHRALAREAAEKSVVLLKNNG-ILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSS 379

Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
              ++L G+  Y        Y  GC  +  N       A  +A+ AD  + V G D T E
Sbjct: 380 RLVTVLEGITGYAGPGITVTYKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSTVE 438

Query: 532 TEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
            E+ D +          L LP +Q+  +  +    K P+++VL  G P  V   E +   
Sbjct: 439 GEEGDAIFSDNYGDLSDLDLPREQIEYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELA 495

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYP 641
            +I++  YPGE G  A+A ++FG+ +P GRLP+T +P    ++P   D +M         
Sbjct: 496 DAIVYAWYPGEEGGNAIARVLFGEISPSGRLPIT-FPRGVDQLPPFTDYSME-------- 546

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
           GR+YR+     +Y FG GLSY  +SY+ L                     Q + SR D  
Sbjct: 547 GRTYRYMREEPLYPFGFGLSYATFSYRGL---------------------QSSASRWDKR 585

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
              E+             V N   +    VV L+ R        P   L GF RV   A 
Sbjct: 586 ETLELVC----------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGAG 635

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             K++ F + P E+LS  ++ GR++LP G     VG
Sbjct: 636 ERKQVRFVLSP-EELSFIDEEGRKVLPEGRLHFHVG 670


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 248/773 (32%), Positives = 369/773 (47%), Gaps = 122/773 (15%)

Query: 57  TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
           T      RA  L+  +TL+EK+ Q   +A +IPRL I AY +W+E+LHG+A  G      
Sbjct: 6   TQQEFQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAG------ 59

Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPN 168
                 T FPQ +  AA+F+  L   I   ++ E R  +N  Q         GLTFW+PN
Sbjct: 60  ----VATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPN 115

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           +NIFRDPRWGRG ET GEDP +       FV   QG               ++  LK   
Sbjct: 116 VNIFRDPRWGRGHETFGEDPFLSGTLGGRFVDGIQG--------------HDETYLKA-- 159

Query: 229 EESDRGDELMLSACCKHLIAY----DLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                      +AC KH   +    D+       R+SFNA ++EQD  +T+ P F+  ++
Sbjct: 160 -----------AACAKHFAVHSGPEDI-------RHSFNAEVSEQDLRETYLPAFKKLVK 201

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           + K   +M +YN+ NG P C    L +   R EW F G++TSDC A+    E+   T   
Sbjct: 202 EHKVEAVMGAYNRTNGEPCCGSKTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNA 261

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
            +S A  +  G D+NCG   +   Q A+  G V+E+ ID AL+ LF+ +++LGLF+ +  
Sbjct: 262 VESVALAMNRGCDLNCGNLYVNLLQ-AVRDGLVEEETIDTALIRLFTTRMKLGLFDKE-E 319

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              +  +  D V T   K+L + A+++ +VLLKN+   LPLN   ++S+ +IGP  NN +
Sbjct: 320 SIPFNTITYDQVDTKSSKELNIKASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRN 379

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA------ 514
            + G Y G      ++L G++  V    + +++ GCH           E  RIA      
Sbjct: 380 ALVGNYEGTASEYITVLEGIKQVVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVC 439

Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           + +D VI   GLD   E E+          D+ +L LPG Q  ++ ++    K PVIL+L
Sbjct: 440 EHSDVVIACLGLDPGLEGEEGDQGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILIL 498

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  L V +  AD  I +IL   YPG  G +A+AE+IFGD NP G+LP+T+Y  +    
Sbjct: 499 LSGSALAVPW--ADEHIPAILQGWYPGAQGGRAIAELIFGDGNPEGKLPVTFYRTTEELP 556

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
              D  M+         R+YR+     +Y FG+GLSYT + +  L               
Sbjct: 557 EFTDYAMK--------NRTYRYMKNEALYPFGYGLSYTTFEHTLL--------------- 593

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                         YV+ D +   +++   V+  V N GD +GS  V   A V  V +  
Sbjct: 594 --------------YVNTDTLGKGSNVECMVR--VKNTGDYEGS--VTTQAYVKYVGEDA 635

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           P  QL G  +V  +    K+I   +D      + N+ G  IL  G + L + +
Sbjct: 636 PNCQLKGLKKVSLLPGEEKDIMIELDD-RAFGLYNEEGEFILNQGEYELYLSD 687


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 240/764 (31%), Positives = 373/764 (48%), Gaps = 61/764 (7%)

Query: 40  PDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           PD  C      S   C+ + S + RA +L++ +  QEK+  L   +  + RLG+PAY WW
Sbjct: 22  PD--CVNGPLKSNAICDVTASPAKRAAALVAAMQTQEKLDNLVSKSKGVARLGLPAYNWW 79

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
            E+LHG+A   PG+NF G   + TSFP  L+ +A+F+  L   I   +  EARA  N G 
Sbjct: 80  GEALHGVAG-APGINFTGPYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGI 138

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           A + FW P+IN FRDPRWGRG ETPGED + +  Y    +   +G+              
Sbjct: 139 APVDFWTPDINPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDK------------A 186

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
           +++++                A CKH + YD+E W    R+SF+A IT QD  + F PPF
Sbjct: 187 QRKII----------------ATCKHYVGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPF 230

Query: 280 RSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFE 335
           + C    K    MCSYN VNGVP C    +  D+ +K  N      YITSDC+AV  I  
Sbjct: 231 QQCARDSKVGSFMCSYNAVNGVPTCADTYVLEDILRKHWNWTDSNNYITSDCEAVKDISL 290

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
              Y  T +++ A     GMD++C          A  +G +    IDRAL   +   +  
Sbjct: 291 RHKYVATLQEATAIAFNNGMDLSCEYSGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHA 350

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G F+G      Y  LG  D+ T E +KL L  A +G+ LLKND   LPL+  + S +A++
Sbjct: 351 GYFDG--AAATYANLGVQDINTPEAQKLVLQVAAEGLTLLKNDDT-LPLSLKSGSKVAMV 407

Query: 456 GPLVNNISQMGGGYTG-IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIA 514
           G   N+ S++ G Y+G  P     +  G +  +               +D    +A+  A
Sbjct: 408 GFWANDSSKLSGIYSGPAPYLHNPVYAGNKLGLDMAVATGPILQKSGAADNWTTKALDAA 467

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           KK+D ++   GLD +   E  DR  +  P  Q+ L+T +A   K  V++ L  G  +D  
Sbjct: 468 KKSDTILYFGGLDPSAAAEGSDRTDISWPSAQIDLITKLAALGKPLVVIAL--GDMVDHM 525

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
                  ++S++W  +PG+ G  A+ ++I G+    GRLP+T YP  +T++ M DMN+R 
Sbjct: 526 PILNMKGVNSLIWANWPGQDGGTAVMQVITGEHAIAGRLPITQYPAKYTQLSMLDMNLRP 585

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
             +   PGR+YR+Y    V  FG GL YT ++ KF S  S LT++        ++I++  
Sbjct: 586 GGNN--PGRTYRWYN-ESVQPFGFGLHYTKFAAKFGSN-SSLTVNI-------QDIMKSC 634

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
               D+  + +V         ++++VTN G+     + + F +     +  P K L+ + 
Sbjct: 635 TK--DHPDLCDVPP-------IEVAVTNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYA 685

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           R+  ++    + +        LS  ++ G  +   G + L++ E
Sbjct: 686 RLRDISGSQTKTASLALTLGTLSRVDQSGNLVAYPGEYTLLLDE 729


>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
 gi|224035251|gb|ACN36701.1| unknown [Zea mays]
 gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 405

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 194/414 (46%), Positives = 264/414 (63%), Gaps = 12/414 (2%)

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           +RLG F+GDPR+  +G LGP DVCT  +++LA +AARQGIVLLKN  K LPL+  ++ S+
Sbjct: 1   MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKNTGK-LPLSATSIKSM 59

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAV 511
           A+IGP  N    M G Y G PC   + L+GL A V+ T Y  GC +V C+ ++     A 
Sbjct: 60  AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVA-TVYQPGCTNVGCSGNSLQLDAAT 118

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           + A  AD  ++V G D + E E  DR SLLLPGQQ  LV++VA  S  P ILV+  GGP 
Sbjct: 119 KAAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPF 178

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
           D+SFA++  +I++ILW+GYPGEAG  A+A+++FG  NP GRLP+TWYPESFTKVPM DM 
Sbjct: 179 DISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTKVPMTDMR 238

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           MR D S  YPGR+YRFYTG  VY FG GLSYT++++  +SAP +L    +L+       L
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQL----ALQLAEGHACL 294

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
            +    ++     E   C  L F V + V NAG+  G H V LF+  P V    P K L+
Sbjct: 295 TEQCPSVEA----EGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPAV-HNAPAKHLL 349

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           GF++V      +  ++F VD C+ LS+ ++ G R + LG+H L VG+L+H+L +
Sbjct: 350 GFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGDLKHTLNL 403


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  366 bits (939), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 242/776 (31%), Positives = 377/776 (48%), Gaps = 103/776 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
             P   +  P+ +T L    R   LI  +TL+EK  QL +   AI RLG+P Y++W+E+L
Sbjct: 16  ATPALANDQPWFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
           HG+A NG            T FPQ +  AA+F++ L     S ++ EARA +N+      
Sbjct: 76  HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
               +GLTFW PNINIFRDPRWGRGQET GEDP + +      V   QG++ K       
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  +A  KH   + +       R+ F+A+ + +D  +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            P F + I +     +M +YN+VNG PA     L     R++WGF G++ SDC  +A   
Sbjct: 216 FPAFEALITEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +Y   T    +SAA  +  G D+NCG  +      A++ G V EK ID+ L  + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP+    Y  +  D V +  H ++A + A + IVLL+N    LPL++N + +L 
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
           + GP  ++   + G Y G+     ++L G+ A VS     +Y  G      N +      
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
               +  D +I V GL    E E+          DR+SL LP  Q++ +  + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+VLT G P  V+  E      +I++  YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             ++P  D     D S Q  GR+YR+ T   +Y FG GLSY    +  ++         +
Sbjct: 569 EAQLPPYD-----DYSMQ--GRTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
            +A + KN  Q+                      V ++VTN G+ +   VV L+ + P  
Sbjct: 615 TQALASKNEPQE-------------------NMTVTVNVTNTGEREFEEVVQLYLKTPDA 655

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               P   L GF R+   A  ++++ F + P + L   N+ G+ +L  G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFSI-PKKHLYSINEQGKPVLLKGQYSVIVG 710


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  365 bits (937), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 243/777 (31%), Positives = 368/777 (47%), Gaps = 115/777 (14%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           ++ PF N  L +  R K L++ LTL+EK+  +  ++ AIPRLGIPAY+WW+E+LHG+A  
Sbjct: 22  ENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR- 80

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG---- 161
                   T+  VT FPQ +  AA+F+      +G   + E RA++N     G+ G    
Sbjct: 81  --------TLEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYR 132

Query: 162 -LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
            LT+W PNINIFRDPRWGRGQET GEDP + +      V+  +GE              +
Sbjct: 133 GLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGE--------------D 178

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
              LK               AC KH   Y +     ++R+SF+A  +  D  DT+ P FR
Sbjct: 179 PHYLKSV-------------ACAKH---YAVHSGPEYNRHSFDARPSVFDLWDTYMPAFR 222

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
             + + K   +MC+YN++NG P C    L     RN+W F GY+TSDC A+    E+   
Sbjct: 223 ELVTKAKVHGVMCAYNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKT 282

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
              H  + +  L AG D+ CG  +       + KG   E+DI+ +L  LF++  ++G+F 
Sbjct: 283 HPEHTIAMSDALLAGTDLECGN-LYHLLAEGVKKGLHSERDINVSLSRLFTILFKIGMF- 340

Query: 400 GDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
            DP  +  Y  +G + +    HK+ A   A++ IVLL+N    LPL+ + + S+A+IGP 
Sbjct: 341 -DPAERVPYSSIGREVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPN 399

Query: 459 VNNISQMGGGYTGIP---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIA 514
            +N       Y G P    +P   L+       K +Y  G   V    DA  F +    A
Sbjct: 400 ADNGQTQLANYFGTPSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKA 459

Query: 515 KKADFVIVVAGLDLTQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
            ++D ++ V+G+    E                 DR ++ LP  Q+ L+  + +T  RP+
Sbjct: 460 AQSDVIVFVSGISADYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTG-RPL 518

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+V   G  +  SF        ++L   Y G+A   A+ +++FG  NP GR+P+T Y   
Sbjct: 519 IIVNMSGSVM--SFEWESQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD 576

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
               P  + +M         GR+YR++ G   Y FG+GLSYT ++Y              
Sbjct: 577 NDLPPFENYSML--------GRTYRYFKGEPRYPFGYGLSYTTFAYS------------- 615

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                            D   +DE  +  + R  V ++V+N GD DG  VV L+   P+ 
Sbjct: 616 -----------------DVQCVDETHTGDTAR--VTVTVSNTGDCDGDEVVQLYVVHPQD 656

Query: 742 SQG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            +   P   L GF R+H     S  +SF + P E+L++    G  +   G   L VG
Sbjct: 657 GRKQIPLCALKGFKRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVG 712


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  364 bits (935), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 255/775 (32%), Positives = 389/775 (50%), Gaps = 84/775 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ SLS S RA +L++ L + EK+  L  NA+   R+G+P Y WWSE+LHG+A   PG  
Sbjct: 44  CDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALHGVAY-APGTQ 102

Query: 115 FN---GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           F    G  +S TSFP  L+ AASF+ SL   IG  +  E+RA  N   +GL +W PN+N 
Sbjct: 103 FRRGPGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWSGLDYWTPNVNP 162

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           F+DPRWGRG ETPGED + +  YA   +K  +G + +          +E+RV+       
Sbjct: 163 FKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPE----------KERRVV------- 205

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                    + CKH  A D E W   SR+ F+A I+ QD  + +  PF+ C    +   I
Sbjct: 206 ---------STCKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSI 256

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSA 347
           MC+YN VNGVP+C    L     R  WG+ G   Y+TSDC+AV  +     Y +T+ +  
Sbjct: 257 MCAYNAVNGVPSCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKYARTNAEGT 316

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A   +AG D +C        + A  +G ++E+ +DRALL L+   +R+G F+G+     +
Sbjct: 317 AMCFEAGTDTSCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGN--SSAF 374

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNKNAVS-----------SLAII 455
             +   DV     + L+L +A +GIV+LKND    LPL     S            LA+I
Sbjct: 375 SDISWADVNAPAAQDLSLQSAVEGIVMLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMI 434

Query: 456 GPLVNNISQMGGGYTGIPC---SPKSLLR--GLEAYVSKTHYASGCHDVPCNSDAGFHEA 510
           G   +   ++ GGY+G      +P    R  GL+   +      G       +D     A
Sbjct: 435 GFWADAPEKLRGGYSGTAAYLRTPAYAARQMGLDVVTAGGPVLQGAAAA--AADNWTAPA 492

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           +  A+ AD+++   GLD T   E++DR  +  PG Q++LV  +A   K P+++V  G   
Sbjct: 493 LAAAEGADYIVYFGGLDETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQMGDQ- 550

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK-VPMND 629
           LD +   A++ + ++LW  +PG+ G  A+  ++ G  +P GRLP+T YP ++T+ VPM +
Sbjct: 551 LDGTPLLANAGVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTE 610

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF-----LSAPSELTISASLKA 684
           M +R  +S   PGR+YR+Y+ T V  FG GL YTN++        L+A S +T S+ L+A
Sbjct: 611 MALRPSASGSRPGRTYRWYS-TPVLPFGFGLHYTNFTPAVTVPPALAAASGVTTSSLLEA 669

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
             D +               E  +   LR    ++V N G     +V + F       + 
Sbjct: 670 CRDPH--------------PERCALPPLR----VAVANTGRRASDYVALAFVSGDYGPRP 711

Query: 745 TPEKQLIGFDRVHTV-AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
            P K L  + R+  V A GS E          ++  ++ G  +L  G + + + E
Sbjct: 712 RPIKTLAAYARLRGVRAGGSAEADLAWT-LGDIARHDEDGNTVLYPGTYKVQIDE 765


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  364 bits (935), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 244/771 (31%), Positives = 371/771 (48%), Gaps = 120/771 (15%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +    AK L++ +TLQEK +QL+ N+ A+ RL IP Y WW+E LHG+A  G     
Sbjct: 20  NTHEAAVKAAKELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAG----- 74

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAP 167
                + T FPQ +  AA F+      +   +A E RA YN            GLT+W+P
Sbjct: 75  -----TATVFPQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSP 129

Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
           N+NIFRDPRWGRG ET GEDP + S   V FVK  QG+                      
Sbjct: 130 NVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQGD---------------------- 167

Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                 G  L LSAC KH   +     G  S R+ FNA+++++D  +T+ P F +C+++ 
Sbjct: 168 ------GKYLKLSACAKHFAVHS----GPESLRHEFNAVVSQKDLHETYLPAFEACVKEA 217

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
               +M +YN+ NG P C    L +   R +WGFKG++ SDC A+A    +   T T  +
Sbjct: 218 NVESVMGAYNRTNGEPCCGSKALLKDILRGKWGFKGHVVSDCWALADFHMHHKVTSTATE 277

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S A  ++ G D+NCG  M  +   A  +G V E+ I  A   L + + +LG+F+ D    
Sbjct: 278 SVALAIENGCDLNCGN-MYLNLLLAYKEGLVTEEQITTAAERLMTTRFKLGMFDEDC--- 333

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           +Y ++  +     EH +++L+A+R+ +VLLKN+   LPL+K+ + ++A+IGP  N+   +
Sbjct: 334 EYNQIPYEVNDCKEHNQVSLEASRKSMVLLKNN-GILPLDKSKLKAVAVIGPNANSEIML 392

Query: 466 GGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKK 516
            G Y+G      ++L G+   +    + +Y+ GCH      +     D    EAV +A++
Sbjct: 393 KGNYSGTASKYTTILDGIHDVLDDDVRVYYSEGCHLYKEKVEDLARRDDRLAEAVSVAER 452

Query: 517 ADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           AD VI+  GLD T E E           D++ L LPG Q  L+  V  T K PV++VL  
Sbjct: 453 ADVVILCLGLDSTIEGEQGDAGNGYGAGDKLDLNLPGIQQELLEKVLETGK-PVVVVLGT 511

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G  L ++ AE   + ++IL   YPG  G  A A+I+FG  +P G+LP+T+Y ++      
Sbjct: 512 GSGLTLNGAE--ERCAAILNAWYPGSHGGTAAADILFGKCSPSGKLPVTFYKDTDKLPEF 569

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQ-VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            D  M+        GR+YR+   +  +Y FG+GL+Y+      L  P+            
Sbjct: 570 TDYAMK--------GRTYRYMDESNCLYPFGYGLTYSTVELSNLQVPA------------ 609

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                     R ++  ID           + + + N G  D   VV  + +  +      
Sbjct: 610 ---------VRGEFDGID-----------ISVEIENTGSYDIEEVVQCYIKDLESKYAVL 649

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              L GF RV      SK ++  ++        +  G RIL      L VG
Sbjct: 650 NHSLAGFKRVSLKKGESKTVTMKLNR-RAFEAVDDAGERILDSKKFKLFVG 699


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 252/745 (33%), Positives = 373/745 (50%), Gaps = 81/745 (10%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS-------SVTS 124
           LTL EK   L +NA  + RLG+P YEW +E+LHG+A   PG   N T +       S T 
Sbjct: 12  LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETP 184
           FP  +V  A+F+  L  +I +AV+ EARA  N  +AGL +WAPNIN +RDPRWGRGQETP
Sbjct: 72  FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP  V+ YA  +V   +G          G G  + +V+                + CK
Sbjct: 132 GEDPYHVAQYAYNYVVGLKG----------GVGPAKSKVV----------------STCK 165

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H   YD+E      R S+NA+I+ QD  + + P FRSC    K   +MCSYN VNG P+C
Sbjct: 166 HFAGYDIEDSDGVVRGSYNAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSC 225

Query: 305 LRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
               +     R+ WG+     ++T DC AV  +F   +  ++     A  +  G D++CG
Sbjct: 226 ANSYMLDTVLRDHWGWGSSAHWVTGDCGAVDGVFNQHHVGQSAAQGVAFAINNGTDLDCG 285

Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSE 419
           T    +  SA+      E  +D+AL  L+S  + LG F  DP +G+ Y  LG  DV T  
Sbjct: 286 TAYASNIASAVQNNYTTEAQLDQALSRLYSSLIVLGYF--DPPEGQEYRTLGVSDVNTPS 343

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS-QMGGGYTGIPCSPKS 478
            +KLA  A  +GI +L       P+ +    ++  +GP  NN S  M G Y G+  +P  
Sbjct: 344 TQKLAYTALVEGINIL----PIRPMGQ----TVLFVGPWANNASVSMFGNYNGV--APYK 393

Query: 479 LL----RGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
            +        AY     Y+ G   V  N  + F  AV  A++AD V+ + G+D   E E 
Sbjct: 394 TIPVPTANSSAYNWNVTYSQGLQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEA 453

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR S+  PG Q++L+  +A  + +PV++V  GGG +D S    +  +  +LW+GYPG+ 
Sbjct: 454 HDRTSIDWPGAQLNLIKQLA--AVKPVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQE 511

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
               L +I+ G   P GRLP+T YP ++ T+VPM D ++R  SS   PGR+YR+Y G+ V
Sbjct: 512 FGSGLIDILSGASAPAGRLPVTQYPANYITQVPMTDQSLRPSSSN--PGRTYRWYNGS-V 568

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
             FG G+ YT ++  + +             GS +     T    D+++ ++        
Sbjct: 569 IPFGTGIHYTKFNISWKT------------GGSGRG----TYDTADFINAEDPKDLAEFD 612

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVS-QGTPEKQLIGFDRVHTVAKG-SKEISFGVD 771
              QI+V N G     +V +LF +      Q  P K L+ + R H    G + +I   V+
Sbjct: 613 V-FQINVENVGSTTSDYVALLFVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVN 671

Query: 772 PCEQLSIANKHGRRILPLGNHVLMV 796
              Q++  +  G  +L  G + L +
Sbjct: 672 -VGQIARNDSSGNLVLYPGAYTLEI 695


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 254/791 (32%), Positives = 386/791 (48%), Gaps = 106/791 (13%)

Query: 34  KPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS---AIPR 90
           K +L+     C+ P FD +PFCN+SLS+  R + L+  L L EK++ L+  AS   +IPR
Sbjct: 17  KLILSSDAHACEYP-FDYFPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPR 75

Query: 91  LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
           LG+P Y W +  +HG+ S        GT    TSFP  +   A F+ +    +   +  E
Sbjct: 76  LGVPEYNWGANCVHGVQSTC------GT-HCATSFPNPVNLGAIFDPNEIYKMAQVIGKE 128

Query: 151 ARAM--------YNLG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
            RA+        Y  G   GL  W+PNINI RDPRWGR  ETP EDP V + Y V + K 
Sbjct: 129 LRALRLEGARENYARGPHIGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKG 188

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
            Q       +G      ++ R L+                  KH +AY  E +G   R  
Sbjct: 189 LQ-------EG------QDSRFLQAV-------------VTLKHYLAYSYENYGGTDRTQ 222

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKG 321
           F+A+++  D  DT+ P F + +  GKA  IMCSYN +NG+P C    L Q  R++  F G
Sbjct: 223 FDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNSLNGIPTCANKWLNQLLRDDLEFDG 282

Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
           YITSD  A+  IF+   YTKT  ++    +++G+DI  G       +   +        I
Sbjct: 283 YITSDTGAIQGIFDGHKYTKTLCEATKIAMESGVDICSGNAYWNCLKQLANSTNFS-ASI 341

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D A+     ++ +LGLF+     G     GP+DV T++  +L+LD AR+ IVLL+N    
Sbjct: 342 DEAIRRTLKLRFQLGLFDA---IGDQPHFGPEDVRTAKSLQLSLDLARKSIVLLQNHGNT 398

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC-----SPKSLLRGLEAYVS-----KTH 491
           LPL       +A+IGP       + G Y G  C       + +   LEA  S      TH
Sbjct: 399 LPLRLGL--RIAVIGPHSMTRRGIMGNYYGQLCHGDYDEVRCIQSPLEAIQSVNGRNNTH 456

Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           + +GC  +   S A F +A++  + AD  ++  G+D++ E E +DR ++ +P  Q+ L+ 
Sbjct: 457 HVNGC-GINDTSTAEFDDALQAVRTADVAVLFLGIDISIERESKDRDNIDVPHIQLELLK 515

Query: 552 SVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNP 609
           ++ R + +P ++VL  GG L +      ADS + +     YPG  GA+A+AEI+FG  NP
Sbjct: 516 AI-RVAGKPTVVVLFNGGILGIEKLILYADSVLEAF----YPGFFGAQAIAEILFGSINP 570

Query: 610 GGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
            G+LP+T Y  +F    +ND++M++ S   YPGRSYR+YT   VY FG GLSYT +S + 
Sbjct: 571 SGKLPVTMYRSNF----INDVDMKSMSMTLYPGRSYRYYTEVPVYSFGWGLSYTTFSIQS 626

Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
           + +     ++  L A                                +I +TN G   G 
Sbjct: 627 IDSHDTRAMNHVLTAQPKM---------------------------YRILITNNGKYYGE 659

Query: 730 HVVMLFARVPKVSQGTP----EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
            V+  F R   +    P    ++QL  + RV       +E+   V   E L++ +++G  
Sbjct: 660 EVLFAFFRPLDIHATGPVESLQQQLFNYTRVRLDPGDMREVPLHVKD-ENLALHDRNGNL 718

Query: 786 ILPLGNHVLMV 796
            +  G + L++
Sbjct: 719 CVFEGFYELII 729


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  362 bits (930), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 240/776 (30%), Positives = 375/776 (48%), Gaps = 103/776 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
             P   +  P+ +T L    R   LI  +TL+EK  QL +   AI RLG+P Y++W+E+L
Sbjct: 16  ATPALANDQPWFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEAL 75

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
           HG+A NG            T FPQ +  AA+F++ L     S ++ EARA +N+      
Sbjct: 76  HGVARNG----------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGN 125

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
               +GLTFW PNINIFRDPRWGRGQET GEDP + +      V   QG++ K       
Sbjct: 126 RSKYSGLTFWTPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGDHPKY------ 179

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                                L  +A  KH   + +       R+ F+A+ + +D  +T+
Sbjct: 180 ---------------------LKTAAAAKH---FAVHSGPEALRHEFDAIASPKDMYETY 215

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            P F + + +     +M +YN+VNG PA     L     R++WGF G++ SDC  +A   
Sbjct: 216 FPAFEALVTEANVETVMAAYNRVNGHPAGGSDFLLNTVLRDKWGFSGHVVSDCWGLADFH 275

Query: 335 EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           +Y   T    +SAA  +  G D+NCG  +      A++ G V EK ID+ L  + + + +
Sbjct: 276 QYHKVTANAVESAALAINTGTDLNCG-AVYNALPDAVEAGLVDEKTIDKRLSKVLATKFK 334

Query: 395 LGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP+    Y  +  D V +  H ++A + A + IVLL+N    LPL++N + +L 
Sbjct: 335 LGFF--DPKDDNPYNNISADVVNSEAHAQVAYEMAVKSIVLLQNKNNILPLDRN-IRNLY 391

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEA 510
           + GP  ++   + G Y G+     ++L G+ A VS     +Y  G      N +      
Sbjct: 392 VTGPFASSSEVLLGNYYGLSGKTTNILDGITANVSVGTTINYKQGILPYQANVNPIDWTT 451

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPV 561
               +  D +I V GL    E E+          DR+SL LP  Q++ +  + + + +PV
Sbjct: 452 GEAKQMGDVIIAVMGLSGAYEGEEGEAIASPHKGDRLSLDLPEHQIAFLRKLRKDNDKPV 511

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+VLT G P  V+  E      +I++  YPG+ G KA+A+I+FG+ +P GRLP+T +P+S
Sbjct: 512 IVVLTAGTP--VNLTEIAELADAIVFAWYPGQEGGKAVADILFGERSPSGRLPIT-FPKS 568

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             ++P  D     D S Q   R+YR+ T   +Y FG GLSY    +  ++         +
Sbjct: 569 EAQLPPYD-----DYSMQE--RTYRYMTQEPMYPFGFGLSYAQVKFDNITL-------GN 614

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
            +A + KN  Q+                      V ++VTN G+ +   VV L+ + P  
Sbjct: 615 TQALASKNEPQEN-------------------MTVTVNVTNTGEREFEEVVQLYLKTPDA 655

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               P   L GF R+   A  ++++ F + P + L   N  G+ +L  G + ++VG
Sbjct: 656 GVSQPLHSLKGFTRIKLAAGQTEQVLFNI-PKKHLYSINAQGKPVLLKGQYSVIVG 710


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  362 bits (930), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 233/669 (34%), Positives = 342/669 (51%), Gaps = 46/669 (6%)

Query: 25  VNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN 84
           ++ +A+    P    PD  C  P       C+ +L  + RA +L++ LT +EK+Q L   
Sbjct: 1   MSAVAYGLDGPFQTYPD--CTKPPLSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSK 58

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNFN---GTVSSVTSFPQVLVSAASFNRSLWS 141
           A   PR+G+PAY WWSE+LHG+A + PG  F    G  +S TSFP  L+ AA+F+  L  
Sbjct: 59  APGAPRIGLPAYNWWSEALHGVA-HAPGTQFRDGPGDFNSSTSFPMPLLMAAAFDDELIE 117

Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
            +G  +  EARA  N G +GL +W PN+N FRDPRWGRG ETPGED + +  YA   ++ 
Sbjct: 118 AVGDVIGTEARAFGNAGWSGLDYWTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRG 177

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
            +G +  S     G G    RV+                + CKH    D E W   +R+ 
Sbjct: 178 LEGRSSSSSSCSFGSGGEPPRVI----------------STCKHYAGNDFEDWNGTTRHD 221

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGF- 319
           F+A+I+ QD  + +  PF+ C    +   +MC+YN VNGVP+C    L     R  W + 
Sbjct: 222 FDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNGVPSCANSYLMNTILRGHWNWT 281

Query: 320 --KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQ 377
               Y+TSDC+AV  +  + +Y  T+ +      +AGMD +C          A   G + 
Sbjct: 282 EHDNYVTSDCEAVLDVSAHHHYADTNAEGTGLCFEAGMDTSCEYEGSSDIPGASAGGFLT 341

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
              +DRAL  L+   +R+G F+G   +  +  LG  DV   E ++LAL AA +GIVLLKN
Sbjct: 342 WPAVDRALTRLYRSLVRVGYFDGP--ESPHASLGWADVNRPEAQELALRAAVEGIVLLKN 399

Query: 438 DKKFLPL---------NKNAVSSLAIIGPLVNNISQMGGGYTGIPC---SPKSLLRGLEA 485
           D   LPL                +A+IG   +   ++ GGY+G P    SP S  R L  
Sbjct: 400 DNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFGGYSGAPPFARSPASAARQLGW 459

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
            V+         D     D     AV  A  AD+++   GLD +   E +DR+++  P  
Sbjct: 460 NVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFGGLDTSAAGETKDRMTIGWPAA 519

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q++L++ +AR  K PV++V  G    D    E D  + ++LW  +PG+ G  A+  ++ G
Sbjct: 520 QLALISELARLGK-PVVVVQMGDQLDDTPLFELDG-VGAVLWANWPGQDGGTAVVRLLSG 577

Query: 606 DFNPGGRLPMTWYPESFTK-VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
             +P GRLP+T YP ++T  VP+ DM +R  ++   PGR+YR+Y  T V  FG GL YT 
Sbjct: 578 AESPAGRLPVTQYPANYTDAVPLTDMTLRPSATN--PGRTYRWYP-TPVRPFGFGLHYTT 634

Query: 665 YSYKFLSAP 673
           +  +F   P
Sbjct: 635 FRAEFGPHP 643


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 242/751 (32%), Positives = 361/751 (48%), Gaps = 124/751 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A++L++ +TL+EK  QL  +A AI RLGIPAY WW+E LHG+A  G            T
Sbjct: 9   KAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAG----------QAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA+F+R   + +   VA E RA YN            GLTFW+PN+NIFRDP
Sbjct: 59  VFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP +     V FVK+ QG                             GD
Sbjct: 119 RWGRGHETYGEDPYLTKELGVSFVKALQG----------------------------NGD 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +  +AC KH   +   +     R+ F+A  + +D E+T+ P F   +++ K   +M +Y
Sbjct: 151 TMKAAACAKHFAVHSGPE---ALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAY 207

Query: 296 NQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           N+ NG P C    L +K R EW F+G+  SDC A+    E+   T T  +SAA  +  G 
Sbjct: 208 NRTNGEPCCGSPTLQKKLRGEWKFQGHFVSDCWAIRDFHEHHMVTDTAVESAALAINNGC 267

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG   L H   A +KG V E+ I RA + LF+ +  LGLF+G     +Y  L   +V
Sbjct: 268 DLNCGNTYL-HIMKAYEKGLVTEETITRAAVRLFTTRYLLGLFDG----SEYDNLSYMEV 322

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
            +  H   A  AA +  VLLKN+   LPL+K  + ++ IIGP  ++   + G Y G    
Sbjct: 323 ESPRHLDAAEKAAEKSFVLLKNN-GILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASR 381

Query: 476 PKSLLRGLEAYVS---KTHYASGC-------HDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
             ++  G++ YV    +   + GC         +    D    EA  +A+ +D VI+  G
Sbjct: 382 YITIQEGIQDYVGDDVRILTSRGCDLFRDRTEHLAFTRDR-IAEAKVVAENSDVVILCMG 440

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E+          D+  + LPG Q  L+ ++A T K PV+  L  G  LD+ +A
Sbjct: 441 LDETLEGEEGDTGNSYVSGDKEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYA 499

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRAD 635
                   +LW  YPG  G KA A+++FG+ +P G+LP+T+Y ES  ++P   D +M+  
Sbjct: 500 AEKFDAVMMLW--YPGCQGGKAAAKVLFGEISPSGKLPVTFY-ESLEELPDFTDYSMK-- 554

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                 GR+YR+      + FG+GL+Y+                   K   DK       
Sbjct: 555 ------GRTYRYMERKAQFPFGYGLTYS-------------------KVAVDKA------ 583

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
                    EV +C   + +V++ V N G  D   VV ++ +        P   L GF R
Sbjct: 584 ---------EVKTCGQ-KINVEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQR 633

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
           +   A   ++I   +   +  ++ ++ G+R+
Sbjct: 634 IFLKAGECRKIEIPIWE-KAFTVVDETGKRM 663


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 240/754 (31%), Positives = 366/754 (48%), Gaps = 130/754 (17%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA++L++ +T++EK  QL  +A AI RLGIPAY WW+E+LHG+A  G            T
Sbjct: 9   RAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAG----------QAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +   A+F+ +L   I   +A E RA YN   A        GLTFW+PN+NIFRDP
Sbjct: 59  VFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V FVK  QG+                            G+
Sbjct: 119 RWGRGHETYGEDPCLTSRLGVAFVKGLQGD----------------------------GE 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +  +AC KH   + +       R+ FNA  + +D E+T+ P F + +++     +M +Y
Sbjct: 151 TMKAAACAKH---FAVHSGPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAY 207

Query: 296 NQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           N+ NG   C    L +  R +WGF+G+  SDC A+    E+   T T ++SAA  + +G 
Sbjct: 208 NRTNGEACCASPVLQKILREDWGFEGHFVSDCWAIRDFHEHHMLTATAKESAAMAINSGC 267

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+NCG   L H   A   G V E+ I  A + LF+ +  LGLF+G     +Y  +    V
Sbjct: 268 DLNCGNTYL-HILHAYRDGLVSEETITEAAVRLFTTRFLLGLFDGS----EYDDIPYTVV 322

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
            + EH  LA  AA +  VLLKN+   LPL K  + ++ +IGP  ++ + + G Y G    
Sbjct: 323 ESKEHLALAEKAALESAVLLKNN-GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASR 381

Query: 476 PKSLLRGLEAYVS---KTHYASGCHDVPCNSD----AG--FHEAVRIAKKADFVIVVAGL 526
            +++ +GL+ Y+    +   + GC      ++    AG    EA  +A+ +D VI+  GL
Sbjct: 382 YETIQQGLQDYLGEDVRVLTSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGL 441

Query: 527 DLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D T E E+          D+ +LLLP  Q  L+ +VA T K PV+L +  G  LD+S+A 
Sbjct: 442 DETLEGEEGDTGNSYASGDKETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAA 500

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADS 636
                 +IL + YPG  G  A A+++FG+ +P G+LP+T+Y E+  ++P   D +M+   
Sbjct: 501 --EHFDAILQLWYPGSQGGSAAAKLLFGEVSPSGKLPVTFY-ETLEELPAFEDYSMK--- 554

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSY-----TNYSYKFLSAPSELTISASLKAGSDKNIL 691
                GR+YR+      Y FG GL+Y     T+ + +  SA  +LT++            
Sbjct: 555 -----GRTYRYMGHPAQYPFGFGLTYGDVRVTDANIRGASAEGDLTLA------------ 597

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
                                     ++  NAG+     V+ ++ +    +   P   L 
Sbjct: 598 --------------------------VTAENAGNAVTDEVLQIYVKCTDSANAVPNPALA 631

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
            F R+H  A   K I   V P    ++ ++ G R
Sbjct: 632 AFGRIHLEAGEKKTIEMTV-PARAFTVVDEAGVR 664


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  359 bits (921), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 252/757 (33%), Positives = 367/757 (48%), Gaps = 120/757 (15%)

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSF 125
           ++LIS ++L+EKI  +   A  IPRLGIP Y WW+E+LHG+A+NG            T F
Sbjct: 13  ETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNG----------EATVF 62

Query: 126 PQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LGQA-------GLTFWAPNINIFRDPRW 177
           PQ +   A+F+  L   +  A+++EARA +N +G+        GLTFWAPNINIFRDPRW
Sbjct: 63  PQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDPRW 122

Query: 178 GRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDEL 237
           GRGQET GEDP++ S     +V+  QG +                              L
Sbjct: 123 GRGQETYGEDPVLTSRLGTAYVRGLQGSD---------------------------PYYL 155

Query: 238 MLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
             +AC KH   +   +     R++FNA ++++D E+T+ P F++ ++ G  S +M +YN+
Sbjct: 156 RAAACAKHFAVHSGPE---GLRHTFNAEVSQKDLEETYLPAFKALVKSGVES-VMGAYNR 211

Query: 298 VNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
           VNG PAC    L  QK R EW F+G++ SDC A+    +    T    +S A  L++G D
Sbjct: 212 VNGEPACGSTYLLKQKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILESIALALRSGCD 271

Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
           +NCG     +   A+ KG V E DI+RA++ L     +LGL + D   G Y  +    + 
Sbjct: 272 LNCGDAY-NYLAEAVLKGYVTEDDINRAVVRLLITLDKLGLIHDD---GPYQGITIHQID 327

Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSP 476
             +H  LAL+AA + IVLLKN+   LPL K+ +S + + GP   N   + G Y G+    
Sbjct: 328 WKKHDSLALEAAEKSIVLLKNN-GVLPLKKDKISYIYVTGPNATNSDALLGNYAGV---S 383

Query: 477 KSLLRGLEAYVSK------THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
             LL  LEA V +        Y  GC       +     A  + K AD  I V G D + 
Sbjct: 384 SRLLTVLEAIVEEAGPEITVTYKKGCPLAERRVNPN-DWASGVTKYADVTIAVMGRDTSV 442

Query: 531 ETEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
           E E+ D +          L L  +Q+S +  + + S +P+I+VL GG P  +   E    
Sbjct: 443 EGEEGDAILSSTYGDFEDLNLNDEQLSYLHKL-KESGKPLIVVLMGGAP--ICSPELHEI 499

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQY 640
             +IL   YPG+AG  A++ I+FG  NP G+LP+T +P+S  ++P   + +M+       
Sbjct: 500 ADAILVAWYPGQAGGTAVSNIVFGKTNPSGKLPVT-FPKSVRQLPEFENYSMQ------- 551

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
            GR+YR+ T   +Y FG GLSYT   +K ++                         R   
Sbjct: 552 -GRTYRYMTEEPLYPFGFGLSYTKMEFKHVTG------------------------RWKS 586

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
              DE+   T L         N G +DG  VV L+          P   LI F RV   A
Sbjct: 587 PEKDELIVSTELY--------NQGTIDGEEVVQLYYHWKDAPFAVPNWSLIDFKRVLVAA 638

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             S    F + P E+L   +  G+ ++P G     VG
Sbjct: 639 GASCICEFKI-PLEKLQCIDPSGKGVIPTGTLQFYVG 674


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  358 bits (919), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 213/630 (33%), Positives = 321/630 (50%), Gaps = 79/630 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A+ L+  +TL+EK+ Q  + A AI RLGI AY WW+E LHG+A  G            T
Sbjct: 24  KAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNEGLHGVARAG----------VAT 73

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L   +G AV+ EARA Y++ Q         GLT WAPNINIFRDP
Sbjct: 74  IFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRYGDTDIYKGLTLWAPNINIFRDP 133

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   + +++  QG + K                           
Sbjct: 134 RWGRGHETYGEDPWLTSRLGIRYIRGLQGSHEKY-------------------------- 167

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   +   +     R+SF+A ++E+D  +T+ P F +C++ G    +M +Y
Sbjct: 168 -LKTAACVKHFAVHSGPE---ELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAVMGAY 223

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+VNGVP C    L +   R EWGF G++ SDC A+    E    T +  +S +  +  G
Sbjct: 224 NRVNGVPCCGNEYLLETILRKEWGFHGHVVSDCWAIKDFHEGHGVTDSPVESVSMAMNHG 283

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG  +  +   A+ +GKV+E+ +D A++ LF+ +L+LG          Y  +   +
Sbjct: 284 CDLNCGN-LFTYLIQAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAGISYLE 342

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V +   KKL   AA + +VLLKN +  LP++     ++ +IGP  ++   + G Y G   
Sbjct: 343 VDSPAMKKLNRSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNYEGTAS 402

Query: 475 SPKSLLRGLEAYV---SKTHYASGCHDVPCN------SDAGFHEAVRIAKKADFVIVVAG 525
              ++L G+       ++  Y+ GCH    N       +    E   I +++D VI   G
Sbjct: 403 EYVTVLEGIREAAEPEARVLYSEGCHLYKSNVSGLGARNDRLSEVKGICRESDIVIACMG 462

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E           D+  L+LPG Q  ++ + A  S +PV+LVL  G  + V++ 
Sbjct: 463 LDSTLEGEQGDTGNIYAGGDKPDLMLPGLQQKILET-AYDSGKPVVLVLLAGSAMAVTW- 520

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
            AD  + +IL   YPG  G + +A+++FG  NP GRLP+T+Y  +       + +M    
Sbjct: 521 -ADEHLPAILTAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYRTTEELPDFTNYSME--- 576

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
                GR+YRF     +Y FG GLSYT +S
Sbjct: 577 -----GRTYRFMKQKALYPFGFGLSYTEFS 601


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  358 bits (919), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQSDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 247/757 (32%), Positives = 357/757 (47%), Gaps = 112/757 (14%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           +++PF N  LS   R   L+S LTL+EKI Q+ + A AI RLGIPAY+WW+E LHG+   
Sbjct: 25  NTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHGVGR- 83

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
                   T   VT FPQ +  AA+++ +L+  + S++A E RA+Y+   +        G
Sbjct: 84  --------TPYKVTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHEIYHG 135

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LT+W PNINIFRDPRWGRGQET GEDP +       FV   QG++ K             
Sbjct: 136 LTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGDDPKY------------ 183

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                          L  SAC KH   Y +      SR+ FN  ++  D  DT+ P FR 
Sbjct: 184 ---------------LKASACAKH---YAVHSGPEISRHFFNTEVSMYDLWDTYLPAFRD 225

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
            +   K S +MC+YN + G P C    L Q   R +W F GY+TSDC A+    +++ + 
Sbjct: 226 LVVDAKVSSVMCAYNALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFLKHKTHA 285

Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
                SA  VL  G D+ CG  +      A+ +G + E  ID ++  LF  + RLGLF+ 
Sbjct: 286 DAAHASADAVLH-GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDP 344

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
             R  KY       +   EHK LAL  +R+ +VLLKND   LPL KN +  +A+IGP  +
Sbjct: 345 ADRV-KYADTPLSVLECDEHKALALKMSRESVVLLKND-NVLPLRKN-LKKIAVIGPNAD 401

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKA 517
           + + + G Y G P    + L  + + V K     Y      V  + +   +  +   K  
Sbjct: 402 DSTVVLGNYNGFPSKVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGV 461

Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           D VI V G+    E E+           DR ++ LP  Q  L+  + + +  PVI V+  
Sbjct: 462 DQVIFVGGISPRLEGEELPISVDGFRGGDRTTIALPEVQTELMKKM-KEAGLPVIFVMMT 520

Query: 568 GGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
           G  L +   E +SQ I +IL   Y G+   +A+A+++FGD+NP G+LP+T+Y       P
Sbjct: 521 GSALGI---EWESQNIPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSDSDLPP 577

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
               +M          R+YR++ G  +Y FG GLSYT + Y      S   +   +K   
Sbjct: 578 FGAFSM--------ANRTYRYFKGEALYPFGFGLSYTMFDYSVPQVVSGGKVGEPIK--- 626

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                                        V + V N G  +G  VV L+     V +  P
Sbjct: 627 -----------------------------VSVKVKNIGKKNGDEVVQLYLSHEGVEKA-P 656

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
              L GF RV+  A   K +SF + P   +S+ + +G
Sbjct: 657 ITALKGFKRVYLKAGEEKTLSFEISP-RDMSLPDDNG 692


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  358 bits (918), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  358 bits (918), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  358 bits (918), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 234/754 (31%), Positives = 364/754 (48%), Gaps = 67/754 (8%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
            C+ +     RA +L++ +  QEK+  L   +  + RLG+PAY WW E+LHG+A   PG+
Sbjct: 36  ICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGI 94

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
            F     + TSFP  ++ +A+F+  L   I + +  EARA  N G A + +W P+IN  R
Sbjct: 95  KFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPMDYWTPDINPVR 154

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           D RWGR  E+PGED   +  Y    +   +G+              +++++         
Sbjct: 155 DIRWGRASESPGEDIRRIKGYTKALLAGLEGDQ------------AQRKII--------- 193

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                  A CKH + YD+E WG + R++F+A IT QD  + + PPF+ C    K    MC
Sbjct: 194 -------ATCKHYVGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMC 246

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           SYN VNGVP C    + Q   R+ W +     YITSDC+AVA I E   Y +T     A 
Sbjct: 247 SYNAVNGVPTCADTYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTAL 306

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
               GMD++C          A  +G +    ID+AL   +   +  G F+G   K  Y  
Sbjct: 307 AFAKGMDLSCEYSGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDG--AKATYAN 364

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGG 468
           L  +D+ T E ++L+L    +G+V+LKND    LPL K   S +A+IG   N+ S++ G 
Sbjct: 365 LSYNDINTPEARQLSLQVTSEGLVMLKNDHTLPLPLTKG--SKVAMIGFWANDSSKLQGI 422

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASG----CHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           Y+G P    S +   E        A G       VP N       A+  A+K+D+++   
Sbjct: 423 YSGPPPYRHSPVFAGEQMGLDMAIAWGPMIQNSSVPDNWTT---NALDAAEKSDYILYFG 479

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G D T   E  DR ++  P  Q+ L+  +A+  K  V++ L  G   D S   +   I+S
Sbjct: 480 GQDWTVAQEGYDRTTISFPQVQIDLLAKLAKLGKPLVVITL--GDMTDHSPLLSMEGINS 537

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS 644
           I+W  +PG+ G  A+  +I G   P GRLP+T YP  + K+ M DMN+R  +  + PGR+
Sbjct: 538 IIWANWPGQDGGPAILNVISGVHAPAGRLPITEYPADYVKLSMLDMNLRPHA--ESPGRT 595

Query: 645 YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHID 704
           YR++    V  FG GL YT +   F    SE  ++  ++   D    Q       Y  + 
Sbjct: 596 YRWFN-ESVQPFGFGLHYTTFEAGFA---SEEGLTYDIQETLDSCTQQ-------YKDLC 644

Query: 705 EVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK 764
           EV         ++++V N G+     V + F +     +  P K LI + R+  +  G+K
Sbjct: 645 EVAP-------LEVTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAK 697

Query: 765 EISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           + +       +L+  ++ G  ++  G + L++ E
Sbjct: 698 KSASLPLTLGELARVDQSGNTVIYPGEYTLLLDE 731


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  357 bits (917), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 249/775 (32%), Positives = 376/775 (48%), Gaps = 143/775 (18%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +  LS   RA +L   L+ +E+ QQL  +A AI + G+P+Y WW+E LHG+A  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA+F++ +   +G  V+ EARAMYN            GLT W
Sbjct: 61  -------TATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP + S   V FVK  QGE              EK    
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGE--------------EKY--- 156

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
                      L  +AC KH   +     G  S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 157 -----------LRAAACAKHFAVHS----GPESLRHEFDARVSEKDLEETYLPAFKALVK 201

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G+   +M +YN+VNG P+C    L  K R EWGF GY  SDC A+         T T  
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCGAIRDFHTNHKITDTAP 260

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            SAA  LKAG D+NCG   L H  +A+++G + ++DI  A ++    ++RLG  + +   
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQDIRTACIHALRTRIRLGQLDDN--- 316

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++  L  D +    +K L+L+AA + +VLL ND   LPL+K+ +SS+A+IGP  ++ + 
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374

Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
           + G Y G P    + L G++ A+  + +YA GC           +P +    + EAV   
Sbjct: 375 LLGNYEGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431

Query: 515 KKADFVIVVAGLDLTQETED-------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           + AD  +V  GLD T E E+        D+  L LP  Q  L+  +  T K P+I+VL  
Sbjct: 432 EAADVTVVCVGLDSTLEGEEGDTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAA 490

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G  ++    E ++ I++  W  YPG+ G KALAEI+FG+ +P G+LP+T+Y  +      
Sbjct: 491 GSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLPDF 545

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
            D +M+         R+YRF       +Y FG+GL+Y+++                    
Sbjct: 546 TDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF-------------------- 577

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVS 742
                                  C  + +    + ++VTN G      V+ ++ R     
Sbjct: 578 ----------------------ECGDISYKDNTLAVNVTNTGSRSAEDVLQVYIR---SE 612

Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            G     L  F+RV      S+ IS  + P       + +G R +  G + L  G
Sbjct: 613 NGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGVRAVRSGRYTLYAG 666


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 224/634 (35%), Positives = 336/634 (52%), Gaps = 87/634 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N SL+   RA+ L   LT++E+  QL  +A  IPRLGIPAY WW+E LHG+A  G   
Sbjct: 9   YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAG--- 65

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                  + T FPQ +  AA+F+ +L   IG   A EARA +   +         GLT W
Sbjct: 66  -------TATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNIN+FRDPRWGRG ET GEDP + +   V FVK  QGE                +VLK
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQGEG---------------KVLK 163

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                         +AC KH   + +       R+SF+A ++ +D E+++ P F + + +
Sbjct: 164 A-------------AACAKH---FAVHSGPEALRHSFDAQVSPKDLEESYLPAFHALVAE 207

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG P+C    L  K  ++WGF GY  SDC A+    ++   TK   +
Sbjct: 208 AKVEGVMGAYNRVNGEPSCASPMLMDKL-HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTE 266

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           SAA  L+ G D+NCG   L +  +A+++G +   DI RA + +   ++RLGLF+ +P   
Sbjct: 267 SAALALRTGCDLNCGNTYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPH-- 323

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            +     D + +  HK ++L  A + +VLLKND   LPL+ + + ++A+IGP  ++ + +
Sbjct: 324 -FAACTYDTIASPAHKAVSLSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAAL 381

Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---DVPCN---SDAGFHEAVRIAKKAD 518
            G Y G      + L G++ A+  + HYA GCH   D   N   +D  + EA+  A+ +D
Sbjct: 382 EGNYCGTADRYVTFLEGIQDAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASD 441

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            VI+  GLD T E E+          D+  L LP  Q  L+  +    K PVILVL  G 
Sbjct: 442 VVILCLGLDATLEGEEGDTGNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGS 500

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
            L+   +      +++L   YPG+ G +ALA I+FG  +P G+LP+T+Y E+  ++P   
Sbjct: 501 ALNPEIS-----CNAVLQAWYPGQCGGQALAHILFGKVSPSGKLPVTFY-ETAEQLPDFT 554

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           D +M+         R+YR+     +Y FG+GL+Y
Sbjct: 555 DYSMQ--------NRTYRYARNNVLYPFGYGLTY 580


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 240/792 (30%), Positives = 376/792 (47%), Gaps = 70/792 (8%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
              L   VV+     S S   +  PD  C      +   C+ +     RA +L++ +  Q
Sbjct: 1   MHFLSTAVVLGASLISKSHAAIG-PD--CTNGPLSTNAICDVNAPPHERAAALVAAMEPQ 57

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EK+  L   +  + RLG+PAY WW E+LHG+A   PG+ F     + TSFP  ++ +A+F
Sbjct: 58  EKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG-APGIKFVEPYKNATSFPMPILMSAAF 116

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           +  L   I + +  EARA  N G A + +W P+IN  RD RWGR  E+PGED   +  Y 
Sbjct: 117 DDDLIFKIANIIGNEARAFGNGGVAPVDYWTPDINPVRDIRWGRASESPGEDIRRIKGYT 176

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
              +   +G+              +++++                A CKH + YD+E WG
Sbjct: 177 KALLAGLEGDQ------------AQRKII----------------ATCKHYVGYDMEAWG 208

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
            + R++F+A IT QD  + + PPF+ C    K    MCSYN VNG+P C    + Q   R
Sbjct: 209 GYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIPTCADTYVLQTILR 268

Query: 315 NEWGF---KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAI 371
           + W +     YITSDC+AVA I E   Y +T     A     GMD++C          A 
Sbjct: 269 DHWNWTDSNNYITSDCEAVADISENHKYVETLAQGTALAFAKGMDLSCEYTGSSDIPGAW 328

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
            +G +    ID+AL   +   +  G F+G   K  Y  L   D+ T E ++L+L    +G
Sbjct: 329 AQGLLNISVIDKALTRQYEGLVHAGYFDG--AKATYANLSYKDINTPEARQLSLQVTSEG 386

Query: 432 IVLLKNDKKF-LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
           +V+LKND    LPL K   S +A+IG   N+ S++ G Y+G P    S +   E      
Sbjct: 387 LVMLKNDHTLPLPLTKG--SKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAGEQMGLDM 444

Query: 491 HYASG----CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
             A G       VP N       A+  A+K+D+++   G D T   E  DR ++  P  Q
Sbjct: 445 AIAWGPMIQNSSVPDNWTT---NALDAAEKSDYILYFGGQDWTVAQEGYDRTTISFPQVQ 501

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+T +A+  K  V++ L  G   D S   +   ++SI+W  +PG+ G  A+  ++ G 
Sbjct: 502 IDLLTKLAKLGKPLVVITL--GDMTDHSPLLSMEGVNSIIWANWPGQDGGPAILNVVSGA 559

Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
             P GRLP+T YP  + K+ M DMN+R  +  + PGR+YR++  + V  FG GL YT + 
Sbjct: 560 HAPAGRLPITEYPADYVKLSMLDMNLRPHT--ESPGRTYRWFNES-VQPFGFGLHYTTFE 616

Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
             F    SE  ++  ++   D       G    Y  + EV         ++++V N G+ 
Sbjct: 617 ASFA---SEEGLTYDIEEILD-------GCTQQYKDLCEVAP-------LEVTVANKGNR 659

Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
               V + F +     +  P K LI + R+  +  G+K+ +       +L+  ++ G  +
Sbjct: 660 TSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELARVDQSGNTV 719

Query: 787 LPLGNHVLMVGE 798
           +  G + L++ E
Sbjct: 720 IYPGEYTLLLDE 731


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 252/780 (32%), Positives = 378/780 (48%), Gaps = 136/780 (17%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N SLS + RA  L+S LTL+EKI  + +N+SA+ RLGI  YEWW+E+LHG+A NG  
Sbjct: 24  PYQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
                     T +PQ +   ASFN +L   + ++++ EAR  Y   +         GLTF
Sbjct: 82  --------LATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTF 133

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
           W PNINIFRDPRWGRGQET GEDP + S   +  V   QG +N K +             
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQNTKYNK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                            AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 182 ---------------THACAKHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           + QG    +MC+YN+  G P C    L     RNEW +KG + SDC A+   F ++   +
Sbjct: 224 VIQGNVKEVMCAYNRFEGDPCCGSDRLLINILRNEWNYKGLVVSDCGAIDN-FYFKGRHE 282

Query: 342 THE---DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH+   D++A  + +G D+ CG        SA+ +G + E  ID++L  L   +  LG  
Sbjct: 283 THKNKADASAAAVLSGTDLECGRSYTGLI-SAVKEGLINESAIDQSLCRLMKARFELGEM 341

Query: 399 NGDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           +       + +L PD + +   H++LAL  AR+ + LL+N K  LPL+K    ++A+IGP
Sbjct: 342 DD---TTPWDQL-PDSLLSCHAHQQLALQMARESMTLLQNHKNILPLDKEM--TVALIGP 395

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFH-----E 509
             N+       Y G P    +LL GL  Y+ +    +      +V       ++      
Sbjct: 396 NANDSVMQWANYNGFPVHTITLLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQA 455

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVARTSKR 559
            +  A KAD +I   G+  + E E+ D          R ++ LP  Q  LV ++  T K 
Sbjct: 456 VINQAAKADVIIYAGGISASLEGEEMDVDAEGFRGGDRTTIELPNVQRKLVKALKATGK- 514

Query: 560 PVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           P++ V   G  + +   + +SQI  +IL   YPG+AG  A+AE++FGD+NP GRLP+T+Y
Sbjct: 515 PIVFVNFSGCAMGL---QPESQICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFY 571

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
            +        D NM+        GR+YR+     +Y FGHGLSYT +SY   S P     
Sbjct: 572 KKDNQLPDFEDYNMQ--------GRTYRYLNYEPLYPFGHGLSYTTFSY---STP----- 615

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
                         + G                    +++ VTN+G+ +G  V+ L+ + 
Sbjct: 616 ------------FIENGK-------------------LKVKVTNSGNYNGDEVIQLYIKR 644

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMVG 797
                G P K L GF R+H  A  + E+SF +   +  +  +K    + PL G + ++VG
Sbjct: 645 YDDPDG-PLKTLRGFQRIHIPAGQTSEVSFPLT-SDTFTWWDKDSNTVHPLQGRYKILVG 702


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  357 bits (916), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 365/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEVAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 249/782 (31%), Positives = 380/782 (48%), Gaps = 107/782 (13%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQ 75
            R L++ VV   +  + S    N   F       DS+PF N SL I TR   LIS LT++
Sbjct: 1   MRALRVTVVCLTLISTASFSQDNVLRFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIE 60

Query: 76  EKIQQLSDN----ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           + I Q           I RLGI   E+ +E L G+              + T FPQ L  
Sbjct: 61  DAINQTVARYGKFTPGIERLGIKPIEYITECLRGVRR-----------ENATGFPQALGL 109

Query: 132 AASFNRSLWSNIGSAVAVEARAMYN-------LGQAGLTFWAPNINIFRDPRWGRGQETP 184
           AASF+R L   + +AV+VE RA YN        G  G+T ++P INI R P WGR QET 
Sbjct: 110 AASFSRDLMQRVATAVSVEVRAFYNHDIQRETYGAHGITCFSPVINILRHPLWGRNQETY 169

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP +    A ++V   QG++ +                            L +SA CK
Sbjct: 170 GEDPYLSGELASQYVSGLQGDDPRY---------------------------LRVSAGCK 202

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  A+         ++ F+A I E+D + TF P F+ CI   K   +MCS+N +NGVP+C
Sbjct: 203 HFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKKCI-AAKPYNVMCSFNSINGVPSC 261

Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTC 362
               L     R +WG++G++ SD  AV  IF   +Y  + E +A   +K+G ++   G  
Sbjct: 262 ANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHYNSSFETAAVEAIKSGCNMELVGKF 321

Query: 363 MLRHTQ--SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEH 420
              + Q   A+++  + + ++   +  +F  +  LG F+  P    + ++  D V ++EH
Sbjct: 322 DPSYWQLTKALNEHLITKDELMENVRPVFLTRFLLGEFD-PPALNPFNQITKDVVLSAEH 380

Query: 421 KKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKS 478
           ++LAL+AA +  VLLKND+ FLPL KN++ ++A++GP+ N    + G Y+    P    +
Sbjct: 381 QRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVGPMSNYTDGLIGDYSTDTDPSLILT 440

Query: 479 LLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
            L G++       +ASGC +  C +D    +       A  V V  G     E E+ DR 
Sbjct: 441 PLHGIKKLAPNVQFASGCSNSTC-TDYRATDVAAAVDGAQVVFVALGTGFIVEAENNDRS 499

Query: 539 SLLLPGQQMSLVT-SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAK 597
            ++LPG Q+ L+  +V   + RPV+L+L  GGPLDV+FA+  S I SI+   +P     +
Sbjct: 500 DIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPLDVTFAQLTSGIVSIVECFFPAMMTGE 559

Query: 598 ALAEIIF---GDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           A+  ++    G  +P GRLP+TW P    +VP + D  M+        GR+YR+YT   +
Sbjct: 560 AIYRMLINNEGISSPAGRLPLTW-PAYLNQVPNITDYTMK--------GRTYRYYTEDPL 610

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT + Y      S+L ++                         EVT    +R
Sbjct: 611 YPFGYGLSYTQFKY------SDLKVTPL-----------------------EVTKGQEIR 641

Query: 714 FHVQISVTNAG--DVDGSHVVMLFARV--PKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
             V++ VTN G  D D   ++++ A V  PK     P  QL+ FDR+H  +  S+ +   
Sbjct: 642 --VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPRWQLVAFDRIHIASGKSETVELT 699

Query: 770 VD 771
           ++
Sbjct: 700 IE 701


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 252/775 (32%), Positives = 371/775 (47%), Gaps = 140/775 (18%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA+ L + LTL+E+  QL  +A A+ RL IPAY WWSE LHG+A  G   
Sbjct: 4   YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA F+    + +GS +  EARA YN   A        GL  W
Sbjct: 61  -------TATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN+NIFRDPRWGRGQET GEDP + +   V F K  QGE                    
Sbjct: 114 SPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQGE-------------------- 153

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                   G+ L  +AC KHL  +   +     R+ F+A+ + +D E+T+ P F + +++
Sbjct: 154 --------GEVLKTAACAKHLAVHSGPE---AIRHEFDAVASPKDMEETYLPAFEALVKE 202

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG PAC    L  K  +EWGF GY  SDC A+         TKT  +
Sbjct: 203 AKVEGVMGAYNRVNGEPACASKFLMGKL-DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPE 261

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           SAA  LK G D+NCG   L H   A ++G + ++DI +A  +L   ++RLG+F+ +    
Sbjct: 262 SAAMALKLGCDLNCGNTYL-HLLHAYNEGLINDEDIKKACTHLMRTRVRLGMFDDET--- 317

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           +Y KL    V   E+K  A   + + +V+LKN+   LPL+ + + ++ +IGP  ++   +
Sbjct: 318 EYDKLDYSIVANEENKAYARKCSERSMVMLKNN-GILPLDPSKIKTIGVIGPNADSRPAL 376

Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCHDVP------CNSDAGFHEAVRIAKKAD 518
            G Y G      + L G++ A+  +  Y+ G H           +D    EA  + + +D
Sbjct: 377 EGNYNGRADRYITFLEGIQDAFGGRVLYSEGSHLYKDRCMGLAVADDRLSEAEIVTEHSD 436

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            V++  GLD T E E+          D+  L LP  Q  LV +V R  K PVI+V   G 
Sbjct: 437 VVVLCVGLDATIEGEEGDTGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGS 495

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
            ++V   EAD       W  YPG+ G  ALA+I+FG  +P G+LP+T+Y ++ TK+P   
Sbjct: 496 AINV---EADCDALIHAW--YPGQFGGTALADILFGKISPSGKLPVTFYTDT-TKLPEFT 549

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           D +M+        GR+YR+     +Y FG+GL+Y+                         
Sbjct: 550 DYSMK--------GRTYRYTQDNILYPFGYGLTYSK------------------------ 577

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                             T  + L+F      + VTN GD D   VV  + +  + S   
Sbjct: 578 ------------------TEVSDLKFENGKASVKVTNTGDFDTEDVVQFYIK-GEGSDYV 618

Query: 746 PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P   L GF RV  + KG     E++ G    E +   +++GRR       +L  G
Sbjct: 619 PFYSLCGFRRVF-LKKGESTVVEVTLGDSAFEAV---DENGRRSRSAKGFILYAG 669


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  356 bits (914), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+  + + + R + LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKLLKEIYQVNPR-IALVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  356 bits (914), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 240/764 (31%), Positives = 365/764 (47%), Gaps = 121/764 (15%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +S  AK +I+ +T+ EKI QL + + AI  LGI  Y+WWSE LHG+  +G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
             T FPQ +   A+F+ +L   IG AVA E RA +N+ Q        AGLTFW+PN+NIF
Sbjct: 80  RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIF 139

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRG ET GEDP++       +V+  QG                           D
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
               L   AC KH   Y +      +R+  +   + +D  +T+ P F+  ++QG+   +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229

Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
            +YN+V G P      L  D+ +K+   WGF G+I SDCDA+   +    Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +KAG+++ CG    +  Q A+D+G + E D+DRAL  L   +L+LG+   D     Y 
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
                ++C+  H  LAL AA + +VLLKN+   LPL+KN + +L + GP  ++   + G 
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
           Y G+     + L+G+   VS+    +  +  P            + AV  A  A+  IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            G +   E E+          DRV + LP  Q++ +  V       +++VLTGG P+D+ 
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR 519

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
                +    + W  YPG+ G +AL +++FGD N  GRLP+T +P     +P  +D +M 
Sbjct: 520 KISKLADAVVMAW--YPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                   GR+Y++ +G  +Y FG+GLSY   +Y      ++  +   +K G        
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 V++ +TN GD     V   +   P   +G+P   L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RV    K S +  F + P E+L      G   L  GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTIQSDGSSKLLKGNYTLTIG 699


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  356 bits (914), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 239/763 (31%), Positives = 361/763 (47%), Gaps = 115/763 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R K L++ LTL+EK+  +  ++ AIPRLGIPAY+WW+E+LHG+A          T+  VT
Sbjct: 2   RVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVAR---------TLEKVT 52

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQAG-----LTFWAPNINIFRD 174
            FPQ +  AA+F+      +G   + E RA++N     G+ G     LT+W PNINIFRD
Sbjct: 53  VFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFRD 112

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRGQET GEDP + +      V+  +GE              +   LK         
Sbjct: 113 PRWGRGQETYGEDPYLTAKMGAAIVRGLEGE--------------DPHYLKSV------- 151

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 AC KH   Y +     ++R+SF+A  +  D  DT+ P FR  + + K   +MC+
Sbjct: 152 ------ACAKH---YAVHSGPEYNRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCA 202

Query: 295 YNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN++NG P C    L     RN+W F GY+TSDC A+    E+      H  + +  L A
Sbjct: 203 YNRLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHTIAMSDALLA 262

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYGKLGP 412
           G D+ CG  +       + KG   E+DI+ +L  LF++  ++G+F  DP  +  Y  +G 
Sbjct: 263 GTDLECGN-LYHLLAEGVKKGLHSERDINVSLSRLFTILFKIGMF--DPAERVPYSSIGR 319

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
           + +    HK+ A   A++ IVLL+N    LPL+ + + S+A+IGP  +N       Y G 
Sbjct: 320 EVLECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGT 379

Query: 473 P---CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAGLDL 528
           P    +P   L+       K +Y  G   V    DA  F +    A ++D ++ V+G+  
Sbjct: 380 PSEIVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISA 439

Query: 529 TQE-------------TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
             E                 DR ++ LP  Q+ L+  + +T  RP+I+V   G  +  SF
Sbjct: 440 DYEGEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTG-RPLIIVNMSGSVM--SF 496

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
                   ++L   Y G+A   A+ +++FG  NP GR+P+T Y       P  + +M   
Sbjct: 497 EWESQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSDNDLPPFENYSML-- 554

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                 GR+YR++ G   Y FG+GLSYT ++Y                            
Sbjct: 555 ------GRTYRYFKGEPRYPFGYGLSYTTFAYS--------------------------- 581

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG-TPEKQLIGFD 754
              D   +DE  +  + R  V ++V+N GD DG  VV L+   P+  +   P   L GF 
Sbjct: 582 ---DVQCVDETHTGDTAR--VTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALKGFK 636

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           R+H     S  +SF + P E+L++    G  +   G   L VG
Sbjct: 637 RIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVG 678


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 240/764 (31%), Positives = 366/764 (47%), Gaps = 121/764 (15%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +S  AK +I+ +T+ EKI QL + + AI  LGI  Y+WWSE LHG+  +G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
             T FPQ +   A+F+ +L   IG AVA E RA +N+ +        AGLTFW+PN+NIF
Sbjct: 80  RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRG ET GEDP++       +V+  QG                           D
Sbjct: 140 RDPRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
               L   AC KH   Y +      +R+  +   + +D  +T+ P F+  ++QG+   +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229

Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
            +YN+V G P      L  D+ +K+   WGF G+I SDCDA+   +    Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +KAG+++ CG    +  Q A+D+G + E D+DRAL  L   +L+LG+   D     Y 
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
                ++C+  H  LAL AA + +VLLKN+   LPL+KN + +L + GP  ++   + G 
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
           Y G+     + L+G+   VS+    +  +  P            + AV  A  A+  IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            G +   E E+          DRV + LP  QM+ +  V       +++VLTGG P+D+ 
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQMNYLRRVKARKGGRIVVVLTGGSPIDLR 519

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
             E      +++   YPG+ G +AL +++FGD N  GRLP+T +P     +P  +D +M 
Sbjct: 520 --EISKLADAVVMAWYPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                   GR+Y++ +G  +Y FG+GLSY   +Y      ++  +   +K G        
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 V++ +TN GD     V   +   P   +G+P   L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RV    K S +  F + P E+L      G   L  GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTIQSDGSSKLLKGNYTLTIG 699


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 229/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP  Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEGQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  356 bits (913), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 228/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q   +  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEEQEKFLKKIYQVNPR-IVLVFHTGNPLTSEW--ADTHILAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  355 bits (912), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 241/750 (32%), Positives = 356/750 (47%), Gaps = 126/750 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AK+L++ +TL+E+  QL  ++ AI RLG+PAY WW+E+LHG+A  G            T
Sbjct: 9   KAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAG----------VAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
           SFPQ +  AA+F+  L   +   +A E RA YN            GLTFW+PN+NIFRDP
Sbjct: 59  SFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V FVK  QGE                             +
Sbjct: 119 RWGRGHETYGEDPYLTSRLGVAFVKGLQGE-----------------------------E 149

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   + +       R+ F+A ++++D  +T+ P F + +++ +   +M +Y
Sbjct: 150 GLKTAACAKH---FAVHSGPEADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAY 206

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ NG P C    L +   R +WGF+G+  SDC A+    E+   T T ++SAA  LK+G
Sbjct: 207 NRTNGEPCCGSPTLMKDILREKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESAALALKSG 266

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L H   A   G V E++I  A   LF+ +  LGLF+G      Y  +  + 
Sbjct: 267 CDLNCGNTYL-HILMAYQNGLVTEEEITTAAERLFTTRYLLGLFDGS----TYDAIPYEV 321

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V +  H  +A +A  + IVLLKN+   LPLNK ++ ++ +IGP  N+   + G Y G   
Sbjct: 322 VESKPHLSVADEATAKSIVLLKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSS 380

Query: 475 SPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIVVAG 525
              ++L GL+  V    +  Y+ G H      +          EA  +AK +D VIV  G
Sbjct: 381 QYITILEGLQKEVGDEVRILYSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVG 440

Query: 526 LDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E+          D+  L LP  Q  LV ++A+  K PVIL L+ G  +D+ + 
Sbjct: 441 LDETLEGEEGDTGNAYASGDKRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQY- 498

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
            AD+   ++L   YPG  G + +A+ + G+  P G+LP+T+Y +        D +M+   
Sbjct: 499 -ADAHYDAVLQAWYPGARGGQVIAKALLGEIVPSGKLPVTFYRDLSGLPAFEDYSMQ--- 554

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
                GR+YR+     +Y FG+GL+Y                                  
Sbjct: 555 -----GRTYRYMQEEALYPFGYGLTYGK-------------------------------C 578

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
           R++    D+     SLR    + V N  D     VV L+ +        P   L GF RV
Sbjct: 579 RIEEASYDQ----GSLR----VLVHNEVDFKLEEVVQLYIKNLDSEFAVPNHSLCGFKRV 630

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
              A  +KEI   V P     + N+ G  I
Sbjct: 631 SLEAGETKEIQINVSP-NAFKVVNEQGEWI 659


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  355 bits (911), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 245/777 (31%), Positives = 376/777 (48%), Gaps = 145/777 (18%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +  LS   RA +L   L+ +E+ QQL  +A AI + G+P+Y WW+E LHG+A  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA+F++ +   +G  ++ EARAMYN            GLT W
Sbjct: 61  -------TATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP + S   V FVK  QGE                    
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEE------------------- 154

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
                    + L  +AC KH   +     G  S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 155 ---------EYLRAAACAKHFAVHS----GPESLRHEFDARVSEKDMEETYLPAFKALVK 201

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G+   +M +YN+VNG P+C    L  K R EWGF GY  SDC A+         T T  
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCWAIRDFHTTHKITDTAP 260

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            SAA  LKAG D+NCG   L H  +A+++G + ++DI  A ++    ++RLG  + +   
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQDIRTACIHALRTRIRLGQLDDN--- 316

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++  L  D +    +K L+L+AA + +VLL ND   LPL+K+ +SS+A+IGP  ++ + 
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374

Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
           + G Y G P    + L G++ A+  + +YA GC           +P +    + EAV   
Sbjct: 375 LLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431

Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           + AD  ++  GLD T E E+          D+  L LP  Q  L+ ++  T K P+I+VL
Sbjct: 432 EAADVTVICVGLDATLEGEEGDTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVL 490

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  ++    E ++ I++  W  YPG+ G KALAEI+FG+ +P G+LP+T+Y  +    
Sbjct: 491 AAGSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLP 545

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
              D +M+         R+YRF       +Y FG+GL+Y+++                  
Sbjct: 546 DFTDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF------------------ 579

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPK 740
                                    C  + +    + ++VTN G      V+ ++    K
Sbjct: 580 ------------------------ECGDVSYKDNTLAVNVTNTGSRSAEDVLQVYI---K 612

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              G     L  F+RV      S+ IS  + P       + +G R +  G + L  G
Sbjct: 613 SENGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGIRAVRSGRYTLYAG 668


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  355 bits (910), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 235/766 (30%), Positives = 370/766 (48%), Gaps = 124/766 (16%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A  L++ + L EK   L  ++ AI RLG+P Y WWSE+LHG+A  G            T
Sbjct: 8   KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGVA----------T 57

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+      I   +A EARA YN            G+T WAPNINIFRDP
Sbjct: 58  VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGEN----WKSDDGGIGFGFREKRVLKGFGEES 231
           RWGRG ET GEDP + S   V F+   QG+     WK+                      
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHYWKA---------------------- 155

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                   +AC KH   + +       R+ F+A+++++D  +T+ P F + + +GK + +
Sbjct: 156 --------AACAKH---FAVHSGPEEERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGM 204

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           M +YN+VNG PAC    L Q   + EWGF GY+ SDC A+         T T  +SAA  
Sbjct: 205 MGAYNRVNGEPACGSKVLLQDILKEEWGFDGYVVSDCWAIRDFHTEHMVTHTATESAALA 264

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           +  G  +NCG   L H   A  +G V E+ I ++   L +++++LGLF+   +  +Y K+
Sbjct: 265 INNGCQLNCGNTYL-HMLQAYKEGLVTEETITKSAQKLMAIRMKLGLFD---KNCEYNKI 320

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
             +      H+ +ALD AR+ +VLLKN+   LPLN     ++ +IGP  N+ + + G Y 
Sbjct: 321 PYEVNDCKVHRDIALDVARRSMVLLKNNG-ILPLNLKQTKAIGVIGPTANSRTVLQGNYF 379

Query: 471 GIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFH-------EAVRIAKKADFV 520
           G      + L G++ YV   ++ +YA GCH +  NS +G         EA+ +A+++D V
Sbjct: 380 GTASRYTTFLEGIQDYVGDAARVYYAEGCH-LFKNSISGLSWENDRLSEALIVAEQSDVV 438

Query: 521 IVVAGLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           I+  GLD + E E  D         +  L L G+Q  L+  V +  K P IL+L+ G  +
Sbjct: 439 ILCLGLDASIEGEQGDTGNAFAAGDKSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAM 497

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
            +  A+      +IL   YPG++G KALA+++FG+++P G+LP+T+Y  +       D +
Sbjct: 498 AIHTAQ--EYCEAILETWYPGQSGGKALAQLLFGEYSPSGKLPITFYKTTEELPDFRDYS 555

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
           M         GR+YR+     +Y FG+GL+Y     K           A +K   ++NI 
Sbjct: 556 M--------AGRTYRYMKNEALYPFGYGLNYAKVEVK----------DAVIK---ERNIE 594

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
            +                  + + +Q+ VTN  +V    VV ++ +  +     P   L 
Sbjct: 595 NE------------------IIYEIQLQVTNQSEVCTYDVVQVYIKDMESRWAVPNYSLC 636

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            +  ++  A    +I+  +       I ++ G+R +   +  L +G
Sbjct: 637 AYKSIYLAAYDEPQITLQIKQS-AFEIVDEEGKRYIDSHHFKLFIG 681


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  355 bits (910), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 238/751 (31%), Positives = 363/751 (48%), Gaps = 105/751 (13%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T  S   RA +LISL TL+E +    + +  +PRLG+P Y+ W+E+LHG+  +     
Sbjct: 63  CDTLTSPYDRAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGL--DRAYFT 120

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRD 174
             G  S  TSFP  +++ ++ NR+L + + S ++ + RA  N G+ GL  ++PNIN FR 
Sbjct: 121 DEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPNINSFRH 180

Query: 175 PRWGRGQETPGEDPMVVS-AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           P WGRGQETPGED   +S AYA E++   QG            G   K            
Sbjct: 181 PVWGRGQETPGEDAYCLSSAYAYEYITGIQG------------GVDPK------------ 216

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
              L L A  KH   YD+E W   SR   +  IT+QD  + + P F       K   +MC
Sbjct: 217 --SLKLVATAKHYAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMC 274

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGF--KGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           SYN VNGVP+C      Q   R+ +GF   GYI+SDCD+   +F    Y      +AA  
Sbjct: 275 SYNAVNGVPSCANSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSSAAADS 334

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           ++AG DI+CGT    +   A     V +  + RA +    ++L   L           +L
Sbjct: 335 IRAGTDIDCGTTYQYYFDEA-----VDQNLLSRADIERGVIRLYSNLM----------RL 379

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
           G  DV                                        GP +N  +Q+ G Y 
Sbjct: 380 GYFDV----------------------------------------GPWMNVSTQLQGNYF 399

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           G      S L          +YA G  ++  NS  GF +A+  AKK+D +I   G+D + 
Sbjct: 400 GPAPYLISPLDAFRDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSL 458

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
           E E  DR+++  PG+Q+ L+  +++  K P+I++  GGG +D S  +++  ++S++W GY
Sbjct: 459 EAETLDRMNITWPGKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGY 517

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYT 649
           PG++G +AL +II G   P GRL +T YP  + T+ P  DM++R   +   PG++Y +YT
Sbjct: 518 PGQSGGQALLDIITGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGNN--PGQTYMWYT 575

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           GT VY FGHGL YT +    +S    + I  +      +++L Q      Y+H++++   
Sbjct: 576 GTPVYEFGHGLFYTTFR---VSHARAVKIKPTYNI---QDLLAQ--PHPGYIHVEQM--- 624

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
             L F V I  TN G     +  MLFA         P+K L+GFDR+ T+   + ++   
Sbjct: 625 PFLNFTVDI--TNTGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTI 682

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELR 800
                 ++  ++ G R+L  G + L +   R
Sbjct: 683 PVTINSMARTDELGNRVLYPGKYELALNNER 713


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 237/779 (30%), Positives = 378/779 (48%), Gaps = 112/779 (14%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS--------AIPRLGIPAYEWW 99
           H   YPF NTSL    R K L+  LT++E + Q+S   S        A+PRLG+  + W 
Sbjct: 22  HVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSWN 81

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           +E L G    G          + TSFPQ L  AA+F+  +  ++ SA ++E RA +N  Q
Sbjct: 82  TECLRGDVYAG----------NATSFPQALGLAATFSTEVICDVASATSIEVRAKFNDYQ 131

Query: 160 --------AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
                    G++ ++P INI R P WGR QET GEDP +    A  FVK  QG       
Sbjct: 132 RRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQG------- 184

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                               D    +  +A CKH   +   +    SR+SF+A ++E+D 
Sbjct: 185 --------------------DDPTYIRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDW 224

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
             TF P F+ C++ G  S +MCS+N++NGVPAC    L     R EWGF GY+ SD +A+
Sbjct: 225 RLTFLPAFKRCVQAGSYS-LMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAI 283

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ----SAIDKGKVQEKDIDRALL 386
             I  Y +YT    D+AA  +KAG ++   T  ++ T      A+  GK+ ++D+ +++ 
Sbjct: 284 ENIMTYHHYTNNSVDTAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVS 343

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            LF  ++RLG F+  P    Y  +    + + EH+ ++L+AA +  VLLKN   FLP+ K
Sbjct: 344 PLFYTRMRLGEFD-PPDHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPITK 402

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD 504
               +++++GP+ +N  Q  G Y    +P    + L+GL     +  YA+GC+D  C S 
Sbjct: 403 -LFDTISVLGPMADNKYQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNAC-SK 460

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR--PVI 562
               E  R    +D   V  G     E ED DR S+ LPGQQ  L+      S +  P++
Sbjct: 461 YNRTEIQRAVNSSDIFFVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIV 520

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGRLPMTWYP 619
           L+L  GGP+++++A+   ++ +I+   +P +   +A+  ++       NP GRLP TW P
Sbjct: 521 LLLFNGGPVNITWADRSDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTW-P 579

Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
           +   ++P M + +M         GR+YR++ G  +Y FG+GLSY+ +++      +   +
Sbjct: 580 KYQDQIPSMVNYSME--------GRTYRYFHGDPLYPFGYGLSYSTFNF------TNAWM 625

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           +  +  G D                            V++ V N G  DG  V+ ++ + 
Sbjct: 626 NPIISQGQD--------------------------LTVRVEVCNEGPTDGDEVIQVYLKW 659

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              ++  P  QL+GF+RV   AK +      V   E +++ N+     +  G + L +G
Sbjct: 660 LDTNETMPIHQLVGFERVSLRAKETLSWLITVR-AENMAVWNESRGFYIEPGRYRLYIG 717


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  354 bits (909), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 228/745 (30%), Positives = 364/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  ++ R ++LI  +TL EK+ QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GE+P + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEEPHLTSRLGVAFVKGLQG---------------------------DHPT 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAY 251

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP      L     R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+++G + E  IDRAL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVEQGLISEAAIDRALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKND   LPLNK  + S+A++GP  +      GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKND-ALLPLNKEKIKSVAVVGPFAD--YNYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               SLL+G++  + K    +  + +  ++D+      ++ K AD V+V  G D     E
Sbjct: 427 PYSVSLLKGVKELIGKKGKVTYLNGMGTSADS----IAQVVKGADIVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP  Q  L+  + + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMPSIYLPEGQEKLLKEIYQVNPR-IVLVFHTGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YGFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 245/756 (32%), Positives = 361/756 (47%), Gaps = 113/756 (14%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R  SL+S ++++EK   +   A  +PRLGIP Y WW+E+LHG+A++G            T
Sbjct: 6   RMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSG----------EAT 55

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-LG-------QAGLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L   +  A++ EARA +N +G       + GLTFW+PNINI+RDP
Sbjct: 56  VFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRDP 115

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + S   V FVK  QG                           D   
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVAFVKGLQG---------------------------DHPY 148

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L ++AC KH   Y +       R+ F+A ++E+D  +T+ P F + ++ G    +M +Y
Sbjct: 149 YLRVAACAKH---YAVHSGPEGLRHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAY 204

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+VNG PAC    L ++  R +WGFKG++ SDC A+A    +   TK   +S A  L+AG
Sbjct: 205 NRVNGEPACGSKRLLEEILRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIESIAMALEAG 264

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG     H   A+  G V E+ +DR++  L S   RLGLF  D     Y +L   D
Sbjct: 265 CDLNCGNTY-EHLLDAVKAGAVSEELVDRSVARLLSTLDRLGLFTDD---HPYVRLSLAD 320

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           +    H+ LA +AA + +VLLKN+   LPL++  +  + + GP   N   + G Y G+  
Sbjct: 321 IDWEAHRALAREAAEKSVVLLKNN-GILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSS 379

Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
              ++L G+  Y        Y  GC  +  N       A  +A+ AD  + V G D   E
Sbjct: 380 RLVTVLEGITGYAGPGITVTYKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSAVE 438

Query: 532 TEDRDRV---------SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
            E+ D +          L L  +Q+  +  +    K P+++VL  G P  V   E +   
Sbjct: 439 GEEGDAIFSDNYGDLSDLNLSREQIDYLRRIKEIGK-PLVVVLLSGAP--VCSPELEELA 495

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYP 641
            +I++  YPGE G  A+A ++FG+ +P GRLP+T +P+   ++ P  D +M         
Sbjct: 496 DAIVYAWYPGEEGGNAIARVLFGEVSPSGRLPIT-FPKGVDQLPPFTDYSME-------- 546

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
           GR+YR+     +Y FG GLSY  +SY+    P                  + + SR D  
Sbjct: 547 GRTYRYMKEEPLYPFGFGLSYATFSYR---DP------------------KSSASRWDKR 585

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
              EV             V N   +    VV L+ R        P   L GF RV     
Sbjct: 586 ETLEVVC----------EVENTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGTG 635

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              ++ F + P E LS  ++ GR++LP G     VG
Sbjct: 636 ERIQVRFVLSP-EDLSFIDEKGRKVLPEGRLRFHVG 670


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 245/777 (31%), Positives = 375/777 (48%), Gaps = 145/777 (18%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +  LS   RA +L   L+ +E+ QQL  +A AI + G+P+Y WW+E LHG+A  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA+F++ +   +G  ++ EARAMYN            GLT W
Sbjct: 61  -------TATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP + S   V FVK  QGE                    
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEE------------------- 154

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFS-RYSFNAMITEQDTEDTFQPPFRSCIE 284
                    + L  +AC KH   +     G  S R+ F+A ++E+D E+T+ P F++ ++
Sbjct: 155 ---------EYLRAAACAKHFAVHS----GPESLRHEFDARVSEKDMEETYLPAFKALVK 201

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G+   +M +YN+VNG P+C    L  K R EWGF GY  SDC A+         T T  
Sbjct: 202 EGRVEGVMGAYNRVNGEPSCASEKLMGKLR-EWGFDGYFVSDCWAIRDFHTTHKITDTAP 260

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            SAA  LKAG D+NCG   L H  +A+++G + +++I  A ++    ++RLG  + +   
Sbjct: 261 QSAAMALKAGCDVNCGNTYL-HILAALEEGLITKQNIRTACIHALRTRIRLGQLDDN--- 316

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++  L  D +    +K L+L+AA + +VLL ND   LPL+K+ +SS+A+IGP  ++ + 
Sbjct: 317 -EFDDLPFDIIACDGNKALSLEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAA 374

Query: 465 MGGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---------DVPCNSDAGFHEAVRIA 514
           + G Y G P    + L G++ A+  + +YA GC           +P +    + EAV   
Sbjct: 375 LLGNYNGTPDRSVTFLEGIQDAFDGRVYYAEGCQLFRDRTQGLALPGDR---YAEAVAAC 431

Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           + AD  +V  GLD T E E+          D+  L LP  Q  L+  +  T K P+I+VL
Sbjct: 432 EAADVTVVCVGLDATLEGEEGDTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVL 490

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  ++    E ++ I++  W  YPG+ G KALAEI+FG+ +P G+LP+T+Y  +    
Sbjct: 491 AAGSSVNTE-CEGNALINA--W--YPGQYGGKALAEILFGEVSPSGKLPVTFYKSADMLP 545

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
              D +M+         R+YRF       +Y FG+GL+Y+++                  
Sbjct: 546 DFTDYSMK--------NRTYRFCDDESNVLYPFGYGLTYSHF------------------ 579

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPK 740
                                    C  + +    + ++VTN G      V+ ++    K
Sbjct: 580 ------------------------ECGDISYKDNTLAVNVTNTGSRSAEDVLQVYI---K 612

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
              G     L  F+RV      S+ IS  + P       + +G R +  G + L  G
Sbjct: 613 SENGVKNHSLCAFERVSLFDGESRTISINI-PEGAFETVDDNGVRAVISGRYTLYAG 668


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  353 bits (905), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 246/764 (32%), Positives = 361/764 (47%), Gaps = 117/764 (15%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF +  L++  R + L+S LTL EK+ Q+   A+AIPRLGIPAY +W+E LHG+A NG  
Sbjct: 22  PFRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG-- 79

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-----------G 161
                     T FPQ++  AA+++  L   + +A++ EARA ++   A           G
Sbjct: 80  --------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQG 131

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW PNIN+FRDPRWGRGQET GEDP + +  A  FV+  QG+   +            
Sbjct: 132 LTFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQGDTPDT------------ 179

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                          L L+AC KH   Y +       R++FNA +T  D  D++ P F  
Sbjct: 180 --------------HLKLAACAKH---YAVHSGPENERHTFNARVTPHDLWDSYLPAFEH 222

Query: 282 CIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +   +   +M +YN+    P C  +  L    R  WGF+G++ SDC A+  I E    T
Sbjct: 223 LVRHARVESVMGAYNRTLDEPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRIT 282

Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN- 399
               +SAA  L  G D+ CGT        A+ +G + E DIDRAL      + +LG+F+ 
Sbjct: 283 TDPVESAALALTKGCDLACGT-TFELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDP 341

Query: 400 -GDPRKGKYGKLGPDDVCT-SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
             D R        P+ + T + H  LA +AA    VLL+N    LPL  + V S+ I GP
Sbjct: 342 ADDNRNPWSNPPAPEAIVTCAAHTALACEAAVASCVLLQNHNHILPLRPD-VRSIYITGP 400

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
           L      + G Y G+P    +LL GL A +    +  Y  G        +A    A    
Sbjct: 401 LAATQDALLGNYYGLPPRAITLLDGLAAALPEGIRADYRPGALLSTPKQNA-LEWAEFDC 459

Query: 515 KKADFVIVVAGLDLTQETE-----------DRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
              D  I   GL    E E           DRD +SL  P Q++ L + + R ++  VI+
Sbjct: 460 ASCDVTIACLGLTALLEGEEGEAIASSLHGDRDDISLP-PPQRLFLESLIQRGAR--VIV 516

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           +L GG  L  S      ++ +ILW GYPG+ G +ALA+I+ G  +P GRLP+T+Y     
Sbjct: 517 ILFGGSAL--SLGPLADKVEAILWAGYPGQEGGRALADILLGRASPSGRLPITFYENIND 574

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
             P  + +MR        GR++R++ GT  + FG GL+YT ++Y      S+L +S    
Sbjct: 575 LPPYANYSMR--------GRTHRWFDGTPAWPFGFGLTYTRFTY------SDLRVSDVYS 620

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF-ARVPKVS 742
            G+D  +                  C S      + +TN GD + + +V ++        
Sbjct: 621 PGNDSPL------------------CGS------VLLTNTGDHEAAEIVQIYLTDFDAPG 656

Query: 743 QG-TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR 785
            G  P + L  F RV      S+ + F + P E + + + +GRR
Sbjct: 657 NGPVPRENLADFHRVTLAPGQSRRVEFSIPP-EHILLVDTNGRR 699


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  352 bits (904), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 242/762 (31%), Positives = 364/762 (47%), Gaps = 118/762 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AK L+S +TLQE+ +QL+  A AI  L I  Y WW+E LHG+A  G          + T
Sbjct: 15  KAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAG----------TAT 64

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+  L   I   +A E RA YN            GLTFW+PN+NIFRDP
Sbjct: 65  VFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRDP 124

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V FVK  QG+               ++ LK          
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGD---------------EKYLK---------- 159

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
              ++AC KH   +   +     R+ FNA+++++D  +T+ P F +C+++     +M +Y
Sbjct: 160 ---IAACAKHFAVHSGPEG---LRHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAY 213

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ N  P C    L +   R +W FKG++ SDC A+A    Y   T T  +SAA  +K G
Sbjct: 214 NRTNDEPCCGSSLLLKDILRGKWQFKGHVVSDCWAIADFHLYHGVTSTATESAALAIKNG 273

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L+    A  +G V E+DI RA   L + ++RLG+F+ +    K      D 
Sbjct: 274 CDLNCGNVYLQMLL-AYKEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMND- 331

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
               EH +++L A+R+ IV+L+N+   LPL+K+ + S+ IIGP  ++   + G Y G   
Sbjct: 332 --CKEHHEVSLMASRKSIVMLRNN-GLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTAS 388

Query: 475 SPKSLLRGLEAYVS----KTHYASGCHDVP------CNSDAGFHEAVRIAKKADFVIVVA 524
              ++L G+   V     +  Y+ GCH            D    EAV +A+ +D VI+  
Sbjct: 389 KYITVLEGIHEAVDSENIRIFYSEGCHLYKDRVQDLAEPDDRMAEAVTVAEHSDVVILCL 448

Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           GLD + E E           D+++L LPG+Q  L+  V  T K PVI+VL  G  L +  
Sbjct: 449 GLDSSIEGEQGDAGNSDGAGDKLNLNLPGKQQELLEKVIATGK-PVIVVLGAGSALTLQG 507

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
            E +   ++IL   YPG  G +A+A++IFG  +P G+LP+T+Y  +       D +M+  
Sbjct: 508 QEENC--AAILNAWYPGSFGGRAIADLIFGKCSPSGKLPVTFYKTTEELPEFTDYSMK-- 563

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                  R+YR+     +Y FG GL+Y+          S+L++S   K            
Sbjct: 564 ------NRTYRYMKNESLYPFGFGLTYSKVQL------SDLSVSDISK------------ 599

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
              D+  ++           V I ++N G+ D   V+  + +  +         L  F R
Sbjct: 600 ---DFEGVE-----------VSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKR 645

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           V      SK +   ++      + N  G RIL      L VG
Sbjct: 646 VALNKGESKVVKMTINK-RAFEVVNDEGDRILDSKKFKLFVG 686


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  352 bits (903), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 257/753 (34%), Positives = 376/753 (49%), Gaps = 102/753 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIA 107
           PFCNTSLS + R + L+S L LQEK   L+  AS  PR     +G+P Y W +  +HG+ 
Sbjct: 36  PFCNTSLSTADRVEDLLSRLPLQEKATLLTARAS--PRGNMSSIGLPEYNWGANCVHGVQ 93

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------- 158
           S   G N        TSFP  +   A F+  +  ++   +  E RA++  G         
Sbjct: 94  STC-GTN------CPTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGP 146

Query: 159 QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
             GL  W+PNINI RDPRWGR  ETP EDP+V S Y V + +  Q               
Sbjct: 147 HLGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--------------- 191

Query: 219 REKRVLKGFGEESDRGDELMLSACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                      E  R D   L A    KH  AY  E +G  +R  F+A+++  D  DT+ 
Sbjct: 192 -----------EGKRQDPRFLQAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYF 240

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFE 335
           P FRS +  G A  +MCSYN VNG+P C   +L +   R   GF GY+TSD  AV  I +
Sbjct: 241 PAFRSSVVDGNAKGVMCSYNSVNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISD 300

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
             +Y  +  ++A   + AG DIN G       ++ +D  +++EK +D AL +   ++  L
Sbjct: 301 MHHYADSQCEAARLAILAGTDINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFEL 360

Query: 396 GLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           GLF  DP   + Y  + P +V T+  K L+L+A R+ +V+L+N+   LPL K     LA+
Sbjct: 361 GLF--DPIDDQPYWNVTPSEVNTAAAKALSLNATRKSLVMLQNNASVLPLQKGV--KLAV 416

Query: 455 IGP--------LVNNISQM-GGGYTGIPC--SPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
           +GP        L N + QM  G Y  + C  +P   +R      S T +A GC  +  NS
Sbjct: 417 LGPHAKSKRGLLGNYLGQMCHGDYDEVGCVQTPLDAIRAANG-ASNTTFAEGC-GISGNS 474

Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
            AGF +AV  AK+AD V++  G+D + E E  DR ++ LP  QM L+  V     RP ++
Sbjct: 475 TAGFEKAVAAAKEADAVVLFLGIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVV 533

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           VL  GG +     E   +  +++   YPG  GA+A+A+++FGD NP G+LP+T Y   + 
Sbjct: 534 VLINGGVIGAE--EIIERTDALVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYV 591

Query: 624 -KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASL 682
            +V M  M+M A     +PGR+YR++ G  V+ FG GLSYT +S         L++ +  
Sbjct: 592 DQVEMKSMDMTA-----HPGRTYRYFKGEPVFPFGWGLSYTTFS---------LSVDSGT 637

Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
            + S  N    +G         EV+   ++   + + V N G+V G  VV+ F R    +
Sbjct: 638 NSSSHSNNAAFSGG--------EVSDTANVT--ISVVVKNDGEVAGDEVVLAFFRPVNSN 687

Query: 743 QGTP----EKQLIGFDRVHTVAKGSKEISFGVD 771
              P     +QL  + RV      S E+SF ++
Sbjct: 688 VTGPATLLNEQLFDYQRVSLGPLDSTEVSFTIE 720


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  352 bits (902), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 256/777 (32%), Positives = 379/777 (48%), Gaps = 128/777 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + +LS   RA+ L+S +TL+EKI Q+   A +I RLG+PAY WW+E+LHG+A  G   
Sbjct: 30  YRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNEALHGVARAG--- 86

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                VS  T FPQ +  AA+F+  L       ++ E RA Y+  Q         GLTFW
Sbjct: 87  -----VS--TMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRKGDRDIYKGLTFW 139

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +P INI RDPRWGRGQET GEDP + S  AV F++  QG                     
Sbjct: 140 SPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------------------- 178

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                  RG  L  +AC KH   +   +     R+ FNA ++++D  +T+ P F + +++
Sbjct: 179 -------RGRYLKAAACAKHFAVHSGPE---SERHQFNAEVSQKDLWETYLPAFEASVKE 228

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
            K + +M +YN+VNG P C  G L     R EW F GY+TSDC A+  I E    TKT E
Sbjct: 229 AKVAGVMGAYNRVNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHGVTKTIE 288

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           +S+A  +K+G D+NCG C       A   G + EK+ID A+  L   ++RLG+F+  P K
Sbjct: 289 ESSALAVKSGCDLNCG-CAYASLVKAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDA-PEK 346

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y  +  +    +EH+  AL+ A + +VLL+N   FLPL+++ + S+A+IGP  ++   
Sbjct: 347 VPYSSIPYEKNDCAEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVA 406

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHE-------AVRIA 514
           + G Y G      ++L G+   V   ++ +YA G H +  NS  G  +       A   A
Sbjct: 407 LEGNYNGTASEYVTVLDGIREAVGDRARVYYAEGSH-LFRNSMGGLSQKNDRLAEAAAAA 465

Query: 515 KKADFVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           ++AD  +V  GL+   E E+          D+  L LPG Q  L+ +V  T   PV+LVL
Sbjct: 466 ERADVAVVCLGLNRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETVKATGT-PVVLVL 524

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  L V++A+ ++      W  YPG A A+     +FG   P G  P           
Sbjct: 525 LSGSALAVNWADENADAVVQAW--YPG-AQAEGRRGALFGIIRPAGGFP----------- 570

Query: 626 PMNDMNMRADSSRQYPGRSYR----FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             +   +R  +SR + G  +        G  +Y FG+GLSYT + Y       +L ++AS
Sbjct: 571 --SRSTVRTRTSRIF-GTIHENRLPLLQGDPLYPFGYGLSYTKFQY------GDLKLAAS 621

Query: 682 -LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
            + AG D                            V ++V NAG+ D   VV L+ +  +
Sbjct: 622 EIPAGEDA--------------------------EVSVTVRNAGERDSDEVVQLYLQDLE 655

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            S   P+ QL GF RVH     S  + F V    Q+++ ++ GR +L  G   +  G
Sbjct: 656 SSVPVPKWQLAGFRRVHLKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  351 bits (901), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 238/764 (31%), Positives = 365/764 (47%), Gaps = 121/764 (15%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +S  AK +I+ +T+ EKI QL + + AI  LGI  Y+WWSE LHG+  +G          
Sbjct: 30  MSRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDG---------- 79

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
             T FPQ +   A+F+ +L   IG AVA E RA +N+ +        AGLTFW+PN+NIF
Sbjct: 80  RATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIF 139

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RD RWGRG ET GEDP++       +V+  QG                           D
Sbjct: 140 RDLRWGRGMETYGEDPLLSGMLGTAYVRGLQG---------------------------D 172

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
               L   AC KH   Y +      +R+  +   + +D  +T+ P F+  ++QG+   +M
Sbjct: 173 DAFYLKTGACAKH---YAVHSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVM 229

Query: 293 CSYNQVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
            +YN+V G P      L  D+ +K+   WGF G+I SDCDA+   +    Y KT E++ A
Sbjct: 230 SAYNRVYGEPCGGSKYLLTDILRKS---WGFNGHIVSDCDAINDFYGGHRYVKTPEEACA 286

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
             +KAG+++ CG    +  Q A+D+G + E D+DRAL  L   +L+LG+   D     Y 
Sbjct: 287 AAIKAGLNVECGHT-FKAMQGALDQGLLAEADLDRALFPLVMTRLKLGILEPD-SACPYN 344

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
                ++C+  H  LAL AA + +VLLKN+   LPL+KN + +L + GP  ++   + G 
Sbjct: 345 SYDESEICSPAHTALALRAADEAMVLLKNN-GILPLDKN-IRTLFVAGPGASDAFYLMGN 402

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPC-----NSDAGFHEAVRIAKKADFVIVV 523
           Y G+     + L+G+   VS+    +  +  P            + AV  A  A+  IVV
Sbjct: 403 YFGLSNRYSTYLQGI---VSRVSSGTSVNFRPAFMQITEELNDMNWAVNEACAAEVAIVV 459

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            G +   E E+          DRV + LP  Q++ +  V       +++VLTGG P+D+ 
Sbjct: 460 MGNNGNMEGEEGEAIASASRGDRVGIGLPASQLNYLRRVKARKGGRIVVVLTGGSPIDLR 519

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
             E      +++   YPG+ G +AL +++FGD N  GRLP+T +P     +P  +D +M 
Sbjct: 520 --EISKLADAVVMAWYPGQEGGEALGDLLFGDKNFSGRLPIT-FPADVDSLPAFDDYSMN 576

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                   GR+Y++ +G  +Y FG+GLSY   +Y      ++  +   +K G        
Sbjct: 577 --------GRTYKYMSGNVMYPFGYGLSYGRVTY------TDARVVGRIKKGEP------ 616

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 V++ +TN GD     V   +   P   +G+P   L+GF
Sbjct: 617 --------------------LAVEVVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGF 656

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RV    K S +  F + P E+L      G   L  GN+ L +G
Sbjct: 657 RRVSIPPKSSVKAVFKIVP-ERLMTVQSDGSSKLLKGNYTLTIG 699


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  350 bits (897), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 221/658 (33%), Positives = 337/658 (51%), Gaps = 90/658 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
            A  ++S +TL EKI Q+  +ASAI RL IP Y +W+E LHG+A  G            T
Sbjct: 11  EAIKIVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGVA----------T 60

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L  +I   +++E RA YN            GLTFW+PNIN+FRDP
Sbjct: 61  VFPQAIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDP 120

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + +   V F+K  QGE                            G 
Sbjct: 121 RWGRGQETYGEDPFLTAQIGVAFIKGLQGE----------------------------GK 152

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L L+AC KH   + +       R+ F+A++  +D  + + P F++ IE+      M +Y
Sbjct: 153 YLRLAACTKH---FAVHSGPEADRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAY 209

Query: 296 NQVNGVPACLRGDLFQKAR-NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N +NG PAC+  +L  K    +WGF+G++ SD  A+  + E  +YT+T  ++ A  +K G
Sbjct: 210 NAINGQPACVNEELIAKTLLGKWGFEGHVVSDYAALEDVHENHHYTQTAAETMALAMKIG 269

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            ++ C   +      A+ KG V E +I  +++ L++  +RLG+F  D     Y  +  + 
Sbjct: 270 TNL-CAGKISDALFEAVGKGLVTETEITASVVKLYTTHVRLGMFAED---NDYDTIPYEV 325

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
             ++EH+ L+L AA + +VLLKND  FLPL+++ + S+A+IGP   NI  + G Y G   
Sbjct: 326 NASAEHEMLSLKAAEKSMVLLKNDN-FLPLSQSEIKSVAVIGPTARNIGALEGNYAGTAN 384

Query: 475 SPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFH-------EAVRIAKKADFVIVVA 524
             ++ + G++  +S   +  YA GCH    ++++          EA+  A+ AD  ++  
Sbjct: 385 HYETFVSGIQQALSNQARVTYALGCHLYADHAESSLSRANERESEAIIAAEHADIAVLCV 444

Query: 525 GLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           GLD T E E  D         + SL LPGQQ  L+  V  T K  VILVLT G  L +  
Sbjct: 445 GLDPTIEGEQGDAGNVYGSGDKPSLSLPGQQKRLIEKVLETGKT-VILVLTSGSALSLEG 503

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
            E  + + +I+   YPG  G  ALA I+ G  +P G+LP+T+  ++      +D +M   
Sbjct: 504 LEKHTGVKAIIQAWYPGAHGGTALANILLGKVSPSGKLPVTFCKDTQGLPDFSDYSM--- 560

Query: 636 SSRQYPGRSYRFYTGTQ---VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
                   + R Y  TQ   +Y FG+GL+Y +   K L    +LT+S + +   D +I
Sbjct: 561 --------AERTYQNTQLEVLYPFGYGLTYGHAEIKTLQL-DDLTLSVTAENKGDYDI 609


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  348 bits (892), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 258/821 (31%), Positives = 381/821 (46%), Gaps = 118/821 (14%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+ + +   RA SL+  LT+ EK+  L D A    R+G+P Y WWSE LHG+A + PGV 
Sbjct: 37  CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAGS-PGVT 95

Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           FN T    S  TSF   +   ASF+  L   +G+A++ EARA  N G  GL +W PN+N 
Sbjct: 96  FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF-GEE 230
           ++DPRWGRG ETPGEDP+ +  Y                          K +L G  G E
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYV-------------------------KAMLAGLEGNE 190

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
           + R     + A CKH  AYDLE+W   +RY F A++T QD  + + PPF+ C    K   
Sbjct: 191 TVR----KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGS 246

Query: 291 IMCSYNQV------------------NGVPACLRGDLFQKARNEWGF---KGYITSDCDA 329
           IMCSYN +                     PAC    L    R+ W +     YITSDC+A
Sbjct: 247 IMCSYNALTIRDMAGGSKPDEIINLTTAQPACANTYLMTILRDHWNWTEHNNYITSDCNA 306

Query: 330 VATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRALL 386
           +     +  N+++T  ++AA   KAG D  C      L     A ++  + E  ID AL 
Sbjct: 307 ILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVGAYNQSLLPEAVIDTALR 366

Query: 387 NLFSVQLRLGLFN-----------GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
            L+   +R G  +           G      Y  L  +DV T   ++LAL +A +GIVLL
Sbjct: 367 RLYEGLIRAGYLDHGRSAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLL 426

Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASG 495
           KN    LPL+ +    +A+IG   N    M G Y+GIP    + L   +       YA+G
Sbjct: 427 KNSGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANG 485

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR 555
                 + D     A+  A+ AD V+   G D T  +ED DR S+  P  QM L++ +A 
Sbjct: 486 PVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAG 545

Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPM 615
             K  V++ L  G  +D S    +  +SSILW+GYPG++G  A+ +++ G   P GRLP+
Sbjct: 546 LGKPVVVIQL--GDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPV 603

Query: 616 TWYPESFT-KVPMNDMNMR-----------------------------ADSSRQYPGRSY 645
           T YPE +  +VP+ +M +R                              + +   PGR+Y
Sbjct: 604 TQYPEGYVDEVPLTEMALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTY 663

Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
           ++Y+ T V  FG+GL YT ++     + +  + S S+ +     +L    +     H+D 
Sbjct: 664 KWYS-TPVLPFGYGLHYTTFNVSLSLSSNASSPSFSIPS-----LLTPCTA----THLDL 713

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                S    + +S+TN G     +V +LF       +  P K L+ + RV  +  G + 
Sbjct: 714 CPFSPSANSALSVSITNTGTHTSDYVALLFLSGEFGPEPYPLKTLVSYKRVKDIKPG-ET 772

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           ++    P    +I+   G      GN VL  G  R  + ++
Sbjct: 773 VTVKDVPVSLGAISRVDGD-----GNTVLYPGTYRFVVDVD 808


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  347 bits (890), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 267/793 (33%), Positives = 388/793 (48%), Gaps = 110/793 (13%)

Query: 16  FRVLQLIVVVNVIAFSNSKPVLNKPDFP--CKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
            R LQLI     I   N   V    DFP  C        PFCNTSLS + R + L+S L 
Sbjct: 1   MRALQLI-----ITLFNWVAVYGDADFPRACVGKTNQELPFCNTSLSTADRVEDLLSRLP 55

Query: 74  LQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQV 128
           LQEK   L+  AS  PR     +G+P Y W +  +HG+ S   G N        TSFP  
Sbjct: 56  LQEKATLLTARAS--PRGNMSSIGLPEYNWGANCVHGVQSTC-GTN------CPTSFPNP 106

Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWGR 179
           +   A F+  +  ++   +  E RA++  G           GL  W+PNINI RDPRWGR
Sbjct: 107 VNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPHLGLDCWSPNININRDPRWGR 166

Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
             ETP EDP+V S Y V + +  Q                          E  R D   L
Sbjct: 167 NTETPSEDPLVNSKYGVAYTRGLQ--------------------------EGKRQDPRFL 200

Query: 240 SACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
            A    KH  AY  E +G  +R  F+A+++  D  DT+ P FRS +  G A  +MCSYN 
Sbjct: 201 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 260

Query: 298 VNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
           VNG+P C   +L +   R   GF GY+TSD  AV  I +  +Y  +  ++A   + AG D
Sbjct: 261 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHYADSQCEAARLAILAGTD 320

Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDV 415
           IN G       ++ +D  +++EK +D AL +   ++  LGLF  DP   + Y  + P +V
Sbjct: 321 INSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLF--DPIDDQPYWNVTPSEV 378

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP--------LVNNISQM-G 466
            T+  K L+L+A R+ +V+L+N+   LPL K     LA++GP        L N + QM  
Sbjct: 379 NTAAAKALSLNATRKSLVMLQNNASVLPLQKGV--KLAVLGPHAKSKRGLLGNYLGQMCH 436

Query: 467 GGYTGIPC--SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           G Y  + C  +P   +R      S T +A GC  +  NS AGF +AV  AK+AD V++  
Sbjct: 437 GDYDEVGCVQTPLDAIRAANG-ASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFL 494

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G+D + E E  DR ++ LP  QM L+  V     RP ++VL  GG +     E   +  +
Sbjct: 495 GIDKSIEGEVGDRNNIDLPNIQMQLLQRV-HAVGRPTVVVLINGGVIGAE--EIIERTDA 551

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           ++   YPG  GA+A+A+++FGD NP G+LP+T Y   +  +V M  M+M A     +PGR
Sbjct: 552 LVEAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSMDMTA-----HPGR 606

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           +YR++ G  V+ FG GLSYT +S         L++ +   + S  N    +G        
Sbjct: 607 TYRYFKGEPVFPFGWGLSYTTFS---------LSVDSGTNSSSHSNNAAFSGG------- 650

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVV--MLFARVPKVSQGTPEKQLIGFDRVH--TV 759
            EV+   ++   + + V N G+V G  V+  +    V  ++    E  L+ F   +   V
Sbjct: 651 -EVSDTANVT--ISVVVKNDGEVAGDEVLGPLDSTEVSTLALPDEEGNLVSFPGSYEVIV 707

Query: 760 AKGSKE-ISFGVD 771
           + G KE + F V+
Sbjct: 708 SNGVKERLRFSVE 720


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 254/770 (32%), Positives = 380/770 (49%), Gaps = 134/770 (17%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +  +A+ LI+++TL EKI Q+ +    IPRLGI  Y+WW+E LHG+  +G          
Sbjct: 9   MQQKARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDG---------- 58

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIF 172
             T FPQ +   A+FN +L   IG A+A E RA YN+ Q         GLTFW+PNINIF
Sbjct: 59  RATVFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIF 118

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRG ET GEDP +     + +V+  QG +                          
Sbjct: 119 RDPRWGRGMETYGEDPFLTGTLGIAYVQGMQGND-------------------------- 152

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
               L ++AC KH   Y +      +R+  N   T++D  +T+ P F+  ++QG    IM
Sbjct: 153 -PFYLKVAACGKH---YAVHSGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIM 208

Query: 293 CSYNQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
            +YN+V G  AC     L  D+ +K   +WGF+G+I SDCDAVA I       KT  ++ 
Sbjct: 209 GAYNRVYG-EACSGSKYLLTDVLRK---QWGFRGHIVSDCDAVADIHAGHKIVKTEAEAC 264

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  +KAG++I CG       + A+ +  + E++IDRALL L   +L+LG+   D  +  Y
Sbjct: 265 AIAIKAGLNIECGHT-FEAMKQAVAQKLLTEQEIDRALLPLMMTRLKLGILEYDA-ECPY 322

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            ++   ++C+ EH  LA  AA + +VLLKN+   LPL+KN + +L I GP  ++   + G
Sbjct: 323 NEVKETEICSPEHIALARKAATESMVLLKNN-GILPLDKN-LHTLFIAGPGASDSFWLMG 380

Query: 468 GYTGIPCSPKSLLRGLEAYVSK----THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVV 523
            Y GI     + L+G+   VS         +     P  +   +  A+  A  A+  IVV
Sbjct: 381 NYFGISNRYCTYLQGIADKVSSGTAVNFRPAFGESTPTKNTINW--ALDEAIAAEKTIVV 438

Query: 524 AGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
            G +   E E+          DRVS+ LP  QM  +  + +  K  +++VLTGG P+DV 
Sbjct: 439 MGNNGNLEGEEGESIASETRGDRVSMRLPASQMKFLRDL-KARKNGIVVVLTGGSPIDVR 497

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV-PMNDMNMR 633
             E      +++   YPG+ G  ALA+++FGD N  GRLP+T +PES   + P  D  M+
Sbjct: 498 --EISRLADAVVMAWYPGQEGGYALADLLFGDENFSGRLPVT-FPESTDALPPFEDYAMK 554

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY---KFLSAPSE---LTISASLKAGSD 687
                   GR+Y++ T    Y FG+GLSYT  +Y   K  + P +   +T+SA       
Sbjct: 555 --------GRTYKYQTAHIQYPFGYGLSYTTVTYAHAKVETMPQKGRGMTVSA------- 599

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
             +L+ TG++     +DEV         V +S   AG       ++ F R+  +  G  E
Sbjct: 600 --VLKNTGNKA----VDEVA-------QVYLSAPGAGTTAALASLVAFKRI-GLQPG--E 643

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           +QL+ FD                 P ++L    + G   L  GN+ + VG
Sbjct: 644 QQLVRFD----------------IPFDRLLTVQEDGTAQLLKGNYTITVG 677


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  345 bits (886), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 226/745 (30%), Positives = 362/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  I+ R K+LI  +TL EK  QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPA 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++     +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAY 251

Query: 296 NQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP +  R  L +  R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+ +G + E  ID+AL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKN+   LPL+K    S+A++GP  ++     GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               +LL+G++  + K    +  + +  + D+     V   K  D V+V  G D     E
Sbjct: 427 PYSITLLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+ ++ + + R ++LV   G PL   +  AD+ I +I+   YPG+
Sbjct: 483 NHDMTSIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADTHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA ++FG+ NP G+LPMT Y        + D +M       + GR+YR+  G  +
Sbjct: 540 EAGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDM-------WKGRTYRYMKGEPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FGHGLSYT++ +  +                            D +  D +  C+   
Sbjct: 593 YSFGHGLSYTSFEFDNIQGN-------------------------DTLQPDAILQCS--- 624

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N+G + G  VV ++         T P K+L+ F +V   +   K++ F + P
Sbjct: 625 ----VELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTIAP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWEDGKWRMLS-GKYTLFIG 703


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  345 bits (885), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 255/758 (33%), Positives = 371/758 (48%), Gaps = 103/758 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
           C  P    + FCN SLS + R + L+  L L EK+  L+  AS    +  +G+P Y W +
Sbjct: 27  CSSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 86

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
             +HG+ S   G N        TSFP  +   A F+     ++   V  E RA++  G  
Sbjct: 87  NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAR 139

Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
                    GL  W+PNINI RDPRWGR  ETP EDP+V S Y V + K  Q       +
Sbjct: 140 ENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ-------E 192

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
           G      ++KR L+                  KH  AY  E +    R +FNA+++  D 
Sbjct: 193 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAVVSRYDF 233

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
            DT+ P F + +  GKA  +MCSYN VNG+P C    L  K  R+  GF GYITSD  A+
Sbjct: 234 ADTYLPAFEASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI 293

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
           A I+  ++YTKT  ++    + +G D+N G+   +     +  G++ EK +D A+     
Sbjct: 294 AGIYHQRHYTKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLK 353

Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           ++  LGLF  DP   + Y  + P++V T+E K+L+LD +R+ IVLL+N    LPL K   
Sbjct: 354 LRFELGLF--DPIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG-- 409

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
             LA+IGP       + G Y G  C    L  G     LEA       S T YA  SG +
Sbjct: 410 KKLAVIGPHAAAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGIN 469

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           D    S AGF EA   A+KA+ V++  G+D + E E  DR ++ +P  QM L+  V R  
Sbjct: 470 DT---STAGFDEAEAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 526

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K P ++VL  GG   V   E       ++   YPG  GA+A+++I+FGD  P G+LP+T 
Sbjct: 527 K-PTVVVLFNGGV--VGAEELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 583

Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           YP ++ T V M  M+M      +YPGRSYR+Y    V+ FG GLSYT ++    S     
Sbjct: 584 YPSNYVTSVDMKSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS----- 633

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
             S+ +   S+  ++ +         +D+          V + ++N G++ G  VV  F 
Sbjct: 634 --SSGVTDPSEPIVVTR--------QLDQT---------VTVILSNDGNLVGDEVVFAFF 674

Query: 737 RVPKV----SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
           R  KV    +     +QL  + RV       +++ F +
Sbjct: 675 RPLKVNATGNAALLNEQLFDYRRVSLRPTQYRKLKFRI 712


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  345 bits (884), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 250/755 (33%), Positives = 356/755 (47%), Gaps = 114/755 (15%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
           +PF + +L    R + L+  LT  E + Q++         A  IPRLGI  Y W +E L 
Sbjct: 27  FPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNTECLR 86

Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
           G A   PG          T+FPQ L  AA+F+  L   + +A A E RA +N        
Sbjct: 87  GDA-EAPG--------WATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFVAAGRY 137

Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GL+ ++P +NI R P WGR QET GEDP + +  A  FV+  QG++          
Sbjct: 138 DDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQH---------- 187

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                R +K              SA CKH   +   +    SR SF+A + E+D   TF 
Sbjct: 188 ----PRYIKA-------------SAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFL 230

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F++C+  G  S  MCSYN++NGVPAC    L     R EWGF+GY+ SD  AV  I  
Sbjct: 231 PQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILL 289

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDRALLNLFSV 391
              YT T  ++A   + AG+++     M      H   A+  G +  + +   +  LF  
Sbjct: 290 GHRYTHTFLETAIASVNAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYT 349

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           +LRLG F+  P    Y  L    V +SEH+ L+L+AA +  VLLKN +  LPL +     
Sbjct: 350 RLRLGEFD-PPAMNPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKR 408

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLL---RGLEAYVSKTHYASGCHDVPC--NSDAG 506
           LA++GP  +N   + G Y  +P  P+ +    RGL+   +   +A+GC +  C   S   
Sbjct: 409 LAVVGPFADNPRVLFGDYAPVP-EPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDE 467

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPVILVL 565
              AVR    AD V+V  G  +  E E RDR  L LPG Q+ L+    R +   PVIL+L
Sbjct: 468 VENAVR---GADVVLVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLL 524

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD--FNPGGRLPMTWYPESFT 623
              GPLDVS+A+    + +IL   +P +A   A+A ++ G    +P GRLP TW P    
Sbjct: 525 FNAGPLDVSWAQLHDGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATW-PAGMH 583

Query: 624 KV-PMNDMNMRADSSRQYPGRSYRFY-TGTQVYGFGHGLSYTNYSYK-FLSAPSELTISA 680
           +V PM +  M         GR+YR+Y     +Y FG+GLSYT + Y+  + +P  L I A
Sbjct: 584 QVPPMENYTME--------GRTYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPVLPICA 635

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           +L                                 V + + N G  D   VV L+ R  +
Sbjct: 636 NLS--------------------------------VSVVLENTGPRDSEEVVQLYLRWEQ 663

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
            S   P  QL+ F RV   A G+ ++SFGV   ++
Sbjct: 664 PSVPVPRWQLVAFRRVAVPAGGATKLSFGVTAAQR 698


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  345 bits (884), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 228/745 (30%), Positives = 360/745 (48%), Gaps = 97/745 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  I+ R K+LI  +TL EK  QL   + +IPRL +PAY +W+E LHG+A  G     
Sbjct: 53  DLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG----- 107

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 108 -----EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDP 162

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D   
Sbjct: 163 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPA 195

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E     +R+S ++ I  +   + + P + +C+++     +M +Y
Sbjct: 196 YLKTVATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAY 251

Query: 296 NQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NGVP +  R  L +  R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 252 NAFNGVPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSG 311

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CGT        A+ +G + E  ID+AL  + + + +LG F  DP +   Y      
Sbjct: 312 CDLECGTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKK 369

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   +  +LA +AA + +VLLKN+   LPL+K    S+A++GP  ++     GGY+G P
Sbjct: 370 LLAGKKFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQP 426

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
               +LL+G++  + K    +  + +  + D+     V   K  D V+V  G D     E
Sbjct: 427 PYSVTLLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARE 482

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           + D  S+ LP +Q  L+ ++ + + R ++LV   G PL   +  AD  I +I+   YPG+
Sbjct: 483 NHDMTSIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADVHIPAIMQAWYPGQ 539

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
              +ALA+++FG+ NP G+LPMT Y        + D +M       + GR+YR+     +
Sbjct: 540 EAGRALADLLFGNENPSGKLPMTIYRAEDQLPDILDFDM-------WKGRTYRYMKEDPL 592

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           YGFGHGLSYT++ +  +        S +LK+G+                          R
Sbjct: 593 YGFGHGLSYTSFGFDGIQG------SDTLKSGA--------------------------R 620

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDP 772
               + ++N G   G  VV ++         T P K+L+ F +V       K + F + P
Sbjct: 621 LQCSVELSNTGKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPP 680

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            E LS+      R+L  G + L +G
Sbjct: 681 RE-LSVWENGNWRML-TGKYTLFIG 703


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  344 bits (883), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 246/781 (31%), Positives = 363/781 (46%), Gaps = 115/781 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
           +PF + SL I  R K ++  L+L + ++Q++         A  IP+  I  Y+W +E L 
Sbjct: 27  FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86

Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG------ 158
           G  + G            TSFP  +  AASFN  L   + +A A E RA           
Sbjct: 87  GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136

Query: 159 --QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GL+ W+P +NI RDPRWGR QET GEDP +       FV   QG            
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQG------------ 184

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                          D    ++ +A CKH   +   +     R SF+A +T  D   TF 
Sbjct: 185 ---------------DDPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFL 229

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F++C+E G  S +MCSYN++NGVPAC    L     RNEW FKGY+ SD  A+  I  
Sbjct: 230 PQFKACVEAGALS-LMCSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVT 288

Query: 336 YQNY-----TKTHEDSAAGV-LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
             +Y     T   + + AG  L+ G     G  +  +   A++KG V    +  A+  LF
Sbjct: 289 QHHYAPDFVTAAADAANAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLF 348

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK---FLPLNK 446
            V+ +LG F+       Y  +    + + EH KL++ AA + IVL+KND     FLPL  
Sbjct: 349 YVRTKLGEFDPPDNNNPYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAA 408

Query: 447 NAVSSLAIIGPLVNNISQMGGGY-----TGIPCSPKSLLRGLEAYVSKTHYASGCHDVP- 500
           +      ++GP + N   M G Y     T    +P + ++  +      +Y  GC D P 
Sbjct: 409 DDFKKACVVGPFIENADTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPA 468

Query: 501 CNSDAGFHEAVRIA-KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS-K 558
           C    G+   VR A +  D VIV AGL    E E  D   + LPG QMSL+T     S  
Sbjct: 469 CEIYDGYK--VRTACEGVDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTDAESASGS 526

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            P+IL+L    PLD+S+A+++ + ++IL   YPG+    A+A ++ G +NP GRLP TW 
Sbjct: 527 APIILLLFNANPLDISYAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW- 585

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           P S  +VP M D  M+         R+YR++T   +Y FG+GLS+T ++Y      S+L 
Sbjct: 586 PASLDQVPDMIDYTMKE--------RTYRYFTQEPLYPFGYGLSFTTFNY------SDLN 631

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           ++++     + +I                         V ++V N G +DG  V   + +
Sbjct: 632 VASTANTNGEGSIA------------------------VSVTVMNTGTMDGDEVTQAYVK 667

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMV 796
              V++  P  QL+G  R       S  +SF + P + Q+ I    G+  +P G + L V
Sbjct: 668 WDNVAEA-PNIQLVGVSRKFISKGQSITVSFTIKPEQLQVWINGDDGKWSIPGGTYSLFV 726

Query: 797 G 797
           G
Sbjct: 727 G 727


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 254/758 (33%), Positives = 370/758 (48%), Gaps = 103/758 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
           C  P    + FCN SLS + R + L+  L L EK+  L+  AS    +  +G+P Y W +
Sbjct: 27  CSSPEHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 86

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
             +HG+ S   G N        TSFP  +   A F+     ++   V  E RA++  G  
Sbjct: 87  NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAR 139

Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
                    GL  W+PNINI RDPRWGR  ETP EDP+V S Y V + K  Q       +
Sbjct: 140 ENYATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ-------E 192

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
           G      ++KR L+                  KH  AY  E +    R +FNA+++  D 
Sbjct: 193 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAVVSRYDF 233

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
            DT+ P F + +  GKA  +MCSYN VNG+P C    L  K  R+  GF GYITSD  A+
Sbjct: 234 ADTYLPAFEASVVHGKAKGVMCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI 293

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
           A I+  ++YTKT  ++    + +G D+N G+   +     +  G++ EK +D A+     
Sbjct: 294 AGIYHQRHYTKTLCEAGRLAILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLK 353

Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           ++  LGLF  DP   + Y  + P++V T+E K+L+LD +R+ IVLL+N    LPL K   
Sbjct: 354 LRFELGLF--DPIDDQPYWHVAPNEVNTAESKQLSLDLSRKSIVLLQNHGNILPLAKG-- 409

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
             LA+IGP       + G Y G  C    L  G     LEA       S T YA  SG +
Sbjct: 410 KKLAVIGPHAAAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGIN 469

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           D    S  GF EA   A+KA+ V++  G+D + E E  DR ++ +P  QM L+  V R  
Sbjct: 470 DT---STGGFDEAEAAARKAETVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 526

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K P ++VL  GG   V   E       ++   YPG  GA+A+++I+FGD  P G+LP+T 
Sbjct: 527 K-PTVVVLFNGGV--VGAEELILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 583

Query: 618 YPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           YP ++ T V M  M+M      +YPGRSYR+Y    V+ FG GLSYT ++    S     
Sbjct: 584 YPSNYVTSVDMKSMSM-----TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS----- 633

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
             S+ +   S+  ++ +         +D+          V + ++N G++ G  VV  F 
Sbjct: 634 --SSGVTDPSEPIVVTR--------QLDQT---------VTVILSNDGNLVGDEVVFAFF 674

Query: 737 RVPKV----SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
           R  KV    +     +QL  + RV       +++ F +
Sbjct: 675 RPLKVNATGNAALLNEQLFDYRRVSLRPTQYRKLKFRI 712


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 218/633 (34%), Positives = 330/633 (52%), Gaps = 85/633 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA++L   +T +E+  QL  +A AI RLGIPAY WW+E +HG+A +G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA F+  L        + EARA YN            GLT W
Sbjct: 61  -------VATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP + +      V+  QG+                +V+K
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQGDG---------------KVMK 158

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                         +AC KH   + +       R+SF+A    +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEALRHSFDAKADAKDMEETYLPAFEALVKE 202

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG PAC    L +K + EW F GY  SDC A+    E+   T    +
Sbjct: 203 AKVESVMGAYNRVNGEPACASDYLMEKLK-EWEFDGYFVSDCWAIRDFHEHHMVTANAVE 261

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           SAA  LKAG D+NCG C  ++  +A+DKG + ++ I  A ++L   ++RLG+F+   +  
Sbjct: 262 SAAMALKAGCDVNCG-CTYQNLLAALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHT 317

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            +  +    V  +EHK ++L+ A + +VLLKN+   LPL+     ++A+IGP  ++ + +
Sbjct: 318 DFDDIPYSKVACAEHKAVSLECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTAL 376

Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH----DVPCNSDAG--FHEAVRIAKKAD 518
            G Y G+     + L G++  +  +  +A GCH     +   + AG  + EAV  AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNAD 436

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            VI+  GLD T E E+          D+  L LP  Q  LV  +    K PV+ V+  G 
Sbjct: 437 LVIMCVGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGS 495

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            ++      +SQ  +++   YPG  G+KALAE++FGD +P G+LP+T+Y ++       D
Sbjct: 496 AINT-----ESQPDALIHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            +M+        GR+YR+ T   ++ FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTDNILFPFGYGLTY 575


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  343 bits (879), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 243/758 (32%), Positives = 355/758 (46%), Gaps = 110/758 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
           +PF NTSL  + R + L+  L L+E + Q+S         A  I RL I  Y W +E L 
Sbjct: 25  FPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIGPYSWNTECLR 84

Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL------- 157
           G  S GP           TSFPQ    AA+F+  L   I +A A E RA YN        
Sbjct: 85  GDLSAGPA----------TSFPQAFGLAATFDAVLIKQIANATAYEVRAKYNNYTKHKEY 134

Query: 158 -GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GL+ ++P INI R P WGR QET GEDP +    A  FV   QG + +      G 
Sbjct: 135 GDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQGNHPRYVTANAG- 193

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                                     CKH  AY   +    SR +F+A ++++D   TF 
Sbjct: 194 --------------------------CKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFL 227

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F  CI+ G  S +MCSYN +NGVPAC    L     R EW F GY+ SD  AV  +++
Sbjct: 228 PAFHECIQAGTYS-LMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYD 286

Query: 336 YQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
             +YTK   D+A   + +G+++    N    ++  T  A+ +G V  K +   +  LF  
Sbjct: 287 AHHYTKDMLDTAIACVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYT 346

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           ++RLG F+  P    Y KL    + + EH++L+L AA +  VLLKN+ +FLPL K  +  
Sbjct: 347 RMRLGEFD-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPL-KEKIDK 404

Query: 452 LAIIGPLVNNISQMGGG----YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           LA++GP  +N  ++ G      + +  +P+  L  +      T +ASGC    C ++   
Sbjct: 405 LAVVGPFGDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLA--TTFASGCLSPAC-TEYDP 461

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT-SVARTSKRPVILVLT 566
               +   + D V+V  G     E E  DR  L LPGQQ+ L+  +V   + +PVIL+L 
Sbjct: 462 KSTKQAIDRVDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLF 521

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD--FNPGGRLPMTWYPESFTK 624
             GPLD+++A ++  I  I+   +P +    AL  +       NPGGRLP+TW P+S ++
Sbjct: 522 NAGPLDITWAVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW-PKSMSQ 580

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           VP M D  M         GR+YR++ G  ++ FG+GLSYT + Y      S+L I+ S  
Sbjct: 581 VPPMEDYTME--------GRTYRYFNGDPLFPFGYGLSYTTFHY------SDLLITPS-- 624

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                                 +  C+S+  ++ + + N GDV G  V   +      S 
Sbjct: 625 --------------------TPIKPCSSI--NIDVFLENTGDVTGDEVTQFYLSWKNASI 662

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
             P+ QL+G  R    +K    I+  V P       NK
Sbjct: 663 PVPKWQLVGVSRTQLQSKTFANIAIIVPPRLMAVYTNK 700


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 243/767 (31%), Positives = 366/767 (47%), Gaps = 120/767 (15%)

Query: 59  LSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT 118
           ++   RA+ L+  + L + + QL  +A AI  LGIPAY WW+E LHG A +G        
Sbjct: 1   MTARQRAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSG-------- 52

Query: 119 VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNIN 170
             + T FPQ +  A+ F+      + S V+ E RA YNL           GLT W+PN+N
Sbjct: 53  --TATVFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVN 110

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRGQET GEDP + +  AV F++  QGE                         
Sbjct: 111 IFRDPRWGRGQETFGEDPYLTARLAVAFIRGLQGE------------------------- 145

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
              G  L  ++C KH  A+   +     R+ FNA++ ++D E+T+ P F S +++ KA  
Sbjct: 146 ---GPVLKTASCVKHFAAHSGPE---PLRHGFNAVVGKKDLEETYLPAFASAVKEAKADA 199

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +M +Y+ +N  P C    L ++  R  WGF+G   SDC A+         TK  E+SAA 
Sbjct: 200 VMGAYSALNDEPCCASSFLMEETLRLRWGFEGMYISDCWAIRDFHLNHKVTKNEEESAAL 259

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LK G D+ CG C  +  + A  KG +  + I +A + + + + +LG F+   +   Y  
Sbjct: 260 ALKRGCDLACG-CEYQSLEKAFQKGLITREQIKKAAIRVMTTRFKLGQFD---QGTAYDT 315

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG + + + EH  LA +A+ + +VLLKND   LPL K AVS LA+IGP  ++   + G Y
Sbjct: 316 LGLESLDSDEHAALAFEASCRSLVLLKND-ALLPLKKEAVSCLAVIGPNADSRQALWGNY 374

Query: 470 TGIPCSPKSLLRGLEAYV---SKTHYASGCH------DVPCNSDAGFHEAVRIAKKADFV 520
            G      ++L GL  YV   ++  Y+ G +      +     D    EAV +AK +D V
Sbjct: 375 HGTSSRYVTILEGLRDYVGSSTRILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVV 434

Query: 521 IVVAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           ++  GL+ T E E           D+  L LP  Q  L+ +VA T K P+I+VL  GG L
Sbjct: 435 VLCLGLNETVEGEMHDDGNGGWAGDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSL 493

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
           D    E  + + +++   YPG+ G KA+A +++G   P G+LP+T+Y       P  D +
Sbjct: 494 DPEI-EQYANVKALIQAWYPGQEGGKAIAHLLYGALCPSGKLPVTFYKAEAKLPPFTDYS 552

Query: 632 MRADSSRQYPGRSYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           +          R+YR+     V Y FG GLSY ++S+  LSA  E          +++N 
Sbjct: 553 LIR--------RTYRYCDDPDVLYPFGFGLSYASFSF-CLSAAQE----------TEQNG 593

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
           +  T                       + V N   +D   VV L+  + +     P   L
Sbjct: 594 VAAT-----------------------VLVRNTSALDARTVVQLYLAM-EGKDLPPHPVL 629

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            G   VH  A    +I+F ++  +Q +   + G R    G + L  G
Sbjct: 630 CGMKSVHLKAGEETQITFILEE-KQFTAVQEDGNRYAVRGGYTLYAG 675


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 225/763 (29%), Positives = 370/763 (48%), Gaps = 121/763 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AK ++  +T+ EKI Q+   A AI RL IP Y +W+E+LHG+A  G            T
Sbjct: 13  QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAG----------VAT 62

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA+F+  L ++I   +  E RA YN            GLTFW+PN+NIFRDP
Sbjct: 63  VFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDP 122

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S + V F+K  QG+                              
Sbjct: 123 RWGRGHETYGEDPFLTSKFGVAFIKGLQGQ----------------------------AK 154

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L L+A  KH   +   +     R+ F+A+++++D  +T+ P F++ +E+     IM +Y
Sbjct: 155 YLKLAATAKHFAVHSGPEG---LRHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAY 211

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N V+GVPA +   L +    ++W F+G++ SD  A   + E   YTK   ++    +KAG
Sbjct: 212 NAVDGVPASVSEMLLRDILHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAETMGLAIKAG 271

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           +++  G  + +    A+++G V E++I  A+++L++ ++RLG+F  D    +Y  +  + 
Sbjct: 272 LNLVAG-HIEQSLHEALNRGLVTEEEITNAVISLYATRVRLGMFATD---NEYDAIPYEA 327

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
             T  H  L+  AA +  VLLKND   LPL K  + ++A++GP  ++   + G Y G P 
Sbjct: 328 NDTKAHNNLSEIAAEKSFVLLKNDG-VLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPS 386

Query: 475 SPKSLLRGLEAYVS---KTHYASG-------CHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
              ++L G++  +    + HY+ G         +    +D    EA+  A+ +D ++ V 
Sbjct: 387 RSYTILEGIQERLGDDVRVHYSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVL 446

Query: 525 GLDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           GLD T E E+ D         + +L LPG+Q  L+  +    K PV+++L  G  L +  
Sbjct: 447 GLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDG 505

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
            E    + +I+ I YPG  G  A+A+++FG  +P G+LP+T+Y  +       D NM   
Sbjct: 506 LENHPNLRAIMQIWYPGARGGLAVADVLFGTVSPSGKLPVTFYKNTDNLPAFEDYNM--- 562

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                 GR+YR+ T   +Y FG+GL+Y++     L   S                     
Sbjct: 563 -----AGRTYRYMTEEALYPFGYGLTYSSVELSDLQVKS--------------------- 596

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
                   +E  + T       +++ N G+ D   VV ++ +  +     P  QL GF R
Sbjct: 597 -------YEETATAT-------VTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKR 642

Query: 756 VHTVAKGSKE-ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           V  + KGSK+ I+F + P +   + ++ G   +      + VG
Sbjct: 643 VF-LGKGSKQTITFDLRP-QDFEVFDEQGHNFIDSNRFEISVG 683


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 230/740 (31%), Positives = 361/740 (48%), Gaps = 97/740 (13%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           I+ R K+LI  +TL EK  QL   + +IPRL +PAY +W+E LHG+A  G          
Sbjct: 58  IAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAG---------- 107

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
            VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDPRWGR 
Sbjct: 108 EVTVFPQAINLASTWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRN 167

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP + S   V FVK  QG                           D    L   
Sbjct: 168 EETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPAYLKTV 200

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH +A + E     +R+S ++ I  +   + + P + +C+++     +M +YN  NG
Sbjct: 201 ATIKHFVANNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNG 256

Query: 301 VP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           VP +  R  L +  R EWGF G++ SDC A+  +        + E++AA  + +G D+ C
Sbjct: 257 VPPSGSRWLLGEVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGVNSGCDLEC 316

Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPDDVCTS 418
           GT        A+ +G + E  ID+AL  + + + +LG F  DP +   Y       +   
Sbjct: 317 GTTYKEKLVQAVKQGLISEATIDQALTRVLTARFKLGEF--DPMELVPYNHYDKKLLAGK 374

Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKS 478
           +  +LA +AA + +VLLKN+   LPL+K    S+A++GP  ++     GGY+G P    +
Sbjct: 375 KFAELAYEAAVKSVVLLKNE-NLLPLSKEKTKSVAVVGPFADH--NYLGGYSGQPPYSVT 431

Query: 479 LLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
           LL+G++  + K    +  + +  + D+     V   K  D V+V  G D     E+ D  
Sbjct: 432 LLKGVKDLMGKRGKVNYLNGIGASRDS----IVAAVKGVDVVLVALGSDEKMARENHDMT 487

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           S+ LP +Q  L+ ++ + + R ++LV   G PL   +  AD  I +I+   YPG+   +A
Sbjct: 488 SIYLPEEQEKLLKAIYQVNPR-IVLVFHSGNPLTSEW--ADVHIPAIMQAWYPGQEAGRA 544

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           LA+++FG+ NP G+LPMT Y        + D +M       + GR+YR+     +YGFGH
Sbjct: 545 LADLLFGNENPSGKLPMTIYRAEDQLPDILDFDM-------WKGRTYRYMKEDPLYGFGH 597

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT++ +  +        S +LK+G                        T+L+  V++
Sbjct: 598 GLSYTSFGFDGIQG------SDTLKSG------------------------TTLQCSVEL 627

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
           S  N G   G  VV ++         T P K+L+ F +V       K + F + P E LS
Sbjct: 628 S--NTGKWTGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNIPPRE-LS 684

Query: 778 IANKHGRRILPLGNHVLMVG 797
           +      R+L  G + L +G
Sbjct: 685 VWENGNWRML-TGKYTLFIG 703


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  342 bits (876), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 219/633 (34%), Positives = 331/633 (52%), Gaps = 85/633 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + +LS   RA++L   +T +E+  QL  +A A+ RLGIPAY WW+E +HG+A +G   
Sbjct: 4   YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA F+  L        + EARA YN            GLT W
Sbjct: 61  -------VATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP + +   +  V+  QG      DG         +V+K
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG------DG---------KVIK 158

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                         +AC KH   + +       R+SF+A    +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEAIRHSFDAKANAKDMEETYLPAFEALVKE 202

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG PAC    L  K + EW F GY  SDC A+    E    T    +
Sbjct: 203 AKVESVMGAYNRVNGEPACASNFLMDKLK-EWEFDGYFVSDCWAIRDFHENHMVTANAIE 261

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S A  LKAG D+NCG C  ++   A++KG V ++DI  A ++L   ++RLG+F+   +K 
Sbjct: 262 STAMALKAGCDVNCG-CTYQNLLVALEKGAVTKEDIRTACVHLMRTRIRLGMFD---KKT 317

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           +Y  +  D V   EHK ++L+ A + +V+L+N+   LP++ +   ++A+IGP  ++ + +
Sbjct: 318 EYDDIPYDKVACKEHKAISLECAEKSLVMLENN-GILPVDTSKYKTIAVIGPNADSRTAL 376

Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH---DVPCN-SDAG--FHEAVRIAKKAD 518
            G Y G+     + L G++  +  +  +A GCH   D   N + AG  + EAV  AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFAD 436

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
             I+  GLD T E E+          D+  L LP  Q  LV  +    K PV+ V+  G 
Sbjct: 437 MTILCLGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGS 495

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            ++      +S+  +++   YPG  G KALAE++FGD +P G+LP+T+Y ++       D
Sbjct: 496 AINT-----ESKPDALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            +M+        GR+YR+ T   +Y FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTENVLYPFGYGLTY 575


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  342 bits (876), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 234/757 (30%), Positives = 356/757 (47%), Gaps = 121/757 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A+ L+S +TL EK  QL  +A+ + RLG+PAY +W+E+LHG+A  G            T
Sbjct: 14  KAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAG----------VAT 63

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+      +G  +A E RA YN   A        GLTFW+PN+NIFRDP
Sbjct: 64  MFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRDP 123

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S   V+FV+  QG+                            G 
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVKFVEGIQGD----------------------------GP 155

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +  +AC KH   Y +       R+ F+A  + +D  +T+ P F + + +     +M +Y
Sbjct: 156 VMKAAACAKH---YAVHSGPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAY 212

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+ NG P C    L +   R +W F+G+ TSDC A+    E+   T T   SAA  L AG
Sbjct: 213 NRTNGEPCCAHKYLMEDVLRGKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSAAMALNAG 272

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L H   A   G V E+ I  + + L + +  LGLF+G     +Y K+    
Sbjct: 273 CDLNCGNTYL-HMMGAYQDGLVTEEKITESAVRLLTTRYLLGLFDG----SEYDKIPYSV 327

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V   EH   AL  AR+  VLLKND   LP++K  V+++ +IGP  ++ + + G Y G   
Sbjct: 328 VECKEHIDEALKMARKSCVLLKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSS 386

Query: 475 SPKSLLRGLEAYVS---KTHYASGC-------HDVPCNSDAGFHEAVRIAKKADFVIVVA 524
              ++L G+        +  Y+ GC        ++  + D    EAV  A+ +D VI+  
Sbjct: 387 EYITVLEGIREEAGDDVRILYSQGCDLYKDKVENLAWDQDR-ISEAVITAENSDVVILCV 445

Query: 525 GLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           GL+ T E E+          D+V L LP  Q  L+  V    K P I+VL  G  +D+++
Sbjct: 446 GLNETLEGEEGDTGNSDASGDKVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNY 504

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
           A+     + IL   YPG  G +A+A+++FG  +P G+LP+T+Y +        D +M+  
Sbjct: 505 AQ--DNCNGILLAWYPGARGGRAIADLLFGKESPSGKLPITFYKDLEGMPEFTDYSMK-- 560

Query: 636 SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
                  R+YR+     +Y FG+GL+Y++      +  +E  +   + A SD        
Sbjct: 561 ------NRTYRYMEKEALYPFGYGLTYSD------TCVTEAEVVGEVSAESD-------- 600

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
                               ++ +V N G VD   VV ++ +            L GF R
Sbjct: 601 ------------------IVLKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSLCGFKR 642

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNH 792
           V   A   K + F +   + ++I ++ G R +  G H
Sbjct: 643 VSLKAGEEKSVEFTISN-KAMNIVDEDGNRYIA-GKH 677


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  341 bits (875), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 245/756 (32%), Positives = 355/756 (46%), Gaps = 118/756 (15%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIP--RLGIPAYEWWSESLHGIAS 108
           +YPF NTSLS   R   L+S LTL E + Q++  + A P  RLGI  Y W SE LHG+  
Sbjct: 25  AYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLHGVVP 84

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQA 160
                         T+FPQ +  AASF+  L S++  A+ +E RA +N            
Sbjct: 85  PD---------GLATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVYQEHT 135

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GL+ ++P INI R P WGR QET GEDP ++      +V+  QG                
Sbjct: 136 GLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQG---------------- 179

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                      D    ++ +A CKH   +   +    SR+SF+A + E+D + TF P F 
Sbjct: 180 -----------DHPRYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFH 228

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
            C++ G  S +MCSYN++N VPAC    L     R EWGF GY+ SD  AV  I    +Y
Sbjct: 229 ECVKAGVYS-VMCSYNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY 287

Query: 340 TKTHEDSAAGVLKAGMDIN----CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           T +  D+ A  + AG +++     G  M      A+  GK++EK +   +  LF  ++RL
Sbjct: 288 TDSIVDTVASAVNAGCNLDLAFPVGDGMYIKIGDAVTAGKIKEKTVVERVKPLFYTRMRL 347

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ-----GIVLLKNDKKFLPLNKNAVS 450
           G F+  P    Y  L    V + EH++LA+ AA Q       VLLK + + LPL+   V+
Sbjct: 348 GEFD-PPELNPYANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLD-TLVN 405

Query: 451 SLAIIGPLVNNISQMGGGYTGIPCSPKSLL----RGLEAYVSKTHYASGCHDVPCNSDAG 506
            LA+IGP  +N S + G Y+  P   K  +    +GL      T    GC   PC +   
Sbjct: 406 KLAVIGPFADNPSYLFGDYS--PNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTT--Y 461

Query: 507 FHEAVRIA-KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK-RPVILV 564
           F E V+ A   AD ++V  G  +  E E  DR  L LPG+Q  L+  V + +  +P+IL+
Sbjct: 462 FSEMVKAAVTGADLIVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILL 521

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF-------GDFNPGGRLPMTW 617
           L   GPLD+ +A  +  I  I+   +P +A   AL  +         G+ NPGGRLP+TW
Sbjct: 522 LFNAGPLDIVWAVENPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW 581

Query: 618 YPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
            P S  +V PM +  M         GR+YR++ G  ++ FG+GLSY ++SY      S L
Sbjct: 582 -PRSMNQVPPMTNYTME--------GRTYRYFNGDPLFPFGYGLSYGSFSY------SSL 626

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            I  S                        + +C  ++  V +     G   G  V  ++ 
Sbjct: 627 VIWPS-----------------------TIPACNGVKVSVTVYKLGPG---GDEVTQVYM 660

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
                S   P+ QL+ F R +    G  E+ F + P
Sbjct: 661 SWNNASVVVPKLQLVAFKRFYLETNGVTEVHFTIAP 696


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 217/633 (34%), Positives = 328/633 (51%), Gaps = 85/633 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA++L   +T +E+  QL  +A AI RLGIPAY WW+E +HG+A +G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSG--- 60

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA F+  L        + EARA YN            GLT W
Sbjct: 61  -------VATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGR  ET GEDP + +      V+  QG+                +V+K
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQGDG---------------KVMK 158

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                         +AC KH   + +       R+SF+A    +D E+T+ P F + +++
Sbjct: 159 A-------------AACAKH---FAVHSGPEALRHSFDAKADAKDMEETYLPAFEALVKE 202

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG PAC    L +K + EW F GY  SDC A+    E+   T    +
Sbjct: 203 AKVESVMGAYNRVNGEPACASDYLMEKLK-EWEFDGYFVSDCWAIRDFHEHHMVTANAVE 261

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           SAA  LKAG D+NCG C  ++  +A+DKG + ++ I  A ++L   ++RLG+F+   +  
Sbjct: 262 SAAMALKAGCDVNCG-CTYQNLLAALDKGLITKEQIRTACVHLMRTRIRLGMFD---KHT 317

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            +  +    V  +EHK ++L+ A + +VLLKN+   LPL+     ++A+IGP  ++ + +
Sbjct: 318 DFDDIPYSKVACAEHKAVSLECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTAL 376

Query: 466 GGGYTGIPCSPKSLLRGLE-AYVSKTHYASGCH----DVPCNSDAG--FHEAVRIAKKAD 518
            G Y G+     + L G++  +  +  +A GCH     +   + AG  + EAV  AK AD
Sbjct: 377 EGNYNGLSDRYTTFLNGIQDRFEGRVIFAEGCHLYKKSISGLAQAGDRYAEAVAAAKNAD 436

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            VI+  GLD T E E+          D+  L LP  Q  LV  +    K PV+ V+  G 
Sbjct: 437 LVIMCVGLDATIEGEEGDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGS 495

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            ++      +SQ  +++   YPG  G KALAE++FGD +P G+LP+T+Y ++       D
Sbjct: 496 AINT-----ESQPDALIHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYEDTDKLPEFTD 550

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
            +M+        GR+YR+ T   ++ FG+GL+Y
Sbjct: 551 YSMK--------GRTYRYTTDNILFPFGYGLTY 575


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 163/322 (50%), Positives = 206/322 (63%), Gaps = 31/322 (9%)

Query: 36  VLNKPDFPCKPPHFDS---YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLG 92
           V   P F C P   ++     FC+ +L+ + RA  L+S LT  EKI QL D A  +PRLG
Sbjct: 25  VAGDPPFSCGPSSAEASEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84

Query: 93  IPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR 152
           +P Y+WW+E+LHG+A++G G++F+  V + TSFPQVL++AA+F+  LW  IG A+  EAR
Sbjct: 85  VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144

Query: 153 AMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
           A++N+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V S YAV FV+  QG +  S  
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQGNSSSS-- 202

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                                    L  SACCKH  AYDLE W   +RYSF A +TEQD 
Sbjct: 203 ------------------------LLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDL 238

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAV 330
           EDTF PPFRSC+ + KASC+MC+Y  +NGVPAC   DL     R +WG  GY+ SDCDAV
Sbjct: 239 EDTFNPPFRSCVVEAKASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAV 298

Query: 331 ATIFEYQNYTKTHEDSAAGVLK 352
           A + + Q Y  T ED+ A  LK
Sbjct: 299 AIMRDAQRYAPTPEDAVAVSLK 320


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 209/626 (33%), Positives = 323/626 (51%), Gaps = 85/626 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +A  L+  +TL+E+  Q+  +A A+PRLGIPAY WW E LHG+A  G          + T
Sbjct: 9   KAVRLVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAG----------TAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFWAPNINIFRDP 175
            FPQ +  AA F+  L   I + V+ E RA YN            GLTFW+PN+NIFRDP
Sbjct: 59  MFPQAIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP + S     FV+  QG+                            G+
Sbjct: 119 RWGRGHETYGEDPYLTSRLGTAFVRGLQGD----------------------------GE 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L ++AC KH   + +       R+ F A  +++D  +T+ P F +C+++     +M +Y
Sbjct: 151 HLKIAACAKH---FAVHSGPEALRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAY 207

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  +G P C    L ++  R +WGF+G+  SDC A+         T T  +SAA  +K G
Sbjct: 208 NSYHGEPCCANTLLMEEILRGQWGFEGHFVSDCWAIRDFHMNYMVTDTAMESAALAVKKG 267

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
            D+NCG   L+    A ++G + +  +  A++ LF+ +  LG+      + +Y  +  + 
Sbjct: 268 CDLNCGNTYLQ-VLKACEEGLLDDACVTEAVVRLFTTRYLLGM----GEETEYDDIPYEV 322

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           V   EH++LA++AAR+ +VLLKND   LPL+   ++++A+IGP  +N + + G Y G   
Sbjct: 323 VECKEHRELAVEAARRSMVLLKNDG-LLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSS 381

Query: 475 SPKSLLRGLEAYVS---KTHYASGCH----DVPCNSDAG--FHEAVRIAKKADFVIVVAG 525
              ++L G++  V    +  YA GCH     V   + AG    EA  +AK +D V++  G
Sbjct: 382 CYTTILEGIQDAVGEDVRVLYAEGCHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVG 441

Query: 526 LDLTQETEDRD---------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFA 576
           LD T E E+ D         +  LLLP  Q  L+  +    K PV++    G  +D+S A
Sbjct: 442 LDETLEGEEGDTGNSHASGDKKDLLLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLA 500

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
           +   +  +++ + YPG  G +ALA+++FG  +P G+LP+T+Y +     P  D +M    
Sbjct: 501 Q--EKAGAVIQVWYPGAEGGRALADLLFGKASPSGKLPVTFYKDLENLPPFEDYSM---- 554

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSY 662
                GR+YR+ T   +Y FG GL+Y
Sbjct: 555 ----DGRTYRYLTAEPLYPFGFGLTY 576


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  339 bits (869), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 241/760 (31%), Positives = 366/760 (48%), Gaps = 121/760 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AKS+IS LTL EKI QL+ +A  I RLGI  Y W +E+LHG+  +G            T
Sbjct: 34  KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDG----------RAT 83

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
            FPQ +   A+F+  +   IG A+A E RA + + Q        AGLTFWAPN+NIFRDP
Sbjct: 84  VFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP +       FVK  QG                           D   
Sbjct: 144 RWGRGMETYGEDPFLTGTLGTAFVKGMQG---------------------------DDPF 176

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   + +      +R++ N   T++D  +T+ P F+  +++GK   IM +Y
Sbjct: 177 YLKAAACGKH---FAVHSGPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAY 233

Query: 296 NQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
            ++ G  +C     L  D+    R +WGFKG++ SDC AV  ++E     K+  ++ A  
Sbjct: 234 QRLYG-ESCSGSKYLLTDIL---RKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFA 289

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           +KAG+++ CG  M R  + AI +  + EKD+D+ALL L   +L+LG+   D     Y + 
Sbjct: 290 IKAGLNLECGNSM-RTMKDAIQQKLITEKDLDKALLPLMMTRLKLGILQPDA-ACPYNEF 347

Query: 411 GPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
            P+ V  SE ++K+A  AA + +VLLKN+   LP+ K+ + +L + GP   +   + G Y
Sbjct: 348 -PESVIGSEANRKIAEQAAEESMVLLKNN-GVLPIAKD-IRTLFVTGPGATDAYYLMGNY 404

Query: 470 TGIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
            G+     + L G+   VS     +Y  G   V  N +   + +V  ++ A+  I++ G 
Sbjct: 405 FGLSNRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLN-DVNWSVSESRGAEVSILIMGN 463

Query: 527 ---------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
                    D     E  DRV+L LP  QM  +  V++     +++VLTGG P+DV   E
Sbjct: 464 SGNTEGEEGDAIASAERGDRVNLRLPDSQMEYLREVSKDRTNKLVVVLTGGSPIDVK--E 521

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADS 636
                 +++   YPG+ G  ALA ++FGD N  GRLP+T +PES  ++P  +D +M+   
Sbjct: 522 ITELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVT-FPESADRLPAFDDYSMK--- 577

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
                GR+Y++ T   +Y FG+GLSY+  +Y   +     T +  +              
Sbjct: 578 -----GRTYKYMTDNILYPFGYGLSYSKVTYSNAAVTKMPTKTTPMT------------- 619

Query: 697 RLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
                              V + VTN GD+    VV ++   P     +P + LIGF RV
Sbjct: 620 -------------------VYVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRV 660

Query: 757 HTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
                 +    F + P E L      G   L  G + + +
Sbjct: 661 KIYPHITVTKDFQI-PMELLETVQADGTSKLLKGEYQIKI 699


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  338 bits (866), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 237/759 (31%), Positives = 368/759 (48%), Gaps = 119/759 (15%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AKS+IS LTL EKI QL+ +A  I RLGI  Y W +E+LHG+  +G            T
Sbjct: 34  KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDG----------RAT 83

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFWAPNINIFRDP 175
            FPQ +   A+F+  +   IG A+A E RA + + Q        AGLTFWAPN+NIFRDP
Sbjct: 84  VFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRG ET GEDP +       FVK  QG +           F                 
Sbjct: 144 RWGRGMETYGEDPFLTGVLGTAFVKGMQGND----------PFY---------------- 177

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   + +      +R++ N   T+ D  +T+ P F+  ++QGK   IM +Y
Sbjct: 178 -LKAAACGKH---FAVHSGPERTRHTANVEPTKHDLYETYLPAFKMLVQQGKVESIMGAY 233

Query: 296 NQVNGVPAC-----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
            ++ G  +C     L  D+    R +WGFKG++ SDC AV  ++E     K+  ++ A  
Sbjct: 234 QRLYG-ESCSGSKYLLTDIL---RKDWGFKGHVVSDCGAVTDMYEGHKLVKSEAEAVAFA 289

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKL 410
           +KAG+++ CG  M R  + A+ +  + EKD+D+ALL L   +L+LG+   D     Y + 
Sbjct: 290 IKAGLNLECGNSM-RTMKDALKQKLITEKDLDKALLPLMMTRLKLGILQPD-VACPYNEF 347

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
               + + +++ +A  AA + +VLLKND   LP+ K+ + +L + GP   +   + G Y 
Sbjct: 348 PESVIGSIDNRNIAQRAAEESMVLLKND-GVLPIAKD-IRTLFVTGPGATDAYYLMGNYF 405

Query: 471 GIPCSPKSLLRGLEAYVSK---THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL- 526
           G+     + L G+   VS     +Y  G   V  N +   + +V  ++ A+  I++ G  
Sbjct: 406 GLSDRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLN-DVNWSVSESRGAEVSIIIMGNS 464

Query: 527 --------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
                   D    +E  DRV L LP  QM  +  V++     +++VLTGG P+DV   E 
Sbjct: 465 GNTEGEEGDAIASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVLTGGSPIDVK--EI 522

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSS 637
                +++   YPG+ G  ALA ++FGD N  GRLP+T +PE+  K+P  +D +M+    
Sbjct: 523 TELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVT-FPETTDKLPSFDDYSMK---- 577

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
               GR+Y++ T   +Y FG+GLSY   +Y                          T ++
Sbjct: 578 ----GRTYKYMTDNILYPFGYGLSYGKVAYG-----------------------NATVTK 610

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
           L   H    +S T     V + ++N G++    VV ++   P     +P + L+ F RV 
Sbjct: 611 LPTKH----SSMT-----VSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESLVAFKRVK 661

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
                +    F + P E+L    + G   L  G + +M+
Sbjct: 662 IAPHATVTTDFEI-PVERLETVQEDGTSKLLKGEYRVMI 699


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 234/760 (30%), Positives = 356/760 (46%), Gaps = 108/760 (14%)

Query: 42  FPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--------ASAIPRLGI 93
           F   P    ++PF N SLS S R   L+  LTL + +QQL+          A AI  LGI
Sbjct: 14  FALTPLASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGI 73

Query: 94  PAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA 153
             Y+W +E L G    G          + TSFPQ +  AA+F++ L  N+  A A E RA
Sbjct: 74  GPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEVRA 123

Query: 154 MYN--------LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
            +N            GL+ ++P +NI R P WGR QET GEDP +   YA  FV+  QG+
Sbjct: 124 KHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQGD 183

Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
           +                         DR   +  +A CKH  A+   +    SR  F+A 
Sbjct: 184 H-------------------------DR--YIQANAGCKHFDAHGGPEDIPESRMGFDAK 216

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
           ++ +D   TF P F+ C++ G A  +MCSYN +NGVPAC    L     R EW F GY+ 
Sbjct: 217 VSMRDLRLTFLPAFQKCVQAG-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVV 275

Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKD 380
           SD  A+     + +Y    ED+AAG + AG ++    N    +      A+  GK++E  
Sbjct: 276 SDEGAIENQISFHHYYNNSEDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESV 335

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           +   +  LF  ++RLG F+  P    Y  +    + + EH+ L+L AA + +VLLK   K
Sbjct: 336 VRNRVKPLFYTRMRLGEFD-PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSK 394

Query: 441 FLP---LNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLEAYVSKTHYASG 495
           F     +       +A+IGP+ NN  Q+ G Y  T  P   K+ L+GL       +YA+G
Sbjct: 395 FSKRHLIGGFPSERMAVIGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAG 454

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR 555
           C D     +    +       AD V+V  G     E+E+ DR  ++LPG+Q+ L+  V  
Sbjct: 455 CVDGTRCLNYSQDDVKTALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVS 514

Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGR 612
            + + V L++   GP+++++A+   ++  IL   YP ++   A+ + +    G FNP GR
Sbjct: 515 MTNKAVYLLVFSAGPVNITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGR 574

Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           LP TWY  +     M D +M          ++YR++TG  +Y FG+GLSY+ + +     
Sbjct: 575 LPYTWYRYTEQIPEMTDYSMAR--------KTYRYFTGVPLYPFGYGLSYSTFVF----- 621

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
            S+L     + AG D N+                         VQ+ V N G  DG  V+
Sbjct: 622 -SKLYFLPKVNAG-DPNV-------------------------VQVRVFNEGPFDGDEVL 654

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
            ++ +     +  P  QL+ F+RV   ++   +IS  + P
Sbjct: 655 QVYIKWMSTKERMPRVQLVAFERVFIRSQQYVDISTAITP 694


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  337 bits (863), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 202/527 (38%), Positives = 293/527 (55%), Gaps = 36/527 (6%)

Query: 298 VNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF---EYQNYTKTHEDSAAGVLKA 353
           +NGVPAC    L  +  R +W   GYI SDCD+V  +    ++  YT     +AA  +KA
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAA--MKA 58

Query: 354 GMDINCGTC-------MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
           G+D++CG            +   A+ +GK++E  +D AL NL+   +RLG F+G P   +
Sbjct: 59  GLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIP---E 115

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL--VNNISQ 464
              LG  DVCT EHK+LA DAARQG+VLLKND   LPL+   V+S+A+ G L  +N    
Sbjct: 116 LESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDV 175

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M G Y G PC   +   G+   VS T     C    C++      A   AK  D  IVVA
Sbjct: 176 MLGDYRGKPCRVVTPYDGVRKVVSSTS-VHACDKGSCDT------AAAAAKTVDATIVVA 228

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GL+++ E E  DR  LLLP  Q S + +VA  S  P++LV+   G +DVSFA+ + +I +
Sbjct: 229 GLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGA 288

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP-ESFTKVPMNDMNMRADSSRQYPGR 643
           ++W GYPGE G  A+A+++FG +NPGGRLP+TWY  E  +K+PM  M +R D+   YPGR
Sbjct: 289 VVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGR 348

Query: 644 SYRFYTGTQV-YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT-----GSR 697
           +Y+FY G  V Y FGHGLSYTN++Y   +A + +T+    K G+ +   Q T      S 
Sbjct: 349 TYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTV----KVGAWEYCKQLTYKAGVSSP 404

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                ++  +          ++V N G  DG+HVV ++   P    G P KQL+ F RV 
Sbjct: 405 PACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVR 464

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
             A  + E++F ++ C+  +I  +    ++P G   ++VG+   SL+
Sbjct: 465 VAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLS 511


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 244/787 (31%), Positives = 364/787 (46%), Gaps = 115/787 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLH 104
           +PF NTSLSI  R K ++  LTL+E ++Q++         A  IPRL I  Y+W +E L 
Sbjct: 24  FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83

Query: 105 GIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG------ 158
           G  S G            TSFP  +  AASFN  L   + +A A E RA +         
Sbjct: 84  GNVSAG----------DATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133

Query: 159 --QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GL+ W+P +NI RDPRWGR QET GEDP +       +V   QG N +        
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQGNNSRY------- 186

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                               ++ +A CKH   +   +    SR+SF+A ++ +D   TF 
Sbjct: 187 --------------------IIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFL 226

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F++C+E G  S +MCSYN++NGVPAC    L     RNEW FKGY+ SD  A+  I  
Sbjct: 227 PQFKACVEAGALS-LMCSYNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVI 285

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTC------MLRHTQSAIDKGKVQEKDIDRALLNLF 389
             +Y      +AA    AG  +  G        +  H   A+    V    +  A+  LF
Sbjct: 286 EHHYAPDFMKAAADAANAGTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLF 345

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND----KKFLPLN 445
            V+++LG F+  P    Y  +    + +  H  L+L AA + IVL+KND      FLP+ 
Sbjct: 346 YVRMKLGEFD-PPDNNPYANIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPIT 404

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYT-----GIPCSPKSLLRGLEAYVSKTHYASGCHDVP 500
            N V    ++GP  ++   + G Y+         +  + L+         +YA GC D P
Sbjct: 405 -NEVKKACMVGPFSDDPEVLFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGP 463

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
              +    +        + +IV AGL    E+E +D   + LPG Q+ L+      SK  
Sbjct: 464 ACRNYDSAKVRSACDGVELIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNA 523

Query: 561 -VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            VIL+L    PLD+ +A+ D +I  IL   YPG+   KA+A ++ G++NP GRLP TW P
Sbjct: 524 SVILILFNASPLDIRYAKTDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW-P 582

Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
            S  +VP + +  M+         R+YR++T   +Y FG+GLSYT + Y      S L I
Sbjct: 583 ASLDQVPGITNYTMKE--------RTYRYFTQEPLYPFGYGLSYTTFHY------SNLNI 628

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           S++  A     I                         V + VTN G +DG+ V  ++   
Sbjct: 629 SSTATASGAGMIA------------------------VSVLVTNTGSMDGTEVTQVYVWC 664

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
             +S   P+ QL+G ++       + E+SF + P EQL +     + ++P+  + L V  
Sbjct: 665 -NISYA-PKLQLVGVNKDFISKGKTLEVSFSIKP-EQLQVWTDDDKWVIPVSTYDLSVSG 721

Query: 799 LRHSLTI 805
            + + T+
Sbjct: 722 QQPNQTV 728


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  335 bits (860), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 217/638 (34%), Positives = 316/638 (49%), Gaps = 89/638 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           +AK L++ ++L+E   QL   A AIPRLG+P Y WW+E+LHG A +G          + T
Sbjct: 9   QAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSG----------TAT 58

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDP 175
            FPQ +  AA F+      I + ++ E RA YN   A        GLT W+PN+NIFRDP
Sbjct: 59  VFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRDP 118

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + S   V F++  QG+                            G 
Sbjct: 119 RWGRGQETYGEDPYLASQLGVSFIQGLQGD----------------------------GP 150

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L  +AC KH   + +       R+ FNA+++ +D  +T+ P F +C+++G+ + +M +Y
Sbjct: 151 YLKTAACVKH---FAVHSGPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAY 207

Query: 296 NQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           + VNG P C    L  D+    RN+WGF+G   SDC A+         TK   DS A  L
Sbjct: 208 SAVNGEPCCGSPFLITDIL---RNDWGFEGMYISDCWAIRDFHLNHAVTKNQVDSVALAL 264

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
            AG D+NCG C     + A  +G +  K I +A + + + +  LGLF+ D     Y  +G
Sbjct: 265 NAGCDLNCG-CEYLSLEKAYQQGLIDRKTITQACIRVMTTRFALGLFSED---CTYSNIG 320

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
            +   T EH+K+A  A+   +VLLKND   LPL+  ++ ++AIIGP  ++   + G Y G
Sbjct: 321 YEQNDTEEHRKVAFKASCNSLVLLKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHG 379

Query: 472 IPCSPKSLLRGLEAYVS---KTHYASGCH------DVPCNSDAGFHEAVRIAKKADFVIV 522
              +  ++L G    +    K  Y+ G        +     +    EA+ +A  +D +I+
Sbjct: 380 TSSTYTTVLEGFRKTLGESVKVKYSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIIL 439

Query: 523 VAGLDLTQETEDR---------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
             G D T E E           D+  L LP  Q +L+ +VA T K P++LVL  GG +D 
Sbjct: 440 CLGYDETVEGEMHDDGNGGWAGDKQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDP 498

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              E    + ++L   YPG+ G  A+A  I G  NP G LP+T+Y          D  M 
Sbjct: 499 EI-ERFPNVKALLQGWYPGQEGGLAIAHTILGLNNPSGHLPVTFYRSETVLPDFCDYRME 557

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
                   GR+YR+     +Y FG GLSYT +SY  LS
Sbjct: 558 --------GRTYRYVQEKVLYPFGFGLSYTTFSYGNLS 587


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  334 bits (856), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 168/346 (48%), Positives = 210/346 (60%), Gaps = 25/346 (7%)

Query: 14  ILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLT 73
           +L  ++ L+V   ++    +      P F C        PFCNT L  + RA  L+S +T
Sbjct: 58  LLAILMHLLVPALLLMRVVAVVAAGAPPFSCGGGPSLGLPFCNTKLPAAQRAADLVSRMT 117

Query: 74  LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN-GTVSSVTSFPQVLVSA 132
             EK  QL D A+ +PRLG+P+Y+WW+E+LHG+A +G G++ + G V S TSFPQVL++A
Sbjct: 118 PAEKASQLGDVANGVPRLGVPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTA 177

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
           ASFN +LW  IG A   EARA YN+GQA GLT W+PN+NIFRDPRWGRGQETPGEDP V 
Sbjct: 178 ASFNDNLWFRIGQATGKEARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVA 237

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S YA  FV+  QG +                              L+ SACCKH  AYDL
Sbjct: 238 SRYAAAFVRGLQGSS---------------------SNTKSVPPVLLTSACCKHATAYDL 276

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
           E W   +RYSF A +T QD  DTF PPFRSC+  GKASC+MC+Y  VNGVP+C   DL  
Sbjct: 277 EDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKASCVMCAYTSVNGVPSCANADLLT 336

Query: 312 KA-RNEWGFKG-YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           K  R  WG  G Y+ +DCDAV+ +   Q Y  T ED+ A  LKAGM
Sbjct: 337 KTFRGSWGLDGRYVAADCDAVSIMRNSQFYRPTAEDTVATTLKAGM 382


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 244/753 (32%), Positives = 366/753 (48%), Gaps = 100/753 (13%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           N + SI  R   L++ +TL+EKI         I RLGIP +EW+ E+LHGI S       
Sbjct: 27  NPNASIEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEALHGIIS------- 79

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                + T FPQ +   A++N  L  ++ +A++ EARA+ N G+  +  ++P +N+ RDP
Sbjct: 80  ----WNCTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEVMMFSPTVNMARDP 135

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR  E   EDP ++S  A  +V+  QG + K                           
Sbjct: 136 RWGRNGECYAEDPHLMSEMARMYVRGMQGNDPKY-------------------------- 169

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            +      KH +A ++E      R   ++ I ++D  + + P +++CI   +A+ IM + 
Sbjct: 170 -VKTVTTVKHYVANNVET----KREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTAL 224

Query: 296 NQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           N +NG+P C   D       RNEWGFKGY+ +D  AV  + +   Y  +   +AA  +KA
Sbjct: 225 NGLNGIP-CSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYASSQAQAAAMAIKA 283

Query: 354 GMDINC------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           G+D  C         M++    A+ +G + EK++D  +  L  ++   G F+ DP    Y
Sbjct: 284 GVDQECFRNKVRQAPMVQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFD-DPSLNPY 342

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +    +    HK+LAL AA Q IVLLKND   LPL K+ + S+A+IGP  +      G
Sbjct: 343 SAIPTSVLECDAHKQLALKAAEQSIVLLKNDA-VLPLKKD-LKSIAMIGPFADRCWM--G 398

Query: 468 GYTGIPCSPKSLLRGLEAYV-SKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVIVVAG 525
            Y+G P S  S L G++AY  +K  +A GC       D     EAV +AKK++ VI+V G
Sbjct: 399 IYSGHPKSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVG 458

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
            D T  TE+ DR S+ LPG Q  L+ +V   +K  VILVL   GP  V++ + +  I  I
Sbjct: 459 NDETTSTENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAVTWEQKN--IPGI 515

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSY 645
           +     G+    ALA+++FGD NPGG+L  TWY         +D  M   +      R+Y
Sbjct: 516 VCAWPNGQEQGTALAKVLFGDVNPGGKLNATWYQSDKDLPNFHDYKMAGGN------RTY 569

Query: 646 RFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
            ++ G  +Y FG+GLSYTN++                   SD +I ++T    +YV    
Sbjct: 570 MYFKGKPLYPFGYGLSYTNFTI------------------SDVSINKKTLQANEYV---- 607

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     V+  V N G V G  VV ++ R  K  + TP K L GF R+   A  SK 
Sbjct: 608 ---------TVKAKVNNTGAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKW 658

Query: 766 ISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVG 797
           +   + P E  S  N K    ++  G   ++VG
Sbjct: 659 VEIKI-PYEAFSHYNTKKEALMVAKGEFEILVG 690


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  332 bits (851), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 248/791 (31%), Positives = 362/791 (45%), Gaps = 120/791 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSA 146
           R+G+P Y WWSE LHG+A + PGV FN T    S  TSF   +   ASF+  L   +G+A
Sbjct: 13  RIGLPKYAWWSEGLHGVAGS-PGVTFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           ++ EARA  N G  GL +W PN+N ++DPRWGRG ETPGEDP+ +  Y            
Sbjct: 72  ISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV----------- 120

Query: 207 WKSDDGGIGFGFREKRVLKGF-GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
                         K +L G  G E+ R     + A CKH  AYDLE+W   +RY F A+
Sbjct: 121 --------------KAMLAGLEGNETVR----KVIATCKHYAAYDLERWHGLTRYEFEAI 162

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV-----------------NGVPACLRGD 308
           +T QD  + + PPF+ C    K   IMCSYN +                    PAC    
Sbjct: 163 VTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTY 222

Query: 309 LFQKARNEWGF---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--C 362
           L    R+ W +     YITSDC+A+     +  N+++T  ++AA   KAG D  C     
Sbjct: 223 LMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGS 282

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN------------GDPRKGKYGKL 410
            L     A ++  + E  ID AL  L+   +R G  +            G      Y  L
Sbjct: 283 PLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDAL 342

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
             +DV T   ++LAL +A +GIVLLKN    LPL+ ++   +A+IG   N    M G Y+
Sbjct: 343 NWEDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYS 402

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           GIP    + L   +       YA+G      + D     A+  A+ AD V+   G D T 
Sbjct: 403 GIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTV 462

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
            +ED DR S+  P  QM L++ +A   K  V++ L  G  +D SF   +  +SSILW+GY
Sbjct: 463 ASEDLDRESIAWPKAQMKLLSELAGLGKPLVVIQL--GDQVDDSFLLENGNVSSILWVGY 520

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR---------------- 633
           PG++G  A+ +++ G   P GRLP+T YPE +  +VP+ +M +R                
Sbjct: 521 PGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEE 580

Query: 634 ------------------ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
                              + +   PGR+Y++Y+   V  FG+GL YT ++     + + 
Sbjct: 581 EVSVQGSGSLTIQPRSTPGNKTLSSPGRTYKWYS-NPVLPFGYGLHYTTFNVSLSLSSNA 639

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
            + S S    S       T       H+D      S    + IS+TN G     +V +LF
Sbjct: 640 SSPSPSFSIPSLLTPCTAT-------HLDLCPFSPSANSALSISITNTGTHTSDYVALLF 692

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
                  +  P K L+ + RV  +  G + ++    P    +I+   G      GN VL 
Sbjct: 693 LSGEFGPKPYPLKTLVSYKRVKDIKPG-ETVTVKDVPVSLGAISRVDGD-----GNTVLY 746

Query: 796 VGELRHSLTIE 806
            G  R ++ ++
Sbjct: 747 PGTYRFAVDVD 757


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  331 bits (848), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 247/757 (32%), Positives = 358/757 (47%), Gaps = 105/757 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---IPRLGIPAYEWWS 100
           C       +PFC+ SLS S R + L+  L L EK+  L+  AS    +  +G+P Y W +
Sbjct: 26  CSDTEHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLTARASPKGNMSSIGLPEYNWGA 85

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-- 158
             +HG+ S   G N        TSFP  +   A F+     ++   +  E RA++  G  
Sbjct: 86  NCVHGVQSTC-GTN------CATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAR 138

Query: 159 -------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
                    GL  W+PNINI RDPRWGR  ETP EDP+V S Y V + +  Q       +
Sbjct: 139 ENYAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQ-------E 191

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
           G      ++KR L+                  KH  AY  E +    R +FNA ++  D 
Sbjct: 192 G------KDKRFLQAV-------------VTLKHYAAYSYEHYDGIDRMAFNAQVSRYDF 232

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAV 330
            DT+ P F + + +GKA  +MCSYN VNG+P C    L  K  R   GF GYITSD  A+
Sbjct: 233 ADTYLPAFHASVVEGKAKGVMCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAI 292

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFS 390
             I+  ++YTK+  ++    + +G D+N G+   +     +  G++ EK +D A+     
Sbjct: 293 EGIYRQRHYTKSLCEAGRLAIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLK 352

Query: 391 VQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           ++  LGLF  DP   + Y  + P +V  +E K+L+L+  R+ IVLL+N    LPL K   
Sbjct: 353 LRFELGLF--DPIDDQPYWHVAPSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLRKG-- 408

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG-----LEAYV-----SKTHYA--SGCH 497
             LA+IGP       + G Y G  C    L  G     LEA       S T YA  SG +
Sbjct: 409 KKLAVIGPHAKAKRALLGNYLGQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGSGIN 468

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           D    S A F  A   A+ AD V++  G+D + E E  DR ++ +P  QM L+  V R  
Sbjct: 469 DT---STADFDAAEAAARGADAVVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAG 525

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K P ++VL  GG   V   E       +    YPG  GA+A+++I+FGD  P G+LP+T 
Sbjct: 526 K-PTVVVLFNGGV--VGAEELILHTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTM 582

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           YP ++    +N ++M++ S  +YPGRSYR+Y    V+ FG GLSYT ++           
Sbjct: 583 YPSNY----INSVDMKSMSMTKYPGRSYRYYKEVPVFPFGWGLSYTKFT----------- 627

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
                        L   G   D    D +     L   V + V+N GD+ G  VV  F R
Sbjct: 628 -------------LALDGEMPD----DPIVITRDLDQTVTVIVSNDGDLVGDEVVFAFFR 670

Query: 738 VPKVS----QGTPEKQLIGFDRVHTVAKGSKEISFGV 770
              V+         +QL  + RV       ++++F +
Sbjct: 671 PLNVNATGDAALLNEQLFDYRRVSLRPTQYRKLTFRI 707


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  330 bits (847), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 244/823 (29%), Positives = 368/823 (44%), Gaps = 169/823 (20%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C P       + + S     RAK+L+  L+L+EK   +  N+ A+ RLGI AY WWSE+L
Sbjct: 24  CAPQEIS---YTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEAL 80

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---- 159
           HG+A NG          S T FPQ +  AASF+      + +AV+ EAR    +      
Sbjct: 81  HGVARNG----------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGR 130

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
               AGL+FW PNINIFRDPRWGRG ET GEDP ++    +  V+  QG+          
Sbjct: 131 VYQYAGLSFWTPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQGD---------- 180

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                              D L   AC KH   Y +      +R+ F+A ++E+D  +T+
Sbjct: 181 ----------------PDADVLKTHACAKH---YAVHSGLESNRHRFDAQVSERDLRETY 221

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF 334
            P F+  + +     +M +YN+  G P      L QK  R EWG+KG + SDC A+   F
Sbjct: 222 LPAFKDLVTKAGVKEVMTAYNRFRGYPCAASEYLVQKILREEWGYKGLVVSDCWAIPDFF 281

Query: 335 E--YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQ 392
           E     +  T E++AA  +  G+D+ CG+   +   +AID+G ++E+D+DR LL + + +
Sbjct: 282 EPGRHGFVATGEEAAALAVANGLDVECGSTFSK-IPAAIDQGLLKEEDLDRNLLRVLTER 340

Query: 393 LRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
            RLG  +G+     +  L P  V   EH+ L+LD AR+ +VLL+N+   LPL   A   +
Sbjct: 341 FRLGEMDGE---SPWDDLDPAIVEGPEHRALSLDIARETMVLLRNN-GVLPL--KAGEKI 394

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN---------- 502
           A+IGP  ++     G Y  +P S  +LL+ ++A V    Y   C  +             
Sbjct: 395 ALIGPNADDAQMQWGNYNPVPKSTITLLQAMQARVPGLVYDRACGILDAEYAPQGSAYAN 454

Query: 503 ----SDAGFHEAVR-------------------------------IAKK---ADFVIVVA 524
               S+A    A R                               + KK    D V+   
Sbjct: 455 LIGASEAQLEAAARRYAVSVNDIKNYIRRDEEQRRSFMPALDEAAVLKKLEGVDVVVFAG 514

Query: 525 GLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           G+    E E+           DR  + LPG Q  L+ ++    K+ V++  +G     + 
Sbjct: 515 GISPRLEGEEMRVQVPGFSGGDRTDIELPGVQRRLLKALHDAGKKVVLVNFSGCA---IG 571

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
                    +IL   YPG+ G  A+A+++FGD NP G+LP+T+Y        + D NM  
Sbjct: 572 LVPETESCDAILQAWYPGQEGGTAIADVLFGDVNPSGKLPVTFYKNVDQLPDVEDYNME- 630

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
                  G +YR++ G  +Y FG+GLSYT++++       E  +         KN+    
Sbjct: 631 -------GHTYRYFRGEPLYPFGYGLSYTSFAF------GEPKVKG-------KNL---- 666

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
                                 +I VTN G V G+ VV L+ R P  + G P K L  F 
Sbjct: 667 ----------------------EIDVTNTGSVAGTEVVQLYVRKPDDTAG-PVKTLRAFR 703

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           RV   A  + ++S  +D    L  + K    +   G + L+ G
Sbjct: 704 RVSVPAGQTVKVSIPLDKETFLWWSEKDQDMVPVRGRYELLCG 746


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  330 bits (845), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 249/787 (31%), Positives = 367/787 (46%), Gaps = 114/787 (14%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSA 146
           RLG+P Y WWSE LHG+A + PGV FN T    S  TSF   +   ASF+  L   +G+A
Sbjct: 13  RLGLPKYAWWSEGLHGVAGS-PGVKFNTTGYPFSYATSFANAINLGASFDDDLVYEVGTA 71

Query: 147 VAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN 206
           ++ EARA  N G  GL +W PN+N ++DPRWGRG ETPGEDP+ +  Y            
Sbjct: 72  ISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYV----------- 120

Query: 207 WKSDDGGIGFGFREKRVLKGF-GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
                         K +L G  G E+ R     + A CKH  AYDLE+W   +RY F A+
Sbjct: 121 --------------KAILAGLEGNETVR----KVIATCKHYAAYDLERWHGLTRYEFEAI 162

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV-----------------NGVPACLRGD 308
           +T QD  + + PPF+ C    K   IMCSYN +                    PAC +  
Sbjct: 163 VTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPY 222

Query: 309 LFQKARNEWGF---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINCGT--C 362
           L    R+ W +     YITSDC+A+     +  N+++T  ++AA   KAG D  C     
Sbjct: 223 LMTILRDHWNWTEHNNYITSDCNAILDFLPDNHNFSQTPAEAAAAAYKAGTDTVCEVSGS 282

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN------------GDPRKGKYGKL 410
            L     A ++  + E  ID AL  L+   +R G  +            G      Y  L
Sbjct: 283 PLTDVVGAYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDAL 342

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
             +DV T   ++LAL +A +GIVLLKN    LPL+ +    +A+IG   N    M G Y+
Sbjct: 343 NWEDVNTPSTQELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYS 401

Query: 471 GIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           GIP    + L   +       YA+G      + D     A+  A+ AD V+   G D T 
Sbjct: 402 GIPPFYHNPLYAAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTV 461

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
            +ED DR S+  P  QM L++ +A   K  V++ L  G  +D S    +  +SSILW+GY
Sbjct: 462 ASEDLDRESIAWPETQMQLLSELAGLGKPLVVIQL--GDQVDDSSLLNNGNVSSILWVGY 519

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMR---------------- 633
           PG++G  A+ +++ G   P GRLP+T YPE +  +VP+ +M +R                
Sbjct: 520 PGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSV 579

Query: 634 --------------ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                          + +   PGR+Y++Y+ + V  FG+GL YT ++     + S  + S
Sbjct: 580 QGRGSLTIQPRSTPGNKTLSSPGRTYKWYS-SPVLPFGYGLHYTTFNVSLSLSSSNASSS 638

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
           +S  + S  ++L    +     H+D      S    + +S+TN G     +VV+LF    
Sbjct: 639 SSSPSFSIPSLLTPCTA----THLDLCPFSPSANSALSVSITNTGTHTSDYVVLLFLSGE 694

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
              +  P K L+ + RV  +  G + ++    P    +I+   G      GN VL  G  
Sbjct: 695 FGPKPYPLKTLVSYKRVKDIKPG-ETVTVKDVPVSLGAISRVDGD-----GNTVLYPGTY 748

Query: 800 RHSLTIE 806
           R  + ++
Sbjct: 749 RFVVDVD 755


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  328 bits (842), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 243/779 (31%), Positives = 365/779 (46%), Gaps = 115/779 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + S  I  R   LI   TLQEK  QL+     +PRLG+P +  W+++LHG+ S  P  
Sbjct: 38  YRDMSRPIEDRITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP-- 95

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG------LTFWAP 167
                    T FP      A+++  L   +  A++ EARA+YN    G      L + +P
Sbjct: 96  --------TTLFPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSP 147

Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGF 227
            INI RDPRWGR QE   EDP++     V +V+  QG++ +                   
Sbjct: 148 VINISRDPRWGRIQEVFSEDPLLTGRMGVAYVRGLQGDDLQ------------------- 188

Query: 228 GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGK 287
                    L L+A  KH    ++E      R   NA + E++  + + P +R+ I +  
Sbjct: 189 --------HLKLAATVKHFAVNNVES----GRQHLNADVDERNLFEFWLPHWRAAIMEAH 236

Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVA---------TIFEYQ 337
           A  +M SYN +NG+P  +   L     R +WGF G++T D  AVA         T    Q
Sbjct: 237 AQSVMSSYNAINGMPDAVNHWLLTDVLRKKWGFDGFVTDDLGAVALLSGTRATNTSEPGQ 296

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           ++++    +AA  ++AG D +       +   A+ +G + EKD+D AL N+  V  RLG 
Sbjct: 297 HFSEDPVVAAAAAIRAGNDSD-DVEFETNLPLAVQRGLLTEKDVDGALRNVLRVGFRLGA 355

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           ++  P+  KY ++G D V +  H+ L+   A + + LL N ++FLPL ++ V S+A+IGP
Sbjct: 356 YD-PPQASKYSRIGMDVVRSQAHRDLSQRVAEESMTLLLNRRQFLPLQRDQVKSVAVIGP 414

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIA 514
                 +  G Y G P    S+  GL A +    K  Y  G   V    D     A  +A
Sbjct: 415 AGGEAYET-GNYYGTPAVKTSVTEGLRALLGSGVKVEYEKGAGYVDLADDKEIERAANLA 473

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           +K+D V++  G +L  E E RDR  L LPG Q  L+ +V   + + V LVL   GPL V+
Sbjct: 474 RKSDVVVLCLGTNLQVEAEGRDRRDLNLPGAQQRLLEAVYAANPK-VALVLMNAGPLGVT 532

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA 634
           +A     + +IL   YPGE G  A+A  +FG  NPGG LP T Y       P N+     
Sbjct: 533 WAH--DHVPAILSAWYPGELGGAAIARTLFGLNNPGGHLPYTVYANLDGVPPQNEY---- 586

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           D SR Y   +Y+++ G  +Y FGHGLSYT++ Y      S+L ++ +  +G   N+    
Sbjct: 587 DVSRGY---TYQYFKGVPLYPFGHGLSYTHFDY------SKLKVTQT--SGDHANVT--- 632

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
                                V  ++TN G   G+ V  L++   K S+  P + L GF+
Sbjct: 633 ---------------------VSFTLTNTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFE 671

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG------ELRHSLTI 805
           RV      SK ++  + P   L   +   H  R+ P G    MVG       LR ++TI
Sbjct: 672 RVTLQPGESKAVAISI-PTSALGWYDTAVHNFRVEP-GAFNFMVGSSSEDIRLRDTVTI 728


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  327 bits (838), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 231/764 (30%), Positives = 355/764 (46%), Gaps = 94/764 (12%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA-IPRLGIPAYEWWSESLHGIASN 109
           SYPFC+T LS+  R K L+S ++  +   QL    SA I  +G+PAY W + ++HG+ + 
Sbjct: 106 SYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQNT 165

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-QAGLTFWAPN 168
               +        TSFP     +A+FN SL  ++G  +  E RA YN     GL  W+P 
Sbjct: 166 ACLADGQ----CPTSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNGLDTWSPT 221

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           IN  RDPRWGR  E+PGE P V   Y   + +  Q                         
Sbjct: 222 INPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ------------------------- 256

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
              D  D        KH +AY +E + N +RY +NA+++E D  DT+ P +   ++  K 
Sbjct: 257 -NGDDKDYTQAVVTLKHWVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKP 315

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
             +MCSYN +NGVP C    L    R +WGF+GYITSD D++  I+   +Y      +  
Sbjct: 316 LGVMCSYNSLNGVPTCGNPALTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVLATR 375

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKY 407
             L  G DI+ G     + ++A+++  V    +D AL N + ++  LGLF  DP     Y
Sbjct: 376 DGLLGGCDIDSGDTYADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLF--DPNVTNAY 433

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            ++  D+V  S  ++ +L AAR+ + LLKND + LP        +A+IG   N+   + G
Sbjct: 434 DRISADEVGMSSSQETSLLAARKSMTLLKNDGQTLPFATG--KKVAVIGKSSNSAEDILG 491

Query: 468 GYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSD-AGFHEAVRIAKKADFVIVVAGL 526
            Y G P  P      ++        A+       + D A  + A+++A  AD V++    
Sbjct: 492 NYVG-PICPSGAFDCVQTLYQGVAAANQGGATTLSDDVADINTAIQLAMDADQVVLTIS- 549

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           +  Q  E +DR  + L   Q  LV +V +  K P  +V+  GG + + + + ++Q  +IL
Sbjct: 550 NYGQAGEGKDRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEAQ--AIL 606

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK------------VPMNDMNMRA 634
               PG  G +A+AE IFG  NPGG+LP+T Y   +                ++ MN+  
Sbjct: 607 VAFAPGVHGGQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNG 666

Query: 635 DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
           +     PGRSY++YTG  +Y F +GLSYT ++  +  AP   T +++L++ +    +  T
Sbjct: 667 ERDDTGPGRSYKYYTGEPLYPFAYGLSYTTFNLSWSPAPPMTTFTSTLRSTTYTATVTNT 726

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--------VSQGTP 746
           GS                             V G  VV  F + PK        V    P
Sbjct: 727 GS-----------------------------VGGDEVVFAFYK-PKSESLKTLPVGNPVP 756

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLG 790
            K++ GF RV      S +++F ++  E L+     G R L  G
Sbjct: 757 IKEIFGFQRVALGPGQSTQVTFELN-AETLAQVTLDGHRELHSG 799


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 231/653 (35%), Positives = 333/653 (50%), Gaps = 108/653 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPR-----LGIPAYEWWSESLHGIAS 108
           FCNTSL +S R + L++ L L EK   L+  AS  PR     +G+P Y W +  +HG+ S
Sbjct: 34  FCNTSLPVSARVEDLLARLPLDEKAILLTARAS--PRGNMSSIGLPEYNWGANCVHGVRS 91

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
              G N        TSFP  +                          NL          +
Sbjct: 92  TC-GTN------CPTSFPNPV--------------------------NL----------S 108

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           I+  RDPRWGR  ETP EDP+V S Y V + K  Q                         
Sbjct: 109 IHRRRDPRWGRNTETPSEDPLVNSKYGVAYTKGLQ------------------------- 143

Query: 229 EESDRGDELMLSACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
            E    D   L A    KH +AY  E +G  +R +FNA+++  D  DT+ P FRS I  G
Sbjct: 144 -EGKHEDPRYLQAVVTLKHYVAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDG 202

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            A  +MCSYN VNGVPAC   +L  K  R   GF GYITSD  A+  I ++ +Y  T  +
Sbjct: 203 NAKGVMCSYNSVNGVPACANNELENKLLRGMLGFDGYITSDSGAIEAISDWLHYVPTRCE 262

Query: 346 SAAGVLKAGMDINCGTCM--LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           +A   + AG D+N G     +   +  ++  ++  K +D  L +   ++  LGLF  DP 
Sbjct: 263 AARLAILAGTDVNSGRGFGYMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLF--DPI 320

Query: 404 KGK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP----- 457
           + + Y K+ P+DV T   KKL+LD AR+ IVLL+N++  LPL +     LA++GP     
Sbjct: 321 EDQPYWKVTPNDVNTDAAKKLSLDLARKSIVLLQNNQPVLPLRRGV--KLAVVGPHAQAK 378

Query: 458 ---LVNNISQM-GGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAV 511
              L N + QM  G Y  + C  K+    + A    S T YA GC +V  NS AGF EAV
Sbjct: 379 RALLGNYLGQMCHGDYNEVGCI-KTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAV 436

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           +  + A+ V++  G+D + E E RDR ++ LP  Q+ L+  V R   +P ++VL  GG L
Sbjct: 437 KAVQGAEAVVLFLGIDKSVEAEVRDRNNIDLPAIQVQLLQRV-RAVGKPTVVVLMNGGVL 495

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
                +   Q  +++   YPG  GA+A+ +I+FGD NPGG+LP+T Y   +    +N ++
Sbjct: 496 TAE--DIIGQTDALVEAFYPGFFGAQAMTDILFGDANPGGKLPVTMYRSDY----VNTVD 549

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF--LSAPSELTISASL 682
           M++ +   YPGRSYR++ G  V+ FG GLSYT++S K    +A +  ++SA++
Sbjct: 550 MKSMNVTAYPGRSYRYFKGEPVFPFGWGLSYTSFSLKADDATATTAKSVSATM 602


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 244/730 (33%), Positives = 353/730 (48%), Gaps = 110/730 (15%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLS-----DNASAIPRLGIPAYEWWSESLHGI 106
           +PF + SLS   RA  L++ LTL+E   Q        +  AI RLGI  Y W +E L G 
Sbjct: 20  FPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITECLAG- 78

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA------ 160
                 VN N T     ++PQ +  AASF+  L  N+   ++ E RA +N  +A      
Sbjct: 79  -----QVNTNAT-----AYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAVGKYST 128

Query: 161 --GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGF 218
             GL+ ++P INI R P WGR QET GEDP++    A  FV+  QG+             
Sbjct: 129 KVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGD------------- 175

Query: 219 REKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPP 278
            + R L+              +A CKH   +   +    SR+SF+A +  +D   TF P 
Sbjct: 176 -DPRYLRA-------------NAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQ 221

Query: 279 FRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQ 337
           F+ C++ G  S +MCSYN++NG+PAC    L     R+EWGF GYI SD  A++ I E  
Sbjct: 222 FKMCVDAGSYS-LMCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQH 280

Query: 338 NYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLNLFSVQLR 394
           +YT +   +    +KAG  +++  G+ M    Q  A+ +G + EK+I   +  L   +LR
Sbjct: 281 HYTNSTVATVVAAIKAGTNLELGGGSNMYYPKQLDAMKQGLLTEKEIRDNVRPLLYTRLR 340

Query: 395 LGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
           LG F  DP     Y K+G D + + EH++ A+ AA  G VLLKN    LP+ K   S LA
Sbjct: 341 LGEF--DPEAMVDYNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQ-YSKLA 397

Query: 454 IIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAV 511
           I+GP  N  S++ G Y+         ++  GL      T  A+GC +  C+         
Sbjct: 398 IVGPFTNATSELFGTYSSEVNLKFTSTIFEGLSPLGGSTRSANGCTNSACSGYVRDDVET 457

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT-SVARTSKRPVILVLTGGGP 570
            +A  AD VIV  G     E+E  DR  L L G Q+ ++  +V  ++  PVILVL   GP
Sbjct: 458 AVA-GADLVIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGP 516

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIF---GDFNPGGRLPMTWYPESFTKVP- 626
           LD+++A+ D  +++IL  GYP ++  +AL   +        P GRL  TW P +  +VP 
Sbjct: 517 LDITWAKLDPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW-PLNLDQVPK 575

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAG 685
           + D  M+        GR+YR+Y G  +Y FG GLSYT++SY  LS +PS +T        
Sbjct: 576 ITDYTMQ--------GRTYRYYVGEPLYPFGFGLSYTSFSYTRLSISPSVIT-------- 619

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                             D VT        V++ + N G  D   VV ++   P+     
Sbjct: 620 ----------------QGDNVT--------VEVCLKNTGSYDSDEVVQVYMSWPQTPFPL 655

Query: 746 PEKQLIGFDR 755
           P+  L  F R
Sbjct: 656 PKWTLAAFAR 665


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  325 bits (834), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 247/770 (32%), Positives = 365/770 (47%), Gaps = 126/770 (16%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS-----DN--ASAIPRLGIPAYEWWSES 102
           + +PF N SL I  R + L++ LT++E I QLS     DN  A AI RLGI  Y+W +E 
Sbjct: 23  EEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGIGPYQWNTEC 82

Query: 103 LHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN------ 156
           L G A NG            T FPQ +  AA+F++ L   +   VA+EARA +N      
Sbjct: 83  LRGYAMNG----------DATCFPQPIGLAATFDQGLIYKMAKTVALEARAKHNNFTKNG 132

Query: 157 -LG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
             G   GL+ ++P INI R P WGR QET GEDP++ S  A  +V   QG          
Sbjct: 133 NFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQG---------- 182

Query: 215 GFGFREKRVLKGFGEESDRGDELML--SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                               DE+ L  +A CKH +AY   +    +R+SF+A +++ D  
Sbjct: 183 --------------------DEIYLPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIG 222

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVAT 332
            TF P FR C+  G A  +MCSYN +NGVP+C    L    R ++ F GY+ SD +A+  
Sbjct: 223 TTFYPAFRECVHAG-AQGVMCSYNAINGVPSCANPMLETTLRKKFHFDGYVVSDENALEN 281

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDINC----GTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           I  Y N+TK+  ++AA  L AG+D+       T        A+++G V E  + R+   L
Sbjct: 282 IDLYFNFTKSKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRL 341

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           F  ++ LG F  DP +  +    P DV  S  H+K A++ A +  VLLKND   LPL K 
Sbjct: 342 FRTRMALGEF--DPPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPL-KQ 397

Query: 448 AVSSLAIIGPLVNNISQMGGGYTG-----IPCSPKSLLRGLEAYVSKTHYASGC-----H 497
               ++I+GP +NN   + G Y          SP      L +      + +GC      
Sbjct: 398 LYDKVSIVGPFINNSEALTGDYPAEFNLKYFSSPLFAANSLSS-SGVARFTTGCVGTNNQ 456

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTS 557
           ++P  +         +   +D V+V  G     E E  DR  + LPG+Q+ L+  V + +
Sbjct: 457 NLPICATYNSTNVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYA 516

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
             PVI+VL   GPLDVS+   ++  ++++   +  +   +A+ E++ G  NP GRLP TW
Sbjct: 517 NGPVIVVLFNAGPLDVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW 574

Query: 618 YPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
            P S  +V PM D +M          R+YR+ T + ++ FG+GLSYT + Y         
Sbjct: 575 -PASMEQVPPMTDYSMHE--------RTYRYSTSSPLFPFGYGLSYTKFWY--------- 616

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                L A  +   +Q+    +                 V++ + N G +DG  VV ++ 
Sbjct: 617 -----LDAVVEPTTIQRCQIPV-----------------VRVLIQNTGHLDGEEVVQIYM 654

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS--IANKHGR 784
              K       +QL+ F RV    K  +E+S  + P   LS  I  KH R
Sbjct: 655 TSKKKRDRELLRQLVAFQRVPI--KAGEEVSISL-PIPHLSRAIWMKHNR 701


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  323 bits (829), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 247/764 (32%), Positives = 366/764 (47%), Gaps = 121/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + N ++ I  R   L+  LTL+EK+ QLSD+  +  I RL IPA    +E LHG      
Sbjct: 72  YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHG------ 124

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   +  T FP  +   ++F+  L   +G A A+EA+A  NL  +    W+P +++
Sbjct: 125 ----QSYATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NLRVS----WSPVLDV 175

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RD RWGR +ET GEDP +V    V ++K FQGE+                         
Sbjct: 176 ARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH------------------------- 210

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  + AC KH   +     G   R S +  ++++   +    PFR  I++  A  +
Sbjct: 211 -------MFACPKHFAGHGQPVGG---RDSHDYGLSDRVMRNIHLAPFRDVIKEANAFGV 260

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           M +Y   NGVP     +L QK  R EWGF+G++ SDC     I   Q+   T E++AA  
Sbjct: 261 MAAYGLWNGVPDNGSKELLQKILREEWGFEGFVVSDCSGPENIQRKQSVVGTMEEAAAMA 320

Query: 351 LKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGK 409
           ++AG+DI CG+   +   SA+ KG ++E ++D  L  +F  ++RLGLF+    +   + K
Sbjct: 321 VRAGVDIECGSAYKKALASAVKKGIIKESELDANLRRVFRAKMRLGLFDRPSIENMVWNK 380

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           L   D  T EH+ LA   A +  VLLKN+   LPL+KN + ++A+IGP  N      G Y
Sbjct: 381 LPEYD--TPEHRALARKVAVKSTVLLKNENNLLPLDKN-IKTIAVIGP--NADQGQTGDY 435

Query: 470 TG--IPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           +    P    S+L G++ +VS   K  YA GC  +  ++  GF EAV IAK+AD VI+V 
Sbjct: 436 SAKYAPGQIISVLEGVKNHVSPSTKVLYAQGCTQLDMDT-TGFAEAVNIAKQADAVILVV 494

Query: 525 GLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           G +  +           E+ D  +L +PG Q  L+ +V  T K PV+LVL  G P  +++
Sbjct: 495 GDNSNRHENGNKKSTTGENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVNGKPFTLTW 553

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRAD 635
              D  I SIL   YPGE G  A A+IIFGD NP GRLP++ +P    ++P+        
Sbjct: 554 --EDENIESILETWYPGEEGGNATADIIFGDENPSGRLPIS-FPRHPGQLPLW------- 603

Query: 636 SSRQYPGRSYRFYTG--TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            + +  GR+Y +Y    T +Y FGHGLSYT + Y          + A+ K+G    +   
Sbjct: 604 YNYETSGRNYDYYDMPFTPLYRFGHGLSYTTFRYS--------NLKATTKSGDPGFVT-- 653

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 V + + N G   G  V  L+      S  T    L GF
Sbjct: 654 ----------------------VSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLKGF 691

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RV       K ++F ++P   LS+ N   +R+L  G   + VG
Sbjct: 692 KRVFLKPGEKKTVTFELNPY-LLSLLNPDMKRVLEAGKFRMHVG 734


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 217/736 (29%), Positives = 350/736 (47%), Gaps = 121/736 (16%)

Query: 91  LGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVE 150
           + IP Y +W+E+LHG+A  G            T FPQ +  AA+F+  L + I   +  E
Sbjct: 1   MNIPEYNYWNEALHGVARAG----------VATVFPQAIGLAATFDDHLINEIADVIGTE 50

Query: 151 ARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSF 202
            RA YN            GLTFW+PN+NIFRDPRWGRG ET GEDP + S + V F+K  
Sbjct: 51  GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110

Query: 203 QGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSF 262
           QG+                               L L+A  KH   +   +     R+ F
Sbjct: 111 QGQ----------------------------AKYLKLAATAKHFAVHSGPEG---LRHGF 139

Query: 263 NAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKG 321
           +A+++++D  +T+ P F++ +E+     IM +YN V+GVPA +   L +    ++W F+G
Sbjct: 140 DAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEG 199

Query: 322 YITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDI 381
           ++ SD  A   + E   YTK   ++    +KAG+++  G  + +    A+D+G V E++I
Sbjct: 200 HVVSDYMAPEDVHENHKYTKDAAETMGLAIKAGLNLVAG-HIEQSLHEALDRGLVTEEEI 258

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
             A+++L++ ++RLG+F  D    +Y  +  +   T  H  L+  AA +  VLLKND   
Sbjct: 259 TNAVISLYATRVRLGMFATD---NEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKNDG-V 314

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASG--- 495
           LPL K  + ++A++GP  ++   + G Y G P    ++L G++  +    + HY+ G   
Sbjct: 315 LPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGL 374

Query: 496 ----CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD---------RVSLLL 542
                 +    +D    EAV  A+ +D V+ V GLD T E E+ D         + +L L
Sbjct: 375 FQDHAAEPLAKADERESEAVIAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSL 434

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           PG+Q  L+  +    K PV+++L  G  L +   E    + +I+ I YPG  G  A+A++
Sbjct: 435 PGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADV 493

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FG  +P G+LP+T+Y          D NM         GR+YR+ T   +Y FG+GL+Y
Sbjct: 494 LFGAVSPSGKLPVTFYKNVDNLPAFEDYNM--------AGRTYRYMTDEALYPFGYGLTY 545

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           ++     L   S                           + D  T        V  ++ N
Sbjct: 546 SSVELSDLQVKS---------------------------YEDTAT--------VTATIQN 570

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE-ISFGVDPCEQLSIANK 781
            G+ D   VV ++ +        P  QL GF RV+ + KG+K+ I+F + P +   + + 
Sbjct: 571 TGNFDTDEVVQVYVKDLGSEFAVPNAQLKGFKRVY-LGKGAKQTITFDLRP-QDFEVFDA 628

Query: 782 HGRRILPLGNHVLMVG 797
            GR  +      + VG
Sbjct: 629 QGRNFIDSDRFEISVG 644


>gi|308080460|ref|NP_001183746.1| uncharacterized protein LOC100502339 [Zea mays]
 gi|238014360|gb|ACR38215.1| unknown [Zea mays]
          Length = 344

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 151/328 (46%), Positives = 209/328 (63%), Gaps = 1/328 (0%)

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDL 528
           Y G PC   + L+GL++YV+   + +GC+   C+  A   +AV +A   D+V +  GL  
Sbjct: 7   YFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDV-AATDQAVALAGSEDYVFLFMGLSQ 65

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
            QE+E +DR SLLLPG Q SL+T+VA  SKRPVILVL  GGP+D++FA+++ +I +ILW 
Sbjct: 66  KQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGAILWA 125

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
           GYPG+AG  A+A+++FGD NP GRLP+TWYPE FTKVPM DM MRAD +  YPGRSYRFY
Sbjct: 126 GYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTKVPMTDMRMRADPTSGYPGRSYRFY 185

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            G  VY FG+GLSY+ +S + +   S   +S++L  G  + +  Q G R  +V       
Sbjct: 186 QGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDAIGTEG 245

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           C  L+F   + V N G +DG H V++F R P   QG P  QLIGF   H  A  + ++ F
Sbjct: 246 CEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGETAKLRF 305

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMV 796
            + PC+  S     GR+++ +G+H LMV
Sbjct: 306 DISPCKHFSRVRADGRKVIDIGSHFLMV 333


>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
          Length = 401

 Score =  319 bits (818), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 248/435 (57%), Gaps = 41/435 (9%)

Query: 373 KGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGI 432
           +GK +E+D+D +L NL+ V  ++G F+G P    Y  L   D+CT EH +LA DAARQGI
Sbjct: 2   QGKAREEDVDTSLRNLYIVLTQVGFFDGIP---SYESLDKKDLCTKEHIELAADAARQGI 58

Query: 433 VLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHY 492
           VLLKN  + LPL+   + +LA+IGP  N   +M G Y G+PC   S L G  AY  K  Y
Sbjct: 59  VLLKNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAY-GKVTY 117

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTS 552
             GC++V C++      AV  +K AD  I++ GLD T E E  DR  LLLPG Q  L+  
Sbjct: 118 EMGCNNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQ 177

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
           V   SK P+ILV+  G  +D+SF++ D ++ +ILW GYPGE G +A+A++++G +NPGGR
Sbjct: 178 VIVASKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGR 237

Query: 613 LPMTWYPESF-TKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
           LP+TW+   + + +PM  M++R      YPGR+Y+F+ G+ VY FGHGLSYT ++Y   S
Sbjct: 238 LPLTWHQNDYLSMLPMTSMSLRP--VNNYPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRS 295

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           +                                   SC    F + I V N G   G+ V
Sbjct: 296 SN---------------------------------MSCKD-HFELDIEVKNIGAKHGNEV 321

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
           V+++++ P    GT  KQ+IGF RV   A GS+ + F  + C+ L I   +  ++LP G 
Sbjct: 322 VLVYSKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGE 381

Query: 792 HVLMVGELRHSLTIE 806
           H +++G+   SL I+
Sbjct: 382 HKIIIGDSPTSLPID 396


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  318 bits (815), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 241/784 (30%), Positives = 351/784 (44%), Gaps = 139/784 (17%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + +LS   RA+ +   L+ +EK +Q   +A A  RLG  AY WWSE LHG+A  G   
Sbjct: 6   YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAG--- 62

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                  + T FPQ +  AA F+       G   + EARA YN   A        GLT W
Sbjct: 63  -------TATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN+NIFRDPRWGRGQET GEDP + S   V + K  QG+                    
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQGD-------------------- 155

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                   G  L  +AC KH   +   +    +R+ F+A    +D  +T+   F + ++ 
Sbjct: 156 --------GKVLRTAACAKHFAVHSGPE---ATRHEFDAKANMKDMTETYIAAFEALVKD 204

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
            K   +M +YN+VNG PAC   D       EWGF G+  SDC A+         TKT  +
Sbjct: 205 AKVESVMGAYNRVNGEPACA-SDFVMNKLEEWGFDGHFVSDCWAIRDFHTNHGVTKTAPE 263

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           SAA  LK G D+NCG   L H  +A ++G + E+D+ R+ + L   ++RLG+F+   +  
Sbjct: 264 SAALALKKGCDLNCGNTYL-HLLAAFNEGLINEEDLRRSCIKLMRTRVRLGMFD---KST 319

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           +Y  L  D V   EHK+ +L  + + +VLLKN+   LPL+ +   ++ +IGP  +++  +
Sbjct: 320 EYDGLDYDIVACDEHKEFSLRCSERSMVLLKNN-GILPLDGSKYKTIGVIGPNADSVPAL 378

Query: 466 GGGYTGIPCSPKSLLRGL-EAYVSKTHYASGCH--DVPCNS----DAGFHEAVRIAKKAD 518
            G Y G      + L G+ EA+  +  Y  G H     C      D    EA  I +   
Sbjct: 379 EGNYNGKADEYITFLSGIREAHDGRVLYTEGSHLYKDRCMGLALPDDRLSEAEIITRTLR 438

Query: 519 FVIVVAGLDLTQETED---------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
               +  LD T E E+          D+  L LP  Q  LV +V    K PVI+V   G 
Sbjct: 439 CSGSLCWLDATIEGEEGDTGNEFSSGDKNDLRLPESQRKLVKTVMAKGK-PVIIVTAAGS 497

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MN 628
            ++V  A+ D+ I +  W  YPG+ G +ALA I+FG  +P G+LP+T+Y E  +K+P  +
Sbjct: 498 AINVE-ADCDALIQA--W--YPGQLGGRALANILFGKVSPSGKLPVTFY-EDASKLPDFS 551

Query: 629 DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
           D +M+         R+YR+  G  ++ FG+GL+Y+                         
Sbjct: 552 DYSMK--------NRTYRYSEGNILFPFGYGLTYSE------------------------ 579

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFH---VQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                             T C+ L F      + VTN G      VV ++ +        
Sbjct: 580 ------------------TECSELSFENGVATVKVTNTGSRFTEDVVQIYIK-GYSENAV 620

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELR 800
           P   L GF RV   A  S+ +   + P       N+ G  I       L  G     EL 
Sbjct: 621 PNHSLCGFKRVALDAGESRIVQITL-PERAFMAVNEKGEWIKEGSEFTLYAGTSQPDELS 679

Query: 801 HSLT 804
             LT
Sbjct: 680 KKLT 683


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  318 bits (814), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 221/632 (34%), Positives = 320/632 (50%), Gaps = 84/632 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEWWSESLHG 105
           PF N SL    R   L+S L + +   QL+         A AI RL I  Y W +E L G
Sbjct: 31  PFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNTECLRG 90

Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------- 157
            A  G          + T+FPQ L  +A+F+R L   + +A   E RA YN         
Sbjct: 91  DAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQKGDFN 140

Query: 158 GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
              GL  ++P INI R P WGR QET GEDP +    A  FV   QG + +         
Sbjct: 141 NHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGNHPRY-------- 192

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                              L+ +A CKH  AY   +    SR+SF+A ++++D + TF P
Sbjct: 193 -------------------LLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFP 233

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
            F+ CI+ G  S +MCSYN VNG+PAC    L     R EWGFKGY+ SD  A+      
Sbjct: 234 AFKECIKAGTYS-VMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELA 292

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
            NYT ++ D+A   LKAG +++ GT    +  +   A++ G +  +D+  ++  LF  +L
Sbjct: 293 HNYTTSYLDTAIKSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRL 352

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSL 452
           RLG F+  P    Y KL  D V  S EH+++AL AA +  VL+KND   LP+ +  + +L
Sbjct: 353 RLGEFD-PPDHNPYVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPI-EGTIHTL 410

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYVSKTHYASGCHDVPCNS--DAGF 507
           A++GP  NN   + G Y   P  P+   ++L GL    +KT +ASGC    C +    G 
Sbjct: 411 AVVGPFANNSKLLFGDYAPNP-DPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQGV 469

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVAR-TSKRPVILVLT 566
             AV     AD V+V  G  +  E+E  DR  +LLPG+Q  L+   AR  + +PVIL+L 
Sbjct: 470 LNAV---TGADVVVVCLGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLF 526

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD---FNPGGRLPMTWYPESFT 623
             GPL++++A +   + +I+   +P +A   AL  ++F +    NPGGRLP TW P +  
Sbjct: 527 NAGPLNITWALSSPSVQAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW-PATVA 584

Query: 624 KV-PMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           ++ PM + +M         GR+YR++  TQ +
Sbjct: 585 QIPPMENYSMD--------GRTYRYFMATQCF 608


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  317 bits (813), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 177/445 (39%), Positives = 246/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N +L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 87  VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QGE+               R +          
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI---------- 185

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KHL  +   + G   R+ F+  ++  D E T+ P FR+ + QG+A  +MC+
Sbjct: 186 ------ATPKHLAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCA 236

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 237 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 296

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCG    R   +AI++G V E  +D++L+ LF+ + RLG     PRK  Y +LG  
Sbjct: 297 GHDLNCGHAY-RELGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 354

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           DV  + H+ LAL AA + IVLLKND   LPL  NA + LA+IGP  + ++ +   Y G  
Sbjct: 355 DVDNAAHRALALQAAAESIVLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTS 412

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
            +P + L GL       +  YA G 
Sbjct: 413 SAPVTPLLGLRQRFGAQQVRYAQGA 437



 Score =  119 bits (297), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 145/292 (49%), Gaps = 54/292 (18%)

Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           ++D V+   GL    E E+           DR  + LP  Q +L+   A+ S +P+++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  + +++A+  +      W  YPG++G  A+A ++ GD NPGGRLP+T+Y       
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY------- 723

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +  ++ A  S    GR+YR++ G  ++ FG+GLSYT ++Y    AP +L+ S +L+AG
Sbjct: 724 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAP-QLS-STTLQAG 777

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
              N LQ                       V  +V N G   G  V  ++ + P   Q +
Sbjct: 778 ---NPLQ-----------------------VTTTVRNTGTHAGDEVAQVYLQYPDRPQ-S 810

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P + L+GF RVH  A   + ++F +D    LS  ++ G+R +  GN+ L VG
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 228/782 (29%), Positives = 372/782 (47%), Gaps = 115/782 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAY-EWWSESLHGIASNGPG 112
           F N  +S+  R   L+S LTL++K Q L+     +   G     + W++ LHG+    P 
Sbjct: 33  FLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGFSIRADQWNQCLHGVKWTEP- 91

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
                     T+FP  +   A+++  L   + + ++ EARA+YN  +          GL 
Sbjct: 92  ---------TTNFPTSIALGATWDTELIHRVATVISDEARAIYNGWKQDPEFRGEHKGLI 142

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           + +P INI R+P WGR  E  GEDP       V +VK  QG                   
Sbjct: 143 YRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQG------------------- 183

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L L++  KH    ++E      R   +A + E+   + + P F+ CI
Sbjct: 184 --------DDSHYLKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKDCI 231

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +GKA  +M SYN +NGVP  +   L     +N+WG +G++ SD   V T+ E  +  + 
Sbjct: 232 VEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQRQI 291

Query: 343 HEDSAAG-VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
             + A G  + AG D +      ++   A+ KG + E+ ++ AL  +  V+ RLG F+ D
Sbjct: 292 SCEEAVGRSIMAGCDFSDAEYE-KYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFD-D 349

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            +   Y ++ PD +   EH+ L+L+AAR+ IVLLKN+KK LP++++ +  +A+IGP  + 
Sbjct: 350 FKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYADL 409

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPC------------NSDAG 506
            +Q  G Y G+P  P + L+G++  V    +  Y  G    P             + +A 
Sbjct: 410 FNQ--GNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEAE 467

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
             +AV +A+ +D V +  G     E E RDR +L+LPG Q  LV +V   +K+ V++VL 
Sbjct: 468 MKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNKK-VVVVLM 526

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
             GP  V+  E    I ++L   +PG+ G  A+A+++FGD+NPGG+LP T Y  S  +VP
Sbjct: 527 SAGP--VAVPEVKKNIPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYA-SDEQVP 583

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
             D     D S+   G +Y +     ++ FGHGLSY+ + Y      S+L IS+ +    
Sbjct: 584 STD---EYDISK---GFTYMYLKKKPLFAFGHGLSYSKFHY------SDLQISSPV---- 627

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                         V +++  S       V + V N G   G  VV L+ R  K     P
Sbjct: 628 --------------VSVNDTVS-------VVLKVKNMGKRTGEEVVQLYVRDVKAKVVRP 666

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLTI 805
            K+L GF R+       +EI   + P + L+  ++  G  ++  G+  +++G     + +
Sbjct: 667 TKELRGFKRIALQPNEEQEIRLML-PVKSLAFYDESIGDFLVEPGSFEILLGSASDDIRL 725

Query: 806 ET 807
           ++
Sbjct: 726 QS 727


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 177/445 (39%), Positives = 246/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 37  RAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N +L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 87  VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QGE+               R +          
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI---------- 185

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KHL  +   + G   R+ F+  ++  D E T+ P FR+ + QG+A  +MC+
Sbjct: 186 ------ATPKHLAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCA 236

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 237 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 296

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCG    R   +AI++G V E  +D++L+ LF+ + RLG     PRK  Y +LG  
Sbjct: 297 GHDLNCGHAY-RELGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 354

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           DV  + H+ LAL AA + IVLLKND   LPL  NA + LA+IGP  + ++ +   Y G  
Sbjct: 355 DVDNAAHRALALQAAAESIVLLKNDANTLPL--NAGTRLAVIGPNADALAALEANYQGTS 412

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
            +P + L GL       +  YA G 
Sbjct: 413 SAPVTPLLGLRQRFGAQQVRYAQGA 437



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 145/292 (49%), Gaps = 54/292 (18%)

Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           ++D V+   GL    E E+           DR  + LP  Q +L+   A+ S +P+++VL
Sbjct: 614 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 672

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  + +++A+  +      W  YPG++G  A+A ++ GD NPGGRLP+T+Y       
Sbjct: 673 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARMLAGDDNPGGRLPVTFY------- 723

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +  ++ A  S    GR+YR++ G  ++ FG+GLSYT ++Y    AP +L+ +A L+AG
Sbjct: 724 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAY---DAP-QLSTTA-LQAG 777

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
              N LQ                       V  +V N G   G  V  ++ + P   Q +
Sbjct: 778 ---NPLQ-----------------------VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-S 810

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P + L+GF RVH  A   + ++F +D    LS  ++ G+R +  GN+ L VG
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLD-ARALSDVDRSGQRAVEAGNYTLFVG 861


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 220/749 (29%), Positives = 343/749 (45%), Gaps = 122/749 (16%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           +YPF +  LS   RA  L   LTL+EK   + +N+  +PRLGI  ++WW E+LHG A  G
Sbjct: 23  TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GL 162
                       T FPQ +  AASF+  L   + +  + EARA YN+            +
Sbjct: 83  ----------LATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           + W PN+NIFRDPRWGRGQET GEDP + S      V+  QG            G    +
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQG------------GKGPHK 180

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRS 281
             K F             AC KH   +   +W   +R+S +   ++ +D  +T+ P F+ 
Sbjct: 181 YYKAF-------------ACAKHFAVHSGPEW---NRHSISIDDVSPRDFHETYLPAFKH 224

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYT 340
            ++ G    +MC+YN ++G P C    L ++  R+EWGFKG + SDC A+  I+  + + 
Sbjct: 225 LVQVGGVKEVMCAYNSIDGEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWR-KGFH 283

Query: 341 KTHEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +   D+A   A  +K G D++CG         A+  GKV E+ ID++L  L   +++LG 
Sbjct: 284 EVEPDAAHASARAVKGGTDMSCGQT-YGSLPEAVRLGKVTEERIDKSLKRLIVGRMQLGE 342

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+ D    ++  +   DV T   +++AL  AR+ + LL N    LPL+K  +  + ++GP
Sbjct: 343 FDPDSIT-RWNAISMKDVSTPASREVALKMARETMTLLHNPMHALPLSKQ-LKQVVVMGP 400

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVP----CNSDAGFHEAV 511
             N+   M G Y G P    ++L G+   +   +  +  GC  V      N      + V
Sbjct: 401 NANDSVMMWGNYNGTPHHTVTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLV 460

Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
                   VI V G+    E E            DRV++ LP  Q  ++ ++    K+ +
Sbjct: 461 EEVGDNKTVIFVGGISPQLEGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQVI 520

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           ++  +G     +      +   +IL   YPGE G +A+A+++FGD+NP G+LP+T+Y + 
Sbjct: 521 MVNCSGSA---IGLVPEVTHTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD 577

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
                  D NMR         R+YR++ G  ++ FGHGLSYT++              A 
Sbjct: 578 SQLPDYLDYNMR--------NRTYRYFKGKPLFPFGHGLSYTSFKIG----------KAK 619

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
           ++ G                              + +SV N G  DG  VV L+      
Sbjct: 620 MRNGK-----------------------------LTVSVKNTGKRDGEEVVQLYISCLDD 650

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
             G P K L GF R+   A   + ++  +
Sbjct: 651 PNG-PIKSLRGFKRMALQAGEQRTVTLNL 678


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 178/453 (39%), Positives = 247/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +   RA  L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 32  NTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 87  -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWS 141

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F+   QGE+                    
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGEDL------------------- 182

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                   D     A  KHL  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  ++H+ LAL AA + IVLLKN+   LPL  NA + LA+IGP  + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--NAGTRLAVIGPNADALAAL 407

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP +L+ S +++AGS                             
Sbjct: 756 FGYGLSYTRFAY---DAP-QLS-STAVQAGST--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864


>gi|294462719|gb|ADE76904.1| unknown [Picea sitchensis]
          Length = 304

 Score =  315 bits (807), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 152/303 (50%), Positives = 208/303 (68%), Gaps = 9/303 (2%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           F EAV  AKK D VIVV GLD TQE E+RDR+ L LPGQQ +LV  V+R +KRPV+ V+ 
Sbjct: 2   FGEAVETAKKVDEVIVVVGLDQTQEKEERDRIKLTLPGQQENLVYQVSRAAKRPVVFVIL 61

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            GGP+DVSFA  D QISSI+W GYPG+AG +ALAEIIFGD+NPGGRLPMTWYP+ F K+P
Sbjct: 62  SGGPVDVSFAVNDPQISSIIWAGYPGQAGGQALAEIIFGDYNPGGRLPMTWYPQDFVKIP 121

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAG 685
           M DMNMR +    YPGR+YRFYTG +V+ FG GLSY+ YSY F S    ++ ++ +++  
Sbjct: 122 MTDMNMRPNHDTGYPGRTYRFYTGKKVFEFGQGLSYSAYSYNFSSTTIQKIDLNVTMEH- 180

Query: 686 SDKNILQQTGSR-LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
                 +  G+R   +V + E T C  L+F   I V N   +DG H V+L+++ P   +G
Sbjct: 181 -----FEALGNRGKGHVRV-ENTPCRKLKFRSSIFVRNHDKMDGRHAVLLYSKSPATHKG 234

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLT 804
            P+KQLIGF  VH   K + +++F V PC+  S   ++G+R+L +G+H L+VG+ ++ ++
Sbjct: 235 APQKQLIGFRSVHVQGKHTAKVTFVVKPCDHFSTVEENGQRLLAIGSHSLIVGDTQYPVS 294

Query: 805 IET 807
           + T
Sbjct: 295 LVT 297


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  315 bits (806), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 17  NTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 71

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 72  -----YATVFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 126

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F++  QGE+               R +  
Sbjct: 127 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 173

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                         A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 174 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 216

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 217 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 276

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 277 SSAAALKAGHDLNCGHA-YRELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 334

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +
Sbjct: 335 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 392

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 393 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425



 Score =  119 bits (298), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 632 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 688

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 740

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AGS                             
Sbjct: 741 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 769

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 770 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 827

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 828 LSDVDRSGQRAVEAGNYTLFVG 849


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  315 bits (806), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 242/830 (29%), Positives = 365/830 (43%), Gaps = 146/830 (17%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+++ S   RA SL+  LT+ EK+  L D +   PRLG+P Y WWSE LHG+A + PGV 
Sbjct: 37  CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAGS-PGVV 95

Query: 115 FNGT---VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
           FN +    S  TSF  V+   A+ +  L   +G+A++ EARA    G  GL +W PNIN 
Sbjct: 96  FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
           ++DPRWGRG ETPGEDP+ +  Y    V   +G                           
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEGN-------------------------- 189

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
             G    + A CKH  AYDLE+W   +RY F+A+++ QD  + + PPF+ C    +   I
Sbjct: 190 --GTVRKVIATCKHFAAYDLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSI 247

Query: 292 MC--------------------------------SYNQVNGVPACLRGDLFQKA-RNEWG 318
           MC                                SYN +NG PAC    L     R+ W 
Sbjct: 248 MCRYVSFFLPPFPSFPRLVTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWN 307

Query: 319 F---KGYITSDCDAVATIF-EYQNYTKTHEDSAAGVLKAGMDINC---GTCMLRHTQSAI 371
           +     YITSDC+A+     +  N+++T  ++AA    AG D  C   G         A 
Sbjct: 308 WTNHNNYITSDCNAIQDFLPDNHNFSQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVGAY 367

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
           ++  + E  ID AL  L+   +R G  +     G+     PD    S             
Sbjct: 368 NQSLLSESVIDTALRRLYEGLIRAGYLD----HGRPASSSPDKAPFSS------------ 411

Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
                    FLPL+     ++A+IG   N    + G Y+G+P    + +  +       +
Sbjct: 412 -------PDFLPLDLTG-KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFY 463

Query: 492 YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
           YA+G      ++D     A+  A+ AD V+   G D T  +ED DR S+  P  Q++L+ 
Sbjct: 464 YANGPVVNSTDADTWTAAAMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIE 523

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
            +A+  K  V++ L  G  +D +    +  ISSILW+GYPG++G  A+ +++ G     G
Sbjct: 524 KLAQVGKPMVVIQL--GDQVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAG 581

Query: 612 RLPMTWYPESFT-KVPMNDMNM--------------------------------RADSSR 638
           RLP+T YP  +  +VP+ +M +                                R + + 
Sbjct: 582 RLPVTQYPAGYVDEVPLTEMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTL 641

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG-SDKNILQQTGSR 697
             PGR+Y++Y    V  FG+GL YT ++     + S    S +     S +++L  T   
Sbjct: 642 SSPGRTYKWYP-RPVLPFGYGLHYTPFNISLSLSTSSNASSTTDNTSISIRSLL--TSQT 698

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
              +H+D    C    F V  S+TN G     +V +LF       +  P K L+G+ RV 
Sbjct: 699 CTAIHLD---LCPFSPFSV--SITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRVK 753

Query: 758 TVAKGSKEISFGVD-PCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +  G   +  G D P    ++A   G      GN VL  G  +  L ++
Sbjct: 754 DIKPGETRVVGGEDIPVNLAAVARVDGN-----GNTVLYPGTYKFRLDVD 798


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  315 bits (806), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 236/722 (32%), Positives = 340/722 (47%), Gaps = 110/722 (15%)

Query: 85  ASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIG 144
           A  IPRLGI  Y W +E L G     PG          T+FPQ L  AA+F+  L   + 
Sbjct: 6   APPIPRLGIAPYNWNTECLRG-DGEAPG--------WATAFPQALGLAAAFSPELIYRVA 56

Query: 145 SAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
           +A A E RA +N   A        GL+ ++P +NI R P WGR QET GEDP +    A 
Sbjct: 57  NATATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELAR 116

Query: 197 EFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
            FV+  QG +               R +K              SA CKH   +     G+
Sbjct: 117 SFVQGLQGPH--------------PRYVKA-------------SAGCKHFSVHG----GH 145

Query: 257 FSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
            +   +   + E+D   TF P F++C+  G  S  MCSYN++NGVPAC    L     R 
Sbjct: 146 ENILLYLLTVLERDWRMTFLPQFQACVRAGSYS-FMCSYNRINGVPACANKKLLTDILRG 204

Query: 316 EWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAI 371
           EWGF GY+ SD  AV  I    +YT++  ++A   + AG ++     M          A+
Sbjct: 205 EWGFDGYVVSDEGAVELIMLGHHYTRSFLETAVASVNAGCNLELSYGMRNNVFMRIPEAL 264

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
             G +  + +   +  LF  ++RLG F+  P    Y  L    V + EH+ L+L+AA + 
Sbjct: 265 AMGNITLQMLRDRVRPLFYTRMRLGEFD-PPAMNPYSSLDLSVVQSPEHRNLSLEAAVKS 323

Query: 432 IVLLKNDKKFLPLNKNAVSS--LAIIGPLVNNISQMGGGYTGIPCSPKSLL---RGLEAY 486
            VLLKN +  LPL    +SS  LA++GP  +N   + G Y  +P  P+ +    RGLE  
Sbjct: 324 FVLLKNVRGTLPLKAQDLSSQHLAVVGPFADNPRVLFGDYAPVP-EPRYIYTPRRGLEML 382

Query: 487 VSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            +   +A+GC +  C   +   E V++   AD V+V  G  +  ETE +DR  L LPG Q
Sbjct: 383 GANVSFAAGCSEPRCQRYSR-AELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQ 441

Query: 547 MSLVT-SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           + L+  +V   + RPVIL+L   GPLDVS+A+A   + +IL   +P +A   A+A ++ G
Sbjct: 442 LELLQDAVQAAAGRPVILLLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLG 501

Query: 606 D--FNPGGRLPMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRFY-TGTQVYGFGHGLS 661
           +   +P GRLP TW P    +V PM +  M         GR+YR+Y     +Y FG+GLS
Sbjct: 502 EAGASPAGRLPATW-PAGMHQVPPMENYTME--------GRTYRYYGQEAPLYPFGYGLS 552

Query: 662 YTNYSYK-FLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           YT + Y+  + +P  L + A+L                                 V + +
Sbjct: 553 YTTFRYRDLVLSPPVLPLCANLS--------------------------------VSVVL 580

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
            N G  D   VV L+ R    S   P  QL+ F RV   A    ++SF V   EQ ++  
Sbjct: 581 ENTGLRDSEEVVQLYLRWEHSSVPVPRWQLVAFRRVAVPAGREAKLSFQVL-AEQRAVWA 639

Query: 781 KH 782
           +H
Sbjct: 640 QH 641


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  314 bits (805), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 177/453 (39%), Positives = 247/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +   RA  L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 32  NTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 87  -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWS 141

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F+   QG++                    
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDL------------------- 182

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                   D     A  KHL  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  ++H+ LAL AA + IVLLKN+   LPL  NA + LA+IGP  + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--NAGTRLAVIGPNADALAAL 407

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP +L+ S +++AGS                             
Sbjct: 756 FGYGLSYTRFAY---DAP-QLS-STAVQAGST--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 32  NTDATPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 87  -----YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 141

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F++  QGE+               R +  
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 188

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                         A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 189 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 232 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFATRYRLGELEA-PRKD 349

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +
Sbjct: 350 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 407

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AGS                             
Sbjct: 756 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 177/453 (39%), Positives = 248/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           NT  +   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 32  NTDATPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 87  -----YATVFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWS 141

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F++  QGE+               R +  
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI-- 188

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                         A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 189 --------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 232 QAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +
Sbjct: 350 PYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAAL 407

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 130/262 (49%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AGS                             
Sbjct: 756 FGYGLSYTRFAY---DAPQLSTT--TLQAGSS--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 843 LSDVDRSGQRAVEAGNYTLFVG 864


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  312 bits (800), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 176/449 (39%), Positives = 245/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+                        
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDL----------------------- 179

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
               D     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 180 ----DHPRTIATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A 
Sbjct: 233 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI++G V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHAY-RDLGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  312 bits (800), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 176/449 (39%), Positives = 245/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+                        
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGEDL----------------------- 179

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
               D     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 180 ----DHPRTIATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A 
Sbjct: 233 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI++G V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHA-YRDLGTAIERGDVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--KAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 701 GTAMARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  310 bits (795), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 176/456 (38%), Positives = 247/456 (54%), Gaps = 56/456 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P    + S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG  
Sbjct: 14  PPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG-- 71

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLT 163
                     T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT
Sbjct: 72  --------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLT 123

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W+PNINIFRDPRWGRG ET GEDP +    AV F++  QGE+               R 
Sbjct: 124 IWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRT 172

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           +                A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ +
Sbjct: 173 I----------------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAAL 213

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +G+A  +MC+YN ++G P C    L   + R +WGFKG++ SDCDAV  + ++  +   
Sbjct: 214 VEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPD 273

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           +  S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     P
Sbjct: 274 NAGSSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-P 331

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           RK  Y +LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + +
Sbjct: 332 RKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADAL 389

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
           + +   Y G   +P + L GL       +  YA G 
Sbjct: 390 AALEANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 632 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 688

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 689 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 740

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 741 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 769

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 770 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 827

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 828 LSDVDRSGQRAVEAGDYTLFVG 849


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+               R +      
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G P C    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A 
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKMHADAIVAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861


>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 343

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 155/344 (45%), Positives = 223/344 (64%), Gaps = 11/344 (3%)

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M G Y G+ C   + L+G+  Y +KT + SGC+DV CN +  F+ A   A+ AD  I+V 
Sbjct: 1   MIGNYAGVACGYTTPLQGIRRY-AKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVM 59

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLD + E E RDR  LLLPG Q  LV+ VAR S+ P ILVL  GGP+DVSFA+ D +I +
Sbjct: 60  GLDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGA 119

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
           ILW+GYPG+AG  A+A+++FG  NPGG+LPMTWYP  +  KVPM +M MRAD SR YPGR
Sbjct: 120 ILWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGR 179

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
           +YRFY G  V+ FGHG+SYT +++  + AP E+++  ASL      ++ + T    + + 
Sbjct: 180 TYRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASL------HVSRNTTGASNAIR 233

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           +    +C +L   V I V N GD+DG+H +++F+  P     T +KQLIGF++VH V   
Sbjct: 234 VSH-ANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWST-QKQLIGFEKVHLVTGS 291

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            K +   +  C+ LS+ ++ G R +P G H L +G+L+HS++++
Sbjct: 292 QKRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGDLKHSISLQ 335


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPDQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+               R +      
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G P C    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A 
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  310 bits (793), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 175/449 (38%), Positives = 245/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+               R +      
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G P C    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A 
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV  + H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VTATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  GN+ L VG
Sbjct: 840 LSDVDRSGQRAVEAGNYTLFVG 861


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 175/453 (38%), Positives = 245/453 (54%), Gaps = 56/453 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
            T  +   RA  L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG     
Sbjct: 32  TTDATPEQRAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----- 86

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWA 166
                  T FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+
Sbjct: 87  -----YATVFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWS 141

Query: 167 PNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKG 226
           PNINIFRDPRWGRG ET GEDP +    AV F+   QG++                    
Sbjct: 142 PNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIHGLQGDDL------------------- 182

Query: 227 FGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                   D     A  KHL  +   + G   R+ F+  ++ +D E T+ P FR+ I +G
Sbjct: 183 --------DHPRTIATPKHLAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAAIVEG 231

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           +A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  
Sbjct: 232 QAGAVMCAYNSLHGTPACAADWLINGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAG 291

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           S+A  LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK 
Sbjct: 292 SSAAALKAGHDLNCGHAY-RELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKD 349

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
            Y +LG  DV  ++H+ LAL AA + IVLLKN+   LPL   A + LA+IGP  + ++ +
Sbjct: 350 PYARLGAKDVDNAQHRALALQAAAESIVLLKNNANTLPL--KAGTRLAVIGPNADALAAL 407

Query: 466 GGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              Y G   +P + L GL       +  YA G 
Sbjct: 408 EANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 440



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP +L+ S +++AGS                             
Sbjct: 756 FGYGLSYTCFAY---DAP-QLS-STAVQAGST--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 785 VTTTVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFNLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  +  G+R +  GN+ L VG
Sbjct: 843 LSDVDPSGQRAVEAGNYTLFVG 864


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 178/474 (37%), Positives = 253/474 (53%), Gaps = 66/474 (13%)

Query: 35  PVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIP 94
           P++  P F  K          +T+ +   RA +L++ ++ +EK+ Q  ++A AIPRLGIP
Sbjct: 21  PLIGAPAFAAK----------STNETPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIP 70

Query: 95  AYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
           AYEWWSE LHGIA NG            T FPQ +  AAS+N  L   +G+ V+ EARA 
Sbjct: 71  AYEWWSEGLHGIARNG----------YATVFPQAIGLAASWNTQLMQQVGTVVSTEARAK 120

Query: 155 YNLG---------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
           +N            AGLT W+PNINIFRDPRWGRG ET GEDP +    AV F++  QG+
Sbjct: 121 FNQAGGPGKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGD 180

Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
           +                            D     A  KH+  +   + G   R+SF+  
Sbjct: 181 DL---------------------------DHPRTIATPKHIAVHSGPEPG---RHSFDVD 210

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYIT 324
           ++ +D E T+ P FR+ +  G+A  +MC+YN ++G PAC    L   + R +WGFKG++ 
Sbjct: 211 VSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVV 270

Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
           SDCDAV  + ++  +   +  S+A  LKAG D+NCG    R   +AI++G+V E  +D++
Sbjct: 271 SDCDAVDDMTQFHYFRPDNAGSSAASLKAGHDLNCGYAY-RALGTAIERGEVDEALLDQS 329

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           L+ LF+ + RLG     P K  Y  LG  D+  + ++ LAL AA Q IVLLKND   LPL
Sbjct: 330 LVRLFAARYRLGELEA-PHKDPYATLGAKDIDNTANRALALKAAAQSIVLLKNDANTLPL 388

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              A   LA+IGP  + ++ +   Y G   +P + L GL     V +  YA G 
Sbjct: 389 KAGA--RLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQRFGVHQVSYAQGA 440



 Score =  115 bits (289), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 133/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 647 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 703

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 704 GTAIARMLAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 755

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT+++Y    AP +L+ S +L+AGS                             
Sbjct: 756 FGYGLSYTSFAY---GAP-QLS-STTLQAGST--------------------------LQ 784

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH      + ++F +D    
Sbjct: 785 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLKPGEQRTLTFTLD-ARA 842

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 843 LSDVDRTGQRAVEAGDYTLFVG 864


>gi|256394020|ref|YP_003115584.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256360246|gb|ACU73743.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1212

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 248/808 (30%), Positives = 370/808 (45%), Gaps = 135/808 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN-ASAIPRLGIPAYEWWSESLHGIASNGPG 112
           +  T  S + RA  L+S +TL EK+ QL  N A AIPRLG+ +Y +WSE  HGI   G  
Sbjct: 45  YLQTRYSFAERAADLVSRMTLPEKVAQLHTNSAPAIPRLGVQSYTYWSEGQHGINLLGAD 104

Query: 113 VNFNGTVSS--VTSFPQVLVSAASFNRSLWSNIGSAVAVEAR----------AMYNLGQA 160
            N  G       TSFP  L S  S++ +L     +A++ E R          A  N+G +
Sbjct: 105 SNNGGAAGGPHATSFPTNLSSTMSWDPALVYQETTAISDEVRGELDKSLWGVAQNNIGPS 164

Query: 161 G-----LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
                 LT+WAP +N+ RDPRWGR  E  GEDP +V   A  FV  +QGE   + DG   
Sbjct: 165 ADDYGSLTYWAPTVNMDRDPRWGRTDEAFGEDPYLVGKMAGAFVAGYQGE---TIDG--- 218

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                          +     L ++A  KH    + E      R++ +A  +E D  D +
Sbjct: 219 ---------------TPTSPYLKVAATAKHFALNNNEN----DRHADSADASESDIRDYY 259

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK---ARNEWGFKGYITSDCDAVAT 332
              FRS +E    + +M SYN +NG P+    D +     A+  WGF GYITSDC AV  
Sbjct: 260 TAQFRSLVEDSHVAGLMTSYNAINGTPSP--ADTYTTDALAQRTWGFDGYITSDCGAVGD 317

Query: 333 I--------------FEYQNYTKTHEDSAAGV------------LKAGMDINC--GTCML 364
           +                  N T T  ++A GV            L+AG D NC  G   L
Sbjct: 318 VTASSSHDWAPPGWTVSVVNGTSTWTNTATGVQVPADAGGQAYALRAGTDANCTGGDATL 377

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RKGKYGKLGPDDVCTSEHKKL 423
            + ++AI    + E  ID AL+ LF+V+++ G F  DP  K  Y ++    + + EH+ L
Sbjct: 378 GNIEAAIKAEILSEGVIDHALVQLFTVRMQTGEF--DPANKVAYTRITKAQIQSPEHQAL 435

Query: 424 ALDAARQGIVLLKND------KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK 477
           A   A   +VLLKND       K LP N  +++++ ++G L N ++   GGY+G P    
Sbjct: 436 AEKVAANSLVLLKNDPMPGSAAKVLPANPASLNNVVVVGDLANTVTL--GGYSGDPTLQV 493

Query: 478 SLLRGLEAYVSKTH-YASGCHDVPCNSDAGFHEAVRIA------KKADFVIVVAGLDLTQ 530
           + ++G+ + V   +  A+   D    S      A   A      K AD V+V  G D + 
Sbjct: 494 NAVQGITSAVKAANPNATVTFDACGTSTTATAAASCSAATQAAIKTADLVVVFVGTDGST 553

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
             E  DR SL +PG   SL++ VA       +L +   GP+D+   + D    +I++  Y
Sbjct: 554 AGESNDRASLAMPGNYDSLISQVAALGNPRTVLSMQTDGPVDIENVKGD--FPAIVYSAY 611

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-ADSSRQYPGRSYRFYT 649
            GE+   ALA+++FG  NP G L  TWY +      + +  +  AD+     GR+Y+++T
Sbjct: 612 NGESQGTALADVLFGKQNPSGHLDFTWYKDDSQLPSIKNYGLNPADTGGL--GRTYQYFT 669

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
           GT  Y FG+GLSYT+++Y  + A                             H D     
Sbjct: 670 GTPTYPFGYGLSYTDFAYSKVQATD---------------------------HADAQGKA 702

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT--PEKQLIGFDRVHTVAKG-SKEI 766
           T     V+  VTN G   G+ V  L+   P V  GT  P +QL GF +   +  G ++ +
Sbjct: 703 T-----VRFDVTNTGKTPGATVAQLYITPPSV-PGTQQPAEQLEGFAKTAVLKPGQTQHL 756

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVL 794
           S  V+  +  +   ++ +  +  G + L
Sbjct: 757 SVSVNIADLATWDAQNAKNAVTDGTYTL 784


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  306 bits (784), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 243/449 (54%), Gaps = 56/449 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S   RA +L++ ++ +EK+ Q  ++A AIPRLGIPAYEWWSE LHGIA NG         
Sbjct: 33  SPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG--------- 83

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N SL   +G+ V+ EARA +N            AGLT W+PNIN
Sbjct: 84  -YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNIN 142

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QGE+               R +      
Sbjct: 143 IFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQGED-----------LNHPRTI------ 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                     A  KH+  +   + G   R+ F+  ++  D E T+ P FR+ + +G+A  
Sbjct: 186 ----------ATPKHIAVHSGPEPG---RHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 232

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G P C    L   + R +WGFKG++ SDCDA+  + ++  +   +  S+  
Sbjct: 233 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA 292

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
            LKAG D+NCG    R   +AI +G+V E  +D++L+ LF+ + RLG     PRK  Y +
Sbjct: 293 ALKAGHDLNCGHA-YRELGTAIARGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYAR 350

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           LG  DV    H+ LAL AA + IVLLKND   LPL   A + LA+IGP  + ++ +   Y
Sbjct: 351 LGAKDVDNVAHRALALQAAAESIVLLKNDANTLPL--RAGTRLAVIGPNADALAALEANY 408

Query: 470 TGIPCSPKSLLRGLEAY--VSKTHYASGC 496
            G   +P + L GL       +  YA G 
Sbjct: 409 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 132/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 644 DRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIMAAW--YPGQSG 700

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A ++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 701 GTAIARMLAGDDNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFP 752

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG   N LQ                       
Sbjct: 753 FGYGLSYTRFAY---DAPQLSTT--TLQAG---NPLQ----------------------- 781

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH  A   + ++F +D    
Sbjct: 782 VIATVRNTGARAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLAAGEQRTLTFHLD-ARA 839

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 840 LSDVDRSGQRAVEAGDYTLFVG 861


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  306 bits (783), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  + A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QG++               R +          
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ + +G+A  +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCGT   R   +AI++G+V E  +D++L+ LF+ + RLG     PRK +Y +LG  
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           D+  + ++ LAL AA + IVLLKN    LPL   A + LA+IGP  + ++ +   Y G  
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
             P + L GL       +  YA G 
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 131/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A  + GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT+++Y    AP +L+ S +L+AGS                             
Sbjct: 755 FGYGLSYTSFAY---DAP-QLS-STTLQAGSP--------------------------LQ 783

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH      + ++F +D    
Sbjct: 784 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G R +  G++ L VG
Sbjct: 842 LSDVDRTGTRAVEAGDYRLFVG 863


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  + A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QG++               R +          
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ + +G+A  +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCGT   R   +AI++G+V E  +D++L+ LF+ + RLG     PRK +Y +LG  
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           D+  + ++ LAL AA + IVLLKN    LPL   A + LA+IGP  + ++ +   Y G  
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
             P + L GL       +  YA G 
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439



 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 131/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A  + GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y+       L+++ +L+AGS                             
Sbjct: 755 FGYGLSYTRFAYET----PRLSVT-TLQAGSP--------------------------LQ 783

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G+  G  V  ++ + P   Q +P + L+GF RVH      + ++F +D    
Sbjct: 784 VTTTVRNTGERAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G R++  G++ L VG
Sbjct: 842 LSDVDRTGTRVVEAGDYRLFVG 863


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 170/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  + A AIPRLGIPAYEWW+E LHGIA NG            T
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YAT 88

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QG++               R +          
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ + +G+A  +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 298

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCGT   R   +AI++G+V E  +D++L+ LF+ + RLG     PRK +Y +LG  
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           D+  + ++ LAL AA + IVLLKN    LPL   A + LA+IGP  + ++ +   Y G  
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KASTRLAVIGPNADALAALEANYQGTS 414

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
             P + L GL       +  YA G 
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/292 (27%), Positives = 139/292 (47%), Gaps = 54/292 (18%)

Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           ++D V+   GL    E E+           DR  + LP  Q +L+   A+ S +P+++VL
Sbjct: 616 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVL 674

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  + +++A+  +      W  YPG++G  A+A  + GD NPGGRLP+T+Y  +    
Sbjct: 675 MSGSAVALNWAKTHADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFYRSTKDLP 732

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
           P    +M+        GR+YR++ G  ++ FG+GLSYT ++Y+     +    + +L+AG
Sbjct: 733 PYVSYDMK--------GRTYRYFKGEALFPFGYGLSYTRFAYE-----TPRLSATTLQAG 779

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
           S                             V  +V N G+  G  V  ++ + P+  Q +
Sbjct: 780 SP--------------------------LQVTTTVRNTGERAGDEVAQVYLQYPERPQ-S 812

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P + L+GF RVH      + ++F +D    LS  ++ G R +  G++ L VG
Sbjct: 813 PLRSLVGFQRVHLQPGEQRTLTFTLD-ARALSDVDRTGTRAVEAGDYRLFVG 863


>gi|256393466|ref|YP_003115030.1| glycoside hydrolase family 3 [Catenulispora acidiphila DSM 44928]
 gi|256359692|gb|ACU73189.1| glycoside hydrolase family 3 domain protein [Catenulispora
           acidiphila DSM 44928]
          Length = 1343

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 234/776 (30%), Positives = 350/776 (45%), Gaps = 133/776 (17%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN-ASAIPRLGIPAYEWWSESLHGIASNGPG 112
           + +T  S + RA  L+S +TL EK  QL  N A AIPRLG+  Y +WSE  HG+ + G  
Sbjct: 49  YLDTHYSFAERAADLVSRMTLPEKAAQLQTNSAPAIPRLGVQEYTYWSEGQHGVNTLGAD 108

Query: 113 VNFNGTVSSV--TSFPQVLVSAASFNRSLWSNIGSAVAVEAR-----AMYNLGQ------ 159
            N       V  TSFP    +  S++ +L     +AV+ E R     +++  GQ      
Sbjct: 109 SNRGDVTGGVHATSFPVNFAATMSWDPALTYKETTAVSDEVRGFLDKSLWGTGQNNLGPS 168

Query: 160 ----AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
                 LTFWAPN+N+ RDP WGR  E+ GEDP + S  A  FV  +QG++         
Sbjct: 169 ASDYGALTFWAPNVNMDRDPLWGRTNESFGEDPYLTSTMAGAFVDGYQGQSMT------- 221

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
            G ++   LK             ++A  KH    ++E     SR++ ++  T+ +  D +
Sbjct: 222 -GQQQTPYLK-------------VAATAKHYSLNNIED----SRHTGSSDTTDANIRDYY 263

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLR----GDLFQKARNEWGFKGYITSDCDAVA 331
              F S +     S IM SYN VNG P+        +L Q     +GF GY TSDC A+ 
Sbjct: 264 TKQFASLVRDAHVSGIMTSYNAVNGTPSPADTYTVDELLQA---TYGFAGYTTSDCGAIG 320

Query: 332 TIFE------------------YQNYTKTHEDSAAG----VLKAGMDINC--GTCMLRHT 367
            ++                     N T     +AAG     ++AG  +NC  G    ++ 
Sbjct: 321 DVYGAASHGWAPPGWTSNGTSWTNNATGRQISAAAGGQAFAIRAGTQLNCAGGEMTAQNI 380

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHKKLALD 426
            +AID G +    +D  L  LF+V++  G F  DP  K  Y K+  D + +  H+ LA  
Sbjct: 381 SAAIDLGLLSNGVVDATLTRLFTVRMETGEF--DPAGKVGYTKITKDQIESPAHQALAEQ 438

Query: 427 AARQGIVLLKN------DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLL 480
            A   IVLL+N        K LP++     S+ I+G L N ++   GGY+G P    + +
Sbjct: 439 VAANDIVLLQNGAVSGTSAKLLPVDPAKTDSVVIVGDLANKVTL--GGYSGEPTHEVNAV 496

Query: 481 RGLEAYV-----SKTHYASGCH-----DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
           +G+ A V     S T     C        P +  A    A+   K A  V+VVAG DL+ 
Sbjct: 497 QGITAAVQAANPSATVTFDACGTGTQITTPASCSAATQAAI---KSASLVLVVAGSDLSV 553

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
             E  DR +L LPG   SL++ V+        LV+   GP D+  A+ D    +I++ GY
Sbjct: 554 ADEANDRSTLALPGNYDSLISQVSALGNPRTALVMQADGPYDIQDAQKD--FPAIVFSGY 611

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
            G++   ALA+++FG  NP G L  TWY       PM++  +    +    GR+Y+++TG
Sbjct: 612 NGQSQGTALAQVLFGQQNPAGHLDFTWYSGDSQLAPMDNYGLTPSQTGGL-GRTYQYFTG 670

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
           T  Y FG+G SY++++Y                                +V +    +  
Sbjct: 671 TPTYPFGYGQSYSSFAYS-------------------------------HVQVGPQNTNA 699

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKV-SQGTPEKQLIGFDRVHTVAKGSKE 765
               HV   V N G V G+ V  L+A  P   +  T  +QL GF + +T+  G  +
Sbjct: 700 DGTVHVSFDVKNTGTVAGTTVAQLYAAPPGAGTNDTTREQLAGFQKTNTLKPGQSQ 755


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  305 bits (781), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 171/445 (38%), Positives = 245/445 (55%), Gaps = 56/445 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++ +EK+ Q  + A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 39  RAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QG++               R +          
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LEHPRTI---------- 187

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ + +G+A  +MC+
Sbjct: 188 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCA 238

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 239 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAASLKA 298

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCGT   R   +AI++G+V E  +D++L+ LF+ + RLG     PRK +Y +LG  
Sbjct: 299 GHDLNCGTAY-RALGTAIERGEVDEALLDQSLVRLFAARYRLGELQA-PRKDRYARLGAK 356

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
           D+  + ++ LAL AA + IVLLKN    LPL   A + LA+IGP  + ++ +   Y G  
Sbjct: 357 DIDNAGNRALALQAAAESIVLLKNANATLPL--KAGTRLAVIGPNADALAALEANYQGTS 414

Query: 474 CSPKSLLRGLEAY--VSKTHYASGC 496
             P + L GL       +  YA G 
Sbjct: 415 SQPVTPLLGLRQRFGAQQVRYAQGA 439



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 131/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 646 DRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKTHADAIVAAW--YPGQSG 702

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A  + GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 703 GTAIARALAGDDNPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 754

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT+++Y    AP +L+ S +L+AGS                             
Sbjct: 755 FGYGLSYTSFAY---DAP-QLS-STTLQAGSP--------------------------LQ 783

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  V  ++ + P   Q +P + L+GF RVH      + ++F +D    
Sbjct: 784 VTTTVRNTGTRAGDEVAQVYLQYPDRPQ-SPLRSLVGFQRVHLQPGEQRTLTFTLD-ARA 841

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G R +  G++ L VG
Sbjct: 842 LSDVDRTGTRAVEAGDYRLFVG 863


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 233/756 (30%), Positives = 348/756 (46%), Gaps = 116/756 (15%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQE----KIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           +PF + +LS   R   L+  LT++E     + Q   +   + RLG+  Y++ +E + G+ 
Sbjct: 20  FPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINECITGV- 78

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-------LGQA 160
                        + T+FPQ +   ASF+  L  N+  A+A E R  YN        G  
Sbjct: 79  ----------RWENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKSQIYGHR 128

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           G+  + P INI R P WGR QET GEDP +    +V FVK  QG                
Sbjct: 129 GVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQG---------------- 172

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
                      D    +  S  CKH   ++  +    SR+ F+A ++E+D   TF P F+
Sbjct: 173 -----------DHPRYIQASGGCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFK 221

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           +C+E G  + IMCSYN++NGVPAC    L     R EWGF GY+ SD  A+  I  +  Y
Sbjct: 222 TCVEAGSIN-IMCSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKY 280

Query: 340 TKTHEDSAAGVLKAGMDINC-----GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           TKT  ++AA  +KAG ++             +  +A+ +  + E+++   L      ++R
Sbjct: 281 TKTLAEAAADSVKAGCNVELTGATGSGVAYFNLLNAVKQNLISEEELRENLKKPMYSRMR 340

Query: 395 LGLFNGDP-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
            G F  DP     + K+    V + EH+ LA+ A+    VL+KN  + LPL K     LA
Sbjct: 341 QGEF--DPVDMNPFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKKR-FDRLA 397

Query: 454 IIGPLVNNISQMGGGYTGIP-CSPKSL---LRGLEAYVSKTHYASGCHDVPC-NSDAGFH 508
           IIGP  +N   + G Y  IP   PK +     GL++      YASGC D  C N D    
Sbjct: 398 IIGPFADNAETLFGDY--IPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSCTNYDPKAI 455

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR-PVILVLTG 567
           E  +  K A FV V  G+    E E  DR  L LPG Q+ ++      S+  P++LVL  
Sbjct: 456 E--KAVKGAQFVFVCLGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFN 513

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG---DFNPGGRLPMTWYPESFTK 624
            GP+D+++ +   ++  I+   YP     KAL +++     D  P  RLP TW P    +
Sbjct: 514 AGPVDLTWPKLSPEVDGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW-PAQLHQ 572

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQ-VYGFGHGLSYTNYSYKFLSAPSELTISASL 682
           VP + D NM         G +YR++ G   +Y FG+GLSYT++ Y+ +S         S+
Sbjct: 573 VPSITDYNMT--------GHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVS-----PTSV 619

Query: 683 KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
           +AG +  +  Q  +R  Y                     NA +V   ++  + A VP   
Sbjct: 620 RAGGNVTVTVQVLNRGPY---------------------NADEVTQVYMSWMEATVP--- 655

Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
              P   L+GF R       S  +SF V   EQ+++
Sbjct: 656 --VPRWTLVGFKRHRHTVNQSSSLSFVVS-AEQMAV 688


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 172/454 (37%), Positives = 247/454 (54%), Gaps = 56/454 (12%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
            + S +   RA +L++ ++  EK+ Q  ++A AIPRL IPAYEWWSE LHGIA NG    
Sbjct: 16  ADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG---- 71

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFW 165
                   T FPQ +  AAS+N +L   +G+ V+ EARA +N            AGLT W
Sbjct: 72  ------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIW 125

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRG ET GEDP +    AV F++  QG++               R + 
Sbjct: 126 SPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LNHPRTI- 173

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                          A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ +  
Sbjct: 174 ---------------ATPKHIAVHSGPEPG---RHGFDVDVSPRDMEATYTPAFRAALVD 215

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G+A  +MC+YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   + 
Sbjct: 216 GQAWSVMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNA 275

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            S+A  LKAG D+NCG    R   +AI++G+V E  +D++L+ LF+ + RLG     PRK
Sbjct: 276 GSSAAALKAGHDLNCGHAY-RELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRK 333

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y +LG  DV  + H+ LAL AA + IVLLKN    LPL   A + LA+IGP  + ++ 
Sbjct: 334 DPYARLGAKDVDNAAHRALALQAAAESIVLLKNTATTLPL--KAGTRLAVIGPNADALAA 391

Query: 465 MGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
           +   Y G   +P + L GL  +    +  YA G 
Sbjct: 392 LEANYQGTSATPITPLLGLRQHFGAQQVRYAQGA 425



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 87/292 (29%), Positives = 143/292 (48%), Gaps = 54/292 (18%)

Query: 516 KADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVL 565
           ++D V+   GL    E E+           DR  + LP  Q +L+   A+ S +P+++VL
Sbjct: 602 QSDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVL 660

Query: 566 TGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
             G  + +++A+A++      W  YPG++G  A+A  + GD NPGGRLP+T+Y       
Sbjct: 661 MSGSAVALNWAKANADAIVAAW--YPGQSGGTAIARALAGDDNPGGRLPVTFY------- 711

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +  ++ A  S    GR+YR++ G  ++ FG+GLSYT+++Y    AP   T   +L+AG
Sbjct: 712 -RSTKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAY---DAPRLST--RTLQAG 765

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
              N LQ                       V  +V N G   G  V  ++ + P   Q +
Sbjct: 766 ---NPLQ-----------------------VTTTVRNTGSRAGDEVAQVYLQYPDRPQ-S 798

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P + L+GF RVH      +E++F +D    LS  ++ G+R +  G + + VG
Sbjct: 799 PLRSLVGFQRVHLKPGEQRELTFTLD-ARALSDVDRSGQRAVEAGEYRVFVG 849


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 235/762 (30%), Positives = 349/762 (45%), Gaps = 118/762 (15%)

Query: 46  PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN-------ASAIPRLGIPAYEW 98
           PP    YPF N  L  + R   L+  LTL+EK+QQL          A A+ RLGI  + W
Sbjct: 31  PPE---YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVW 87

Query: 99  WSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
            SE + G+ ++G            T+FPQ L  AA+F+ +L       +A+E RA  N  
Sbjct: 88  GSECVTGLGTDG-------NDPHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFD 140

Query: 159 QA--------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
           +         GL+ W+P +NI R P WGR  ET GE P++ S  A  FV+  QG + +  
Sbjct: 141 RENGVVKFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGNHTRY- 199

Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
                                        +A CKHL   D+    +  RY F+A +++ D
Sbjct: 200 --------------------------YAAAAACKHL---DVYGGPDNLRYVFDADVSQAD 230

Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC--LRGDLFQKARNEWGFKGYITSDCD 328
              TF   F  C   G     MCSYN + GVPAC   R   F  AR +WGF+GY+ SD  
Sbjct: 231 LTGTFLMAFEECAAAGVMG-YMCSYNSIRGVPACANYRTMTF-FAREQWGFEGYVVSDQG 288

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRA 384
           AV  I E  NYT      A   L AG D+    +       +   A+D        ID +
Sbjct: 289 AVFRITESHNYTANQTLGAVAALNAGCDMEDSDDAQHVAYYNLSLALDLKLTDMATIDAS 348

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           +  LF V++RLG F+  P    +  L    V +  H ++A D A   IVLLKN  + LPL
Sbjct: 349 VSRLFYVRMRLGEFD-PPENDPWRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPL 407

Query: 445 NKNAV-SSLAIIGPLVNNISQMGGGY-----TGIPCSPKSLLRGLEAYVSKT---HYASG 495
           +  A  +S  ++GP  +N   M G Y     T +  + ++ L       S+T    Y  G
Sbjct: 408 SAAAKNASYCLLGPFADNADLMMGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEG 467

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV-- 553
           C    C+          I +  D V++  G     E+E  DR ++  PG Q +LV +V  
Sbjct: 468 CTGPFCDGLDTAAVTTFIQQGCDTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLE 527

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
           A  +K+ ++L+++  GP+D++  E D+++++IL + Y G+    ALA+I+ G+ +P GRL
Sbjct: 528 ALGTKQRLVLLVSTAGPVDLAALEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRL 587

Query: 614 PMTWYPESFTKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           P +W P   + V P++D  M+        GR+YRF     ++ FG+GLSYT ++   L+A
Sbjct: 588 PFSW-PNKVSDVPPIDDYTMQ--------GRTYRFAQADVLFPFGYGLSYTQFNLSHLAA 638

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
           P  L +                              C +LR  V  +VTN G + G+  +
Sbjct: 639 PYILPV------------------------------CQALRLSV--NVTNTGRLSGAIPL 666

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
            ++   P  + G P +QL    RV   A  SK +   + P E
Sbjct: 667 QVYVEWPN-AVGGPIRQLATTTRVFVDAASSKTVQLSIRPRE 707


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 170/445 (38%), Positives = 242/445 (54%), Gaps = 55/445 (12%)

Query: 63  TRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
           +RA +L++ +TL EK+ Q+ ++A AIPRLG+PAY+WWSE LHGIA NG            
Sbjct: 31  SRAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YA 80

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL---GQA-----GLTFWAPNINIFRD 174
           T FPQ +  AAS++ SL   +G+ ++ EARA +N    G+A     GLT W+PNINIFRD
Sbjct: 81  TVFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRD 140

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRGQET GEDP +    AV FV+  QG++ +             R +          
Sbjct: 141 PRWGRGQETYGEDPYLTGQLAVAFVRGIQGDDPQ-----------HPRAI---------- 179

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH +A+   + G   R SF+  ++  D EDT+ P FR+ +  G A  +MC+
Sbjct: 180 ------ATPKHFVAHSGPEAG---RDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCA 230

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGF GY+ SDCDAV  I  Y  +      ++   ++A
Sbjct: 231 YNALHGTPACANAGLLDTRLRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVAAVQA 290

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D++CG       Q A+ +G + E  +D +L+ LF+ + RLG   G      Y ++G D
Sbjct: 291 GTDLDCGHTYASLAQ-AVRQGDIAESALDASLVRLFTARYRLGEL-GSRGNDPYARIGAD 348

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            + +  H+KLAL AA + +VLLKN    LPL  +A   LA+IGP  + +  +   Y G  
Sbjct: 349 QIDSPAHRKLALQAALESLVLLKNAHSTLPL--HAGMRLAVIGPDADALETLEANYHGTA 406

Query: 474 CSPKSLLRGLEAYVSKTH--YASGC 496
             P + L+GL A     H  YA G 
Sbjct: 407 RHPVTPLQGLRARFGADHVAYAQGA 431



 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/296 (31%), Positives = 138/296 (46%), Gaps = 54/296 (18%)

Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
           R+   AD V+   GL    E E            DR  + LP  Q +L+   AR S +P+
Sbjct: 593 RVLHDADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPL 651

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+VL  G  + + +A+      +IL   YPG+AG  A+A+++ GD+NPGGRLP+T+Y  +
Sbjct: 652 IVVLLSGSAVALDWAQ--QHADAILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYRST 709

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
               P     M+        GR+YR++ G  +Y FG+GLSYT ++Y   +AP+    +A+
Sbjct: 710 RDLPPYVSYAMQ--------GRTYRYFDGRPLYPFGYGLSYTRFTY---AAPT--LSAAT 756

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
           LKAG                              V   V NAG   G  VV ++   P  
Sbjct: 757 LKAGG--------------------------TLQVSAEVRNAGQRAGDEVVQVYLDTPP- 789

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           S   P   L+GF R+H  A   + + F + P  QLS  +  G R +  G + + +G
Sbjct: 790 SPLAPRHALVGFRRIHLAAGEQRLVRFTLAP-RQLSSVDAAGARAVEPGQYRVFIG 844


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 235/694 (33%), Positives = 330/694 (47%), Gaps = 104/694 (14%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           P   +YP+C+ +L I  R   L +  T+ E I Q+   A+A+PRLG+PA  +  E+LHG+
Sbjct: 336 PSAAAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGV 395

Query: 107 ASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------- 159
            S              T FP      ASF+R LW  +G+A  +EARA++   Q       
Sbjct: 396 WST------CAAGRCPTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASDC 449

Query: 160 -------AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
                   GLTF+APN+N+ RDPRWGR +E P EDP++   Y  EFV+ FQG+       
Sbjct: 450 ARSLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQGD------- 502

Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE---------KWGNFS----- 258
                                G   + +A  KH   Y+LE          W   +     
Sbjct: 503 ---------------------GAYRVANAVVKHFAVYNLEVDVEDTPPADWCGSAACAPP 541

Query: 259 --RYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RN 315
             R+SF+A ++ +D E+T+  PF +      A+  MCSYN VNG PAC  G L + A R 
Sbjct: 542 NDRHSFDARVSPRDFEETYVGPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRG 600

Query: 316 EWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGK 375
              F G + +DC A+           T  ++AA  + AG+D NCG  +      A+  G 
Sbjct: 601 ALNFTGVLATDCGALEDAVARHKRYATEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGL 660

Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
           V+   +   L  L   +LRLGL +         +   D V +  H+ LAL AAR+G+VLL
Sbjct: 661 VRPDALRPPLERLLEARLRLGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLL 720

Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG----LEAYV--SK 489
           +N  + LPL+     +LA+IGP  N    +  GY G P  P  LLR     LEA     K
Sbjct: 721 QNPNQILPLDGR--GTLAVIGPNANASMNLLSGYHGTP--PPDLLRSPLQELEARWRGGK 776

Query: 490 THYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL------------DLT----QETE 533
             YA GC +    + A   EAV +AK AD V++  GL            D T     E E
Sbjct: 777 VVYAVGC-NASGAATAALDEAVDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAE 835

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILV-LTGGGPLDVSFAEADSQISSILWIGYPG 592
             DR SL LPG Q +L + +    K   + V L   G +D SFA+     +++L  GY G
Sbjct: 836 SVDRTSLKLPGAQEALFSKIWALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGG 892

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
           E G  A+A+ + G +NPGG L  T  P++    P  DM MR   S   PGR+YRF    +
Sbjct: 893 EFGGVAVADALLGAYNPGGALTATMLPDAGLP-PFRDMAMR--PSAASPGRTYRFLDERR 949

Query: 653 V---YGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           V   + FG GLSYT ++   L+ P+ +   A+ +
Sbjct: 950 VAPLWRFGFGLSYTAFAVS-LAGPTRVPRRAATR 982


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 173/457 (37%), Positives = 253/457 (55%), Gaps = 56/457 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF +  LS   RA  L++ +TL EK+ Q+ ++A AIPRLGIPAY+WW+E+LHG+A  G  
Sbjct: 25  PFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRLGIPAYDWWNEALHGVARAG-- 82

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTF 164
                     T FPQ +  AA+++ +L   I   ++ EARA YN            GLTF
Sbjct: 83  --------LATVFPQAIGLAATWDATLMHRIAETISTEARAKYNEAIRNDDHSRYRGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + S  AV F+K  QGE+                  
Sbjct: 135 WSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQGED------------------ 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         + A  KH   Y +      SR+ F+   + +D  DT+ P FR+ I 
Sbjct: 177 ---------PHYYKVIATAKH---YAVHSGPESSRHQFDVKPSPRDLADTYLPAFRASIV 224

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           + +A  +MC+YN+V+G+PAC   DL +K  R EWGF+G++ SDC AV+ IF   +Y    
Sbjct: 225 EARADSLMCAYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQPDA 284

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
             ++A  +KAG D+ CG    R    A+  G + E +I+R+L  LF  + +LG+F+  P 
Sbjct: 285 ASASAVAVKAGTDLTCGN-EYRALVDAVKTGLITEPEINRSLERLFVARFKLGMFD-PPE 342

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  +  +   +V ++ H+K+AL+AAR+ IVLLKND   LPL K+++  +A+IGP  ++  
Sbjct: 343 RVPFSNIPYSEVDSAGHRKIALEAARKSIVLLKNDGT-LPL-KSSIKKIAVIGPAADDAE 400

Query: 464 QMGGGYTGIPCSPKSLLRGLE---AYVSKTHYASGCH 497
            + G Y G      + L G+E   A  ++  YA G +
Sbjct: 401 ALLGNYNGFSSLQVTPLAGIEHQWAGKAEVRYALGAN 437



 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 78/282 (27%), Positives = 120/282 (42%), Gaps = 60/282 (21%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           A+     AD  +   GL+ + E E+           DR +L LP  Q  L+ +   T K 
Sbjct: 597 AIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLIEAAIATGK- 655

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
           PV++VL  G  + ++FA      S++L   Y GE    A+A+ + G  NP GRLP+T+Y 
Sbjct: 656 PVVVVLASGSAVAMNFAA--QHASALLETWYNGEETGTAIADTLAGINNPSGRLPVTFYR 713

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                 P  +  M+        GR+YR++ G  +Y FG GLSY+ + Y  L         
Sbjct: 714 SVDQLPPFEEYAMK--------GRTYRYFNGDALYSFGFGLSYSKFQYSALK-------- 757

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            + +AGS   +  +                          V NA  ++G  VV L+    
Sbjct: 758 -TRRAGSGTIVASR--------------------------VRNASSIEGDEVVQLYVNG- 789

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
             + G P + L GF R+H     S+E+ F   P  Q  + N+
Sbjct: 790 SGADGDPIRSLRGFQRIHLRPGESREVHF---PLGQEDLTNR 828


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  299 bits (765), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 170/457 (37%), Positives = 249/457 (54%), Gaps = 55/457 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T+ S   RA  L+S +TL EK+ Q+ ++A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 43  YLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAG--- 99

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTFW 165
                    T FPQ +  AA+F+ +L  +  +A++ EARA YN  Q         GLTFW
Sbjct: 100 -------EATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTFW 152

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PN NIFRDPRWGRGQET GEDP + S   V FV+  +G++                   
Sbjct: 153 SPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGLEGDDPTYQK-------------- 198

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   + +       R+ F+   +E+D  +T+ P F++ ++Q
Sbjct: 199 -------------LDATAKH---FAVHSGPESERHRFDVHPSERDLHETYLPAFQALVQQ 242

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G    +M +YN+V+GVPA     L Q   R +WGFKGY+ SDCDAVA I+++     T E
Sbjct: 243 GGVDAVMGAYNRVDGVPATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVPTAE 302

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +  G D+NCGT      + A+  G V E  ID A+  L   + RLG+F+  P +
Sbjct: 303 QAAALAVNNGDDLNCGTTYATLVK-AVHDGLVNEHTIDTAVTRLMLARFRLGMFD-PPGR 360

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +  L    V + +H  LAL  A++ +VLLKND   LPL+ N V  +A+IGP  +N++ 
Sbjct: 361 VPWSTLPMSVVQSPQHDALALRTAQESMVLLKND-GLLPLSHN-VRRIAVIGPTADNVTA 418

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDV 499
           + G Y G P +P ++L+G+   V  ++  Y  G   V
Sbjct: 419 LLGNYHGTPKAPVTILQGIREAVPNAQVTYVQGTELV 455



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 96/337 (28%), Positives = 145/337 (43%), Gaps = 62/337 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+  +AY  +  Y    HD           + + F  A+  A+ AD VI   GL    E 
Sbjct: 589 LKAGKAYAIRLDYLRATHDAGVRLAWRMPGAKSPFEAALDAARHADVVIFAGGLSSDLEG 648

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR +L LP  Q  L+ ++  T K PV+LVLT G  L + +A+    +
Sbjct: 649 EEMPVDYPGFAGGDRTTLALPATQRKLLQALQVTGK-PVVLVLTTGSALAIDWAK--QHL 705

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FG+ +P GRLP+T+Y  +    P +D  M+        G
Sbjct: 706 PAILLAWYPGQDGGHAVADALFGNVDPAGRLPVTFYKSARQLPPFDDYAMK--------G 757

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++TG  ++ FG GLSYT ++Y  L                                
Sbjct: 758 RTYRYFTGQPLFPFGFGLSYTRFAYSDL-------------------------------Q 786

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           +D  T   S R  + + V N G   G  VV L+ R  +       K L GF R+      
Sbjct: 787 LDRDTLGPSDRMRISLRVKNTGQRAGDEVVQLYLRPLRAPHARAIKSLRGFQRISLKPGE 846

Query: 763 SKEISFGVDPCEQLSIAN--KHGRRILPLGNHVLMVG 797
            + +SF + P   L   +   H   + P G + + VG
Sbjct: 847 ERSVSFDISPQTDLKYYDVAHHAYAVAP-GRYQVQVG 882


>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
 gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
          Length = 727

 Score =  297 bits (761), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 229/752 (30%), Positives = 343/752 (45%), Gaps = 127/752 (16%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF NTSLS   R  +L+S++T+ EKI  LS N   +PRLGI      SE LHG+A  GP
Sbjct: 24  YPFQNTSLSDEKRLDNLLSIMTIDEKINALSTNL-GVPRLGIRN-TGHSEGLHGMALGGP 81

Query: 112 G-------VNFNGT--VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQ 159
           G       VN+     V   T+FPQ      +++  L   +    A E R         +
Sbjct: 82  GNWGGFKMVNYQRVPDVYPTTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYTK 141

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGF 216
            GL   APN ++ RDPRWGR +E+ GEDP +VS  AV F+K  QGEN   WKS       
Sbjct: 142 GGLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGENPRYWKS------- 194

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                                  ++  KH +A   E   + +  +F+  +  +     + 
Sbjct: 195 -----------------------ASLMKHFLANSNEDGRDSTSSNFDNRLFHE----YYS 227

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
            PFR  IE+G +   M +YN  N +P  +   + +K R +W FKG I +D  A+  + + 
Sbjct: 228 YPFRKGIEKGGSQAFMAAYNSWNEIPMTIH-PILKKIRKDWNFKGIICTDGGALDLLIKA 286

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
                TH + +A ++KAG+          +   A++KG + E +ID+A+   F + L+LG
Sbjct: 287 HKTFPTHTEGSAAIVKAGVG-QFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLG 345

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR----QGIVLLKNDKKFLPLNKNAVSSL 452
           L +GD  K  Y  +G  D  +    K   D  R    + +VLLKN+KK LPLNK  +  +
Sbjct: 346 LLDGDQTKLPYAHIGVTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRI 405

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVR 512
           A+IGP  N +  +   Y+G P    S+L+G++  V          +V   S     +A  
Sbjct: 406 AVIGPRANEV--LLDWYSGTPPYTVSILQGIKNAVGNN------VEVIYESSNEIDKAYL 457

Query: 513 IAKKADFVIVVAGLDLT---------------QETEDRDRVSLLLPGQQMSLVTSVARTS 557
            A+KAD  IV  G  +                +E  DR  +SL    +Q  LV  V + +
Sbjct: 458 AAQKADIAIVCVGNHVYGTDPKWKYSPVPSDGREAVDRKALSL----EQEDLVKIVHKAN 513

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
               ++VL    P  +++++ +  I +IL I    +     LA++IFG++NP GR   TW
Sbjct: 514 PN-TVMVLVSSFPFAINWSQEN--IPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTW 570

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
                   PM D ++R        GR+Y +     +Y FG+GLSYTN++Y      S++ 
Sbjct: 571 VKSIADLPPMMDYDIRN-------GRTYMYAKEKPLYPFGYGLSYTNFTY------SDMA 617

Query: 678 ISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           +S+S L  G +                            V ++V N GD+DG  V  L+ 
Sbjct: 618 LSSSALSKGKN--------------------------LKVSVNVKNTGDMDGEEVAQLYV 651

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             P+     P KQL GFDR+      SK   F
Sbjct: 652 SFPQSKVVRPIKQLKGFDRISIKKGESKTFEF 683


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 136/273 (49%), Positives = 174/273 (63%), Gaps = 29/273 (10%)

Query: 42  FPCKPPHFDS--YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWW 99
           F C P    S  +PFC   L I +R   LI  +TLQEK+  L ++A+A+PRLGI  YEWW
Sbjct: 27  FACDPEDGTSRNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIKGYEWW 86

Query: 100 SESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ 159
           SE+LHG+++ GPG  F G     TSFPQV+ +AASFN +LW  IG  V+ EARAM+N G 
Sbjct: 87  SEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAMFNGGV 146

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
           AGLT+W+PN+NIFRDPRWGRGQETPGEDP+V   YA  +V+  QG +             
Sbjct: 147 AGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGND------------- 193

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
                         GD L ++ACCKH  AYDL+ W    R+ FNA +++QD EDTF  PF
Sbjct: 194 --------------GDRLKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPF 239

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
           R C+++GK + +MCSYNQVNG+P C    L +K
Sbjct: 240 RMCVKEGKVASVMCSYNQVNGIPTCADPKLLKK 272


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 169/442 (38%), Positives = 237/442 (53%), Gaps = 58/442 (13%)

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L++ +T  EKI Q  + A AIPRLG+PAYEWWSE LHGIA NG            T FPQ
Sbjct: 40  LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNG----------EATVFPQ 89

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWG 178
            +  AA++N  L  ++G+  + EARA +NL           AGLT W+PNINIFRDPRWG
Sbjct: 90  AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149

Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
           RG ET GEDP +    AV F+   QG++               R +              
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQGDDPA-----------HPRTI-------------- 184

Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
             A  KHL  +   + G   R+ F+  ++  D E T+ P FR+ I  G+A  +MC+YN +
Sbjct: 185 --ATPKHLAVHSGPEPG---RHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSL 239

Query: 299 NGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           +G PAC    L   + R +WGFKG++ SDCDA+  + ++  Y   +  S+A  LKAG D+
Sbjct: 240 HGTPACAADWLIDGRVRGDWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAAALKAGHDL 299

Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVC 416
           NCGT   R    A D+G+  E  +DR+L+ LF+ + RLG     PR+   Y +LG  D+ 
Sbjct: 300 NCGTAY-RELGIAFDRGEADEALLDRSLVRLFAARYRLGELQ--PRRNDPYARLGARDID 356

Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSP 476
           ++ H+ LAL AA+Q +VLLKN    LPL       LA++GP  + ++ +   Y G    P
Sbjct: 357 SAAHRALALQAAQQSLVLLKNANATLPLRPGL--RLAVLGPNADALAALEANYQGTSVQP 414

Query: 477 KSLLRGLEAY--VSKTHYASGC 496
            + L+GL      ++  YA G 
Sbjct: 415 VTPLQGLRTRFGAAQVAYAQGA 436



 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 133/262 (50%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  L LP  Q +L+   A+ S +P+++VL  G  + +++AE  +      W  YPG++G
Sbjct: 643 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAEQHADAIIAAW--YPGQSG 699

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+ + GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 700 GTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEPLFP 751

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y    AP   T   +L+AG                              
Sbjct: 752 FGYGLSYTQFAY---DAPQLSTT--TLQAGQP--------------------------LQ 780

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  VV ++ + P+ +Q +P + L+GF RVH     ++ +SF +D   Q
Sbjct: 781 VSTTVRNTGARAGDEVVQVYLQYPQRAQ-SPLRSLVGFQRVHLQPGEARTLSFALD-ARQ 838

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 839 LSDVDRSGQRAVEAGDYRLFVG 860


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  295 bits (755), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 175/446 (39%), Positives = 241/446 (54%), Gaps = 58/446 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ +T  EKI Q  ++A AIPRLGIPAYEWWSE LHGIA NG            T
Sbjct: 55  RATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------EAT 104

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L   +G+  + EARA +NL            GLT W+PNINIFRD
Sbjct: 105 VFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKNHARYGGLTIWSPNINIFRD 164

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F+   QG++               R +          
Sbjct: 165 PRWGRGMETYGEDPYLTGQLAVGFIHGLQGDDPT-----------HPRTI---------- 203

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KHL  +   + G   R+ F+  ++  D E T+ P FR+ I +G A  +MC+
Sbjct: 204 ------ATPKHLAVHSGPESG---RHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMCA 254

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G+PAC    L   + R  WGFKG++ SDCDA+  + ++  Y   +  SAA  LKA
Sbjct: 255 YNALHGIPACAADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFHYYRADNAGSAAAALKA 314

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGP 412
           G D+NCG    R   +A+D+G+ +E  +DR+L+ LF+ + RLG     PR K  Y +LG 
Sbjct: 315 GHDLNCGYAY-RDLGTALDRGEAEEAMLDRSLVRLFAARYRLGELQ--PRSKDPYARLGA 371

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
            D+ +  H+ LAL AA+Q +VLL+N    LPL       LA+IGP  + ++ +   Y G 
Sbjct: 372 KDIDSPTHRALALQAAQQSLVLLQNRNDTLPLRPGL--RLAVIGPNADALAALEANYQGT 429

Query: 473 PCSPKSLLRGLEAYVSKT--HYASGC 496
             +P + L+GL A    T  HY  G 
Sbjct: 430 SVAPVTPLQGLRARFGTTQVHYTQGA 455



 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 84/262 (32%), Positives = 139/262 (53%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  L LP  Q +L+   A+ S +P+I+VL  G  + +++A+      +IL   YPG++G
Sbjct: 662 DRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALNWAK--QHADAILAAWYPGQSG 718

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+ + GD NPGGRLP+T+Y  +    P    +M+        GR+YR++ G  ++ 
Sbjct: 719 GTAIAQALAGDINPGGRLPVTFYRSTKDLPPYVSYDMK--------GRTYRYFKGEALFP 770

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT+++Y   +AP +L+ S +L+AG                             H
Sbjct: 771 FGYGLSYTHFAY---TAP-QLS-STTLQAGDT--------------------------LH 799

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V  +V N G   G  VV ++ + P  +Q +P + L+GF RV      ++ +SF ++P  Q
Sbjct: 800 VTTTVRNTGARAGDEVVQVYLQYPPRAQ-SPLRALVGFQRVSLQPGEARTLSFALEP-RQ 857

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ L VG
Sbjct: 858 LSDVDRSGQRAVEAGDYRLFVG 879


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 181/564 (32%), Positives = 276/564 (48%), Gaps = 75/564 (13%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+T+   + RA +L+  +   EK+Q +   ++ + RLG+P Y WWSE+LHG+A   PG+ 
Sbjct: 65  CDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVA-GAPGIQ 123

Query: 115 FNGTV--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F+ +   +  TS P  ++ AA+F+  L   +G+ +  EARA  N   +G+ FW PNIN F
Sbjct: 124 FSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWTPNINPF 183

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRG ETPGED + +  Y    ++  +G               ++R++        
Sbjct: 184 KDPRWGRGSETPGEDTLRLKGYVAALLRGLEGNK------------AQRRII-------- 223

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH  A DLE W   +R+ F+A I+ QD  + +  PF+ C    K    M
Sbjct: 224 --------ATCKHYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFM 275

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN VNGVPAC    L Q   R+ W +     Y+TSDC+AV  I    +Y  T+    A
Sbjct: 276 CSYNSVNGVPACANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHYASTNAAGTA 335

Query: 349 GVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
               AG D +C                                    G F+G   K  Y 
Sbjct: 336 LAFNAGTDSSC----------------------------------EAGYFDGS--KALYS 359

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            LG  DV T + ++LAL A   GIV+LKND   LPL  ++ S +A+IG   ++ S++ GG
Sbjct: 360 SLGWSDVNTPQAQQLALQATVDGIVMLKNDGT-LPLKLDSKSKVAMIGFWASDSSKLQGG 418

Query: 469 YTGIPCSPKSLLRGLEAYVSKTHYASG-CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
           Y+G     ++ +   +      + A+G        +D     A+  A K+D+++   GLD
Sbjct: 419 YSGKAPYLRTPVYAAQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLD 478

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            +   E  DR SL  P  Q++L+  ++   K P+I++   G  +D +    +  +SSILW
Sbjct: 479 TSAAAEGVDRTSLEWPSAQLALIKKLSALGK-PLIIIQE-GDQMDNTPLLTNKGVSSILW 536

Query: 588 IGYPGEAGAKALAEIIFGDFNPGG 611
             +PG+ G  A+ +II G  +P G
Sbjct: 537 ASWPGQDGGPAVMQIISGAKSPAG 560


>gi|256393789|ref|YP_003115353.1| beta-glucosidase [Catenulispora acidiphila DSM 44928]
 gi|256360015|gb|ACU73512.1| Beta-glucosidase [Catenulispora acidiphila DSM 44928]
          Length = 1548

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 232/799 (29%), Positives = 357/799 (44%), Gaps = 153/799 (19%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNAS-AIPRLGIPAYEWWSESLHGIASNGPG 112
           + +T  S   RA  L+S +TL EK++QLS N+  AIPRLG+  Y +WSE  HG+ + G  
Sbjct: 64  YLDTHYSAQARAADLVSRMTLPEKVEQLSTNSGPAIPRLGVQQYTYWSEGQHGVNTLGAN 123

Query: 113 VNFNGTVSSV--TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY----------NLGQA 160
            +  G   +V  TSFP    S  S++ SL     +A++ EAR             NLG +
Sbjct: 124 QDNGGNGGAVHSTSFPTNFASTMSWDPSLIYQETTAISDEARGFLDKSLFGVNQNNLGPS 183

Query: 161 G-----LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
                 LTFWAP +N+ RDPRWGR  E  GEDP + S  A  FV  ++G           
Sbjct: 184 AADYGSLTFWAPTVNMDRDPRWGRTDEAFGEDPYLTSTMAGAFVNGYEGNTPTG------ 237

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
                         +S  G  L ++A  KH    D+E+     R   ++ +++ D  D +
Sbjct: 238 --------------QSKTG-TLKVAATAKHYALNDVEQ----DRTGISSNVSDTDLHDYY 278

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVAT 332
              F S IE    S +M SYN +NG P+    D +   Q A+ ++GF GY+TSDC A+ T
Sbjct: 279 TKQFASLIENAHVSGLMTSYNAINGTPSVA--DTYTANQLAQRQFGFNGYVTSDCGAIGT 336

Query: 333 IFEY----------------QNYTKTHEDSAAGV------------LKAGMDINC--GTC 362
            ++                 ++ T T  ++A G             L+AG D+NC  G  
Sbjct: 337 AYQSFPSGHDWAPPGWTTDGKSSTGTWTNTATGATVPAQAGGQAYALRAGTDLNCAGGEN 396

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR-KGKYGKLGPDDVCTSEHK 421
                 +AI  G + E  ID AL+ +F+V++  G F  DP     Y  +    + +  H+
Sbjct: 397 TYAQITAAISAGVLSEGVIDNALVKIFTVRVETGEF--DPAGSNPYTGITKAQIQSPAHQ 454

Query: 422 KLALDAARQGIVLLKND-------------------------KKFLPLNKNAVSSLAIIG 456
            LA   A   +VLLKN                          K  LPL+  A + + I+G
Sbjct: 455 ALATKVADNSLVLLKNQPPAASGTSTTPPAASSAASSAAAAAKPLLPLSAAATAKIVIVG 514

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-------THYASGCHDVPCNSDAGFHE 509
            + N ++   G Y+  P    S ++G+ A V K       T  A G       + +   +
Sbjct: 515 DMANAVTL--GNYSSDPALKVSPVQGITAAVRKANPGASVTFDACGTSTTASAAASCSAQ 572

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            +     AD VIV  G +     E +DR S+ +PG   SL++ VA      ++L +  GG
Sbjct: 573 TLADVAGADAVIVFVGTNQQIADEGKDRTSIAMPGNYDSLISQVAAVGNPRMVLAVQSGG 632

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
           P+ +   + D   +SI++ G+ GE+   ALA+++FG  NP G L  TWY +      M++
Sbjct: 633 PVRIDDVQKD--FASIVFSGFNGESQGTALADVLFGAQNPDGHLDFTWYADDSQLPAMSN 690

Query: 630 MNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
             +    +    GR+Y ++TGT  Y FG+GLSY+ +S+  + A              +  
Sbjct: 691 YGLTPAQTGGL-GRTYMYFTGTPTYPFGYGLSYSTFSFSGVHA--------------EGR 735

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS---QGTP 746
            +   GS+                  V ++V N G   GS V  L+A+ PK +   Q  P
Sbjct: 736 SVDANGSQ-----------------SVSVTVKNTGKTAGSTVAQLYAQ-PKFTVAGQTFP 777

Query: 747 EKQLIGFDRVHTVAKGSKE 765
            +QL+GF +   +  G  +
Sbjct: 778 NEQLVGFAKSKVLKPGESQ 796


>gi|255545660|ref|XP_002513890.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546976|gb|EEF48473.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 336

 Score =  292 bits (748), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 156/344 (45%), Positives = 209/344 (60%), Gaps = 19/344 (5%)

Query: 467 GGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
           G Y G PC   + L+GL+ Y+  T Y  GC+ V C S A   EAV+IA++ D V++V  L
Sbjct: 5   GNYAGPPCKTVTPLQGLQNYIKNTKYHRGCNTVAC-SKATIREAVQIAREVDQVVLVMEL 63

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D TQE E  D ++L LPG Q  L+ SVAR + +PV+LVL  GG +DVSFA  + +I SIL
Sbjct: 64  DQTQEAERIDLLNLRLPGNQQKLIISVARAANKPVVLVLICGGLVDVSFAITEPKIGSIL 123

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
           W GYPGEAG  ALAEIIFGD NPGG+LP+TWYP+ +TK+PM D+ MR   +  YPGRSYR
Sbjct: 124 WAGYPGEAGGTALAEIIFGDHNPGGKLPVTWYPQQYTKIPMTDVRMRPQIASGYPGRSYR 183

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSE---LTISASLKA--GSDKNILQQTGSRLDYV 701
           FY G +V+ FG+GLSY+N SY+  S P +   L   +S+K    S   ++ + G  L   
Sbjct: 184 FYEGKKVFEFGYGLSYSNCSYEIASIPQDKIFLRSPSSIKGVKTSSYTLVSELGKEL--- 240

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
                  C   +F V + V N G + G H V++F R PK   G P K+L+ F  V   A 
Sbjct: 241 -------CERSKFSVTVKVKNEGKIIGKHPVLVFLRQPKPGSGRPVKKLVAFQTVRLNAG 293

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
            + EI   + PCE L+ AN+ G  ++  G   L+VGE  + +TI
Sbjct: 294 QNAEIQRKLSPCEPLTRANEDGSMVIDGG---LVVGEKPYQITI 334


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  292 bits (747), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 195/565 (34%), Positives = 286/565 (50%), Gaps = 56/565 (9%)

Query: 19  LQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           L L + +    F+N       PD  C          C+   +   RA +L+  L ++EK+
Sbjct: 5   LSLRLALASALFANGVIGFQYPD--CANGPLADNTVCDPKATPPERAAALVKALNIEEKL 62

Query: 79  QQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN----GTVSSVTSFPQVLVSAAS 134
           Q L D +    RLG+PAY WWSE+LHG+A++ PGV FN    G  SS TSF   +  +A+
Sbjct: 63  QNLVDMSKGAERLGLPAYAWWSEALHGVAAS-PGVRFNRTAGGRFSSATSFANSITLSAA 121

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           F+  L   +   ++ EARA  N G AGL +W PNIN ++DPRWGRG ETPGEDP+ +  Y
Sbjct: 122 FDDELVYKVADTISTEARAFANAGLAGLDYWTPNINPYKDPRWGRGHETPGEDPVRIKGY 181

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
               +   +G     DD  I      ++V+                A CKH  AYDLE+W
Sbjct: 182 VKALLAGLEG-----DDPSI------RKVV----------------ATCKHYAAYDLERW 214

Query: 255 GNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
              +R+ F+A+++ QD  + + PPF+ C    K    MCSYN +NG PAC    L     
Sbjct: 215 QGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDIL 274

Query: 314 RNEWGF---KGYITSDCDAVATIF---EYQNY--TKTHEDSAAGVLKAGMDINC---GTC 362
           R  WG+     YITSDC+A+       ++ N+  T+T  ++AA   +AG D  C   G  
Sbjct: 275 RKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVCEVPGWP 334

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDVCTSEHK 421
                  A ++  + E+ ID AL  L+   +R+G F  DP  G  Y  +G +DV T E +
Sbjct: 335 PYTDVIGAYNQTLLSEEVIDTALKRLYEGLVRVGYF--DPASGSPYRSIGWEDVNTPEAQ 392

Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN--NISQMGGGYTGIPCSPKSL 479
           +LAL +   G+VLLKND   LPLN     ++A+IG   N  N  ++ GGY+G P    S 
Sbjct: 393 ELALQSGTDGLVLLKNDGT-LPLNLED-KTVALIGFWANSTNGGRILGGYSGFPPYIHSP 450

Query: 480 LRGLEAYVSKTHYASGC---HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
           +   E      HYASG    +      D    +A+  AKK++ ++   G D +   ED D
Sbjct: 451 VDAAEKLNLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTSIAAEDLD 510

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPV 561
           R S+  P  Q++++ +++   + P 
Sbjct: 511 RDSIAWPEIQLAVIEALSALRQAPA 535


>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
          Length = 333

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 158/342 (46%), Positives = 214/342 (62%), Gaps = 12/342 (3%)

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M G Y G P    + L+GL A V+ T Y  GC +V C + A   EA +IA  AD  +++ 
Sbjct: 1   MIGNYEGTPGKYTTPLQGLTALVATT-YLPGCSNVACGT-AQIDEAKKIAAAADATVLIV 58

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G+D + E E RDRV++ LPGQQ  L+T VA+ SK  VILV+  GG  D+SFA+ D +I+S
Sbjct: 59  GIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITS 118

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           ILW+GYPGEAG  A+A++IFG +NP GRLP TWYP+S+  KVPM +MNMR D +  YPGR
Sbjct: 119 ILWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGR 178

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           +YRFYTG  +Y FG GLSYT +++  + AP     S S+      +        +D V  
Sbjct: 179 TYRFYTGETIYTFGDGLSYTQFNHHLIQAPK----SVSIPIEEGHSCHSSKCKSVDAVQ- 233

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
               SC +L F + + V NAG++ GSH V LF+  P V   +P+K L+GF++V   AK  
Sbjct: 234 ---ESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPSV-HNSPQKHLLGFEKVFVTAKAE 289

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
             + F VD C+ LSI ++ G R + LG HVL VG L+HSL +
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGNLKHSLNV 331


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  289 bits (739), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 165/437 (37%), Positives = 231/437 (52%), Gaps = 56/437 (12%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           +T +EK+ Q  + A AIPRLG+PAYEWW+E LHGIA NG            T FPQ +  
Sbjct: 68  MTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIGL 117

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRDPRWGRGQE 182
           AA++N +L   +G+  + EARA +NL           AGLT W+PNINIFRDPRWGRG E
Sbjct: 118 AATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGME 177

Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
           T GEDP +    AV F++  QG++               R +                A 
Sbjct: 178 TYGEDPYLTGQLAVGFIRGLQGDD-----------LTHPRTI----------------AT 210

Query: 243 CKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP 302
            KHL  +   + G   R+ F+  ++  D E T+ P FR+ I  G+A  +MC+YN ++G P
Sbjct: 211 PKHLAVHSGPEPG---RHGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTP 267

Query: 303 ACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT 361
           AC    L   + R +WGF G++ SDCDAV  + ++  +   +  S+A  LKAG D+NCG 
Sbjct: 268 ACAADWLLNGRLRGDWGFTGFVVSDCDAVDDMTQFHYFRADNAGSSAAALKAGHDLNCGY 327

Query: 362 CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHK 421
              R    AI +G   E  +D++L+ LF+ + RLG      RK  Y +LG  DV ++ H+
Sbjct: 328 AY-RDLGKAIARGDADEAVLDQSLVRLFAARYRLGELQPQ-RKDPYARLGAKDVDSAAHR 385

Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR 481
            LAL AA+Q IVLL+N    LPL       LA+IGP  + ++ +   Y G   +P + L 
Sbjct: 386 ALALQAAQQSIVLLQNRNATLPLRPGL--RLAVIGPNADALAALEANYQGTSAAPVTPLL 443

Query: 482 GLEAY--VSKTHYASGC 496
           GL      +   YA G 
Sbjct: 444 GLRERFGAANLRYAQGA 460



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 137/262 (52%), Gaps = 44/262 (16%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  L LP  Q +L+   A+ S +P+++VL  G  + +++A+  +      W  YPG++G
Sbjct: 667 DRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALNWAKQHADAIVAAW--YPGQSG 723

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++ GD NPGGRLP+T+Y         +  ++ A  S    GR+YR++ G  ++ 
Sbjct: 724 GTAIAQVLAGDVNPGGRLPVTFY--------RSTKDLPAYVSYDMKGRTYRYFKGEPLFA 775

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYT ++Y    A  +L+ + +L+AG++                            
Sbjct: 776 FGSGLSYTRFTY----AAPQLSAT-TLQAGAN--------------------------LQ 804

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V+  V N+G   G  VV ++ + P+ +Q +P + L+GF RV      ++E+ F + P  Q
Sbjct: 805 VRTQVRNSGTRAGDEVVQVYLQPPQGAQ-SPLRTLVGFQRVTLQPGEAREVGFELTP-RQ 862

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  ++ G+R +  G++ + VG
Sbjct: 863 LSDVDRAGQRAVQPGDYRVFVG 884


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  288 bits (737), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 173/455 (38%), Positives = 245/455 (53%), Gaps = 55/455 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + +L    RA  L+S +TL+EK+ Q+   A AI RLGIP Y WW+E LHG+       
Sbjct: 40  YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGV------- 92

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGLTFW 165
              G     T FPQ +  AA ++      I +AV+ EARA ++     G+     GLTFW
Sbjct: 93  ---GRAGIATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            PNINIFRDPRWGRG ET GEDP +    AV+++K  QG+    DD          R LK
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGD----DD----------RYLK 195

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH + +   +     R+ F+A  + +D+  T+ P F+  I++
Sbjct: 196 -------------LVATSKHFLVHSGPE---PDRHHFDARTSARDSLMTYTPHFKKTIQE 239

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFE--YQNYTKTH 343
                +MC+YN+ NG+P C    +    RNEWGFKGYI SDC AVA  ++  +     T 
Sbjct: 240 AGVYSVMCAYNRYNGLPCCGSKPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTV 299

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E++AA  +KAG D+NCG         A+ +G V E++ID  +  L   +LRLG+F+  P 
Sbjct: 300 EEAAAMAVKAGTDLNCGNSYPALV-DAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPE 357

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              Y  +    V + EH++LAL AAR+ +VLLKND   LPL+KN V ++A+IGP  NN+ 
Sbjct: 358 MVPYTNIPYSVVDSKEHRELALIAARKSMVLLKNDNNTLPLDKN-VKNVAVIGPNANNLD 416

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            +   Y G P +P + L G+   +  +   YA GC
Sbjct: 417 VLLANYNGYPSNPVTPLDGIRQKLPNANVQYALGC 451



 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/299 (34%), Positives = 153/299 (51%), Gaps = 55/299 (18%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           EA++IA  +D V++  GL    E E+           DRV + LP  Q  LV ++    K
Sbjct: 609 EAIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK 668

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            PV+LVL  G  L +++ EA++ + +IL   YPG+AG  A+A+++FGD+NP GRLP+T+Y
Sbjct: 669 -PVVLVLLNGSALAINW-EAEN-VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFY 725

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
                  P  D +M         GR+Y+++ G  ++ FG+GLSYT++ Y  L  P +   
Sbjct: 726 KSVTQLPPFEDYSMD--------GRTYQYFKGEALFPFGYGLSYTSFKYDNLVVPDK--- 774

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
              L+AG                   EVT        V + VTN G+ DG  VV L+   
Sbjct: 775 ---LEAGK------------------EVT--------VHVDVTNTGNRDGDEVVQLYVSH 805

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P V +  P + L GFDR+   A  +K +SF + P EQL++       ++P GN  L VG
Sbjct: 806 PDV-ESAPIRSLQGFDRIALKAGETKTVSFTLKP-EQLAVYQPQNGLVVPAGNLKLSVG 862


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 173/479 (36%), Positives = 250/479 (52%), Gaps = 68/479 (14%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N S SI  R   L++ LTL+EK+ QL D + AI RL IP Y WW+E+LHG+A  G   
Sbjct: 34  YLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAG--- 90

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AA+F+  L   +G+A++ E RA ++   A        GLT+W
Sbjct: 91  -------KATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMYTGLTYW 143

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +  AV F+   QG+N                   
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------------------- 184

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                    + L   A  KH   Y +      SR+S +   +++D  +T+ P F+  I Q
Sbjct: 185 --------TEYLKSVATLKH---YAVHSGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQ 233

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQ--NYTKT 342
            K + +MC+YN VNG PAC   +L Q K R+E+ F GYI SDC A+A  ++ +  N   T
Sbjct: 234 TKVASVMCAYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNT 293

Query: 343 HEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
              +AA  LK G D+NCG        +   A+ +G V+EKD+D+AL  L   + +LG+F+
Sbjct: 294 EAKAAAMALKTGTDLNCGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD 353

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P    Y     D V +++H  L  +AA++ +VLLKN+ + LPL  N    +A+IGP  
Sbjct: 354 -NPENVPYSDTSIDIVGSNKHLALTQEAAKKSLVLLKNE-QVLPLKGN--EKVALIGPNA 409

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG--------CHDVPCNSDAGFH 508
           +N + + G Y G+P  P +    LE  + K +  Y +G         H  P  +   FH
Sbjct: 410 DNEAILLGNYNGMPIVPITPKLALEQRLGKNNLTYTAGSSITGEVYTHYQPIKAKHFFH 468



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 140/301 (46%), Gaps = 57/301 (18%)

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
            +A+  A +AD ++ V G+    E E+           DR ++ LP  Q++L+  + +T 
Sbjct: 597 QQALNNANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTG 656

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K P++LV   G  + +++   +  I +I+   YPGEA   AL  +++G+++P G+LP+T+
Sbjct: 657 K-PIVLVNMSGSAMALNWE--NENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITF 713

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y +S + +P   D +M+         R+Y++Y G  +Y FG GLSY ++ YK        
Sbjct: 714 Y-KSVSDLPDFKDYSMK--------NRTYKYYEGEVLYPFGFGLSYADFKYK-------- 756

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
               S+ AGS                            ++  ++TN        VV ++ 
Sbjct: 757 NTRHSIDAGSGD-------------------------LNLTTTITNQSSFSADDVVQVYV 791

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
            +P     TP KQL+GF  +    +   +I F + P  +LS  N+ G  +   G  ++ V
Sbjct: 792 SMPDAPIKTPNKQLVGFKHITLKNESKNDIKFTI-PKNKLSYINEQGIAVAYKGRLIITV 850

Query: 797 G 797
           G
Sbjct: 851 G 851


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 175/500 (35%), Positives = 257/500 (51%), Gaps = 81/500 (16%)

Query: 2   SRPFMSIQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSI 61
           +R   S++    I F  L  I++ NV                C+   F  +P  +T+LS 
Sbjct: 6   TRRLCSVKAHRAIFFLSLLTILIANV----------------CQAQSFKGFPMWDTNLSF 49

Query: 62  STRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSS 121
             R K L+S LTL+EK+ Q+ + A AIPRLGIPAY+WW+E LHG+A          T   
Sbjct: 50  EVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR---------TPFH 100

Query: 122 VTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIF 172
           VT +PQ +  AA ++ +  + +    A+E RA++N   A         GLT+W PNINIF
Sbjct: 101 VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNERYLGLTYWTPNINIF 160

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           RDPRWGRGQET GEDP + S     FV+  QG++ K                        
Sbjct: 161 RDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDDPKY----------------------- 197

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
               L  +AC KH   +   +    SR+S N   +  D  DT+ P F+  + + K   +M
Sbjct: 198 ----LKAAACAKHFAVHSGPE---PSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVM 250

Query: 293 CSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
           C+YN  +G P C    L  D+ +K   +W FKGY+TSDC A+   F++    KTH D+ +
Sbjct: 251 CAYNAFHGQPCCGSDVLMNDILRK---QWQFKGYVTSDCWAIDDFFKFH---KTHPDATS 304

Query: 349 GVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP-RK 404
             + A   G D+ CGT + +     + KG + E  +D +L+ LF+ + RLG+F  DP   
Sbjct: 305 ASVDAVLHGTDVECGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMF--DPVSM 362

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            KY +     + T+EHK  +L  A+Q IVLLKN+   LPL+KN +  +A++GP  +N   
Sbjct: 363 VKYAQTPESILETAEHKAHSLKMAQQSIVLLKNEGNTLPLSKN-IKKIAVLGPNADNRIV 421

Query: 465 MGGGYTGIPCSPKSLLRGLE 484
           + G Y G P    + L+G++
Sbjct: 422 VLGNYNGQPSEIITALQGIK 441



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 90/297 (30%), Positives = 136/297 (45%), Gaps = 54/297 (18%)

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
           V   K AD ++ V G+    E E+           DR S+LLP  Q  L+  +  T K  
Sbjct: 613 VNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNGGDRTSILLPAVQTELLKMLKGTGKPL 672

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V +V+TG     ++    D  I +I+   Y G++   A+A+++FGD+NP GRLP+T+Y  
Sbjct: 673 VFVVMTGSA---IALPYEDQNIPAIVNAWYGGQSAGTAIADVLFGDYNPAGRLPVTFYKA 729

Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
                   D ++    S     R+YR++ G  +Y FGHGLSYT++ Y  L  P ++   A
Sbjct: 730 --------DSDLPDFKSYDMNNRTYRYFKGDALYPFGHGLSYTSFQYSKLKTPGKIKSGA 781

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           S                                F V  ++TN G  DG  VV L+   P+
Sbjct: 782 S--------------------------------FKVSATLTNTGKKDGDEVVQLYLAYPE 809

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           V+   P + L GF+R+   A  SK +SF + P EQ  + N+ G    P G   + +G
Sbjct: 810 VAGKAPIRALKGFNRIRLKAGESKTVSFTLSP-EQCQLVNEEGALYQPKGKMEISLG 865


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 229/756 (30%), Positives = 359/756 (47%), Gaps = 128/756 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           + L  KIQ+     +   RLGIPA     E L G  + G            T FPQ +  
Sbjct: 72  VELANKIQKFLKEKT---RLGIPAI-IHEECLSGFMAKG-----------ATVFPQAIGM 116

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
           A++F   L   +   +    RA  N+ Q GL+   P ++I RDPRWGR +ET GEDP +V
Sbjct: 117 ASTFEPELIRRVSDVIRQHMRAA-NVHQ-GLS---PVLDIPRDPRWGRTEETFGEDPYLV 171

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S  A E+VK  QGE+W+                              + A  KH  AY +
Sbjct: 172 SRMAAEYVKGLQGEDWREG----------------------------IIATVKHFTAYGI 203

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
            +    +R    A + E++  + F  PF   I++G+A  +M +Y++++GVP      L  
Sbjct: 204 SEG---ARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLT 260

Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQ 368
           K  R EWGFKGY+ SD  A+  +  +    K  +++A   L+AG+DI   +  C      
Sbjct: 261 KILRWEWGFKGYVVSDYIAIRMLENFHRVAKDAKEAAVLALEAGIDIELPSVDCYGEPLI 320

Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAA 428
            A+ +G + E+ I+ ++  +   +  LGLF+GD  K    K   D     E ++L+ + A
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDGDLEKDP--KKVYDIFDKPEFRELSREVA 378

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---------------GIP 473
           R+ IVLLKND   LPL+KN + ++A+IGP  +N   + G Y+                IP
Sbjct: 379 RRSIVLLKNDG-ILPLSKN-IRTVAVIGPNADNPRNLHGDYSYTAHIPSVSETLEGVKIP 436

Query: 474 --CSPK--SLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             C+ +  S+L G++  VS   +  YA GC ++  +S  GF EA+ IAK+AD +I V G 
Sbjct: 437 EECAVRTVSILEGIKNKVSAETQVLYAKGC-EILSDSKEGFDEAIEIAKRADVIIAVMGE 495

Query: 526 ----LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ 581
                      E  DR +L L G Q  L+  + +  K P++LVL  G P  + +   +  
Sbjct: 496 ESGLFHRGISGEGNDRTTLELFGIQRDLLRELHKLGK-PIVLVLVNGRPQALKWEHEN-- 552

Query: 582 ISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP 641
           +++IL   YPGE G  A+A++IFGD+NP G+LP++ +P    +VP+   N +  +   Y 
Sbjct: 553 LNAILEAWYPGEEGGDAVADVIFGDYNPSGKLPIS-FPAVTGQVPVY-YNRKPSAFTDYV 610

Query: 642 GRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
             S +      +Y FGHGLSYT + Y  L                              +
Sbjct: 611 EESAK-----PLYPFGHGLSYTTFEYSNLK-----------------------------I 636

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
           H ++V +    +  +  ++ N G  +G  VV L+      S   P K+L GF ++H    
Sbjct: 637 HPEKVNALE--KVEISFTIKNTGVREGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPG 694

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            SK ++F + P EQL+  ++  R ++  G   +M+G
Sbjct: 695 ESKRVTFILYP-EQLAFYDEFMRFVVEKGIFEIMIG 729


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 172/450 (38%), Positives = 239/450 (53%), Gaps = 58/450 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
            A +L++ +T QEKI Q  + A AIPRLGIPAY+WWSE LHGIA NG            T
Sbjct: 33  HAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YAT 82

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N  L  ++G+  + EARA +NL           AGLT W+PNINIFRD
Sbjct: 83  VFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRD 142

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP + S  AV F++  QG                           D  
Sbjct: 143 PRWGRGMETYGEDPYLTSQLAVSFIRGLQG---------------------------DTP 175

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
           D     A  KH   +   + G   R+SF+  ++  D E T+ P FR+ I  G A  +MC+
Sbjct: 176 DHPRTIATPKHFAVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCA 232

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + RN+WGF G++ SDCDA+  +  +  + + +  ++A  LK+
Sbjct: 233 YNALHGTPACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAAALKS 292

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGP 412
           G D+NCG    R    AI +G + E  +D+AL+ LF+ + RLG     PR+   Y  +G 
Sbjct: 293 GNDLNCGNTY-RDLNQAIARGDIDESTLDQALIRLFTARQRLGTLQ--PREHDPYAAIGI 349

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
             + T  H+ LAL AA Q +VLLKN    LPL     ++LA++GP  ++++ +   Y G 
Sbjct: 350 KHIDTPAHRALALQAAAQSLVLLKNSGNTLPLPPE--TTLAVLGPDADSLTALEANYQGT 407

Query: 473 PCSPKSLLRGLEAY--VSKTHYASGCHDVP 500
             +P + L GL      +K HYA G    P
Sbjct: 408 SSTPVTPLTGLRTRFGTAKVHYAQGASLAP 437



 Score =  122 bits (307), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 148/299 (49%), Gaps = 54/299 (18%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           EA R    AD ++   GL    E E+           DR ++ LP  Q +L+  V +T+ 
Sbjct: 603 EAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KTTG 661

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           +P+I+VL  G  + +++A+      +IL   YPG++G  A+A+ + GD NPGGRLP+T+Y
Sbjct: 662 KPLIVVLMSGSAVALNWAQ--HHADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY 719

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
             +    P    +M         GR+YR++ G  +Y FG+GLSYT ++Y+   AP   T 
Sbjct: 720 RSTQDLPPYISYDMT--------GRTYRYFKGQPLYPFGYGLSYTQFAYE---APQLST- 767

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
            A+LKAG                      +  ++  HV+ + T AGD     VV L+   
Sbjct: 768 -ATLKAG----------------------NTLTVTTHVRNTGTRAGD----EVVQLYLEP 800

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P   Q  P + L+GF RV      S+ ++F +D   QLS   + G+R +  G++ L VG
Sbjct: 801 PYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSSVQQTGQRSVEAGHYHLFVG 857


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 234/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +  A     V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT         ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  286 bits (732), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 161/462 (34%), Positives = 247/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T  S   RA  L++ +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++            GLTFW
Sbjct: 94  -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+     
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYRK----- 201

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   +   +     R+ F+A  +++D  +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN+V G  A     L +   R+ WGF GY+ SDC A+  I++      T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATRE 305

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           ++AA  +K G ++ CG        SA+ KG + E D+D+AL  L   ++RLG+F+  P K
Sbjct: 306 EAAALAVKHGTELECG-AEYSTLPSAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEK 363

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + ++      + EH  LA   AR+ +VLLKND   LPL++  +  +A++GP  ++   
Sbjct: 364 LAWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMA 422

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 147/302 (48%), Gaps = 53/302 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             +A+  A++AD V+ V GL    E E+           DR  L LP  Q +L+ ++  T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   G                 R H  + V N G   G  VV L+ 
Sbjct: 791 RL--------DRSKLAADG-----------------RLHATLKVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR-RILPLGNHVLM 795
           +     +    K L GF R+      ++E+ F + P   L + ++  +  ++  G++ L 
Sbjct: 826 QPLSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDPGDYELQ 885

Query: 796 VG 797
           VG
Sbjct: 886 VG 887


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 168/458 (36%), Positives = 244/458 (53%), Gaps = 60/458 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N S SI  R   L++ LTL+EK+ QL D + AI RL +P Y WW+E+LHG+A  G   
Sbjct: 34  YLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAG--- 90

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTFW 165
                    T FPQ +  AA+F+  L   +G+A++ E RA ++            GLT+W
Sbjct: 91  -------KATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMYTGLTYW 143

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +  AV F+   QG+N                   
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGDN------------------- 184

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                    + L   A  KH   Y +      SR+S +   +E+D  +T+ P F+  I Q
Sbjct: 185 --------AEYLKSVATLKH---YAVHSGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQ 233

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQ--NYTKT 342
            K + +MC+YN VNG PAC   +L Q K R+E+ F GYI SDC A+A  ++ +  N   T
Sbjct: 234 TKVASVMCAYNSVNGTPACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNT 293

Query: 343 HEDSAAGVLKAGMDINCGT---CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
              +AA  LK G D+NCG        +   A+ +G V+EKD+D+AL  L   + +LG+F+
Sbjct: 294 GAKAAAMALKTGTDLNCGDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD 353

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P    Y     D V +++H  L  +AA++ +VLLKN+ + LPL  N    +A+IGP  
Sbjct: 354 -NPENVPYSDTSIDVVGSNKHLALTQEAAQKSLVLLKNE-QVLPLKGN--EKIALIGPNA 409

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG 495
           +N + + G Y G+P  P +    LE  + K +  Y +G
Sbjct: 410 DNEAILLGNYNGMPIVPITPKLALEQRLGKNNLTYTAG 447


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 164/468 (35%), Positives = 248/468 (52%), Gaps = 45/468 (9%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           H  + P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A
Sbjct: 28  HAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------ 159
             G            T FPQ +  AA+F+  L   + +A++ EARA ++  L Q      
Sbjct: 88  RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
            GLTFW+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYR 197

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
           +                  L A  KH   +   +     R+ F+A  +++D  +T+ P F
Sbjct: 198 K------------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAF 236

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
            + ++ GK   +M +YN+V G  A     L Q   R++WGF+GY+ SDC A+  I+++  
Sbjct: 237 EALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHK 296

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
              T E +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F
Sbjct: 297 IVATREQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMF 355

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +  P +  +  +      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP 
Sbjct: 356 D-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPT 413

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSD 504
            ++   + G Y G P +P ++L+G+ A   K    YA G   V    D
Sbjct: 414 ADDTMALLGNYYGTPAAPVTVLQGIRAAAPKAQVLYARGADLVEGRDD 461



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+   AY  +  Y  G  D           +     EA+ +A+ AD V+ V GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR  L LP  Q  L+ ++  T  RPV+ VLT G  L + +A+    +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FGD NPGGRLP+T+Y ES T    +D  MR        G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ GT +Y FGHGLSYT ++Y                          +G RLD   
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           I    S T+      ++V N G   G  VV L+       +    K+L GF R+      
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +E+ F ++  + L + ++ 
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  286 bits (731), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 164/468 (35%), Positives = 250/468 (53%), Gaps = 45/468 (9%)

Query: 48  HFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA 107
           H  + P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A
Sbjct: 28  HAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87

Query: 108 SNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------ 159
             G            T FPQ +  AA+F+  L   + +A++ EARA ++  L Q      
Sbjct: 88  RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137

Query: 160 AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFR 219
            GLTFW+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYR 197

Query: 220 EKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPF 279
           +                  L A  KHL  +   +     R+ F+A  +++D  +T+ P F
Sbjct: 198 K------------------LDATAKHLAVHSGPE---ADRHHFDARPSQRDLYETYLPAF 236

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
            + +++GK   +M +YN+V G  A     L Q   R++WGF+GY+ SDC A+  I+++  
Sbjct: 237 EALVKEGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHK 296

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
              T E +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F
Sbjct: 297 IVATREQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMF 355

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +  P +  +  +      +  H  LA   AR+ +VLLKND   LPL++     +A+IGP 
Sbjct: 356 D-PPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPT 413

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            ++   + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 414 ADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  132 bits (333), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 95/320 (29%), Positives = 140/320 (43%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+   AY  +  Y  G  D           +     EA+ +A+ AD V+ V GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR  L LP  Q  L+ ++  T  RPV+ VLT G  L + +A+    +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FGD NPGGRLP+T+Y ES T    +D  MR        G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ GT +Y FGHGLSYT ++Y  L                                
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAYSGL-------------------------------R 788

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           +D  T  T       ++V N G   G  VV L+       +    K+L GF R+      
Sbjct: 789 LDRTTIATDGSLAATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +E+ F ++  + L + ++ 
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  286 bits (731), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 163/463 (35%), Positives = 248/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+ AY+WW+E+LHG+A  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK---- 199

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 200 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 242

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  I+++     T 
Sbjct: 243 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATR 302

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +K G ++ CG        +A+ +G ++E  ID +L  L + ++RLG+F+  P 
Sbjct: 303 EQAAALAVKHGTELECGE-EYSTLPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPG 360

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + K+      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 361 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTM 419

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 420 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 134/286 (46%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ A+ V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES      +D  M         GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 743 FYKESEALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 788

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D+N +   GS                 F   ++V N G   G  V  L+ 
Sbjct: 789 RL--------DRNTVAADGS-----------------FTATVTVKNTGQRAGDEVAQLYL 823

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF RV       +E+ F ++  E L I ++ 
Sbjct: 824 HPLTPQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQ 869


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 234/738 (31%), Positives = 347/738 (47%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +  A     V+ FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMGTALVRGFQGETL-- 249

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  285 bits (730), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 232/764 (30%), Positives = 368/764 (48%), Gaps = 120/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + N +L + +R + L+S++T+++K++ L +      IP LG+PA     E++HG +    
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ +   A++N+ L      A+  E  +  N  QA    W+P +++
Sbjct: 227 --------SGATIFPQSIGMGATWNKRLIEAAAMAIGDETVSA-NAVQA----WSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++V+     ++K +Q +                          
Sbjct: 274 AQDARWGRCEETYGEDPVLVTEIGGAWIKGYQSKG------------------------- 308

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  L    KH  A+     G   R S +  ++E++  +    PFR   ++ K   I
Sbjct: 309 -------LMTTPKHFAAHGAPLGG---RDSHDIGLSEREMREIHLVPFRDIYKKYKYQSI 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE-DSAAG 349
           M SY+   GVP     +L +   R+EWGF G+I SDC A+  +   ++YT   + ++A  
Sbjct: 359 MMSYSDFLGVPVAKSKELLKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T       +A  +G++   D+D     L     R GLF  +P K   +
Sbjct: 419 ALAAGIATNCGDTYNDPDVIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    + EH+ LA   A++ IVLL+N    LPL+K ++ ++A+IGP  +N+    G
Sbjct: 479 NKIYPG-WNSPEHQALARKTAQESIVLLENKGNILPLSK-SLKTIAVIGPGADNLQP--G 534

Query: 468 GYTGIP--CSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT  P     KS+L G++A V   +K  Y  GC  +         +AV+ A+ AD  ++
Sbjct: 535 DYTSKPQPGQLKSVLTGIKAAVNSSTKVLYEEGCRFIGTEG-TDIAKAVKAAENADVAVL 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V +T K PV+L+L  G P ++
Sbjct: 594 VLGDCSTSEALKGITNTSGENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNL 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           S+A  + Q   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 653 SYAAENCQAVLVNWL--PGQEGGYATADVLFGDYNPAGRLPMT-FPRDAAQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R Y      +Y    +Y FG+GLSYT+++Y      S+L IS           L++
Sbjct: 709 T-SGRVYDYVDMPYYP---LYQFGYGLSYTSFNY------SDLNIS-----------LEK 747

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G+                   V  +VTN G V G  VV L+      S  T   +L  F
Sbjct: 748 NGN-----------------VSVNATVTNTGKVAGDEVVQLYITDMYASVKTRVMELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           DRV+     SK++SF + P  QLS+ N    R++  G   +MVG
Sbjct: 791 DRVYLNPGESKKVSFVLTPY-QLSLLNDEMDRVVEKGLFKIMVG 833


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  285 bits (730), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 172/454 (37%), Positives = 239/454 (52%), Gaps = 58/454 (12%)

Query: 60  SISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTV 119
           S    A +L++ +T QEKI Q  + A AIPRLGIPAY+WWSE LHGIA NG         
Sbjct: 29  SPEQHAAALVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG--------- 79

Query: 120 SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNIN 170
              T FPQ +  AAS+N  L  ++G+  + EARA +NL           AGLT W+PNIN
Sbjct: 80  -YATVFPQAIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNIN 138

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           IFRDPRWGRG ET GEDP +    AV F++  QG                          
Sbjct: 139 IFRDPRWGRGMETYGEDPYLTGQLAVSFIRGLQG-------------------------- 172

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
            D  D     A  KH   +   + G   R+SF+  ++  D E T+ P FR+ I  G A  
Sbjct: 173 -DTPDHPRTIATPKHFAVHSGPEQG---RHSFDVDVSAYDLEATYTPAFRAAIVDGHAGS 228

Query: 291 IMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +MC+YN ++G PAC    L   + RN+WGF G++ SDCDA+  +  +  + + +  ++A 
Sbjct: 229 VMCAYNALHGTPACASDWLLNTRLRNDWGFNGFVVSDCDAIEDMTRFHFFRQDNASASAA 288

Query: 350 VLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYG 408
            LK+G D+NCG    R    AI +G + E  +D+AL+ LF+ + RLG     PR+   Y 
Sbjct: 289 ALKSGDDLNCGNTY-RDLNQAIARGDIDESTLDQALIRLFTARQRLGTLQ--PREHDPYA 345

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
            +G   + T  H+ LAL AA Q +VLLKN    LPL     ++LA++GP  ++++ +   
Sbjct: 346 AIGIKHIDTPAHRALALQAAAQSLVLLKNSGNTLPLPPE--TTLAVLGPDADSLTALEAN 403

Query: 469 YTGIPCSPKSLLRGLEAY--VSKTHYASGCHDVP 500
           Y G   +P + L GL      +K HYA G    P
Sbjct: 404 YQGTSSTPVTPLTGLRTRFGTAKVHYAQGASLAP 437



 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 98/299 (32%), Positives = 148/299 (49%), Gaps = 54/299 (18%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           EA R    AD ++   GL    E E+           DR ++ LP  Q +L+  V +T+ 
Sbjct: 603 EAERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHV-KTTG 661

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           +P+I+VL  G  + +++A+      +IL   YPG++G  A+A+ + GD NPGGRLP+T+Y
Sbjct: 662 KPLIVVLMSGSAVALNWAQ--HHADAILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFY 719

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
             +    P    +M         GR+YR++ G  +Y FG+GLSYT ++Y+   AP   T 
Sbjct: 720 RSTQDLPPYISYDMT--------GRTYRYFKGQPLYPFGYGLSYTQFAYE---APQLST- 767

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
            A+LKAG   N L  T                    HV+ + T AGD     VV L+   
Sbjct: 768 -ATLKAG---NTLTVTA-------------------HVRNTGTRAGD----EVVQLYLEP 800

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P   Q  P + L+GF RV      S+ ++F +D   QLS   + G+R +  G++ L VG
Sbjct: 801 PYSPQA-PLRSLVGFKRVTLRPGESRLLTFTLD-ARQLSGVQQTGQRSVEAGHYHLFVG 857


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 163/463 (35%), Positives = 248/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+ AY+WW+E+LHG+A  G  
Sbjct: 34  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG-- 91

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 92  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 143

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 144 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYRK---- 199

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 200 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 242

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  I+++     T 
Sbjct: 243 DGKVDAVMGAYNRVYGESASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVATR 302

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +K G ++ CG        +A+ +G ++E  ID +L  L + ++RLG+F+  P 
Sbjct: 303 EQAAALAVKHGTELECGE-EYSTLPAAVHQGLIEEAQIDTSLQTLMTARMRLGMFD-PPG 360

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + K+      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 361 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTM 419

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 420 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 135/286 (47%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ A+ V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 626 LQEALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 685

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES      +D  M         GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 743 FYKESEALPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 788

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D+N +   GS                 F   ++V N G   G  V  L+ 
Sbjct: 789 RL--------DRNTVAADGS-----------------FTATVTVKNTGQRAGDEVAQLYL 823

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF RV       +E+SF ++  E L I ++ 
Sbjct: 824 HPLTPQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQ 869


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 157/342 (45%), Positives = 202/342 (59%), Gaps = 34/342 (9%)

Query: 146  AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
            AV+ EARAMYN+G+ GLT+W+PNIN+ RDPRWGR  ETPGEDP VV  YAV FV+  Q  
Sbjct: 917  AVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQ-- 974

Query: 206  NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
                D  G          +   G+ + R   L  SACCKH  AYDL+ W N +R+ F+A 
Sbjct: 975  ----DIPG-------HEAVAAGGDPNTR--PLKTSACCKHYAAYDLDDWHNHTRFEFDAR 1021

Query: 266  ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYIT 324
            + E+D  +TFQ PF  C+  G  S +MCSYN+VNG+PAC    L  Q  R +WG  GYI 
Sbjct: 1022 VDERDMVETFQRPFEMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIV 1081

Query: 325  SDCDAVATIFEYQNYT-KTHEDSAAGVLKAGMDINCGTCMLRHT-------------QSA 370
            SDCDAV  + +   +   T  +++A  LKAG+D++CG      T               A
Sbjct: 1082 SDCDAVRVMTDNATWLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEA 1141

Query: 371  IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
            ++KGK++E DID AL N +   +RLG F+      +Y  LG  D+CT +HK LALD ARQ
Sbjct: 1142 VNKGKMRESDIDNALTNQYMTLMRLGYFDD---IAQYSSLGRQDICTDQHKTLALDGARQ 1198

Query: 431  GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTG 471
            GIVLLKND K LPL+ N V  + + GP V    + M G YTG
Sbjct: 1199 GIVLLKNDNKLLPLDANKVGFVNVRGPHVQAPEKIMDGDYTG 1240


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 163/463 (35%), Positives = 247/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+  T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK---- 201

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 244

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  ++++     T 
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +  G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + K+      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V   +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L V +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAVDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  M         GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   G+                     ++V N G   G  VV L+ 
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
              K  +    K+L GF R+       +E+ F ++  + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRI 867


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  285 bits (728), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 163/463 (35%), Positives = 247/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+  T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G  
Sbjct: 36  PYLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG-- 93

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTF 145

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRK---- 201

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 244

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  ++++     T 
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +  G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + K+      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V   +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVSYPGFAGGDRTDLRLPKPQRELLEALQAT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  M         GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESETLPAFDDYAMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   G+                     ++V N G   G  VV L+ 
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
              K  +    K+L GF R+       +E+ F ++  + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGQQRELRFTINAKDALRI 867


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-TVRDS 515

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 632

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 633 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 686

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 687 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 715

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 716 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 774

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 775 MQEGEWVVEPGRFTIMVG 792


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 156/392 (39%), Positives = 219/392 (55%), Gaps = 52/392 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           RA +L++ ++  EK+ Q  ++A AIPRL IPAYEWWSE LHGIA NG            T
Sbjct: 35  RAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSEGLHGIARNG----------YAT 84

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTFWAPNINIFRD 174
            FPQ +  AAS+N +L   +G+ V+ EARA +N            AGLT W+PNINIFRD
Sbjct: 85  VFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 144

Query: 175 PRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRG 234
           PRWGRG ET GEDP +    AV F++  QG++               R +          
Sbjct: 145 PRWGRGMETYGEDPFLTGQLAVGFIRGLQGDD-----------LNHPRTI---------- 183

Query: 235 DELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                 A  KH+  +   + G   R+ F+  ++ +D E T+ P FR+ +  G+A  +MC+
Sbjct: 184 ------ATPKHIAVHSGPEPG---RHGFDVDVSPRDMEATYTPAFRAALVDGQAWSVMCA 234

Query: 295 YNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           YN ++G PAC    L   + R +WGFKG++ SDCDAV  + ++  +   +  S+A  LKA
Sbjct: 235 YNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAAALKA 294

Query: 354 GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           G D+NCG    R   +AI++G+V E  +D++L+ LF+ + RLG     PRK  Y +LG  
Sbjct: 295 GHDLNCGHA-YRELGTAIERGEVDEALLDQSLVRLFAARYRLGELEA-PRKDPYARLGAK 352

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
           DV  + H+ LAL AA + IVLLKN    LPL 
Sbjct: 353 DVDNAAHRALALQAAAESIVLLKNTATTLPLK 384


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 229/763 (30%), Positives = 346/763 (45%), Gaps = 132/763 (17%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SL  + R   L   + L++ I QL + A A+  + +PAY W +++ HG+       
Sbjct: 71  YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGV------- 123

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN-----LGQA-------- 160
              GT  + T +P      AS++  L   +G+A+  E+RA +N      G A        
Sbjct: 124 --KGTAHA-TVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180

Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
                 G+T +APN+N+ RDPRWGR +E  GEDP + +  AV  V   QG          
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQGN--------- 231

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAY-------DLEKWGNFSRYSFNAMIT 267
                              G  L+  ACCKH  A+       DL       R   +A ++
Sbjct: 232 ----------AEGSTSGPGGGPLVTGACCKHFAAHFAVYQNEDLPA----DRMVLDANVS 277

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
            +D  +T+ P  ++C+ + KA+        VNG P C   +L     R  WGF G++ SD
Sbjct: 278 SRDLWETYLPVMKACVVRAKAT-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSD 330

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG---TCMLRHTQSAIDKGKVQEKDIDR 383
            DA + +     Y  T E++AA  + AGMD   G      +     A+  G V    + R
Sbjct: 331 YDAWSNLVTTHKYVSTWEEAAAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRR 390

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLAL--DAARQGIVLLKNDKKF 441
           +   L  V+LRLG+F+       YG+    D       KLAL  +AAR+GIVL KN    
Sbjct: 391 SFERLMRVRLRLGMFDPPASTAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGA 450

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYT-----GIPCSPKSLLRGLEAYVSKTHYASGC 496
           LPL K A   +A++GP V++   + G        G   +P ++ +GLEA V+    A+GC
Sbjct: 451 LPLAKGA--RIALVGPQVDDWRVLLGAVNYAFEDGPDVAPVTIQKGLEA-VANVSVAAGC 507

Query: 497 HDVPCNSDAGFHEAVRIAKK--------ADFVIVVAGLDLTQ-------ETEDRDRVSLL 541
             V C +      A R+A           D      G  L +       E+E  DR ++ 
Sbjct: 508 DSVACAALVDVDGAKRLAAAADATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIE 567

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LPG+Q++LV ++   S R ++ VL  GG + +  A  D      LW+  PG+ G  ALA+
Sbjct: 568 LPGEQVALVAALRAASSR-LVCVLVHGGAVALGAAADDCDAVLDLWV--PGQMGGAALAD 624

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG-TQVYGFGHGL 660
           ++FGD++P GR P+T Y  +    PM   +  A  S    G +YR+Y G    Y FG GL
Sbjct: 625 VLFGDYSPAGRSPITMYAATSDLPPMGVFDEYAGESSN--GTTYRYYAGPAPTYAFGDGL 682

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SY ++SY + +AP                                V +C ++R   +++V
Sbjct: 683 SYASFSYAWAAAPPT-----------------------------TVDACGAIRL--RVAV 711

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
           TN G V    VV ++ARVP  +   P  +L+ FDRV  +A G+
Sbjct: 712 TNTGSVASDEVVQVYARVPDATVPAPAIRLVAFDRVRAIAPGA 754


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 160/462 (34%), Positives = 246/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T  S   RA  L++ +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++            GLTFW
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE+        G  +R+     
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYRK----- 201

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   +   +     R+ F+A  +++D  +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN+V G  A     L +   R+ WGF GY+ SDC A+  I++      T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVATRE 305

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           ++AA  +K G ++ CG        +A+ KG + E D+D AL  L   ++RLG+F+  P K
Sbjct: 306 EAAALAVKHGTELECG-AEYSTLPTAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEK 363

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + ++      + EH  LA   AR+ +VLLKND   LPL++  +  +A++GP  ++   
Sbjct: 364 LAWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMA 422

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 147/302 (48%), Gaps = 53/302 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             +A+  A++AD V+ V GL    E E+           DR  L LP  Q +L+ ++  T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESETLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   G                 R H  + V N G   G  VV L+ 
Sbjct: 791 RL--------DRSKLAADG-----------------RLHATLKVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLM 795
           +     +    K L GF R+      ++E+ F + P   L + ++  +  ++  G++ L 
Sbjct: 826 QPLSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQ 885

Query: 796 VG 797
           VG
Sbjct: 886 VG 887


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 162/448 (36%), Positives = 236/448 (52%), Gaps = 53/448 (11%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           +PF NT L+   R ++L+S LTL+EK+ Q+ + A AIPRLGIPAY+WW+E+LHG+A    
Sbjct: 27  FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
                 T    T FPQ +  AA+F+++    +    A+E RA+YN             GL
Sbjct: 84  ------TPFKTTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+W PNINIFRDPRWGRGQET GEDP + +     FVK  QG++ K              
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKY------------- 184

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         L  +AC KH   Y +       R++F+  +T  +  DT+ P F+  
Sbjct: 185 --------------LKAAACAKH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFKKL 227

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +   K + +MC+YN     P C    L     RN+W F GY+TSDC A+   F+      
Sbjct: 228 VTNSKVAGVMCAYNAFRTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHP 287

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
               ++A  +  G DI+CGT   +    A+  G++ EK ID ++  LF ++ RLG+F  D
Sbjct: 288 DAASASADAVLHGTDIDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMF--D 345

Query: 402 P-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           P    KY +     + + EHK+ AL  ARQ IVLLKN+K  LPL+K  +  + ++GP  +
Sbjct: 346 PVSMVKYAQTPSSVLESEEHKEHALKMARQSIVLLKNEKNTLPLSKK-LKKIVVLGPNAD 404

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS 488
           N   + G Y G P    ++L+G++  +S
Sbjct: 405 NSISILGNYNGTPSKLTTVLQGIKEKIS 432



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 54/287 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           F   +   K AD  I   G+    E E+           DR S+LLP  Q  L+ ++ ++
Sbjct: 596 FANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSILLPEVQTRLLKAL-QS 654

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           S +PV+ ++  G  + V + EA++ I +IL I Y G++   A A++IFGD+NP GRLP+T
Sbjct: 655 SGKPVVFLIMTGSAIAVPW-EAEN-IPAILNIWYGGQSAGTASADVIFGDYNPAGRLPVT 712

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y          D ++ +    +   ++YR++ G  +YGFG+GLSYT + Y  L  P + 
Sbjct: 713 FY--------KGDSDLSSFVDYKMDNKTYRYFKGIPLYGFGYGLSYTEFKYSGLKTPDK- 763

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                +K G    I                           + VTN G ++G  V  L+ 
Sbjct: 764 -----IKKGQPVTI--------------------------SVKVTNTGKMEGEEVAQLYL 792

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
             P  S  +P K L GF+R +     S  ++F + P E LS   + G
Sbjct: 793 INPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTLSP-EDLSYVTESG 838


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 169/446 (37%), Positives = 230/446 (51%), Gaps = 55/446 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F N  L I  R   L+  LTL+EKI Q+ +NA AI RLGIPAY WW+E LHG+A      
Sbjct: 26  FLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIPAYNWWNECLHGVAR----- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
               +   VTSFPQ +  AA+++      +    + E RA+Y+            GLT+W
Sbjct: 81  ----SPYPVTSFPQAIAMAATWDTESVHQMAVYASDEGRAIYHDATRKGTPGIFRGLTYW 136

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + ++  V FVK  QG                     
Sbjct: 137 SPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQG--------------------- 175

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                 D    L  SAC KH   +   +W   +R++++A +   D  DT+ P F+  + +
Sbjct: 176 ------DDPVYLKSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVVE 226

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK + +MC+YN   G P C    L     RN W F GY+TSDC AV    ++ N  KTH+
Sbjct: 227 GKVTGVMCAYNSFFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVE---DFYNTHKTHQ 283

Query: 345 DSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
           D+AA    A   G D  CG    R    A+ +G + EK ID +L  LF ++ RLG+F+ D
Sbjct: 284 DAAAASADAVLHGTDCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPD 343

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y  +    +    HK  AL  ARQ IVLLKN  + LPLNKN +  +A++GP  ++
Sbjct: 344 DRV-PYSNIPLSVLECDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADD 402

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAYV 487
            S +   Y G P    + L G++  V
Sbjct: 403 KSVLLANYYGYPSHITTALEGIQKKV 428



 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 91/311 (29%), Positives = 143/311 (45%), Gaps = 56/311 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV     A    D+     A + +     K AD +I V GL    E E+           
Sbjct: 568 YVQHADNAEIKFDMGILRKADYKQTAAAVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRG 627

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR S+ +P  Q +L+  +  T K PV+ V+  G  L + +  A   + +IL   Y G+AG
Sbjct: 628 DRTSISIPSVQQNLLKELYATGK-PVVFVMMTGSALGLEWESA--HLPAILNAWYGGQAG 684

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP GRLP+T+Y        +ND+    D S +   R+YR++TGT VY 
Sbjct: 685 GQAIADVLFGDYNPSGRLPLTFYKS------VNDLPDFEDYSME--NRTYRYFTGTPVYP 736

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT + Y           S  L+   DK  ++ T                     
Sbjct: 737 FGYGLSYTTFQYS----------SLKLQPSPDKRSVKVTA-------------------- 766

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
               +TN G ++G  V  L+   P+    TP + L GF R++     S+ + F V   ++
Sbjct: 767 ---KITNTGKMEGEEVAQLYVSNPR-DFVTPIRALKGFKRINLKPGESQTVEF-VLTSKE 821

Query: 776 LSIANKHGRRI 786
           LS+ +  G+ +
Sbjct: 822 LSVVDISGKSV 832


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAAQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYIEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGLFTIMVG 783


>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
          Length = 349

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 148/346 (42%), Positives = 215/346 (62%), Gaps = 12/346 (3%)

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVA 524
           M G Y G+ C   + L+G+  Y    H A GC DV CN +  F  A   A++AD  ++V 
Sbjct: 6   MIGNYAGVACGYTTPLQGIGRYTRTIHQA-GCTDVHCNGNQLFGAAEAAARQADATVLVM 64

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLD + E E  DR  LLLPG Q  LV+ VAR S+ P ILVL  GGP+DV+FA+ D +IS+
Sbjct: 65  GLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISA 124

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
           I+W+GYPG+AG  A+A+++FG  NPGG+LPMTWYP+++ T +PM DM MRAD +R YPGR
Sbjct: 125 IIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGR 184

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
           +YRFY G  V+ FG GLSYT +++     P+ +++   SLKA ++  +L +    +   H
Sbjct: 185 TYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA---VRVSH 241

Query: 703 IDEVTSCTSLR-FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
            D    C +L    V + V N G +DG+H +++F   P   +    KQL+GF ++H  A 
Sbjct: 242 AD----CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPD-GKWASSKQLMGFHKIHIAAG 296

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
             K +   V  C+ LS+ ++ G R +PLG H L +G+L H ++++T
Sbjct: 297 SEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGDLSHHVSLQT 342


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T+ S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A  G  
Sbjct: 36  PYLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 93

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 94  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 145

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 146 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK---- 201

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 202 --------------LDATAKHFAVHSGPE---AERHHFDARPSQRDLYETYLPAFEALVK 244

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  ++++     T 
Sbjct: 245 DGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVATR 304

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +  G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 305 EQAAALAVTHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALQTLMTARMRLGMFD-PPG 362

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + K+      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 363 QLPWSKIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTM 421

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V   +D
Sbjct: 422 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRND 464



 Score =  132 bits (331), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 91/282 (32%), Positives = 134/282 (47%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 628 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQAT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTAGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  M         GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESETLPAFDDYTMH--------GRTYRYFGGTPLYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   G+                     ++V N G   G  VV L+ 
Sbjct: 791 RL--------DRSTLTADGA-----------------LTATVAVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
              K  +    K+L GF R+       +E+ F ++  + L I
Sbjct: 826 HPLKPQRERAGKELRGFQRLALQPGEQRELRFTINATDALRI 867


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 163/462 (35%), Positives = 246/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTFW
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+     
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK----- 199

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++ 
Sbjct: 200 -------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKD 243

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  I+++     T E
Sbjct: 244 GKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATRE 303

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P +
Sbjct: 304 QAAALAVKHGTELECGE-EYSTLPAAVHQGLIDEAQIDTALKTLMTARMRLGMFD-PPGQ 361

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +  +      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++   
Sbjct: 362 LPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMA 420

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 421 LLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/282 (32%), Positives = 131/282 (46%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 626 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y         
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 785

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                            +G RLD   I    S T+      ++V N G   G  VV L+ 
Sbjct: 786 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 823

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
                 +    K+L GF R+       + + F +D    L I
Sbjct: 824 HPLTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRI 865


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 198/607 (32%), Positives = 297/607 (48%), Gaps = 84/607 (13%)

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINI RDPRWGR QE P EDP++   +   +    Q                      
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQ--------------------- 49

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
             GE+S R  +++++   KH  AY LE    F+R++F+A ++     DT+ P FR  + +
Sbjct: 50  --GEDS-RYTKVVVT--LKHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVME 104

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED 345
           G A  +MCSYN +NG P C    L +  R+ W F GY+TSD  A+  I+   +YT     
Sbjct: 105 GNAKGVMCSYNALNGRPTCTHPLLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASA 164

Query: 346 SAAGVLKAG-MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
           + A  L+ G  D++ G         A++ G+    D+DRAL N   ++  LGLF  DP +
Sbjct: 165 AVAAALRDGRCDMDSGAVYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLF--DPIE 222

Query: 405 GK-YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
            + Y ++    + T+  + L +    + ++LL+N    LP  K     +A+IGP +N   
Sbjct: 223 DQPYWRINASSINTTYAQDLNMKITLESMILLQNHNNALPFKKG--RKVAVIGPHINAQE 280

Query: 464 QMGGGYTGIPCSPKSL------LRGLEAY--VSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
            + G Y G  C   S       L  +EA   +S T  A G   + C +DA   EAV +AK
Sbjct: 281 ALVGNYLGQLCPDDSFDCITSPLAAIEAINGMSNTVSAMGSGVLAC-TDASIQEAVNVAK 339

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
            AD+V+++ G++ T E E  DR S+ LP  Q  L  ++A  +K    +++ GG    ++ 
Sbjct: 340 DADYVVLLIGINDTIEAESNDRTSIDLPQCQHKLTAAIAHLNKTTAAVLINGGM---LAI 396

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNP-GGRLPMTWYPESFT-KVPMNDMNMR 633
            +   Q+ +I+  GYPG  G  A+A+ IFGD N  GG+LP T YP  +  K+ M+DM M 
Sbjct: 397 EQEKKQLPAIIEAGYPGFYGGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEM- 455

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                  PGRSYR+YTG  ++ FG GL+YT +S +    PS  T +              
Sbjct: 456 ----TNSPGRSYRYYTGQPLWPFGFGLAYTTFSVQS-PGPSASTFA-------------- 496

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF---ARVPKVSQGTPEKQL 750
           TGS              +  F + + V N G   G  VV ++     +P  S    +KQL
Sbjct: 497 TGS--------------NTSFSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSL-KKQL 541

Query: 751 IGFDRVH 757
           I F+RVH
Sbjct: 542 IAFERVH 548


>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
 gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
          Length = 733

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 203/687 (29%), Positives = 342/687 (49%), Gaps = 93/687 (13%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T FP  L  AA++N  +     +A + EA         G+T+ +AP I++ RDPRWGR  
Sbjct: 112 TVFPIPLAMAAAWNPEVARQTSAAASTEALT------DGVTWVFAPMIDVSRDPRWGRIA 165

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E+ GEDP + +AY   +V+                           G + D G     ++
Sbjct: 166 ESIGEDPYLTAAYGRAWVE---------------------------GSQIDNGPGRATAS 198

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           C KH   Y + + G      +N + +++++  D   PPF+  +E G  S IM S+N++NG
Sbjct: 199 CPKHFAGYGMAEAGR----DYNTVDLSDRELRDIILPPFQDAVEAGALS-IMASFNEING 253

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +PAC    L +   R+EWGF+G + SD +A+  +          E++    + AG D++ 
Sbjct: 254 IPACANEYLLKTILRDEWGFEGVVASDYNALVELI-VHGVAANEEEACEMTVLAGCDMDM 312

Query: 360 GTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
            + +  R     +  G+V E  +D ++  + +++++LGL   +  K    +        S
Sbjct: 313 HSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLL--EQSKSDVSQSAATQPLKS 370

Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG-----GYTGIP 473
           E+ +LA +AARQ IVLL+N ++ LPL+K A +S+A+IGPL +N +   G     G +   
Sbjct: 371 EYVELAREAARQSIVLLQNKEQVLPLSK-AGASIAVIGPLADNATDPLGCWALDGRSDEV 429

Query: 474 CSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
            +    +R   A  +   YA GC D+  +S+ GF  A+  A+ +D V+++ G   T   E
Sbjct: 430 VTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVVMLLGESATMSGE 488

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
            R R +L LPG+Q +LV +VA+  K P++ V+  G PL  +FA    Q S+I+   + G 
Sbjct: 489 SRSRAALDLPGKQRALVEAVAKLGK-PIVAVILSGRPL--TFAWLPEQASAIVQAWHLGV 545

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY---TG 650
               A+A+++FGDFNP GRLP+T +P++  ++P+   + R  + R   G    +Y   T 
Sbjct: 546 QSGNAIADVLFGDFNPSGRLPVT-FPQNVGQIPI--YHYRKKTGRPPAGAYSSYYIDSTT 602

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             +Y FG+GL+YT + Y  +      T  +S+  G+D+                      
Sbjct: 603 EPLYPFGYGLTYTEFEYGAIQ-----TSKSSI--GADE---------------------- 633

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
             +  V +S+ N G++ G  VV  + R    S   P K+L+ F +V   A  S +++F +
Sbjct: 634 --QLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRKVKVAAGESVDVTFTI 691

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
              E L+I +KH +R +  G+  L +G
Sbjct: 692 GAAE-LAILDKHMKRTVEPGDFTLWIG 717


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A  G  
Sbjct: 33  PYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 91  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 142

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGAAAPKNAQGEPYRK---- 198

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 241

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           +GK   +M +YN+V G  A     L Q   R++WGF+GY+ SDC A+  I+++     T 
Sbjct: 242 EGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 302 EQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  +  +      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 360 QLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTM 418

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  133 bits (334), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+   AY  +  Y  G  D           +     EA+ +A+ AD V+ V GL    E 
Sbjct: 591 LQAGRAYDVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR  L LP  Q  L+ ++  T  RPV+ VLT G  L + +A+    +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATG-RPVVAVLTTGSALAIDWAQ--QHL 707

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FGD NPGGRLP+T+Y ES T    +D  MR        G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ GT +Y FGHGLSYT ++Y                          +G RLD   
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           I    S T+      ++V N G   G  VV L+       +    K+L GF R+      
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +E+ F ++  + L + ++ 
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 248/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 91  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTF 142

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYRK---- 198

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVK 241

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R++WGF+GY+ SDC A+  I+++     T 
Sbjct: 242 DGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 302 EQAAALAVKHGTELECGE-EYSTLPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  + ++      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 360 QLPWSQIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTM 418

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+   AY  +  Y  G  D           +     EA+ +A+ A+ V+ V GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEG 650

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR  L LP  Q  L+ ++  T K PV+ VLT G  L + +A+    +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK-PVVAVLTTGSALAIDWAQ--QHL 707

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FGD NPGGRLP+T+Y ES T    +D  MR        G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ GT +Y FGHGLSYT ++Y                          +G RLD   
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           I    S T+      ++V N G   G  VV L+       +    K+L GF R+      
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLTPQRERAGKELHGFQRIALTPGE 848

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +E+ F ++  + L + ++ 
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 164/448 (36%), Positives = 232/448 (51%), Gaps = 53/448 (11%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           + F N SLS   R   L+S LTL+EK+ Q+ +++  I RLGIPAY+WW+E+LHG+A    
Sbjct: 27  FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVAR--- 83

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
                 T    T +PQ +  AA+F+++    +    A+E RA+YN             GL
Sbjct: 84  ------TPFKTTVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+W PNINIFRDPRWGRGQET GEDP + +     FVK  QG++ K              
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDDPKY------------- 184

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         L  +AC KH   Y +       R++F+  +T  +  DT+ P FR  
Sbjct: 185 --------------LKAAACAKH---YAVHSGPESLRHTFDVDVTPYELWDTYLPAFRKL 227

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           I +   + +MC+YN     P C    L     R EW F GY+TSDC A+   F+      
Sbjct: 228 ITESNVAGVMCAYNAFRTQPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHP 287

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
             E +AA  +  G DI+CGT   +    A+  GK+ EK ID ++  LF ++ RLG+F  D
Sbjct: 288 DAESAAADAVFHGTDIDCGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMF--D 345

Query: 402 P-RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           P    KY +     + + EH+  AL  ARQ IVLLKN+K  LPLNKN +  + ++GP  +
Sbjct: 346 PVSMVKYAQTPSSVLESKEHQLHALKMARQSIVLLKNEKNILPLNKN-LKKIVVLGPNAD 404

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVS 488
           N   + G Y G P    ++L+G++  VS
Sbjct: 405 NAISILGNYNGTPSKLTTVLQGIKEKVS 432



 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 133/287 (46%), Gaps = 54/287 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           F + +   K AD  I   G+    E E+           DR S+L P  Q  L+ ++ ++
Sbjct: 596 FADLIEHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLLKAL-QS 654

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           S +PV+  +  G  + + + EA++ I +IL I Y G++   A A++IFGD+NP GRLP+T
Sbjct: 655 SGKPVVFAMMTGSAIAIPW-EAEN-IPAILNIWYGGQSAGTAAADVIFGDYNPAGRLPVT 712

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y         ND ++ +    +   ++YR++ GT +YGFG+GLSYT++ Y  L  P   
Sbjct: 713 FYK--------NDSDLPSFVDYKMDNKTYRYFKGTPLYGFGYGLSYTSFKYSDLKTP--- 761

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                +K G   +IL                          + V N G  +G  V  L+ 
Sbjct: 762 ---VKIKKGQSVSIL--------------------------VKVANTGKTEGEEVAQLYL 792

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
                +  TP K L GF+R +     +K I+F + P E LS     G
Sbjct: 793 INQDTAIKTPLKSLKGFERFNLKPGENKTITFNLSP-EDLSYVTPEG 838


>gi|427386425|ref|ZP_18882622.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726465|gb|EKU89330.1| hypothetical protein HMPREF9447_03655 [Bacteroides oleiciplenus YIT
           12058]
          Length = 864

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 161/445 (36%), Positives = 228/445 (51%), Gaps = 49/445 (11%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           Y F N  L +  R   L+  LTL+EKI Q+ +NA AI RLGIPAY WW+E LHG+A    
Sbjct: 24  YKFQNPDLPVEERVNDLVGHLTLEEKISQMMNNAPAIERLGIPAYNWWNECLHGVAR--- 80

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                 +   VTSFPQ +  AA+++      +    + E RA+Y+            GLT
Sbjct: 81  ------SPYPVTSFPQAIAMAATWDTKSVYQMAEYASDEGRAIYHDAARKGTPGIFRGLT 134

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           +W+PNINIFRDPRWGRGQET GEDP + +A  V FVK  QG                   
Sbjct: 135 YWSPNINIFRDPRWGRGQETYGEDPYLTAAIGVAFVKGLQG------------------- 175

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L  SAC KH   +   +W   +R+++NA ++  D  DT+ P FR  +
Sbjct: 176 --------DDPVYLKSSACAKHYAVHSGPEW---NRHTYNAEVSNHDLWDTYLPAFRELV 224

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
              K + +MC+YN     P C    L     RN+W F GY+TSDC A+   +   N  + 
Sbjct: 225 VDAKVTGVMCAYNSFFEQPCCGNDLLMMDILRNQWKFDGYVTSDCGAIEDFYNTHNTHED 284

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
             +++A  +  G D  CG    R    AI +G + E+ +D +L  LF ++ RLG+F+ D 
Sbjct: 285 AAEASADAVLHGTDCECGNGAYRALADAIVRGLITEEQVDVSLKKLFEIRFRLGMFDPDD 344

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           R   Y  +    +    HK  AL  ARQ IVLLKN+K+ LPL+ N +  +A++GP  ++ 
Sbjct: 345 RV-PYSDIPISVLECDAHKAHALKMARQSIVLLKNEKQLLPLDMNKIKKIAVVGPNADDK 403

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV 487
           S +   Y G P    ++L G++  V
Sbjct: 404 SVLLANYYGYPSCVTTVLEGIKGKV 428



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 166/376 (44%), Gaps = 76/376 (20%)

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNS 503
           L  + ++ L I G   N +  +   Y      P +  +G ++Y+ + HY     +     
Sbjct: 527 LKADDMADLYIDGVKQNKVGNINSYY------PLNAEKG-KSYLMEIHYVQHADNAEIKF 579

Query: 504 DAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           D G      + +     K AD V+ V GL    E E+           DR S+ +P  Q 
Sbjct: 580 DMGTLRKADYRQTASSVKDADVVVFVGGLSAKVEGEEMKVEIDGFKRGDRTSISIPVVQQ 639

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           +L+  +  T K PVI +L  G  + + +      + +IL   Y G+AG +A+A+++FGD+
Sbjct: 640 NLLKELYATGK-PVIFILMTGSAVGLEWES--EHLPAILNAWYGGQAGGQAIADVLFGDY 696

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NP GRLP+T+Y        +ND+    D S +   R+YR++TG  VY FG+GLSYT++ Y
Sbjct: 697 NPSGRLPLTFYKN------VNDLPDFEDYSMK--NRTYRYFTGIPVYPFGYGLSYTDFQY 748

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
             +     L          DK                       L   V   V+N G  +
Sbjct: 749 NTIKVQPSL----------DK-----------------------LSVKVTAEVSNVGKYE 775

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL 787
           G  VV L+   P+    TP + L GF R++     S+ + F V   ++LS+ +       
Sbjct: 776 GEEVVQLYVSNPR-DFVTPIRALKGFRRINLKPGESQMVEF-VLTSKELSVVDV------ 827

Query: 788 PLGNHVLMVGELRHSL 803
             GN V M GE++ SL
Sbjct: 828 -AGNFVPMKGEVQISL 842


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 232/738 (31%), Positives = 346/738 (46%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY          +   +  GSD                     C   R  V ++
Sbjct: 678 LSYTTFSYT--------DMKVQVTEGSD--------------------DC---RVDVTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ +    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783


>gi|391417909|gb|AFM44649.1| Xyl3A [Caldanaerobius polysaccharolyticus]
          Length = 789

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 222/756 (29%), Positives = 345/756 (45%), Gaps = 146/756 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     ES  G  + G            T FPQ +  A+++N  L   +GS +  
Sbjct: 98  RLGIPAL-VHEESCSGYMAKG-----------ATCFPQTIGVASTWNTELVKQMGSVIRE 145

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + +A+           AP +++ RD RWGR +ET GEDP ++S   V +++  QG N K 
Sbjct: 146 QMKAV-----GAHQALAPLMDVARDARWGRVEETFGEDPYLISEMGVSYIEGLQGGNIK- 199

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMI 266
            DG                          + A  KH + Y       FS    N   A I
Sbjct: 200 -DG--------------------------IMATVKHFVGY------GFSEGGMNWAPAHI 226

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITS 325
            E++  + F  PF + +++ K + +M +Y++++G+P      L  Q  RNEWGF G + S
Sbjct: 227 PERELREVFLLPFEAAVKKAKTASVMAAYHELDGIPCHGSKKLLTQILRNEWGFDGLVVS 286

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDR 383
           D   V  ++EY +  +   ++A   L+AG+DI   +  C  +  + A+ KG V+E  ID 
Sbjct: 287 DYFGVNMLYEYHHVARDKGEAAKIALQAGVDIELPSRDCYGQPLKEAVQKGLVEEALIDE 346

Query: 384 ALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
            +  +  ++   G+F     D  K        +   T + +KLA   A+Q IVLLKN   
Sbjct: 347 VVRRILRMKFLSGVFENPYVDVEKAA------EVFDTPDQRKLAYKLAQQSIVLLKNQGD 400

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC----------------SP-------- 476
            LPL K+ + S+A+IGP  +++  + G Y   PC                +P        
Sbjct: 401 LLPLKKD-IKSIAVIGPNADSVRNIIGDYA-YPCHIESLVETKEQSNVFNTPVPDKVSLV 458

Query: 477 ------KSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-- 525
                 KS+L G++  +S   + HYA GC +V  +   GF EA+  AKK+D  IVV G  
Sbjct: 459 DNFVPIKSILEGIKGKISPETELHYAKGC-EVTGDDKGGFAEAIEAAKKSDVAIVVVGDK 517

Query: 526 ---LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
               D     E RDR  L LPG Q  LV ++  T   P ++VL  G PL +++      I
Sbjct: 518 AGLTDDCTSGESRDRADLNLPGVQQELVEAIYNTGT-PTVVVLVNGRPLSINWI--SRHI 574

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +I+    PGE GA A+A+++FGD+NPGG+LP++ +P S  +VP+   N +    R +  
Sbjct: 575 PAIIEAWLPGEEGAAAVADVLFGDYNPGGKLPVS-FPRSVGQVPVY-YNHKPSGGRSHWK 632

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYV 701
             Y   +   +Y FG+GLSYT + +  L  APSE+     ++                  
Sbjct: 633 GDYVEMSTKPLYPFGYGLSYTKFEFSNLEIAPSEVYDDGKVR------------------ 674

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
                         + + V NAG ++G  VV L+ R    +   P K+L GF RV     
Sbjct: 675 --------------ISVDVQNAGKLEGDEVVQLYVRNEVSNVTRPVKELKGFKRVSLRPG 720

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             K++ F +    QL   ++  R ++  G   +M+G
Sbjct: 721 EKKKVVFELS-VSQLGFYDEDMRYVVQPGTVKVMIG 755


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  282 bits (722), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 45/463 (9%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T  S   RA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A  G  
Sbjct: 33  PYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG-- 90

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTF 164
                     T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTF
Sbjct: 91  --------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTF 142

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+    
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYRK---- 198

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                         L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++
Sbjct: 199 --------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVK 241

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            GK   +M +YN+V G  A     L Q   R++WGF+GY+ SDC A+  I+++     T 
Sbjct: 242 DGKVDAVMGAYNRVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVATR 301

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P 
Sbjct: 302 EQAAALAVKHGTELECGEEYAT-LPAAVRQGLIDEAQIDTALKTLMTARMRLGMFD-PPG 359

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +  +  +      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++  
Sbjct: 360 QLPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTM 418

Query: 464 QMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
            + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 419 ALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L+   AY  +  Y  G  D           +     EA+ +A+ AD V+ V GL    E 
Sbjct: 591 LQAGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEG 650

Query: 533 EDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+           DR  L LP  Q  L+ ++  T K PV+ VLT G  L + +A+    +
Sbjct: 651 EEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK-PVVAVLTAGSALAIDWAQ--QHL 707

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+A+ +FGD NPGGRLP+T+Y ES T    +D  MR        G
Sbjct: 708 PAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYKESETLPAFDDYAMR--------G 759

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ GT +Y FGHGLSYT ++Y                          +G RLD   
Sbjct: 760 RTYRYFGGTPLYPFGHGLSYTQFAY--------------------------SGLRLDRTT 793

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           I    S T+      ++V N G   G  VV L+       +    K+L GF R+      
Sbjct: 794 IATDGSLTAT-----VTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGE 848

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +E+ F ++  + L + ++ 
Sbjct: 849 QRELGFTINAKDALRLYDEQ 868


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  282 bits (722), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 162/462 (35%), Positives = 246/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++  L Q       GLTFW
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  +GE   +     G  +R+     
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRGEGADAPKNAQGEPYRK----- 199

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   +   +     R+ F+A  +++D  +T+ P F + ++ 
Sbjct: 200 -------------LDATAKHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKD 243

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN+V G  A     L Q   R +WGFKGY+ SDC A+  I+++     T E
Sbjct: 244 GKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATRE 303

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G ++ CG        +A+ +G + E  ID AL  L + ++RLG+F+  P +
Sbjct: 304 QAAALAVKHGTELECGE-EYSTLPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQ 361

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             +  +      +  H  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++   
Sbjct: 362 LPWSTIPASVNQSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMA 420

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+ A    ++  YA G   V    D
Sbjct: 421 LLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/282 (33%), Positives = 131/282 (46%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A  AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 626 LQEALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 685

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 686 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 742

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y         
Sbjct: 743 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 785

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                            +G RLD   I    S T+      ++V N G   G  VV L+ 
Sbjct: 786 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 823

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
                 +    K+L GF R+   A   + + F +D    L I
Sbjct: 824 HPLTPQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRI 865


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  282 bits (721), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 156/457 (34%), Positives = 249/457 (54%), Gaps = 44/457 (9%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ N  LS   RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG 
Sbjct: 23  YPYQNLQLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ    AAS++  L   +  A + EA A  NL +         G++
Sbjct: 82  ---------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVS 132

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PNINIFRDPRWGRGQET GEDP + S   +  V   QG+ ++ D             
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
           ++ F E       L   AC KH   +   +W   +R+ F+   + E+D  +T+ P F+S 
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
           +++G    +MC+Y +++G P C     L Q  R EWG+ G + SDC A++  +   + + 
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHV 293

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            +T  +++A  ++AG D+ CG  +      A+++G +  + ID +++ L   +  +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            + +   +   GP+ + +  H++LALD AR+ + LL+N  + LPL+KN +  +A++GP  
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N+   + G YTG P S  ++L+G+ + V    +  GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score =  105 bits (262), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 93/334 (27%), Positives = 143/334 (42%), Gaps = 65/334 (19%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV     A    D+   S     E    A  AD V+ V G+    E E+           
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGG 639

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR S+ LP  Q  ++  + +  K  V +  +GG    V  AEA     ++L   Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPEAEA---CDAVLQAWYAGEAG 696

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP G+LP+T+Y          D  M         GR+YR++ GT ++ 
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGTPLFP 748

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYT++ +                 G+ +    + G                    
Sbjct: 749 FGFGLSYTSFVF-----------------GTPR---YENG-------------------K 769

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           + + VTN G  DG+ VV ++ + P  + G P K L GF R+   A   + +   + P E+
Sbjct: 770 LYVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827

Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
                A  +  R+ P GNH+LMVG       ++T
Sbjct: 828 FEGWDATTNTMRVKP-GNHLLMVGSSSRDADLQT 860


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  282 bits (721), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 168/443 (37%), Positives = 239/443 (53%), Gaps = 55/443 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L  S R   L+  LT++EKI QL   + AI RLGIP Y WW+ESLHG+A  G 
Sbjct: 27  YPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARAG- 85

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AGLT 163
                      T FPQ +  AA+++  L   +  A++ EARA ++     GQ     GLT
Sbjct: 86  ---------YATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQGLT 136

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+PNINIFRDPRWGRG ET GEDP +     + +VK  QG +                 
Sbjct: 137 FWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQGND----------------- 179

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                      + L L A  KH   +   +     R+ F+   +++D  +T+ P FR  +
Sbjct: 180 ----------PNYLKLVATAKHFAVHSGPE---PLRHEFDVSPSKRDLWETYLPAFRYLV 226

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           +QG    +M +YN+V G  A     LF   R+ W F GY+ SDC A++ I++Y    K  
Sbjct: 227 KQGDVKSVMTAYNRVYGEAASASDTLFTILRDYWDFDGYVVSDCFAISDIWKYHKIAKDA 286

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
            +++A  +  G D+NCG    +  Q A  +G V EKDID AL  L   +++LG+F  DP 
Sbjct: 287 AEASAMAVIEGCDLNCGDSYEKLNQ-AYQQGMVTEKDIDIALSRLMEARIKLGMF--DPE 343

Query: 404 K-GKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           +   Y ++ P +V TSE H +LAL AA++ IVLLKN    LPL+K+ + S+A+IGP  +N
Sbjct: 344 QLVPYAQI-PFNVNTSEKHNQLALKAAKESIVLLKNQGDLLPLSKD-LKSVAVIGPNADN 401

Query: 462 ISQMGGGYTGIPCSPKSLLRGLE 484
           I  + G Y G P  P ++L+G++
Sbjct: 402 IQSLWGNYNGNPKDPITVLQGIQ 424



 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 135/263 (51%), Gaps = 46/263 (17%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L LP  Q +L+  VA+T K P++LVL  G  L +++A  +  I +I+  GY G+ G
Sbjct: 631 DRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN--IPAIMTAGYAGQQG 687

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+AE++FGD+NP  RLP+T+Y +S   +P   D NM         GR+YR++    +Y
Sbjct: 688 GNAVAEVLFGDYNPAARLPVTYY-KSVEDLPDFEDYNMD--------GRTYRYFEKEPLY 738

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + Y     PS++ ++ S++                               
Sbjct: 739 PFGYGLSYTTFDYSKFQLPSKIDMNESIE------------------------------- 767

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
            + + VTN G  DG  VV ++    K S   P ++L+GF R+H     S+++ F ++P  
Sbjct: 768 -LSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHLKKGESQKVQFTIEP-R 825

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
           QLS+ +  G  ++  G   + VG
Sbjct: 826 QLSMIDDKGDLVIEPGVFSISVG 848


>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
 gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
          Length = 805

 Score =  282 bits (721), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 233/738 (31%), Positives = 347/738 (47%), Gaps = 124/738 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 140 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 187

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 188 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 241 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 273

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 274 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 333 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 391

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 392 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 448

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 449 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 506

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 507 SRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 566

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+LVL  G PL +  A  +++     W  YPG  G  A+
Sbjct: 567 LHLMGRQLELLEEISRLGK-PVVLVLIKGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 623

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 624 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 677

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY                  +D  +    GS  D   +D           V ++
Sbjct: 678 LSYTTFSY------------------TDMKVQVTEGS--DDCWVD-----------VTVT 706

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 707 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 765

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 766 MQEGEWVVEPGRFTIMVG 783


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 156/457 (34%), Positives = 249/457 (54%), Gaps = 44/457 (9%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ N  LS   RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG 
Sbjct: 23  YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ    AAS++  L   +  A + EA A  NL +         G++
Sbjct: 82  ---------FATVFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVS 132

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PNINIFRDPRWGRGQET GEDP + S   +  V   QG+ ++ D             
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
           ++ F E       L   AC KH   +   +W   +R+ F+   + E+D  +T+ P F+S 
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
           +++G    +MC+Y +++G P C     L Q  R EWG+ G + SDC A++  +   + + 
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWGYNGLVVSDCGAISDFYREGHHHV 293

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            +T  +++A  ++AG D+ CG  +      A+++G +  + ID +++ L   +  +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            + +   +   GP+ + +  H++LALD AR+ + LL+N  + LPL+KN +  +A++GP  
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N+   + G YTG P S  ++L+G+ + V    +  GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 93/334 (27%), Positives = 142/334 (42%), Gaps = 65/334 (19%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV     A    D+   S     E    A  AD V+ V G+    E E+           
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGG 639

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR S+ LP  Q  ++  + +  K  V +  +GG    V  AEA     ++L   Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPEAEA---CDAVLQAWYAGEAG 696

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP G+LP+T+Y          D  M         GR+YR++ GT ++ 
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGTPLFP 748

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYT++++               K   +  +L                        
Sbjct: 749 FGFGLSYTSFAFG--------------KPRYENGML------------------------ 770

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
             + VTN G  DG+ VV ++ + P  + G P K L GF R+   A   + +   + P E+
Sbjct: 771 -YVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827

Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
                A  +  R+ P GNH+LMVG       ++T
Sbjct: 828 FEGWDATANTMRVKP-GNHLLMVGSSSRDADLQT 860


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 160/462 (34%), Positives = 245/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T  S   RA  L+S +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G   
Sbjct: 37  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++            GLTFW
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+     
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYRK----- 201

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   +   +     R+ F+A  +++D  +T+ P F + +++
Sbjct: 202 -------------LDATAKHFAVHSGPE---ADRHHFDAHPSQRDLYETYLPAFEALVKE 245

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN+V G  A     L +   R+ WGF GY+ SDC A+  I++      T E
Sbjct: 246 GKVDAVMGAYNRVYGESASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVATRE 305

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +  G ++ CG        +A+ KG + E D+D+AL  L   ++RLG+F+  P  
Sbjct: 306 QAAALAVNNGTELECGE-EYSTLPAAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDT 363

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++ ++      + EH  LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++   
Sbjct: 364 LRWAQIPLSANQSPEHDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMA 422

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 423 LLGNYYGTPAAPVTVLQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/302 (31%), Positives = 146/302 (48%), Gaps = 53/302 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             +A+  A++AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 628 LQDALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGT 687

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+++FGD NPGGRLP+T
Sbjct: 688 GK-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVT 744

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES      +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y      S+L
Sbjct: 745 FYKESEKLPAFDDYAMR--------GRTYRYFAGTALYPFGHGLSYTQFAY------SDL 790

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D++ L   GS                  H  + V N G   G  VV L+ 
Sbjct: 791 RL--------DRSKLATDGS-----------------LHATLKVKNTGQRAGDEVVQLYL 825

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLM 795
                 +    K+L GF R+      ++E+SF + P   L + ++  +  ++  G++ L 
Sbjct: 826 HPLSPQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQ 885

Query: 796 VG 797
           VG
Sbjct: 886 VG 887


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 150/388 (38%), Positives = 212/388 (54%), Gaps = 49/388 (12%)

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L +S+CCKH   YD++ W N         ++EQD ++TF  PF                 
Sbjct: 19  LKVSSCCKHYATYDIDSWLN---------VSEQDMKETFFSPFE---------------- 53

Query: 297 QVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNY-TKTHEDSAAGVLKAGM 355
                            R+EW   GYI SDC  +  I + QNY  ++  D+ A  L+AG+
Sbjct: 54  -----------------RDEWDLHGYIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGL 96

Query: 356 DINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           D+ CG         ++  GKV + ++DRAL N++ + +R+G F+G P    Y  LG  D+
Sbjct: 97  DLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIP---AYESLGLKDI 153

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
           C ++H +LA +AARQGIVLLKND + LPL       L ++GP  N    M G Y G+P  
Sbjct: 154 CAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKLVLVGPHANATEVMIGNYAGLPYK 211

Query: 476 PKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR 535
             S L    A +    YA+GC D  C++D  F EA   AK A+  I+  G DL+ E E  
Sbjct: 212 YVSPLEAFSA-IGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIFVGTDLSIEAEFV 270

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DRV  LLPG Q  L+  VA  S  PVILV+  G  +D++FA+ + +IS+ILW+G+PGE G
Sbjct: 271 DRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQG 330

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFT 623
             A+A+++FG +NPGGRLP+TWY   + 
Sbjct: 331 GHAIADVVFGKYNPGGRLPVTWYEADYV 358


>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
          Length = 534

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 192/584 (32%), Positives = 291/584 (49%), Gaps = 105/584 (17%)

Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCIEQGK---------- 287
           ++A CKHL AY LE      R++F+A  I   D E T+ P F +C+   +          
Sbjct: 10  IAATCKHLAAYSLET----DRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHYNASG 65

Query: 288 ---------ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQ 337
                    A  +MCSYN ++GVPAC    L +   R +W F G + SDC AV  I    
Sbjct: 66  GGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNIHSNH 125

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            +  ++E++    L++G+D++CG       + A D+  + E DID AL  LF V + LG 
Sbjct: 126 RFVASYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVLMDLGY 185

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN-----DKKFLPLNKNAVSSL 452
           F  D       K   D++   EH +LAL+AA Q IVLLKN     +   LPL+      +
Sbjct: 186 F--DETDEPDAKSSDDEM---EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHKEI 240

Query: 453 AIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCHDVPCNSDAGFHE 509
           A+ GPL +N + + G Y G+P +  + L GL      V+    AS C          FH 
Sbjct: 241 ALFGPLADNQTVLLGNYHGLPSTIVTPLMGLAKMGVEVAFRQRASVCD---------FH- 290

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK---RPVILVLT 566
                      I+V GLD + E ED+DR +LLLP +Q  L+ +++R SK    PV+LV+ 
Sbjct: 291 ------GESATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVV 344

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKV 625
            GG +D+S  +  S I +++ + YPG+ G  ALA++++G +NP G+L  T YPES+  +V
Sbjct: 345 SGGMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEV 404

Query: 626 PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY--KFLSAPSELTISASLK 683
            ++DM MR D   ++PGR++R+Y G  +Y FG+GLSYT++ Y  +FL    ++T+S    
Sbjct: 405 SLHDMRMRPDG--KFPGRTHRYYRGDVIYPFGYGLSYTSFRYAMEFLGGTVKVTVS---- 458

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV-S 742
                                                 N+G +DGS  V+LF   P+  +
Sbjct: 459 --------------------------------------NSGSMDGSVAVLLFHSAPQAGN 480

Query: 743 QGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
           +  P + LIGF++++     S+ +SF V        A  H  RI
Sbjct: 481 EQEPFRSLIGFEKIYVSVGDSQLVSFDVSKRMNPGEAGSHTFRI 524


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 228/743 (30%), Positives = 350/743 (47%), Gaps = 134/743 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G A+A 
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GED  +        VK FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGE---- 247

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                 F   + +V+                A  KH  AY    W        +A +  +
Sbjct: 248 ------FPRTKGKVI----------------ATLKHFAAYG---WTEGGHNGGSAHVGNR 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G+P     +L     +  W FKG++ SD  
Sbjct: 283 EMEEAIYPPFREAVAAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
           A+  + E+     T  ++A   + AG+D + GT +      +A+ +G VQE  I++A+  
Sbjct: 342 AIGGLREH-GVADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSR 400

Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLP 443
           + +++  +GLF+    D R+       P+ V  S EH +LA + ARQ I+LLKN  + LP
Sbjct: 401 ILALKFHMGLFDHPFVDERE-------PEQVVASTEHLELAREVARQSIILLKNKNELLP 453

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL---LRGLEAYVSK-TH--YASGCH 497
           LNK  + ++A+IGP  +NI  M G YT  P S  S+   L G+   VS  TH  YA GC 
Sbjct: 454 LNKK-MKTIAVIGPNADNIYNMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA 511

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------ 535
            V  +S +GF EA+  A+++D V++V G     D + + E+                   
Sbjct: 512 -VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEG 570

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR +L L G+Q  L+  V + +K P++LVL  G PL +   EA  ++ +I+   YPG  
Sbjct: 571 YDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGIEA--EVDAIVDAWYPGMQ 627

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  A+A+++FGD+NP GRL ++  P S  ++P+     R  +  +Y         GT  Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTIS-VPRSVGQLPVYYNTKRKGNRSKYIEEE-----GTPRY 681

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT+++Y  L A                               + V +  S   
Sbjct: 682 PFGYGLSYTSFNYSDLKA-------------------------------EVVEAEDSCLV 710

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
           ++ + V N G  DG  VV L+ R    S  TP KQL GF R+H     +KEI+F +D  +
Sbjct: 711 NISVKVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-K 769

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
            L++  ++    +  G   LM+G
Sbjct: 770 SLALYMQNEEWAVEPGRFTLMLG 792


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 162/459 (35%), Positives = 249/459 (54%), Gaps = 56/459 (12%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           +++ F N +L+   R   L+S +++ EKI QL D++ AI RLG+P Y WW+ESLHG+A  
Sbjct: 22  ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----LGQ----AG 161
           G            T FPQ +  A+S++R L  ++ + ++ EARA ++     GQ     G
Sbjct: 82  G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW+PN+NIFRDPRWGRG ET GEDP +     +++V   QG N K             
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGTNEKY------------ 179

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                          L + A  KH   Y +      SR+ FNA  ++ D  +T+ P FR+
Sbjct: 180 ---------------LKVIATAKH---YAVHSGPEPSRHLFNAETSDIDLYETYLPAFRT 221

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
            +++G    +M +YN+  G        LF   RN WGF GYI SDC AV  I++Y   T 
Sbjct: 222 LVKEGHVYSVMGAYNRFRGESCSASPFLFNILRNVWGFDGYIVSDCGAVTDIWKYHKITG 281

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
               ++A  LK G+D+ CG+   +  + AID+  + E DID A+  LF+ + +LG+F  D
Sbjct: 282 DAATASALALKDGLDLECGSS-FKSLKEAIDRKLISEADIDIAVKRLFTARFKLGMF--D 338

Query: 402 PRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           P +   Y ++       S H  LA  A+++ IVLLKN    LPL+++ + ++A+IGP  N
Sbjct: 339 PEEIVSYAQIPYSVNNNSAHDWLARVASQKSIVLLKNQNNTLPLSRD-IKTVAVIGPNAN 397

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
           ++  + G Y+G+P +P ++L+G++  +   +K  YA G 
Sbjct: 398 DVQSLWGNYSGVPSNPITVLKGIQNKLEPNTKVLYAKGT 436



 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 150/301 (49%), Gaps = 54/301 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EAV++A +AD +++V GL+   E E+           DR SL LP  Q  L+ ++  T
Sbjct: 589 LQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEELMKAMTAT 648

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PVILVL  G  L +++A  +  + +IL  GYPG+ G  A+A+++FGD+NP GRLP+T
Sbjct: 649 GK-PVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNPAGRLPVT 705

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  +       + +M+        GR+YR++    +Y FG GLSYT + Y  L  P+ +
Sbjct: 706 YYKSTEQLPAFENYDMK--------GRTYRYFQKKPLYPFGFGLSYTKFKYSNLKLPTNV 757

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           T                                    F + + VTN G+ DG  V+ L+ 
Sbjct: 758 TPEKD--------------------------------FEILVDVTNIGERDGDEVIELYL 785

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           +  K S   P  QL GF+RV+     +K + F + P  QLS+ NK G+R++  G   + V
Sbjct: 786 KDEKASTPRPILQLEGFERVNLKKGETKTVRFTITP-RQLSLINKKGQRVIEPGWFTISV 844

Query: 797 G 797
           G
Sbjct: 845 G 845


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 229/743 (30%), Positives = 349/743 (46%), Gaps = 134/743 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G A+A 
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGRAIAT 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GED  +        VK FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQGE---- 247

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                 F   + +V+                A  KH  AY    W        +A +  +
Sbjct: 248 ------FPRTKGKVI----------------ATLKHFAAYG---WTEGGHNGGSAHVGNR 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE-WGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G+P     +L      E W FKG++ SD  
Sbjct: 283 EMEEAIYPPFREAVAAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
           A+  + E+     T  ++A   + AG+D + GT +      +A+ +G VQE  I++A+  
Sbjct: 342 AIGGLREH-GVADTDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSR 400

Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLP 443
           + +++  +GLF+    D R+       P+ V  S EH +LA + ARQ I+LLKN  + LP
Sbjct: 401 ILALKFHMGLFDHPFVDERE-------PEQVVASTEHLELAREVARQSIILLKNKNELLP 453

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL---LRGLEAYVSK-TH--YASGCH 497
           LNK    ++A+IGP  +NI  M G YT  P S  S+   L G+   VS  TH  YA GC 
Sbjct: 454 LNKK-TKTIAVIGPNADNIYNMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA 511

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------ 535
            V  +S +GF EA+  A+++D V++V G     D + + E+                   
Sbjct: 512 -VRDSSKSGFQEAIEAARQSDVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEG 570

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR +L L G+Q  L+  V + +K P++LVL  G PL +   EA  ++ +I+   YPG  
Sbjct: 571 YDRSTLELLGRQRELIREVGKLNK-PIVLVLIKGRPLLLEGIEA--EVDAIVDAWYPGMQ 627

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  A+A+++FGD+NP GRL ++  P S  ++P+     R  +  +Y         GT  Y
Sbjct: 628 GGNAVADVLFGDYNPAGRLTIS-VPRSVGQLPVYYNTKRKGNRSKYIEEE-----GTPRY 681

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT+++Y  L A                               + V +  S   
Sbjct: 682 PFGYGLSYTSFNYSDLKA-------------------------------EVVEAEDSCLV 710

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
           ++ + V N G  DG  VV L+ R    S  TP KQL GF R+H     +KEI+F +D  +
Sbjct: 711 NISVKVRNEGSRDGDEVVQLYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-K 769

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
            L++  ++    +  G   LM+G
Sbjct: 770 SLALYMQNEEWAVEPGRFTLMLG 792


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 165/454 (36%), Positives = 232/454 (51%), Gaps = 61/454 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N SL    RA+ L+S +TL+EK  QL + A AIPRL +PAY WWSE+LHG+A NG   
Sbjct: 39  YLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYNWWSEALHGVAVNG--- 95

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FP+ +   A+F+      +   +  E R +Y   +         GL FW
Sbjct: 96  --------TTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEENEKDGSSKIFHGLDFW 147

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APN+NIFRDPRWGRGQET GEDP +     V FV   QG+N K             RV+ 
Sbjct: 148 APNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGDNPKY-----------YRVI- 195

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                          A  KH   +D+      +R+  +  ++  D  DT++P FR+ I Q
Sbjct: 196 ---------------ATPKH---FDVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQ 237

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQ-KARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           G A  +MCSYN +NG PAC      Q + R  WGFKGY+ SDCDAV  I+    Y  T  
Sbjct: 238 GHADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRPTLA 297

Query: 345 DSAAGVLKAGMDINC-------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            +AA  ++ GMD +C       G    +    A+ +G + ++ +D AL+ LF+ +++LGL
Sbjct: 298 QAAAISMERGMDNDCADFAQPKGDDDYKAYIDAVQQGYLSQQAMDTALVRLFTARIKLGL 357

Query: 398 FNGDPR-KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           F  DP+    Y      ++ +  H+  A   A + +VLLKND   LPL   +V S+A++G
Sbjct: 358 F--DPKGMDPYADTPHSELNSPAHRAYARKLADESMVLLKNDGT-LPLKPGSVHSIAVVG 414

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT 490
           PL +  + + G Y G+P    S L GL A    T
Sbjct: 415 PLADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNT 448



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 83/263 (31%), Positives = 129/263 (49%), Gaps = 46/263 (17%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L +P  + +LV +VA+T K PV++VL  G  L V++       +++L   Y GE G
Sbjct: 652 DRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSALAVNWIS--QHANAVLEAWYSGEEG 708

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+ + G  +P GRLP+T+Y +S  ++P   D +M          R+YR++ G  +Y
Sbjct: 709 GAAIADTLSGKNDPAGRLPVTFY-KSVNQLPNFEDYSME--------NRTYRYFKGKPLY 759

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + Y  LS P      A++ AG                             
Sbjct: 760 PFGYGLSYTTFRYSDLSIP-----HATVDAGQP--------------------------V 788

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
               +VTN G V G  VV L+ + PKV  G P+  L GF R+H     S+++ F +    
Sbjct: 789 EASATVTNTGKVAGDEVVQLYLKFPKVD-GAPDIALRGFQRIHLEPGQSQQVHFELK-KR 846

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
            LS+    G+ I+  G++ L +G
Sbjct: 847 DLSMVTALGQIIVAQGDYTLSIG 869


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 245/462 (53%), Gaps = 45/462 (9%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + ++  + + RA  L++ +TL+EK+ Q+ + A AIPRLG+PAY+WW+E LHG+A  G   
Sbjct: 34  YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                    T FPQ +  AA+F+  L   + +A++ EARA ++            GLTFW
Sbjct: 91  -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP + +   V FV+  QGE   +     G  +R+     
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYRK----- 198

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                        L A  KH   + +       R+ F+A  +++D  +T+ P F + +++
Sbjct: 199 -------------LDATAKH---FAVHSGPESERHHFDARPSQRDLYETYLPAFEALVKE 242

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GK   +M +YN++ G  A     L +   R  WGF GY+ SDC A+  I++      T E
Sbjct: 243 GKVDAVMGAYNRLFGESASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVATRE 302

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G  + CG        +A+ +G + E DID AL  L + ++RLG+F+  P +
Sbjct: 303 QAAALAVKNGTQLECGQEYAT-LPAAVQQGLIGETDIDAALRTLMTARMRLGMFD-PPGQ 360

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
            ++ +L      + EH  LA   AR+ +VLLKND   LPL++     +A+IGP  ++   
Sbjct: 361 LRWAQLPISVNQSPEHDALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMA 419

Query: 465 MGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+ A    +   YA G   V   SD
Sbjct: 420 LLGNYYGTPATPVTILQGIRAAAPDADVLYARGADLVEGRSD 461



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 143/304 (47%), Gaps = 57/304 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+  A++AD V+ V GL    E E+           DR  L LP  Q  L+ +++ T
Sbjct: 625 LQEALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSAT 684

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+++FGD NPGGRLP+T
Sbjct: 685 GK-PVVAVLTTGSALAIDWAQ--EHVPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVT 741

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  S T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y  L      
Sbjct: 742 FYKASETLPAFDDYAMR--------GRTYRYFAGTPLYPFGHGLSYTQFAYSDL------ 787

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                               RLD   +         +    + VTN G   G  VV L+ 
Sbjct: 788 --------------------RLDRRKV-----AADGQLSATLKVTNTGTRAGDEVVQLYL 822

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI---ANKHGRRILPLGNHV 793
                ++    K+L GF R+      S+++ F + P   L I   A KH   ++  G++ 
Sbjct: 823 HPLAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKH--YVVDPGDYE 880

Query: 794 LMVG 797
           L VG
Sbjct: 881 LQVG 884


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 222/736 (30%), Positives = 345/736 (46%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG    V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  ET GEDP +V     + + S Q  N   
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G   +   +  +  +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I        T+ED  A  + AG++I          +   + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDNGKISQETLD 386

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  ++ RLGLF+ +P +G  GK     V + EH+ ++L+AARQ +VLLKN+   L
Sbjct: 387 KRVAEILRIKFRLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
           PL+K ++ S+A+IGP  N  +Q+   Y       K++ +G++  +  T   Y  GC    
Sbjct: 445 PLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTEVIYKKGCDIID 503

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  ++      EA+R AK+A+ V++V G +     EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V  T K P+ILV+  G    +++A A   I +IL   +PGE   +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PIILVMLDGRASSINYAAA--HIPAILHAWFPGEFCGQAVAEALFGD 620

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S           + T VYG    FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT ++Y      S+L IS S             G + D               HV   + 
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+ R    S  T  K L GF+R+   A   + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756

Query: 782 HGRRILPLGNHVLMVG 797
           +    + LG+  +M+G
Sbjct: 757 NMNFRVELGSFKVMLG 772


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 160/445 (35%), Positives = 232/445 (52%), Gaps = 53/445 (11%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           F N  LS   R   L++ LTL+EK+ Q+ +++ AIPRL IPAY+WW+E+LHG+A      
Sbjct: 31  FQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVAR----- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGLTF 164
               T   VT +PQ +  AA+F+++    +    A+E RA+YN             GLT+
Sbjct: 86  ----TPFKVTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNERYLGLTY 141

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP +       FVK  QG++ K                
Sbjct: 142 WTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDDPKY--------------- 186

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                       L  +AC KH   Y +       R++F+  +T  +  DT+ P F+  + 
Sbjct: 187 ------------LKAAACAKH---YAVHSGPEPLRHTFDVDVTPYELWDTYLPAFQKLVT 231

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           + K + +MC+YN     P C    L     RN+W F+GY+TSDC A+   F+        
Sbjct: 232 ESKVAGVMCAYNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA 291

Query: 344 EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
           E ++A  +  G DI+CGT   +    A+  GK+ EK ID ++  LF ++ RLG+F  DP 
Sbjct: 292 ESASADAVFHGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMF--DPV 349

Query: 404 KG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           +  KY +     +   EHK  AL  ARQ IVLL+N+ K LPL+K  +  + ++GP V+N 
Sbjct: 350 EMVKYAQTPTSVLENDEHKAHALKMARQSIVLLRNENKTLPLSKK-LKKIVVLGPNVDNA 408

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYV 487
             + G Y G P    ++L G++  V
Sbjct: 409 IAILGNYNGTPSKLTTVLEGIKEKV 433



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 143/307 (46%), Gaps = 61/307 (19%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           F+  V   K AD  + V G+    E E+           DR S+LLP  Q  L+ ++ +T
Sbjct: 598 FNTLVNRVKDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKAL-KT 656

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
           + +P++ V+  G  + + + EA++ I +I    Y G+A   A+A+++FG++NP GRLP+T
Sbjct: 657 TGKPIVFVMMTGSAIAIPW-EAEN-IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVT 714

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y       P  D  M          R+YR++ G  +YGFG+GLSYT + Y        L
Sbjct: 715 FYKSDADLSPFVDYKMD--------NRTYRYFKGKPLYGFGYGLSYTTFKY------DNL 760

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            I+ S+  G  KN+                         + + VTN G V G  VV L+ 
Sbjct: 761 KIAPSVIKG--KNV------------------------PITVKVTNTGKVSGEEVVQLYV 794

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
                +   P K L GF+R+   A  SK I+F + P E LS        I   GNH    
Sbjct: 795 INQNTAIKAPLKTLKGFERISLKAGKSKTITFTLSP-EDLSY-------ITAEGNHQQYN 846

Query: 797 GELRHSL 803
           G+++ ++
Sbjct: 847 GKIKIAI 853


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 155/457 (33%), Positives = 249/457 (54%), Gaps = 44/457 (9%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ N  LS   RA+ L S LTL+EK + + +++ AIPRLGIP +EWWSE+LHGIA NG 
Sbjct: 23  YPYQNPRLSSQERAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG- 81

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ    AAS++  L  ++  A + EA A  NL +         G++
Sbjct: 82  ---------FATVFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIKRYQGVS 132

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PNINIFRDPRWGRGQET GEDP + S   +  V   QG+ ++ D             
Sbjct: 133 IWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRD------------- 179

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
           ++ F E       L   AC KH   +   +W   +R+ F+   + E+D  +T+ P F+S 
Sbjct: 180 MRPFTERPRYYKTL---ACAKHYAVHSGPEW---NRHVFDVERLPERDLWETYLPAFKSL 233

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFE--YQNY 339
           +++G    +MC+Y +++G P C     L Q  R EW + G + SDC A++  +   + + 
Sbjct: 234 VQEGNVREVMCAYQRIDGSPCCGNTRYLHQILRGEWEYNGLVVSDCGAISDFYREGHHHV 293

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            +T  +++A  ++AG D+ CG  +      A+++G +  + ID +++ L   +  +G F+
Sbjct: 294 VETPAEASAMGVRAGTDVECG-AVYATLPRAVEQGLISREAIDTSVVRLLKARFEVGDFD 352

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            + +   +   GP+ + +  H++LALD AR+ + LL+N  + LPL+KN +  +A++GP  
Sbjct: 353 SE-KLVPWKLTGPEVIASETHRRLALDMARESMTLLQNRNRLLPLSKNGL-RIAVMGPNA 410

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N+   + G YTG P S  ++L+G+ + V    +  GC
Sbjct: 411 NDSVMLWGNYTGYPISTTTILKGIRSKVPAARFVEGC 447



 Score =  102 bits (254), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 141/334 (42%), Gaps = 65/334 (19%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV     A    D+   S     E    A  AD V+ V G+    E E+           
Sbjct: 580 YVQNKAMAVCQFDIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGG 639

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR S+ LP  Q  ++  + +  K  V +  +GG    V   EA     ++L   Y GEAG
Sbjct: 640 DRTSIELPEAQREVIRLLRQAGKLVVFVNCSGGAVALVPETEA---CDAVLQAWYAGEAG 696

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP G+LP+T+Y          D  M         GR+YR++ G  ++ 
Sbjct: 697 GQAVADVLFGDYNPSGKLPVTFYKSDADLPDFLDYRM--------TGRTYRYFRGIPLFP 748

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYT++++                 G  +    + G                    
Sbjct: 749 FGFGLSYTSFAF-----------------GKPR---YENGK------------------- 769

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           + + VTN G  DG+ VV ++ + P  + G P K L GF R+   A   + +   + P E+
Sbjct: 770 LYVEVTNTGKRDGAEVVQVYVKNPADADG-PVKTLRGFARIDLKAGERRRVEIAM-PRER 827

Query: 776 LS--IANKHGRRILPLGNHVLMVGELRHSLTIET 807
                A  +  R+ P GNH+LMVG       ++T
Sbjct: 828 FEGWDATTNTMRVKP-GNHLLMVGSSSRDADLQT 860


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 227/769 (29%), Positives = 357/769 (46%), Gaps = 142/769 (18%)

Query: 74  LQEKIQQ-LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           L  KIQ+ L +N     RL IPA     E L G+            V + T+FPQ +  A
Sbjct: 68  LTNKIQKFLMENT----RLKIPAI-VHEECLTGL-----------MVKTATAFPQAIGLA 111

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
           ++++  L   + S +  +A+ +    Q      +P +++ RDPRWGR +ET GED  +V+
Sbjct: 112 STWDPDLIREVSSTIRYQAK-LIGTNQC----LSPVLDVCRDPRWGRCEETYGEDQYLVA 166

Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
           +  + +VK  QGEN                                L A  KH  A+   
Sbjct: 167 SIGLAYVKGLQGENE-------------------------------LIATVKHFAAHGFP 195

Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
           + G   R      +  ++  + F  PF   I+ GKA  +M +Y++++G+P     +L  K
Sbjct: 196 EGG---RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTK 252

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQS 369
             R EWGF+G + SD DA+  +      +   +++A   L+AG+D       C       
Sbjct: 253 ILRQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE 312

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
           A+ +G + E  IDRA+  +  ++ +LGLFN D    +     P+ +  S+ ++LALD AR
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFN-DHYINENNV--PEKLDNSKSRELALDVAR 369

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-------GIPCSPKSLLRG 482
           + IVLLKND   LPLNKN + ++A+IGP  N    + G YT        +     ++L G
Sbjct: 370 KSIVLLKNDN-ILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLEG 427

Query: 483 LEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVV----AGLDLTQET--- 532
           +   VS      YA GC D+   S  GF EA+ IAKK D +I V    +GL L+      
Sbjct: 428 IMRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVPG 486

Query: 533 ------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEA 578
                       E  DR SL LPG Q  L+  + +T K P+ILVL  G PL +S  F E 
Sbjct: 487 KDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFNEV 545

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
           ++ I +  W  +PGE G  A+A++IFGD+NP GRLP++ +P    ++P+   N +  S  
Sbjct: 546 NAIIDA--W--FPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPIY-YNRKPSSL- 598

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSR 697
               R Y       ++ FG+GLSYT + Y  L   P E+  S  +K              
Sbjct: 599 ----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVNSSGKIK-------------- 640

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                             + + V N G  +G   V L+          P K+L GF +V+
Sbjct: 641 ------------------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAKVY 682

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
                 ++I+F + P E L+  +++ R I+  G++ +++G+    + ++
Sbjct: 683 LKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730


>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
 gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
 gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.14.25]
 gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.27]
          Length = 755

 Score =  278 bits (711), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 229/771 (29%), Positives = 356/771 (46%), Gaps = 146/771 (18%)

Query: 74  LQEKIQQ-LSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
           L  KIQ+ L +N     RL IPA     E L G+            V + T+FPQ +  A
Sbjct: 68  LTNKIQKFLMENT----RLKIPAI-VHEECLTGL-----------MVKTATAFPQAIGLA 111

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
           ++++  L   + S +  +A+ +    Q      +P +++ RDPRWGR +ET GED  +V+
Sbjct: 112 STWDPDLIREVSSTIRYQAK-LIGTNQC----LSPVLDVCRDPRWGRCEETYGEDQYLVA 166

Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
           +  + +VK  QGEN                                L A  KH  A+   
Sbjct: 167 SIGLAYVKGLQGENE-------------------------------LIATVKHFAAHGFP 195

Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
           + G   R      +  ++  + F  PF   I+ GKA  +M +Y++++G+P     +L  K
Sbjct: 196 EGG---RNIAPVHVGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTK 252

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQS 369
             R EWGF+G + SD DA+  +      +   +++A   L+AG+D       C       
Sbjct: 253 ILRQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAILALEAGVDTEFPNIDCFGEPLLE 312

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
           A+ +G + E  IDRA+  +  ++ +LGLFN            P+ +  S+ ++LALD AR
Sbjct: 313 AVKEGLISESIIDRAVERVLRIKEKLGLFNNHYINENNV---PEKLDNSKSRELALDVAR 369

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---------GIPCSPKSLL 480
           + IVLLKND   LPLNKN + ++A+IGP  N    + G YT         GI     ++L
Sbjct: 370 KSIVLLKNDN-ILPLNKN-IGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVV--TVL 425

Query: 481 RGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVV----AGLDLTQET- 532
            G+   VS      YA GC D+   S  GF EA+ IAKK D +I V    +GL L+    
Sbjct: 426 EGIMRKVSNNTNVLYAKGC-DIAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDV 484

Query: 533 --------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FA 576
                         E  DR SL LPG Q  L+  + +T K P+ILVL  G PL +S  F 
Sbjct: 485 PGKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVNGRPLALSSIFN 543

Query: 577 EADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADS 636
           E ++ I +  W  +PGE G  A+A++IFGD+NP GRLP++ +P    ++P+     R  S
Sbjct: 544 EVNAIIDA--W--FPGEEGGNAIADVIFGDYNPSGRLPIS-FPIDTGQIPI--YYNRKPS 596

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTG 695
           S     R Y       ++ FG+GLSYT + Y  L   P E+  S  +K            
Sbjct: 597 SL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTPKEVNSSGKIK------------ 640

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
                               + + V N G  +G   V L+          P K+L GF +
Sbjct: 641 --------------------ISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGFAK 680

Query: 756 VHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           V+      ++I+F + P E L+  +++ R I+  G++ +++G+    + ++
Sbjct: 681 VYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGKSSEDIVLK 730


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 228/760 (30%), Positives = 356/760 (46%), Gaps = 136/760 (17%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           + L  KIQ+     +   RLGIPA     E L G  + G            T FPQ +  
Sbjct: 72  VELANKIQKFLKEKT---RLGIPAI-IHEECLSGFMAQG-----------ATVFPQAIGM 116

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
           A++F   L   +   +    +A  N+ Q GL+   P ++I RDPRWGR +ET GEDP +V
Sbjct: 117 ASTFEPELIRRVSDVIRQHMKAA-NVHQ-GLS---PVLDIPRDPRWGRTEETFGEDPYLV 171

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S  A E+VK  QGE+W+                              + A  KH  AY +
Sbjct: 172 SRMATEYVKGLQGEDWREG----------------------------IVATVKHFTAYGI 203

Query: 252 EKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQ 311
            +    +R    A + E++  + F  PF   I++G+A  +M +Y++++GVP      L  
Sbjct: 204 SEG---ARNLGPAKVGERELREVFLFPFEVAIKEGQAGSLMNAYHEIDGVPCASSKFLLT 260

Query: 312 KA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQ 368
           K  R EWGFKGY+ SD  AV  +  +    +  +++A   L+AG+DI   +  C      
Sbjct: 261 KILRWEWGFKGYVVSDYIAVRMLENFHKVARDAKEAAVLALEAGIDIELPSVDCYGEPLI 320

Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC----TSEHKKLA 424
            A+ +G + E+ I+ ++  +   +  LGLF+ +  K       P  V       E + L+
Sbjct: 321 QAVKEGLISEEVINASVERVLRAKFMLGLFDDNLEKD------PKKVYEVFDKPEFRDLS 374

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT-------------G 471
            + AR+ IVLLKND   LPL+KN +  +A+IGP  +N   + G Y+             G
Sbjct: 375 REVARRSIVLLKNDGT-LPLSKN-LKKVAVIGPNADNPRNLHGDYSYTAHIPSIAEGLEG 432

Query: 472 IPCSPK------SLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
           +    K      S+L G+   VS   +  YA GC D+  +S  GF EA+ +AK+AD +I 
Sbjct: 433 VKVEEKCVVRTVSILEGIRNKVSPETEVLYAKGC-DIISDSKDGFAEAIEMAKEADVIIA 491

Query: 523 VAG-----LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           V G            E  DR +L L G Q  L+  + +  K P++LVL  G P  + +  
Sbjct: 492 VMGEESGLFHRGISGEGNDRTTLELFGVQRDLLKELHKLGK-PIVLVLINGRPQALKWEH 550

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
            +  +++IL   YPGE G  A+A++IFGD+NP G+LP++ +P    ++P+   N +  + 
Sbjct: 551 EN--LNAILEAWYPGEEGGNAVADVIFGDYNPSGKLPIS-FPAVTGQIPVY-YNRKPSAF 606

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             Y   S +      +Y FGHGLSYT + Y      S+L IS                  
Sbjct: 607 SDYIDESAK-----PLYPFGHGLSYTTFEY------SDLKISP----------------- 638

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                 ++V S    +  +  ++ N G+ DG  VV L+      S   P K+L GF +++
Sbjct: 639 ------EKVNSLE--KVEISFTIKNTGNRDGEEVVQLYIHDQVASLERPVKELKGFKKIY 690

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                SK ++F + P EQL+  ++  R I+  G   +M+G
Sbjct: 691 LKPGESKRVTFTLYP-EQLAFYDEFMRFIVEKGVFEVMIG 729


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 172/483 (35%), Positives = 253/483 (52%), Gaps = 66/483 (13%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           +YPF + SLS+  R   L+S LT  EKI Q+ ++  AI RLGIPAY WW+ESLHG+A  G
Sbjct: 42  TYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARAG 101

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGL 162
                       T +PQ +  A++F+  L   + ++++ E RA Y+  L +       GL
Sbjct: 102 ----------KATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGGL 151

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           TFW+PNINIFRDPRWGRGQET GEDP +    A+ FVK  QGEN                
Sbjct: 152 TFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQGEN---------------- 195

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                    D  D L   A  KH   Y +      +R+S +   T +D  +T+ P FR  
Sbjct: 196 ---------DNSDYLKAVATIKH---YAVHSGPEKTRHSDDYHPTRKDLFETYLPAFRMA 243

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY-- 339
           I +     +MC+YN+V+G PAC   +L Q+  R + GF GY+ SDC A+A  +E +++  
Sbjct: 244 IAETNVQSLMCAYNRVDGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHV 303

Query: 340 TKTHEDSAAGVLKAGMDINCGTC---MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
             +  ++AA  +K+G D+NCG        +   A+ +G + E  ID A+  LF  +++LG
Sbjct: 304 VDSPAEAAAWAVKSGTDLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVKRLFKARIKLG 363

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+   R   Y ++G D V + +H  L  +AA + IVLLKN+   LPL   A   +A+IG
Sbjct: 364 MFDEQDRV-PYSEIGMDVVGSPKHLALTQEAAEKSIVLLKNN-GVLPL--KAGVKVAVIG 419

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASG--------CHDVPCNSDAG 506
           P   +   + G Y G+P  P   L G+   V + +  YA G         H  P +++  
Sbjct: 420 PNAVDEDVLVGNYHGVPVKPVLPLEGIVNRVGEANVFYAPGSAQIADIYSHYEPISAENF 479

Query: 507 FHE 509
           +H+
Sbjct: 480 YHK 482



 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 83/299 (27%), Positives = 133/299 (44%), Gaps = 54/299 (18%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           EA+  A+KAD +I + G+D   E E+           DR  + LP  Q +L+  +  T K
Sbjct: 619 EALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK 678

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            PV++V   G  + +++     ++ +IL   YPGEA   ALA I++GD +P GRLP+T+Y
Sbjct: 679 -PVVMVNFSGSAMALNWES--EKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFY 735

Query: 619 PESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
                    ND +M          R+Y+FY G  +Y FGHGL Y +++Y      + L +
Sbjct: 736 KGVDDLPAFNDYHME--------NRTYKFYRGEPLYAFGHGLGYVDFAY------NNLVV 781

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           + + +AG                              + +SVTN G +    V  ++  +
Sbjct: 782 ANTAEAGK--------------------------ALPIAVSVTNTGKMQAEDVAQVYISL 815

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                 TP + L  F R    A  S E+ F + P   L+  + +G+     G   + VG
Sbjct: 816 LDAPANTPIRDLKAFKRTKLAAGESTELEFNL-PARVLTYIDDNGKTQTYTGRVEVTVG 873


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 211/711 (29%), Positives = 336/711 (47%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L  +I S +  + R +  + Q      +P +++ +DPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QG+N                                L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +P      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +A+ +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL  AR+ IVLLKN+   LPL+KN V+ +A+IGP  N+   M G YT       
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V  SK  YA GC D+   S  GF EA+ IA++AD +I + G 
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                   +D+  E E +          DR SL LPG Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWMDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +++   +PGE G  A+A++IFGD+NPGGRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
              N +  S      R Y     + ++ FG+GLSYT + Y  L   P E+        G 
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           + NI                         + I V N G ++G  VV L+      S   P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF ++H      + + F + P E L+  +   R ++  G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 161/462 (34%), Positives = 242/462 (52%), Gaps = 54/462 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T L   TRA  L+S +TL+EK  Q+ + A AIPRL +PAY+WW+E+LHG+A  G  
Sbjct: 39  PYLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG-- 96

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                     T FPQ +  AA+F+  L + + +A++ EARA ++   A        GLTF
Sbjct: 97  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTF 148

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  Q +                   
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                   +G    L A  KH   Y +       R+ F+   +E+D  +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLHETYLPAFQALVQ 238

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G  + +M +YN+VNG  A     L    R +WGF GYI SDC A+  I++      T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G D++CG         A+  G + E  ID +L  L + ++RLG+F+  P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPKAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAK 356

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + ++      + +H  LA   AR+ +VLLKND   LPL K  +  +A++GP  ++   
Sbjct: 357 VAWAQIPASVNQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 86/286 (30%), Positives = 130/286 (45%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
             EAV  A+ A+ V+ V GL    E E+ D          R    LP  Q  L+ ++  T
Sbjct: 620 LQEAVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
              PV+ VLT G  L V +A+    + +IL   YPG+ G  A+ +++FG  +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAVDWAQ--QHVPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPIT 736

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y E+      +D  MR        GR+YR++TGT +Y FGHGLSYT ++Y      S+L
Sbjct: 737 FYKEAERLPAFDDYAMR--------GRTYRYFTGTALYPFGHGLSYTQFAY------SDL 782

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
            +        D+  L   G+                     + V N G   G  VV L+ 
Sbjct: 783 RL--------DRTTLGADGT-----------------LRATLKVRNTGKRAGDEVVQLYL 817

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF R+       +E++F +   + L I ++ 
Sbjct: 818 HPLDPKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQ 863


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 168/471 (35%), Positives = 242/471 (51%), Gaps = 63/471 (13%)

Query: 46  PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHG 105
           P    +  + +T+L    RA  L+S +TL+EK  QL   A AIPRLG+P Y WW+E LHG
Sbjct: 15  PADVAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNEGLHG 74

Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------N 156
           +A  G            T FPQ +  AA+++  + + +G  V+ E RA Y          
Sbjct: 75  VARAG----------VATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKYVERVHPDGGT 124

Query: 157 LGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGF 216
               GLT W+PNINIFRDPRWGRGQET GEDP + S   + ++   QG + K        
Sbjct: 125 DWYRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGNDPKF------- 177

Query: 217 GFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQ 276
                                   A  KH   +   +    +R+  +   ++ D EDT+ 
Sbjct: 178 --------------------FKTVATSKHFAVHSGPE---SNRHKEDVYPSKFDLEDTYL 214

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF- 334
           P FR+ + +GKA  +MC YN V GVP C    L  +K R  WGF G++ SDC A A IF 
Sbjct: 215 PAFRATVTEGKAYSVMCVYNAVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFR 274

Query: 335 -EYQNYTKTHEDSAAGVLKAGMDINCG------TCMLRHTQSAIDKGKVQEKDIDRALLN 387
            +  +YTKT E+  A  LKAGMD+ CG      +  ++   +A+  G++    +D+AL+ 
Sbjct: 275 EDALHYTKTAEEGVAVGLKAGMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVR 334

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           LF  ++RLG+F+  P    +  +  DD  T  H  +ALD A++ +VLLKND   LPL K 
Sbjct: 335 LFEGRIRLGMFD-PPASLPFAHITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPL-KA 391

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAY--VSKTHYASGC 496
              ++A+IGP  +++  + G Y G P  P ++L G+ A    +K  YA G 
Sbjct: 392 EPKTIAVIGPNADSLDALVGNYYGKPSKPVTVLDGIRARFPTAKIVYAEGT 442



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 95/300 (31%), Positives = 141/300 (47%), Gaps = 58/300 (19%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           +AV +AK ADFV+ V GL    E E+           DR S+ LP  Q  L+  V  T K
Sbjct: 590 QAVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIGTGK 649

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
            P +LVL  G  L V++A  D  + +I+   YPG  G  A+A++I GD++P GRLP+T+Y
Sbjct: 650 -PTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY 706

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
             S   +P  +D  M+         R+YR++ G  +Y FGHGLSYT ++Y   + P    
Sbjct: 707 -RSVDALPGFSDYTMK--------NRTYRYFNGEVLYPFGHGLSYTTFAY---ANPKVSA 754

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
            S +  +    ++       +D    DEV         VQ+ V++ G             
Sbjct: 755 ASVAAGSSVTVSVDVSNSGAMDS---DEV---------VQLYVSHPG------------- 789

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                 GT  + L GF RV      +K + F +D    LS+ ++HG R +  G   L +G
Sbjct: 790 ------GTAIRSLQGFQRVSLKKGETKTVQFKLD-DRALSVVDEHGGRKVQAGQVDLWIG 842


>gi|160901716|ref|YP_001567297.1| glycoside hydrolase family 3 protein [Petrotoga mobilis SJ95]
 gi|160359360|gb|ABX30974.1| glycoside hydrolase family 3 domain protein [Petrotoga mobilis
           SJ95]
          Length = 777

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 220/750 (29%), Positives = 351/750 (46%), Gaps = 141/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA+    E L G  + G            T FPQ++ +A+++   L   + +++  
Sbjct: 91  RLGIPAF-MHEECLSGYMTRG-----------ATIFPQMIGAASTWEPPLIERMTTSIRN 138

Query: 150 EARAMYNLG-QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           + +A   LG   GL+   P +++ RDPRWGR +ET GEDP +++   V +VK  Q ++ K
Sbjct: 139 QMKA---LGIHQGLS---PVVDVTRDPRWGRTEETFGEDPYLIAKMGVAYVKGLQSDDLK 192

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
           +                             + A  KH + Y + + G N++     A I 
Sbjct: 193 NG----------------------------IVATLKHFVGYGVSEGGMNWAP----AHIP 220

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
           E++ ++TF  PF + I++GK   +M +Y++++G+P      L ++  R EWGF G + SD
Sbjct: 221 ERELKETFLFPFEAAIKEGKVKSVMNAYHEIDGIPCGASETLLRRILREEWGFDGIVVSD 280

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRA 384
             A+ ++ EY       E++A   LKAG+D+   +  C     ++AI+ G+  E  ID++
Sbjct: 281 YFAINSLMEYHKIALNKEEAAIKALKAGIDVELPSFDCYKEPLKNAIENGEFSEAFIDKS 340

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLG--PDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + N+  ++  +GLF     +  Y  L   PD++ T E +KLA + A++ IVLLKND   +
Sbjct: 341 VRNILRLKFEMGLF-----ENPYVDLEKVPDNLDTPEDRKLAYEIAKKSIVLLKND-GIV 394

Query: 443 PLNKNA-VSSLAIIGPLVNNISQMGGGYT-------------------GIPCSP------ 476
           PL KN+ +  +A+IGP  N+   + G YT                   GI  S       
Sbjct: 395 PLKKNSKIKKVAVIGPNANSARNLTGDYTYLTHLETLKQGAFGTSAMEGITFSESELPIK 454

Query: 477 ---KSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------LD 527
              +SL   LE    +T YA GC ++  ++     EAV +A+ +D  ++V G      LD
Sbjct: 455 TIYESLKEKLEKLNVETSYAKGC-EINDDNKEMIKEAVELAENSDVALLVLGDKSGLTLD 513

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
            T   E RD  +L+LPG Q+ L+ SV  T   PVI+VL  G P  + +   +  +S+I  
Sbjct: 514 CTT-GESRDSSTLILPGVQLDLLKSVINTG-TPVIVVLVNGRPYSLDWVSKN--VSAIFE 569

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
              PGE G  ALA+II GD +P G+LP++ +P    ++P+   N +    R      Y  
Sbjct: 570 AWLPGEEGGNALADIILGDESPSGKLPIS-FPRHVGQIPVY-YNHKPSGGRSQWWGDYTD 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
                +Y FGHGLSYT + Y                     N+  +   R+  + +D   
Sbjct: 628 SPAKPLYPFGHGLSYTQFEY--------------------GNLQIENNDRIVKISMD--- 664

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
                       V N G+  G  +V L+      S   P K+L GF RV       K I 
Sbjct: 665 ------------VKNIGEETGDEIVQLYMNDEVASVTRPVKELKGFQRVTLKPSEKKRII 712

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           F + P E L++ N+    ++  G   +MVG
Sbjct: 713 FNL-PIETLALYNEKMEFLVEKGYFKVMVG 741


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 158/446 (35%), Positives = 228/446 (51%), Gaps = 59/446 (13%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ N  L  +TR   L+  +TL+EK+ Q+ +++ AIPRL IPAY+WW+E LHG+A    
Sbjct: 26  YPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVAR--- 82

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
                 T   VT +PQ +  AA+F+R   + +    A+E RA++N             GL
Sbjct: 83  ------TPFKVTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEKYLGL 136

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T+W PNINIFRDPRWGRGQET GEDP +  A    FV   QG + K              
Sbjct: 137 TYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQGNDPKY------------- 183

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                         L  +AC KH   Y +       R+ FNA I+  D  DT+ P F+  
Sbjct: 184 --------------LKAAACAKH---YAVHSGPEPLRHVFNADISTYDLWDTYLPAFKKL 226

Query: 283 IEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQN 338
           +   K + +MC+YN     P C    L  D+    RN+W F GY+TSDC  +   F+   
Sbjct: 227 VVDDKVAGVMCAYNAFKTQPCCGSDLLMVDIL---RNQWKFSGYVTSDCGGIDDFFKNHK 283

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
              T ED++   +  G DI CGT   +   +A+ +GK+ E  ID ++  LF ++ RLG+F
Sbjct: 284 THATAEDASTDAVLHGTDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMF 343

Query: 399 NGDPRK-GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
             DP    KY +     + + EH+  AL  ARQ +VLLKN    LPL+K  +  + ++GP
Sbjct: 344 --DPSDVVKYAQTPVSVLESPEHQAHALKMARQSVVLLKNANHTLPLSKT-IRKIVVLGP 400

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGL 483
             +N   + G Y G P +  ++ +G+
Sbjct: 401 NADNPIAILGNYNGTPSNLTTVYQGI 426



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 141/284 (49%), Gaps = 56/284 (19%)

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
           V+    AD ++ V G+    E E+           DR S+ LP  Q +L+ ++  T K P
Sbjct: 597 VKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGGDRTSIQLPAAQTNLMKTLQATGK-P 655

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V+ V+  G  L   + EA++ I +I+   Y G+A   A+A+++FGD+NP GRLP+T+Y +
Sbjct: 656 VVFVMMTGSALATPW-EAEN-IPAIVNAWYGGQAAGTAVADVLFGDYNPAGRLPVTFY-K 712

Query: 621 SFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           S T +P   D +M          R+YR++ G  +YGFG+GLSYT + Y       +L + 
Sbjct: 713 SDTDLPDFTDYSMT--------NRTYRYFKGIPLYGFGYGLSYTQFKY------DKLIVP 758

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
           A++K+G                             H+ ++VTN+G + G  VV ++ +  
Sbjct: 759 ATVKSGK--------------------------AIHLSVTVTNSGQIAGDEVVQIYMKHH 792

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
                 P K L GF RV+  A   + ++F + P + L++ + +G
Sbjct: 793 SQRIKVPLKALKGFARVYLKAGERRTLNFILSP-DDLAVTSSNG 835


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 161/458 (35%), Positives = 239/458 (52%), Gaps = 54/458 (11%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +L    RA+ L+  L+L EKI  +   + AI RLGIP Y WW+E+LHG+A NG   
Sbjct: 23  YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNG--- 79

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG--------QAGLTFW 165
                    T FP  +  AA+++R L   I   ++ EARA YN            G++ W
Sbjct: 80  -------RATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           APNINIFRDPRWGRG ET GEDP +    AV F+K  QG+              +K+ LK
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQ--------------DKKYLK 178

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQ 285
                          A  KHL  +   +     R+ FNA+++  D  +T+ P F+  I +
Sbjct: 179 TI-------------ATPKHLAVHSGPE---PERHHFNALVSNYDLNETYLPHFKKSIMK 222

Query: 286 GKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           GKA  +MC+YN++ G   C    L     RN+WGF+G + SDC AV  IF       + E
Sbjct: 223 GKAYSVMCAYNRLRGKACCGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHKIVDSPE 282

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  + +G D+ CG   L   ++A   G + EK+ID AL  +   + +LG+F+  P  
Sbjct: 283 KAAALAVSSGTDLECGNTFL-SLKNAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEI 340

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             Y ++    +  S ++++AL+AAR+ IVLLKND K LPL+ ++++ +A+IGP  +N+  
Sbjct: 341 VSYSQIDESYLDNSYNREIALEAARKSIVLLKNDNKLLPLD-SSINKIAVIGPNADNLES 399

Query: 465 MGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGCHDVP 500
           + G Y G P    + L+ +   +   +  Y  GC   P
Sbjct: 400 LLGNYHGFPSEYITPLQAIRRVLKNGEVFYEKGCDFAP 437



 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 135/298 (45%), Gaps = 55/298 (18%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           A + A K+D VI+  GL    E E            DR+ L LP  Q+ L+  +  T K 
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
           PVILVL  GGP  +S       I +IL   YPG+AG +A+ ++I+G +NP G+LP+T Y 
Sbjct: 650 PVILVLLNGGP--ISTVWESENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                 P  + +M         GR+YR++ G  +Y FG GL+YT+ +       S + +S
Sbjct: 708 SENDLPPFENYDME--------GRTYRYFKGEVLYPFGWGLNYTDITI------SNIELS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
           A                       +E+    ++R  V + + N G++ G   V L+ +  
Sbjct: 754 A-----------------------NEIKDNDTIR--VVVKLKNNGNLAGEETVQLYTKAL 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           K ++    K L GF+++         + F +   +     +  G   +P G + ++VG
Sbjct: 789 KDNRTI--KTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  275 bits (704), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 212/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L  +I S +  + R +  + Q      +P +++ +DPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNLELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QG+N                                L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +P      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +A+ +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYSEPLVNALTEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL  AR+ IVLLKN+   LPL+KN V+ +A+IGP  N+   M G YT       
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V  SK  YA GC D+   S  GF EA+ IA++AD +I V G 
Sbjct: 417 DSGIEIV--TVLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                    D+  E E +          DR SL LPG Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +++   +PGE G  A+A++IFGD+NPGGRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--PIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPGGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
              N +  S      R Y     + ++ FG+GLSYT + Y  L   P E+        G 
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           + NI                         + I V N G ++G  VV L+      S   P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF ++H      + + F + P E L+  +   R ++  G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  275 bits (704), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 219/761 (28%), Positives = 332/761 (43%), Gaps = 156/761 (20%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ + SL    RAK L+S LTL+EK       ++ +  LGI AY WWSE+LHG+A NG  
Sbjct: 45  PYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGVARNG-- 102

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                   + T FPQ +  AASF+  L   + +AV+ EAR  Y + +         G+TF
Sbjct: 103 --------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQYQGVTF 154

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRG ET GEDP +     +  V+  QG +             +  VL
Sbjct: 155 WTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPS-------------DSPVL 201

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
           K               AC KH   +   +W   +R+S++A ++E+D  +T+ P F+  + 
Sbjct: 202 KA-------------HACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVT 245

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEY--QNYT 340
           +     +M +YN+  G P C   D       R EWG+KG ITSDC AV   +      Y+
Sbjct: 246 KANVQEVMTAYNRFRGEP-CGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYS 304

Query: 341 KTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
                +AA  + AG+D  CG    RH   A+++G + EKD+DR L+ LF+ + +LG  + 
Sbjct: 305 PDVASAAAAAVHAGVDTECGQAY-RHIPEAVERGLLDEKDLDRNLIRLFTARYQLGEMDD 363

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
                 +  L    +   EH  L+   A++ +VLL+N    LPL  +    +A++GP  +
Sbjct: 364 ---ISLWDDLPASILEGPEHLALSRKMAQESMVLLQNKGGILPLAPDV--RVALVGPNGD 418

Query: 461 NISQMGGGYTGIPCSPKSLLRGL------------------------------------- 483
           +     G Y  +P    +L   L                                     
Sbjct: 419 DREMQWGNYNPVPGRTVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKS 478

Query: 484 --EAYVSKTHYASGCHDV------PCNSDAGF------HEAVRIAKKADFVIVVAGLDLT 529
             E       YA G  D+           A F         ++  +  D VI   G+   
Sbjct: 479 REEMEAIARQYAIGVQDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPR 538

Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
            E E+           DR  + LP  Q  L+ ++    K+ +++  +G     +      
Sbjct: 539 FEGEEMPVNLPGFKGGDRTDIQLPQVQRDLMKALHDAGKKVILVNFSGCA---IGLVPET 595

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSR 638
               +IL   YPGE G  A+ +++FGD NP G+LP+T+Y  S   +P   + +M+     
Sbjct: 596 ESCDAILQAWYPGEEGGLAITDVLFGDVNPSGKLPVTFY-RSVEDLPDFENYDMK----- 649

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
              G +YR++ G  ++ FG+GLSY+ + YK           A ++  S    ++ TG R 
Sbjct: 650 ---GHTYRYFKGKPLFPFGYGLSYSTFRYK----------RAKVRNNSLIIPVKNTGKR- 695

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGS-HVVMLFARV 738
                 E T        VQ+ V   GD DG    +  F RV
Sbjct: 696 ------EATEV------VQVYVRRKGDPDGPVKTLRAFRRV 724


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  275 bits (702), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 160/447 (35%), Positives = 233/447 (52%), Gaps = 59/447 (13%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L ++ R ++L++LLT +EKI  + DN+ A+PRL IPAY WW+E+LHG+A  G 
Sbjct: 44  YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAG- 102

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
                      T FPQ +  AA+++          ++ EARA YN             GL
Sbjct: 103 ---------IATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           TFW PNINIFRDPRWGRGQET GEDP + S   V  VK  QG + K              
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKF------------- 200

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                             AC KH   +   +W   +R+S+NA I+++D  +T+ P F++ 
Sbjct: 201 --------------FKTHACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKAL 243

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++G    +MC+YN  +G P C    L  +  R +W + G + SDC A+A  F+ + Y  
Sbjct: 244 VQEGNVREVMCAYNAFDGQPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQ-KKYHG 302

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D   +AA  LK   D+ CG     +   ++  G + EKDID ++  +      LG+ 
Sbjct: 303 THPDEKTTAADALKHSTDLECGDT-YNNLNKSLASGLITEKDIDESMRRILKGWFELGML 361

Query: 399 NGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
             DP+   +    P  V  S EHKK AL  A++ IVL+KN+K  LPLN+N +  +A++GP
Sbjct: 362 --DPKSSVHWNTIPYSVVDSEEHKKQALKMAQKSIVLMKNEKNILPLNRN-IKKIAVVGP 418

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLE 484
             ++     G Y G P S  ++L G++
Sbjct: 419 NADDGLMQLGNYNGTPSSIVTILDGIK 445



 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 127/295 (43%), Gaps = 47/295 (15%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           +  K  Y S   +V       F       K AD ++   GL  + E E+           
Sbjct: 592 HTQKGKYVSITFEVYRKDPVNFASVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGG 651

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           D+ S+ LP  Q  L+  + +T K PV+ VL  G  L +   + +    ++L   Y G++G
Sbjct: 652 DKTSIALPKVQRDLLAELRKTGK-PVVFVLCTGSALGLE--QDEKNYDALLNAWYGGQSG 708

Query: 596 AKALAEIIFGDFNPGGRLPMTWYP--ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
             A+A+++ GD+NP G+LP+T+Y   E                +    GR+YR+ T   +
Sbjct: 709 GTAVADVLAGDYNPSGKLPITFYKNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPL 768

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FGHGLSY+    KF+   S+L+ ++                    + ++E  + T   
Sbjct: 769 YPFGHGLSYS----KFVYGDSKLSKNS--------------------ISVNENVTIT--- 801

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
               I VTN  + +G  VV ++ +    +Q  P K L  F+R    +K +K I  
Sbjct: 802 ----IPVTNISEREGEEVVQVYIKRNNDAQA-PVKTLRAFERTPIKSKETKNIQL 851


>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 812

 Score =  275 bits (702), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 231/738 (31%), Positives = 346/738 (46%), Gaps = 126/738 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQGE    
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTALVRGFQGETL-- 249

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +DG        K V+                A  KH  +Y    W         A I E+
Sbjct: 250 NDG--------KSVI----------------ATLKHFASYG---WTEGGHNGGTAHIGER 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           AV  + E+      +E +A   + AG+D + GT +      +A+ +G V    ID+A+  
Sbjct: 342 AVGGLREHGVAGNDYE-AAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRR 400

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + S++ ++GLF+ DP   +  K     V +SEH  LA + ARQ IVLLKN  K LPL K+
Sbjct: 401 ILSLKFQMGLFD-DPFVDE--KQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKD 457

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSKTH---YASGCHDVPCN 502
            + +LA+IGP  +N+  M G YT      +  ++L G+   VSK     YA GC  V  +
Sbjct: 458 -IRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGCA-VRDS 515

Query: 503 SDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------DRVS 539
           S  GF +A+  A+ AD V++V G     D + E E+                    DR +
Sbjct: 516 SRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRAT 575

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q+ L+  ++R  K PV+L+   G PL +  A  +++     W  YPG  G  A+
Sbjct: 576 LHLMGRQLELLEEISRLGK-PVVLI--KGRPLLMEGAIQEAEAIVDAW--YPGMQGGNAV 630

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NP GRL ++  P S  ++P+   N R   +R      Y    GT  Y FG+G
Sbjct: 631 ADVLFGDYNPAGRLTLS-VPRSVGQLPVY-YNTRRKGNRS----RYVEEPGTPRYPFGYG 684

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT +SY                  +D  +    GS  D   +D           V ++
Sbjct: 685 LSYTTFSY------------------TDMKVQVTEGS--DDCWVD-----------VTVT 713

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G  DG  V  L+ R    S  TP KQL  F R+H  A  S+E++F +D  + L++ 
Sbjct: 714 IQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALY 772

Query: 780 NKHGRRILPLGNHVLMVG 797
            + G  ++  G   +MVG
Sbjct: 773 MQEGEWVVEPGRFTIMVG 790


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  275 bits (702), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 212/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L  +I S +  +AR +  + Q      +P +++ +DPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNLELVMDIASVIRSQAR-LVGVNQC----LSPVLDVCKDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QG+N                                L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +P      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +A+ +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYSEPLVNALTEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL  AR+ IVLLKN+   LPL+KN V+ +A+IGP  N+   M G YT       
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V  SK  YA GC D+   S  GF EA+ IA++AD +I V G 
Sbjct: 417 DSGIEIV--TVLQGVVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAVMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                    D+  E E +          DR SL LPG Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +++   +PGE G  A+A++IFGD+NP GRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--PIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
              N +  S      R Y     + ++ FG+GLSYT + Y  L   P E+        G 
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           + NI                         + I V N G ++G  VV L+      S   P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF ++H      + + F + P E L+  +   R ++  G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 220/736 (29%), Positives = 345/736 (46%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG    V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  ET GEDP +V     + + S Q  N   
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G   +   +  +  +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I        T+ED  A  + AG++I          +   + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDNGKISQETLD 386

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  ++ RLGLF+ +P +G  GK     V + EH+ ++L+AARQ +VLLKN+   L
Sbjct: 387 KRVAEILRIKFRLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETNLL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
           PL+K ++ S+A+IGP  N  +Q+   Y       K++ +G++  +  ++  Y  GC    
Sbjct: 445 PLSK-SIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIID 503

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  ++      EA+R AK+A+ V++V G +     EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V  T K PVILV+  G    +++A A   + +IL   +PGE   +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVILVMLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S           + T VYG    FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT ++Y      S+L IS S             G + D               HV   + 
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+ R    S  T  K L GF+R+   A   + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756

Query: 782 HGRRILPLGNHVLMVG 797
           +    +  G+  +M+G
Sbjct: 757 NMNFRVEPGSFKVMLG 772


>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
 gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
          Length = 954

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
           M +Y+   G+P     +L Q+  R EWGF G+I SDC A+  +   ++YT   +  AA  
Sbjct: 359 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +++D     + S   R  LF  +P K   +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL KN + ++A++GP  +++    G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 534

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NPGGRLPMT +P    ++P+   N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y                  SD  I ++
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 746

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
               +                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 747 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
           DR++     SK +SF + P + +S+ N H  R++  G   + VG          E++HS+
Sbjct: 791 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 849


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 213/711 (29%), Positives = 332/711 (46%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L +N+ S +  + R +  + Q      +P +++ RDPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQC----LSPVLDVCRDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QGE                                 L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGET-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           VP      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 VPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +AI +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYGEPLVTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-NPFVDESAV--PERLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL AAR+ IVLLKN+   LPL+KN ++ +A+IGP  N+   M G YT       
Sbjct: 358 RKSRELALKAARESIVLLKNENNMLPLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYVS--KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V   K  YA GC D+   S  GF EA+ IAK+AD +I V G 
Sbjct: 417 DSGIEIV--TVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                    D+  E E +          DR SL L G Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +I+   +PGE G  A+A+IIFGD+NP GRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--PIINYVKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
                   S +    R Y     + ++ FG+GLSYT + Y  L   P E+          
Sbjct: 590 ------YYSRKPSSFRPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPKEV---------- 633

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                      L Y+ I              + V N G+++G  VV L+      S   P
Sbjct: 634 ---------GPLSYITI-------------LLDVKNVGNMEGDEVVQLYISKSFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF +VH      + + F + P E L+  +   R ++  G + +++G
Sbjct: 672 VKELKGFAKVHLKPGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 213/711 (29%), Positives = 332/711 (46%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L +N+ S +  + R +  + Q      +P +++ RDPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNPELLTNVASTIRSQGR-LIGVNQC----LSPVLDVCRDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QGE                                 L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGET-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           VP      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 VPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLEAIHKVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +AI +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYGEPLVTAIKEGLVSEAIIDRAVERVLRIKERLGLLD-NPFVDESAV--PERLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL AAR+ IVLLKN+   LPL+KN ++ +A+IGP  N+   M G YT       
Sbjct: 358 RKSRELALKAARESIVLLKNENNMLPLSKN-INKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYVS--KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V   K  YA GC D+   S  GF EA+ IAK+AD +I V G 
Sbjct: 417 DSGIEIV--TVLQGIAKKVGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                    D+  E E +          DR SL L G Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWTDIPSEEEFKKYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +I+   +PGE G  A+A+IIFGD+NP GRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--PIINYVKAIIEAWFPGEEGGNAIADIIFGDYNPSGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
                   S +    R Y     + ++ FG+GLSYT + Y  L   P E+          
Sbjct: 590 ------YYSRKPSSFRPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTPKEV---------- 633

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                      L Y+ I              + V N G+++G  VV L+      S   P
Sbjct: 634 ---------GPLSYITI-------------LLDVKNVGNMEGDEVVQLYISKSFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF +VH      + + F + P E L+  +   R ++  G + +++G
Sbjct: 672 VKELKGFAKVHLKPGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIG 721


>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
 gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
          Length = 745

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 211/692 (30%), Positives = 337/692 (48%), Gaps = 97/692 (14%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T FP  L  + S++           A EA A       G+ + +AP ++I RDPRWGR  
Sbjct: 120 TIFPIPLAESCSWDLEAMERSARIAASEATA------EGINWIYAPMVDISRDPRWGRVA 173

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GED  + S  A   VK FQG+N  + +                           + A
Sbjct: 174 EGAGEDVYLGSLIAAARVKGFQGDNLSAVN--------------------------TVVA 207

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           C KH  AY     G      +N + ++  +  +T+ PPF++ ++ G    IM S+N +NG
Sbjct: 208 CVKHYAAYGATMAGR----DYNTVDMSLNELWNTYLPPFKAALDAG-CGTIMTSFNDLNG 262

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +PA     L +   R++W F G++ +D  ++  +  +  Y    + SA   + AG+D++ 
Sbjct: 263 IPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMIPH-GYANDEKHSAEIAMNAGVDMDM 321

Query: 360 -GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
            G   + H ++ I++GKV EKD+  A   +  ++ +LGLF  DP +         D+ T 
Sbjct: 322 QGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFE-DPYRYCDANREKTDILTP 380

Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI---PCS 475
            +K+ A D AR+ +VLLKNDK+ LPL +N    +A+IGPLV +  ++ G ++ +      
Sbjct: 381 ANKEAARDMARKSMVLLKNDKQTLPLKEN--KRVALIGPLVKDKYEILGCWSAMGNRDTI 438

Query: 476 PKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETE 533
           P S+  GL   +   K  YA GC D+      GF EAVR+A  +D V++V G       E
Sbjct: 439 PVSVYDGLVEAIGKDKISYAKGC-DIQSEDTKGFAEAVRVASASDVVVMVMGEFHNMSGE 497

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
           +  R +L LPG Q+ L+ ++ +T K PV+LVL  G PL +++ E D+ + +IL   +PG 
Sbjct: 498 NNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINW-EKDN-LDAILEAWFPGT 554

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ- 652
            G  A+A+++ G +NP G+L MT +P++  ++P+       ++ R Y     +F  G++ 
Sbjct: 555 MGGAAIADVLTGKYNPSGKLTMT-FPQNVGQIPL--FYNHKNTGRPYDPNVPQFAYGSRY 611

Query: 653 -------VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
                  +Y FG+GLSYT ++Y      S+LT+S+                        E
Sbjct: 612 WDVSNEPLYPFGYGLSYTTFTY------SDLTLSSK-----------------------E 642

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
           +T    L+  V + +TN+G+ DG  VV L+ R    S   P K+L GF +V   A  SK 
Sbjct: 643 ITKENPLK--VSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLKAGESKV 700

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           I F +     L   N     +   G+  L VG
Sbjct: 701 IDFTLS-VNDLRFYNSQLEYVYEPGDFHLFVG 731


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 210/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L  +I S +  + R +  + Q      +P +++ +DPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QG+N                                L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +P      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +A+ +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL  AR+ IVLLKN+   LPL+KN V+ +A+IGP  N+   M G YT       
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V  SK  YA GC D+   S  GF EA+ IA++AD +I + G 
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                   +D+  E E +          DR SL LPG Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWMDIPSEEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +++   +PGE G  A+A++IFGD+NP GRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
              N +  S      R Y     + ++ FG+GLSYT + Y  L   P E+        G 
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           + NI                         + I V N G ++G  VV L+      S   P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF ++H      + + F + P E L+  +   R ++  G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 950

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 354

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
           M +Y+   G+P     +L Q+  R EWGF G+I SDC A+  +   ++YT   +  AA  
Sbjct: 355 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 414

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +++D     + S   R  LF  +P K   +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 474

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL KN + ++A++GP  +++    G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 530

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D V++
Sbjct: 531 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 589

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NPGGRLPMT +P    ++P+   N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 704

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y                  SD  I ++
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 742

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
               +                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 743 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
           DR++     SK +SF + P + +S+ N H  R++  G   + VG          E++HS+
Sbjct: 787 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 845


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 242/462 (52%), Gaps = 54/462 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ + +  +  RA  L+S +TL+EK  Q+ + A AIPRL +P Y+WW+E+LHG+A  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTF 164
                     T FPQ +  AA+F+  L + + +A++ EARA ++   AG        LTF
Sbjct: 97  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTF 148

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  Q +                   
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                   +G    L A  KH   Y +       R+ F+   +E+D  +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G  + +M +YN+VNG  A     L    R +WGF GYI SDC A+  I++      T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G D++CG        +A+  G + E  IDR+L  L + +LRLG+F+  P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + ++      + +H  LA   AR+ +VLLKND   LPL K  +  +A++GP  ++   
Sbjct: 357 VPWAQIPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 124/286 (43%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
             EAV  A+ AD V+ V GL    E E+ D          R    LP  Q  L+ ++  T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
              PV+ VLT G  L + +A+    + +IL   YPG+ G  A+ +++FG  +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y E       +D  MR        GR+YR++ G  +Y FGHGL+YT ++Y  L      
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                                     +D  T          +SV N G   G  VV L+ 
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVSVKNTGQRAGDEVVQLYL 817

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF R+       +E+SF + P E L I ++ 
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQ 863


>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
          Length = 954

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 226/780 (28%), Positives = 361/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHAIRNYDCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
           M +Y+   G+P     +L Q+  R EWGF G+I SDC A+  +   ++YT   +  AA  
Sbjct: 359 MMAYSDYMGIPVAKSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +++D     + S   R  LF  +P K   +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNPCKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL KN + ++A++GP  +++    G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLTKN-LRTIAVLGPGADDLQP--G 534

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKEAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVVM 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NPGGRLPMT +P    ++P+   N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y                  SD  I ++
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEY------------------SDLKIQEK 746

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
               +                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 747 PNGNVT----------------VQATVKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
           DR++     SK +SF + P + +S+ N H  R++  G   + VG          E++HS+
Sbjct: 791 DRIYLQPGESKTVSFELTPYD-ISLLNDHMDRVVEKGEFKICVGGMSPDYVAKNEIKHSV 849


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 169/467 (36%), Positives = 238/467 (50%), Gaps = 71/467 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T L    RA  L++ +TL+EK +Q+   A AIPRLG+PAY WW+E LHG+A  G   
Sbjct: 38  YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAG--- 94

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLTF 164
                    T FPQ +  AA+++          +  E RA Y              GLT 
Sbjct: 95  -------IATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP +     V F++  QG++           F+     
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQGQDPNF--------FKTIATA 199

Query: 225 KGF----GEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           K +    G ES+R  E +      H  AYDL                    EDT+ P FR
Sbjct: 200 KHYAVHSGPESNRHREDV------HPSAYDL--------------------EDTYLPAFR 233

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQN- 338
           + + +GK   +MC+YN V+GVPAC   DL  Q+ R +WGF G++ SDC A A I+   + 
Sbjct: 234 AAVTEGKVQAVMCAYNAVDGVPACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSL 293

Query: 339 -YTKTHEDSAAGVLKAGMDINCG------TCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
            Y KT E+     L AGMD+ CG            T SA+ KG + E  +D AL+ LF+ 
Sbjct: 294 AYVKTPEEGITRALNAGMDLVCGDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFAD 353

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           ++RLGLF+  P +  + K+      T EH+ ++L+ A+  + LLKND   LPL K     
Sbjct: 354 RIRLGLFD-PPAEVPFSKITAAQNDTPEHRAMSLEMAKASMTLLKND-GVLPL-KGEPRR 410

Query: 452 LAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           +A++GP  +++  + G Y G P +P ++L G+ A   K    YA G 
Sbjct: 411 IAVVGPNADSVDALIGNYYGTPSNPVTVLAGIRARFPKAEVVYAEGT 457



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 135/290 (46%), Gaps = 54/290 (18%)

Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           D V+ V GL    E E+           DR SL LP  Q  L+  +  T K PV+LVL  
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G  L V++A  D+ + +I+   YPG  G  A+A+++ GD++P GRLP+T+Y  +    P 
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYRSAGDLPPF 729

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            D  M+        GR+YR++ G  +Y FG+GLSYT +SY    AP     S S      
Sbjct: 730 ADYAMK--------GRTYRYFGGEVLYPFGYGLSYTRFSY---GAPQLSARSVSADG--- 775

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                            E+T  T         VTN G +DG  VV L+   P    GTP 
Sbjct: 776 -----------------EITVTT--------QVTNTGGMDGEEVVQLYVSHPG-RDGTPI 809

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + L GF R+      ++ +SF +    QLS+ +  G R +  G   + VG
Sbjct: 810 RALQGFQRIGLKRGETRPVSFTLK-DRQLSVVDAEGNRRVEPGRVEVWVG 858


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +T      RA  L++ +TL+EK  Q+ + A AIPRL +P Y+WW+E+LHG+A  G  
Sbjct: 19  PYLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 76

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                     T FPQ +  AA+F+  L + + +A++ EARA ++   A        GLTF
Sbjct: 77  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTF 128

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  Q +                   
Sbjct: 129 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 169

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                   +G    L A  KH   + +       R+ F+   +E+D  +T+ P F++ ++
Sbjct: 170 --------QGPYRKLDATAKH---FAVHSGPEADRHHFDVHPSERDLHETYLPAFQALVQ 218

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +GK + +M +YN+VNG  A     L    R +WGF GYI SDC A+  I++      T E
Sbjct: 219 EGKVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 278

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G D++CG        +A+  G + E  ID AL  L + ++RLG+F+  P K
Sbjct: 279 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAK 336

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + ++      + +H  LA   AR+ +VLLKND   LPL K  +  +A+IGP  ++   
Sbjct: 337 VPWAQIPASANQSPQHDALARRTARESLVLLKND-GVLPL-KPTLKRIAVIGPTADDPMS 394

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 395 LLGNYYGTPAAPVTILQGIRDAAPQAQVIYARGSDLVEGRED 436



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 85/320 (26%), Positives = 132/320 (41%), Gaps = 59/320 (18%)

Query: 480 LRGLEAYVSKTHYASGCHDVPCN-------SDAGFHEAVRIAKKADFVIVVAGLDLTQET 532
           L G +AY  +  Y     D           +     EAV  A+ A+ V+ V GL    E 
Sbjct: 566 LEGGKAYDLRVEYYEATRDAGVRLAWRMPGAKPPLQEAVDAARDAEVVVFVGGLTGDVEG 625

Query: 533 EDRD----------RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI 582
           E+ D          R    LP  Q  L+ ++  T   PV+ VLT G  L + +A+    +
Sbjct: 626 EEMDVNYPGFAGGDRTDTRLPKPQRELLQALQATGT-PVVAVLTTGSALAIDWAQ--QHV 682

Query: 583 SSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG 642
            +IL   YPG+ G  A+ +++FG  +PGGRLP+T+Y E+      +D  MR        G
Sbjct: 683 PAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYKEAERLPAFDDYAMR--------G 734

Query: 643 RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVH 702
           R+YR++ G  +Y FGHGLSYT ++Y  L                                
Sbjct: 735 RTYRYFQGKPLYPFGHGLSYTQFAYSDL-------------------------------R 763

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
           +D  T          +++ N G   G  VV L+    K  +    K+L G  R+      
Sbjct: 764 LDRTTVAADGTLTATVTLKNTGQRAGDEVVQLYLHPLKPQRERALKELHGLQRITLQPGE 823

Query: 763 SKEISFGVDPCEQLSIANKH 782
            +++ F +   + L I ++ 
Sbjct: 824 QRQLRFTIKAQDALRIYDEQ 843


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ + +  +  RA  L+S +TL+EK  Q+ + A AIPRL +P Y+WW+E+LHG+A  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG--------LTF 164
                     T FPQ +  AA+F+  L + + +A++ EARA ++   AG        LTF
Sbjct: 97  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTF 148

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  Q +                   
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                   +G    L A  KH   Y +       R+ F+   +E+D  +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G  + +M +YN+VNG  A     L    R +WGF GYI SDC A+  I++      T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G D++CG        +A+  G + E  IDR+L  L + +LRLG+F+  P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + +       + +H  LA   AR+ +VLLKND   LPL K  +  +A++GP  ++   
Sbjct: 357 VPWAQTPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 124/286 (43%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
             EAV  A+ AD V+ V GL    E E+ D          R    LP  Q  L+ ++  T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
              PV+ VLT G  L + +A+    + +IL   YPG+ G  A+ +++FG  +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y E       +D  MR        GR+YR++ G  +Y FGHGL+YT ++Y  L      
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                                     +D  T          +SV N G   G  VV L+ 
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVSVKNTGQRAGDEVVQLYL 817

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF R+       +E+SF + P E L I ++ 
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYDEQ 863


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 241/462 (52%), Gaps = 54/462 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ + +  +  RA  L+S +TL+EK  Q+ + A AIPRL +P Y+WW+E+LHG+A  G  
Sbjct: 39  PYLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG-- 96

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                     T FPQ +  AA+F+  L + + +A++ EARA ++   A        GLTF
Sbjct: 97  --------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKRYQGLTF 148

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W+PNINIFRDPRWGRGQET GEDP + +   V FV+  Q +                   
Sbjct: 149 WSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQ------------------- 189

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                   +G    L A  KH   Y +       R+ F+   +E+D  +T+ P F++ ++
Sbjct: 190 --------QGPYRKLDATAKH---YAVHSGPEADRHHFDVHPSERDLYETYLPAFQALVQ 238

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           +G  + +M +YN+VNG  A     L    R +WGF GYI SDC A+  I++      T E
Sbjct: 239 EGHVAAVMGAYNRVNGESASASTRLEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVPTPE 298

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            +AA  +K G D++CG        +A+  G + E  IDR+L  L + +LRLG+F+  P K
Sbjct: 299 AAAALGVKHGTDLDCGDTYA-ALPAAVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAK 356

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + +       + +H  LA   AR+ +VLLKND   LPL K  +  +A++GP  ++   
Sbjct: 357 VPWAQTPASANQSPQHDALARRTARESLVLLKND-GLLPL-KPTLKRIAVVGPTADDPMS 414

Query: 465 MGGGYTGIPCSPKSLLRGLE--AYVSKTHYASGCHDVPCNSD 504
           + G Y G P +P ++L+G+   A  ++  YA G   V    D
Sbjct: 415 LLGNYYGTPAAPVTILQGIRDAAPQAEVVYARGSDLVEGRED 456



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/286 (29%), Positives = 123/286 (43%), Gaps = 52/286 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRD----------RVSLLLPGQQMSLVTSVART 556
             EAV  A+ AD V+ V GL    E E+ D          R    LP  Q  L+ ++  T
Sbjct: 620 LQEAVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELLQALQAT 679

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
              PV+ VLT G  L + +A+    + +IL   YPG+ G  A+ +++FG  +PGGRLP+T
Sbjct: 680 GT-PVVAVLTTGSALAIDWAQ--QHVPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPIT 736

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y E       +D  MR        GR+YR++ G  +Y FGHGL+YT ++Y  L      
Sbjct: 737 FYKEDERLPAFDDYAMR--------GRTYRYFDGKPLYPFGHGLAYTQFAYSNL------ 782

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                                     +D  T          + V N G   G  VV L+ 
Sbjct: 783 -------------------------RLDRTTVAADGTLRATVWVKNTGQRAGDEVVQLYL 817

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
                 +    K+L GF R+       +E+SF + P E L I ++ 
Sbjct: 818 HPLNPQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYDEQ 863


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 209/711 (29%), Positives = 335/711 (47%), Gaps = 123/711 (17%)

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRG 180
           S T+FPQ +  A+++N  L  +I S +  + R +  + Q      +P +++ +DPRWGR 
Sbjct: 100 SSTAFPQAIGLASTWNPELVMDIASVIRSQGR-LVGVNQC----LSPVLDVCKDPRWGRC 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           +ET GEDP +V++  + ++   QG+N                                L 
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQGDN-------------------------------QLV 183

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH  A+   + G   R      +  ++  +TF  PF   ++ GK   IM +Y++++G
Sbjct: 184 ATAKHFAAHGFPEGG---RNIAQVHVGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDG 240

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           +P      L     R EWGF G + SD D +  +           ++A   L++G+DI  
Sbjct: 241 IPCHGNPQLLTNILRQEWGFDGIVVSDYDGIRQLETIHRVASNKMEAAILALESGVDIEF 300

Query: 360 GT--CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
            T  C      +A+ +G V E  IDRA+  +  ++ RLGL + +P   +     P+ +  
Sbjct: 301 PTIDCYGEPLVNALKEGLVPESLIDRAVERVLRIKDRLGLLD-NPFVNENSV--PEKLDD 357

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT------- 470
            + ++LAL  AR+ IVLLKN+   LPL+KN V+ +A+IGP  N+   M G YT       
Sbjct: 358 HKSRELALKTARESIVLLKNENNILPLSKN-VNKIAVIGPNANDPRNMLGDYTYTGHLNI 416

Query: 471 --GIPCSPKSLLRGLEAYV--SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG- 525
             GI     ++L+G+   V  SK  YA GC D+   S  GF EA+ IA++AD +I + G 
Sbjct: 417 DSGIEIV--TVLQGIVKKVGESKVLYAKGC-DIASESKEGFAEAIEIARQADVIIAIMGE 473

Query: 526 --------LDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
                   +D+  + E +          DR SL LPG Q  L+  + +T K P+ILVL  
Sbjct: 474 KSGLPLSWMDIPSKEEFKKYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN 532

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL +S     + + +++   +PGE G  A+A++IFGD+NP GRLP+T +P    ++P+
Sbjct: 533 GRPLVLS--SIINYVKAVIEAWFPGEEGGNAIADVIFGDYNPSGRLPIT-FPMDTGQIPL 589

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGS 686
              N +  S      R Y     + ++ FG+GLSYT + Y  L   P E+        G 
Sbjct: 590 Y-YNRKPSSF-----RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEI--------GP 635

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           + NI                         + I V N G ++G  VV L+      S   P
Sbjct: 636 NSNI------------------------AISIDVKNVGKMEGDDVVQLYVSKTFSSVARP 671

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF ++H      + + F + P E L+  +   R ++  G + L++G
Sbjct: 672 VKELKGFAKIHLKPGEKRRVKF-ILPTEALAFYDSFMRLVVEKGEYQLLIG 721


>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 954

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 226/781 (28%), Positives = 363/781 (46%), Gaps = 132/781 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++N+ L   +   +  E  A  N  QA    W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQS--------------------------- 306

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   GVP     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 359 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +D+D     + S   R  LF  +P K   +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNPCKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL+K  + ++A++GP  +++    G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLSK-TLRTIAVVGPGADDLQP--G 534

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDA-GFHEAVRIAKKADFVI 521
            YT   +P   KS+L G+++ V   +K  Y  GC     N DA    +AV+ A ++D VI
Sbjct: 535 DYTPKLLPGQLKSVLTGIKSAVGKQTKVLYEQGCDFT--NPDATNIPKAVKTASQSDVVI 592

Query: 522 VVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D
Sbjct: 593 MVLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYD 651

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +   +A     +IL    PG+ G  A+A+++FGD+NP GRLPMT +P    ++P+   N 
Sbjct: 652 I--LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNF 707

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           +  S R+Y      +Y    +Y FG GLSYT++ Y  L           ++  ++ N+  
Sbjct: 708 KT-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV-- 751

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                                  VQ +V N G   G  V  L+      S  T   +L  
Sbjct: 752 ----------------------EVQATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELKD 789

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHS 802
           F R+H     SK +SF + P + +S+ N    R++  G   +M+G          E++HS
Sbjct: 790 FTRIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMIGGMSPDYVAKNEIKHS 848

Query: 803 L 803
           +
Sbjct: 849 V 849


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 162/448 (36%), Positives = 244/448 (54%), Gaps = 57/448 (12%)

Query: 57  TSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
           T+ S   R   LISL+T +EKI+QL++ A +IPRLG+ AY +W+ESLHG+ + G      
Sbjct: 109 TTGSYKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGVLAEG------ 162

Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPR 176
                 TSFPQ +   A+++  L + + +AV+ EARA+  L   GLT+W+P INI RDPR
Sbjct: 163 -----ATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPR 217

Query: 177 WGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDE 236
           WGR +E+  EDP ++S   V F+K  QG                           D    
Sbjct: 218 WGRNEESYSEDPYLLSRMGVAFIKGMQG---------------------------DHPYY 250

Query: 237 LMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYN 296
           L   A  KH IA + E+     R++ ++ +  ++  + + P F+S I + +A  IM +YN
Sbjct: 251 LKTVATPKHFIANNEEE----RRHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYN 306

Query: 297 QVNGVPA----CLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           ++N VP+     L  DL    R +WGF+GY+ SDC A+  +     + KT  ++ A  + 
Sbjct: 307 ELNHVPSNANMFLMTDLL---RRQWGFEGYVVSDCGAIHDMLYGHKFFKTGAEAVARSIL 363

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
           AG D+NCG       + A+D+G ++EKDID AL  + S + RLG F+  P    Y  +G 
Sbjct: 364 AGCDLNCGQAYREFIKDALDEGLLREKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGK 422

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
           D + + E+++LALDAAR+ IVLLKN+   LP++K+ + S+A+IGP  N      G Y+G 
Sbjct: 423 DKLDSKENRRLALDAARKSIVLLKNN-DILPIDKSKIKSIAVIGP--NAREAQLGIYSGF 479

Query: 473 PCSPKSLLRGLEAYVS----KTHYASGC 496
           P    S L G++        +  Y  GC
Sbjct: 480 PNVLISPLEGIKNKADSLDIRVGYVKGC 507



 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 83/271 (30%), Positives = 129/271 (47%), Gaps = 41/271 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           F +A +IA + D VI+V G+      E+ DR  + LP  Q  LV   A  +   +++VL 
Sbjct: 658 FEKAKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEVNPN-IVIVLV 716

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            GGP  V+ A A+    +I+   Y GE G +ALA+++FGD+NPGG+LP T+Y  +    P
Sbjct: 717 NGGP--VALAGAEKYAKAIVENWYNGEFGGQALADVLFGDYNPGGKLPQTFYASTEQLPP 774

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
           M+D ++  +       R+Y +     ++ FGHGLSYT + Y  L   S            
Sbjct: 775 MSDYDIINNP------RTYMYLNEQALFPFGHGLSYTTFKYDSLKIVS------------ 816

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
             N L +T +                   +Q  +TN G+ +G  VV ++A         P
Sbjct: 817 --NTLNETDT-----------------LSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVP 857

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
            KQL  F R+      SK + F + P ++L+
Sbjct: 858 RKQLKRFRRLTLQTGESKVLEFKI-PVDELA 887


>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 779

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 225/755 (29%), Positives = 345/755 (45%), Gaps = 142/755 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  L  AA+++  +    G  +A 
Sbjct: 128 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGLGMAATWSTDVIEQAGVIIAK 175

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++  +PRW R +ET GEDP++    AV  VK         
Sbjct: 176 EIRL-----QGGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVK--------- 221

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D        A  KH IAY + + G     S   +I  +
Sbjct: 222 ---GLGAG--------------DITKPFATIATLKHFIAYGIPESGQNGAPS---IIGTR 261

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           D  D F PPFR  I+ G A  +M SYN ++G+P    G L  +  RN+WGFKG++ SD  
Sbjct: 262 DLLDNFLPPFRRAIDAG-ALSVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLY 320

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I+   +   + +++    L+AG+D++ G         A+ +G+V E  ID A+L +
Sbjct: 321 SIDGIYGTHHTVSSLQEAGIEALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDEAVLRI 380

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             +++ +GLF     +P+  K G      V T+E+ ++A   A + I LLKN  K LPL+
Sbjct: 381 LRMKIEMGLFEHPYVNPKTAKTG------VRTAENIQVAKRVAEESITLLKNSNKLLPLS 434

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVSKTH--YASGC--HDV 499
           KN    +A+IGP  +N   M G YT      + K++L G+ + +S +   Y  GC   D 
Sbjct: 435 KNI--KIAVIGPNADNRYNMLGDYTAPQQDSNVKTILDGIRSKLSPSQITYVKGCSIRDT 492

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      EAVR A++AD ++V  G     D     QET                E  D
Sbjct: 493 VFNE---IGEAVRAAREADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESGEGFD 549

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G Q  L+ S+  T K P++++   G PLD ++  A  Q  ++L   YPG+ G 
Sbjct: 550 RASLSLMGIQSRLLQSLKETGK-PMVVIYIEGRPLDKTW--ASEQADALLTAYYPGQEGG 606

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A ++FGD+NP GRLP+T  P S  ++P+     R          +Y     T +Y F
Sbjct: 607 NAIANVLFGDYNPAGRLPIT-VPRSVGQLPVYYNKKRPVV------HNYVEMASTPLYPF 659

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+GLSYT++                                 DY H++ +T  +   + V
Sbjct: 660 GYGLSYTSF---------------------------------DYSHLN-ITKKSEEEYEV 685

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              + N+G+ DG  V  L+      S   P KQL GF R+H     +K I+  +   + L
Sbjct: 686 SFDIRNSGERDGDEVAQLYISDKVASVVQPVKQLKGFARIHLKKGETKRITL-ILKKDDL 744

Query: 777 SIANKHGRRILPLGNHVLMVG------ELRHSLTI 805
           SI +++  R++  G+  + +G       L+  LTI
Sbjct: 745 SITDRNMERVVEAGDFEIQIGSSSEDIRLKAKLTI 779


>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 950

 Score =  271 bits (693), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 225/780 (28%), Positives = 360/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 354

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   GVP     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 355 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 414

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +D+D     +     R  LF  +P K   +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 474

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL+K  + ++A++GP  +++    G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKENLLPLSK-TLCTIAVLGPGADDLQP--G 530

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D VI+
Sbjct: 531 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 589

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 704

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y  L           ++  ++ N+   
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 747

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 748 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
            R+H     SK +SF + P + +S+ N    R++  G   +MVG          E++HS+
Sbjct: 787 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 845


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 157/444 (35%), Positives = 237/444 (53%), Gaps = 45/444 (10%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           +TL+EK  Q+ + A AIPRLG+PAY+WW+E+LHG+A  G            T FPQ +  
Sbjct: 1   MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50

Query: 132 AASFNRSLWSNIGSAVAVEARAMYN--LGQ------AGLTFWAPNINIFRDPRWGRGQET 183
           AA+F+  L   + +A++ EARA ++  L Q       GLTFW+PNINIFRDPRWGRGQET
Sbjct: 51  AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110

Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
            GEDP + +   V FV+  QGE   +     G  +R+                  L A  
Sbjct: 111 YGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEPYRK------------------LDATA 152

Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
           KH   +   +     R+ F+A  +++D  +T+ P F + ++ GK   +M +YN+V G  A
Sbjct: 153 KHFAVHSGPE---ADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 209

Query: 304 CLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC 362
                L Q   R +WGFKGY+ SDC A+  I+++     T E +AA  +K G ++ CG  
Sbjct: 210 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVATREQAAALAVKHGTELECGE- 268

Query: 363 MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
                 +A+ +G + E  ID AL  L + ++RLG+F+  P +  +  +      +  H  
Sbjct: 269 EYSTLPAAVRQGLIDEAQIDTALTTLMTARMRLGMFD-PPGQLPWSTIPASVNQSPAHDA 327

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRG 482
           LA   AR+ +VLLKND   LPL++  +  +A+IGP  ++   + G Y G P +P ++L+G
Sbjct: 328 LARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 386

Query: 483 LEAYV--SKTHYASGCHDVPCNSD 504
           + A    ++  YA G   V    D
Sbjct: 387 IRAAAPNAQVLYARGADLVEGRDD 410



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/282 (32%), Positives = 131/282 (46%), Gaps = 52/282 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             EA+ +A+ AD V+ V GL    E E+           DR  L LP  Q  L+ ++  T
Sbjct: 574 LQEALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQAT 633

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV+ VLT G  L + +A+    + +IL   YPG+ G  A+A+ +FGD NPGGRLP+T
Sbjct: 634 GK-PVVAVLTTGSALAIDWAQ--QHLPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVT 690

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y ES T    +D  MR        GR+YR++ GT +Y FGHGLSYT ++Y         
Sbjct: 691 FYKESETLPAFDDYAMR--------GRTYRYFGGTPLYPFGHGLSYTQFAY--------- 733

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                            +G RLD   I    S T+      ++V N G   G  VV L+ 
Sbjct: 734 -----------------SGLRLDRTTIAADGSLTAT-----VTVKNTGQRAGDEVVQLYL 771

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
                 +    K+L GF R+       + + F +D    L I
Sbjct: 772 HPLTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRI 813


>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 950

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 225/780 (28%), Positives = 359/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 222

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 223 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 302

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 303 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 354

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   GVP     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 355 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 414

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +D+D     +     R  LF  +P K   +
Sbjct: 415 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 474

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N    LPL+K  + ++A++GP  +++    G
Sbjct: 475 KKIYPG-WNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--G 530

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D VI+
Sbjct: 531 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 589

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 590 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 648

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 649 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 704

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y  L           ++  ++ N+   
Sbjct: 705 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 747

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 748 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 786

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
            R+H     SK +SF + P + +S+ N    R++  G   +MVG          E++HS+
Sbjct: 787 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 845


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 164/474 (34%), Positives = 238/474 (50%), Gaps = 61/474 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N +L +  R ++L+ LLT+ EKI  + DN+ A+PRL IPAY WW+E+LHG+A  G 
Sbjct: 23  YPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVARAG- 81

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG---------QAGL 162
                    + T FPQ +  AA+++          ++ EARA YN             GL
Sbjct: 82  ---------TATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKTGRYEGL 132

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           TFW PNINIFRDPRWGRGQET GEDP + S   V  VK  QG + K              
Sbjct: 133 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGNDPKY------------- 179

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                             AC KH   +   +W   +R+S+NA ++++D  +T+ P F+S 
Sbjct: 180 --------------FKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSL 222

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           + +G    +MC+YN  +G P C    L  +  R +W + G + SDC A+A  ++ + Y  
Sbjct: 223 VLEGNVREVMCAYNAFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQ-EKYHG 281

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D   +AA  LK   D+ CG     +   ++  G + EKDID ++  +      LG+ 
Sbjct: 282 THPDEKSTAADALKHSTDLECGDT-YNNLNKSLAGGLITEKDIDISMRRILKGWFELGML 340

Query: 399 NGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
             DP+        P  V  S EHKK AL  A++ IVL+KN+   LP NKN +  +A++GP
Sbjct: 341 --DPKSSVLWNQIPYSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNKN-IKKIAVVGP 397

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHE 509
             ++     G Y G P S  ++L G++A    T   Y  G      +S A  ++
Sbjct: 398 NADDEMMQLGNYNGTPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSSRASLYQ 451



 Score = 95.9 bits (237), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 78/297 (26%), Positives = 121/297 (40%), Gaps = 47/297 (15%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           +  K  Y S   +V       F       K AD ++   GL  + E E+           
Sbjct: 571 HTQKGKYVSISFEVYRKDPVNFASVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGG 630

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           D+ S+ LP  Q  L+  + +T K PV+ VL  G  L +   E +  +    W G  G++G
Sbjct: 631 DKTSIELPKVQRELLAELRKTGK-PVVFVLCTGSSLGLEQDEKNYDVLLNAWYG--GQSG 687

Query: 596 AKALAEIIFGDFNPGGRLPMTWYP--ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
             A+A+++ GD+NP GRLP+T+Y   E            +   +    GR+YR+ T   +
Sbjct: 688 GTAVADVLAGDYNPSGRLPVTFYKNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPL 747

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FGHGLSY+ ++Y                 G+ K          D +            
Sbjct: 748 YAFGHGLSYSKFNY-----------------GNAKLSKNSISPNEDII------------ 778

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
             + + VTN  D DG  VV ++ +        P K L  F+RV   +K +K I   +
Sbjct: 779 --ITVPVTNISDRDGEEVVQVYVKRNN-DVLAPVKTLRAFERVLIRSKETKNIQLTI 832


>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
 gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
          Length = 752

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 217/689 (31%), Positives = 335/689 (48%), Gaps = 117/689 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   + ++ +HG                 T FP  L SAAS++           A+
Sbjct: 104 RLGIPLL-FGADVIHGFK---------------TVFPIPLASAASWDLEAIEKAERVAAI 147

Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E+ A      AG+ F +AP ++I RDPRWGR  E  GEDP + S  A   V+ FQ     
Sbjct: 148 ESTA------AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQ----- 196

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-IT 267
                                E    D   ++AC KH  AY     G      +N + ++
Sbjct: 197 ---------------------EQSLTDPQTMAACVKHFAAYGAPDGGR----DYNTVDMS 231

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
           E+   + + PP+++ I+ G A+ IM S+N++NG+ A     L +   R EWGFKG + SD
Sbjct: 232 ERLLREMYLPPYKAGIDAG-AATIMTSFNELNGIAASGSQFLLRDILRKEWGFKGMVVSD 290

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRAL 385
             +V  +  +      + ++A   LKAG+D++  G   L      +++GK+  K +D A+
Sbjct: 291 WQSVNEMVAH-GNAANNAEAAMMALKAGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAV 349

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
            N+  ++  LGLF+ DP +        +++   EH + A D A++ IVLLKN +K LPL 
Sbjct: 350 RNVLKLKYDLGLFD-DPYRYSDTIREKNNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLK 408

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYVS---KTHYASGCHDVP 500
           K ++ ++A+IGPL +N + M G ++  G    P + L+G++  VS   +  YA GC+   
Sbjct: 409 K-SIGTIAVIGPLADNQADMNGTWSFFGEAQHPITFLQGIKDAVSGQSRVLYAEGCNLYD 467

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
            + D  F EAV IAKKAD VI+  G       E   R  + LPG Q  LV  +A+T K P
Sbjct: 468 RSKDK-FAEAVNIAKKADVVILAVGESAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-P 525

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V+ ++  G PLD+S+   D  I +IL +   G     A A+++FGD+NP G+LP+T +P 
Sbjct: 526 VVALVMSGRPLDLSW--LDENIPAILEVWTLGSEAGNAAADVLFGDYNPSGKLPVT-FPR 582

Query: 621 SFTKVPMNDMNMRADSSRQYPG------------RSYRFYTGTQVYGFGHGLSYTNYSYK 668
           +  +VP+       ++ R Y G              YR    + +Y FG+GLSY+ + Y 
Sbjct: 583 NVGQVPI--YYNHKNTGRPYEGDYSEPLSERIYRSKYRDVQNSPLYPFGYGLSYSTFEY- 639

Query: 669 FLSAPSELTISA-SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
                S++T+SA +L AG                  + +T+         +S+TN G  D
Sbjct: 640 -----SDITLSADTLNAG------------------ESITAS--------VSITNEGPYD 668

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRV 756
           G  VV L+ R    S   P K+L GF ++
Sbjct: 669 GEEVVQLYIRDLVGSVTRPVKELKGFKKL 697


>gi|198425902|ref|XP_002120563.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 996

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 198/594 (33%), Positives = 294/594 (49%), Gaps = 81/594 (13%)

Query: 64  RAKSLISLLTLQEKIQQLS-----DN--ASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
           R+ S +S LT++E + Q+S     DN  A AI RLGI  Y+W +E L G A NG      
Sbjct: 450 RSFSEVSRLTVKELVLQISRGGAGDNGPAPAITRLGIGPYQWNTECLRGYAMNG------ 503

Query: 117 GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTFWAPN 168
                 T FPQ +  AA+F++ L   +   +A+EARA +N            GL+ ++P 
Sbjct: 504 ----DATCFPQPIGLAATFDQGLIYKLAKTIALEARAKHNNFTKNGNFGDHTGLSCFSPV 559

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           INI R P WGR QET GEDP++ S  A  +V   QG                        
Sbjct: 560 INILRHPLWGRNQETFGEDPVMSSLMARAYVTGLQG------------------------ 595

Query: 229 EESDRGDELML--SACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQG 286
                 DE+    +A CKH  AYD  +    SR SFNA ++ +D   T+ P FR C+  G
Sbjct: 596 ------DEIYYPATANCKHFAAYDGPENIPSSRLSFNANVSIEDLGRTYFPAFRECVHSG 649

Query: 287 KASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
            A  I+CSYN +NG PAC    L    R+++ FKGY++SD  A+     Y  YTK++  S
Sbjct: 650 -AFGIVCSYNAINGEPACASSYLQTILRDKFNFKGYVSSDESAIEFFDIYFKYTKSNLLS 708

Query: 347 AAGVLKAGMDINCGTCMLRHTQS----AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           A     AG+D+   +    +  S    A+++G V E  + R+   LF  ++ LG F  DP
Sbjct: 709 AVAAFDAGVDLELTSYGKNNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGEF--DP 766

Query: 403 RKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           ++  +    P DV  S  H+K A++ A +  VLLKND   LPLN +   ++AI+GP +NN
Sbjct: 767 QEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GVLPLN-HRYENVAIVGPFINN 824

Query: 462 ISQMGGGY-----TGIPCSPKSLLRGLEAYVSKTHYASGC-----HDVPCNSDAGFHEAV 511
              + G Y          SP      L +      + +GC      ++P  +        
Sbjct: 825 SEALTGDYHPNYNLKYFSSPLFAANSLSSS-GVARFTTGCVGTNNQNLPICATYNSTHVK 883

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
            +   +D V+V  G     E E RDR+S+ LPG+Q+ ++  V + +  PVI+VL   GPL
Sbjct: 884 EVVTGSDIVLVTLGTGTGIEAEGRDRLSMELPGKQLDMIKDVVKYANGPVIVVLFNAGPL 943

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKV 625
           DVS+   ++  ++++   +  +   +A+ E++ G  NP GRLP TW P S  +V
Sbjct: 944 DVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGIVNPAGRLPNTW-PASMQQV 994


>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
           17393]
 gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 935

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 220/767 (28%), Positives = 353/767 (46%), Gaps = 120/767 (15%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
           S  + + +L +  R +SL+S++T ++K++ + +      IP L +P      E++HG + 
Sbjct: 148 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 206

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
                      S  T FPQ L   A++N+ L   +  AV  E      L    +  W+P 
Sbjct: 207 G----------SGATIFPQALAMGATWNKKLTEEVAMAVGDET-----LSAGTMQAWSPV 251

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           +++ +D RWGR +ET GEDP++VS     ++K +Q                         
Sbjct: 252 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS------------------------ 287

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
                   + L    KH   +     G   R S +  ++E++  +    PFR  I     
Sbjct: 288 --------MGLYTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 336

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSA 347
             +M +Y+   GVP     +L     R EWGF G+I SDC A+  +   ++YT  ++  A
Sbjct: 337 QSLMMAYSDFLGVPVAKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEA 396

Query: 348 AG-VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           A   L AG+  NCG T   +    A   G++  +++D     +  +  R  LF   P K 
Sbjct: 397 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPNKP 456

Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + K+ P    +  HK++A  AAR+ IVLL+N    LPL+K+ + ++A++GP  N++  
Sbjct: 457 LDWNKIYPG-WNSDSHKEMARQAARESIVLLENKDNILPLSKD-MRTIAVLGPGANDLQP 514

Query: 465 MGGGYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
             G YT    P   KS+L G++  V   +K  Y  GC D     +    +AV++A ++D 
Sbjct: 515 --GDYTPKLQPGQLKSVLTGIKQAVGKQTKVIYEQGC-DFTSLGENNIAKAVKVASQSDV 571

Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V++V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P
Sbjct: 572 VLLVLGDCSTSEATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 630

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
            ++S A    +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   
Sbjct: 631 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 686

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           N +  S R+Y      +Y    +Y FG+GLSYT++ Y  L           ++   + NI
Sbjct: 687 NFKT-SGRRYEYSDMEYYP---LYYFGYGLSYTSFEYSGL----------KIQEKENGNI 732

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                                    VQ +V N G   G  VV L+      S  T   +L
Sbjct: 733 T------------------------VQATVKNIGQRAGDEVVQLYVTDMYASVKTRITEL 768

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             F R+H     +K +SF + P E LS+ N H  R++  G   ++VG
Sbjct: 769 KDFTRIHLKPGEAKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILVG 814


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 193/650 (29%), Positives = 310/650 (47%), Gaps = 93/650 (14%)

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           +T+W+PN+NIFRDPRWGRGQET GEDP + +     +V+  QG               + 
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQG--------------NDP 46

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
             LK              +AC KH   Y +       R+ FNA  +++D  +T+ P F +
Sbjct: 47  FFLKA-------------AACAKH---YAVHSGPEALRHEFNASPSKRDLFETYLPAFEA 90

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITSDCDAVATIFEYQN 338
            +++ K   +M +YN+V G  A   G  F      R +WGFKG++ SDC AV  I+    
Sbjct: 91  LVKEAKVEAVMGAYNRVYGESAS--GSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHK 148

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
             K   +++A  LK+G+++NCG       + A+++  + E D+D AL+ L   +L+LG  
Sbjct: 149 IAKDVAEASAIALKSGLNLNCGGS-FHALKEALERKLITEVDLDNALMPLMMTRLKLGNL 207

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
             D  +  Y  +    + +  H  +A + A++ +VLLKN+   LPL K+ V ++ + GP 
Sbjct: 208 TDD-DESPYKNISDSVIASYTHAMVAREVAQKSMVLLKNNNHTLPLKKD-VKTIFVTGPY 265

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCN---SDAGFHEAVR 512
             +   M G Y G+     + L+G+ A VS     +Y  G      N   +D    E VR
Sbjct: 266 AADTYVMMGNYYGVSPRSNTFLQGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGE-VR 324

Query: 513 IAKKADFVIVVAGLDLTQE------TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
            A+ A  VI ++G+D  +E      +   D+ +L LP  Q+  +  ++R     ++ V+T
Sbjct: 325 AAEVAIVVIGLSGIDEGEEGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVIT 384

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
           GG P+D+      S    + W  YPG+ G  AL +++FGD +  GR+P+T+        P
Sbjct: 385 GGSPIDLEEVSELSDAVIMAW--YPGQEGGMALGDLLFGDVSFSGRMPVTF--------P 434

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
           +N   + A       GR+Y++ T   +Y FG+GL+Y + SY                  S
Sbjct: 435 INSDWLPAFEDYNMQGRTYKYMTDNIMYPFGYGLTYGDVSY------------------S 476

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           D  IL        Y    E+        HVQ ++ N G+ +   VV L+   P     TP
Sbjct: 477 DVKILNPK-----YDGKQEI--------HVQATLRNNGNNEVEEVVQLYLSAPGAGVITP 523

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
              LIGF RV   +  S+ + F + P +QL +  + G + L  G + ++V
Sbjct: 524 ISSLIGFKRVTLESHLSQTVEFIIKP-DQLKMVMEDGSKNLLKGKYTIIV 572


>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
 gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
          Length = 954

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 225/780 (28%), Positives = 359/780 (46%), Gaps = 130/780 (16%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E  A  N  QA    W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 306

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 307 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHAIRNYDCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   GVP     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 359 MMAYSDYMGVPVAKSKELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +D+D     +     R  LF  +P K   +
Sbjct: 419 ALAAGIATNCGDTYNNKEVIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNPCKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N    LPL+K  + ++A++GP  +++    G
Sbjct: 479 KKIYPG-WNSDSHKEMARQAARESIVMLENKDNLLPLSK-TLRTIAVLGPGADDLQP--G 534

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D     +    +AV+ A ++D VI+
Sbjct: 535 DYTPKLLPGQLKSVLTGIKGAVGKQTKVLYEQGC-DFTNPDETNIPKAVKAASQSDVVIM 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D+
Sbjct: 594 VLGDCSTSEATNDVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDI 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A+A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 653 --LKASEMCKAILVNWLPGQEGGPAMADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG GLSYT++ Y  L           ++  ++ N+   
Sbjct: 709 T-SGRRYEYVDMEYYP---LYRFGFGLSYTSFEYSNL----------KIQEKANGNV--- 751

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 752 ---------------------EVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHSL 803
            R+H     SK +SF + P + +S+ N    R++  G   +MVG          E++HS+
Sbjct: 791 ARIHLQPGESKTVSFEMTPYD-ISLLNDRMDRVVEKGEFKIMVGGMSPDYVAKNEIKHSV 849


>gi|116312033|emb|CAJ86398.1| OSIGBa0125M19.1 [Oryza sativa Indica Group]
          Length = 247

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 139/237 (58%), Positives = 167/237 (70%), Gaps = 28/237 (11%)

Query: 43  PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSES 102
           PC  P   +YPFCN +L    RA++L+SLLTL EKI QLS+ A+  PRLG+P +EWWSES
Sbjct: 29  PCASPAASAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSES 88

Query: 103 LHGIASNGPGVNFN-GTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG 161
           LHG+  NGPGVNF+ G V S T FPQV++SAA+FNRSLW     A+AVEARAM+N GQAG
Sbjct: 89  LHGVCDNGPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAG 148

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFWAPNIN+FRDPRWGRGQETPGEDP VVSAY+VE+VK FQ                  
Sbjct: 149 LTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQ------------------ 190

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE--QDTEDTFQ 276
              + +GEE      +MLSACCKH IAYDLEKW  F+RY+FNA +      ++ T+Q
Sbjct: 191 ---RDYGEEG----RMMLSACCKHYIAYDLEKWRGFTRYTFNAKVMNVLVPSQSTYQ 240


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 160/457 (35%), Positives = 237/457 (51%), Gaps = 56/457 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           +PF N  L + TR   L+S +TL+EKI QL  +A AI RL IP Y WW+ESLHG+A  G 
Sbjct: 24  FPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAG- 82

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA---------MYNLGQAGL 162
                      T FPQ +  AAS++  L   + +A++ EARA          +++ Q GL
Sbjct: 83  ---------YATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHDIYQ-GL 132

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           T W+PNINIFRDPRWGRG ET GEDP +      ++VK  QG                  
Sbjct: 133 TMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQG------------------ 174

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                    D  + L + A  KH   +   +    SR+ F+A  +E+D  +T+ P FR  
Sbjct: 175 ---------DDPEYLKVVATAKHFAVHSGPE---ESRHYFDANTSERDLWETYLPAFRML 222

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++  +   +M +YN+  G  A     LF   RN+WGF GY+ SDC A+  I+E    T  
Sbjct: 223 VKDAQVQSVMTAYNRFRGEAASSNKLLFDILRNKWGFDGYVVSDCGAINDIWEDHKITAD 282

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
              ++A  L+ G D+NCG    +  + AI  G + E+ I+ A+  LF  +L+LG+F+ + 
Sbjct: 283 AASASALALETGTDLNCG-ATYKSLKEAIANGLITEEKINIAIERLFRARLKLGMFDTE- 340

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
               Y  +       + H  LA  AA++ IVLLKN+   LPL+K+ +  +A+IGP  +N+
Sbjct: 341 ENLSYATIPFSVNTNASHTALARKAAQESIVLLKNEAHMLPLSKD-LKQIAVIGPNAHNV 399

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC 496
             + G Y G P +P ++++G+   V+   K  Y  G 
Sbjct: 400 QSLWGNYNGTPKNPVTVVQGIRNKVAPQVKVVYEEGA 436



 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 154/302 (50%), Gaps = 56/302 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
              AV +A+ +D  I+V GL+   E E+           DR +L LP +Q  L+ ++  T
Sbjct: 589 LERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQRELMRALVAT 648

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K P++LVL  G  L +++A+    + +IL  GYPG+ G  A+A+++FGD+NP GRLP+T
Sbjct: 649 GK-PIVLVLLNGSALAINYAQ--EHVPAILSAGYPGQEGGNAIADVLFGDYNPAGRLPVT 705

Query: 617 WYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
           +Y +S   +P   D +M+        GR+YR++ G  +Y FG+GLSYT +SY        
Sbjct: 706 YY-KSVDDLPDFEDYSMK--------GRTYRYFEGEALYPFGYGLSYTQFSYD------- 749

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
             I  S +  +DK +                        +VQ++VTN+GD DG  VV L+
Sbjct: 750 -AIKTSGRLAADKVL------------------------NVQVTVTNSGDRDGDEVVQLY 784

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLM 795
            +    S   P+ QL+GF R+H     ++ + F +D   Q S+ N   + ++  G   L 
Sbjct: 785 LKDEVASTTRPQVQLVGFKRIHLQKGETQTVEFRLD-ARQFSMINDQEQLVVEPGWFTLY 843

Query: 796 VG 797
            G
Sbjct: 844 AG 845


>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
          Length = 954

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 219/764 (28%), Positives = 354/764 (46%), Gaps = 120/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + +L +  R +SL+S++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++N+ L   I  AV  E      L    +  W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKKLTEEIAMAVGDET-----LAAGTMQAWSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q +                          
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   GVP     +L     R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 359 MMAYSDFLGVPVAKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++  +++D     +  +  R  LF   P K   +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPNKPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  AAR+ IV+L+N +  LPL+K  + S+A++GP  +++    G
Sbjct: 479 NKIYPG-WNSDNHKEMARQAARESIVMLENKENILPLDK-GIRSIAVLGPGADDLQP--G 534

Query: 468 GYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT   +P   KS+L G++  V   +K  Y  GC D    S+    +AV+ A ++D V++
Sbjct: 535 DYTPKLLPGQLKSVLTGIKQAVGKQTKVIYEQGC-DFTNLSETNIPKAVKAASQSDVVVM 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PVILVL  G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQAGRPYNL 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           + A    +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P+   ++P+   N +
Sbjct: 653 TKASKLCKAIIVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPQHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG+GLSYT++ Y  L           ++   + NI   
Sbjct: 709 T-SGRRYEYSDLEYYP---LYYFGYGLSYTSFEYSGL----------KVQEKDNGNIT-- 752

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 VQ +V N G   G  VV L+      S  T   +L  F
Sbjct: 753 ----------------------VQATVKNVGQRAGDEVVQLYVTDMYASVKTRITELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            R++     SK +SF + P + LS+ N H  R++  G   ++VG
Sbjct: 791 TRINLKPGESKTVSFELTPYD-LSLLNDHMDRVVEKGEFKILVG 833


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 223/753 (29%), Positives = 339/753 (45%), Gaps = 137/753 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  +A+++ +L   +G A+A 
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q     + P +++ RDPRW R +ET GEDP++        V          
Sbjct: 190 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVT--------- 235

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D   E    A  KH +AY + + G    Y   A +  +
Sbjct: 236 ---GLGSG--------------DLSREHATIATLKHFLAYAVPEGGQNGNY---ASVGAR 275

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D  + F PPFR  IE G A  +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 276 DLHENFLPPFREAIEAG-ALSVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLY 334

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   L AG+DI+ G     +   A+  GK+ E  I+ A+  +
Sbjct: 335 SIEGIHESHFVASTMEEAAVQALSAGVDIDLGGDAFMNLLQAVRSGKLDETQINAAVDRI 394

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     +P      K     V   EH KLA   A+  +VLL+N    LPL+
Sbjct: 395 LRMKFEMGLFEHPYVNP------KTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPLS 448

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGC--HD 498
           K  +  +A++GP  +N   M G YT  P   K +   L+  +SK       Y  GC   D
Sbjct: 449 KK-IKRVAVVGPNADNRYNMLGDYTA-PQEDKDIRTVLDGVISKLSPSRVEYVRGCAIRD 506

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
              N  A   EAV  A +++ +I V G     D     QET                E  
Sbjct: 507 TTVNEIA---EAVEAAHRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 563

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L L G+Q  L+ ++ +T+ +P+I+V   G PLD  +A   +   ++L   YPG+AG
Sbjct: 564 DRATLTLLGKQQDLLNAL-KTTGKPLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAG 620

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   +      + +YG
Sbjct: 621 GDAIADVLFGDYNPAGRLPVS-VPRSVGQIPVY-YNKKAPRNHDYVEMA-----ASPLYG 673

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT + Y                  SD  I Q+             + C    F 
Sbjct: 674 FGYGLSYTTFEY------------------SDLQITQK-------------SPC---HFE 699

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V   V N G+ DG  V  L+ +    S   P KQL  F+R        KEI F +   + 
Sbjct: 700 VSFKVKNTGNYDGEEVAQLYLKDEYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KD 758

Query: 776 LSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
           LSI ++  +R++  G+  +M+G     + + T+
Sbjct: 759 LSIIDRSMKRVVETGDFRIMIGASSDDIRLTTH 791


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ FNA  T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A   + +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ FNA  T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A   + +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ FNA  T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A   + +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  269 bits (687), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 228/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ FNA  T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPSKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A   + +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827


>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 823

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 213/735 (28%), Positives = 342/735 (46%), Gaps = 125/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ +              T+FP  L    ++++ L + +G+   +
Sbjct: 172 RLGIPV-DFTNEGIRGVEA-----------YQATAFPTQLNMGMTWDKPLVNQMGNITGM 219

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EARA+      G T  +AP +++ RD RWGR +E  GEDP +V+   VE  K  Q  N  
Sbjct: 220 EARAL------GYTNVYAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN-- 271

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH   Y   K G       +  +  
Sbjct: 272 -----------------------------QIAATAKHFAVYSANKGGREGLARTDPQVAP 302

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ E+    PF+  I++     +M SYN  +G+P  + G    L Q+ R E+GFKGY+ S
Sbjct: 303 REVENILLYPFKKVIKEAGLMGVMSSYNDYDGIP--ISGSSYWLIQRLRQEFGFKGYVVS 360

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DA+  ++   +     +D+      AGM++         ++ + +  + +GK+    I
Sbjct: 361 DSDALEYLYNKHHVAADLKDAVYQAFMAGMNVRTTFRTPDSIIIYARQLVKEGKLPIDTI 420

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+ +LGLF  D    +  +     V  + ++ +AL A+++ IVLLKN    
Sbjct: 421 NSRVRDVLRVKFKLGLF--DHPYVQDAEASAKLVNCAANQAVALQASKESIVLLKNKGAI 478

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGCHDV 499
           LPL+K    +LA+IGP   N       Y  +     ++L G++A V   K  YA GC+ V
Sbjct: 479 LPLSKQ--QTLAVIGPNALNDDYAHTHYGPLASKSINILEGIQAKVGAGKVLYALGCNLV 536

Query: 500 ----------PCNSD----AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
                     P + D    A    AV IA+ AD  +VV G +     E++ R SL LPG 
Sbjct: 537 DKHWPESEILPQDPDQAEQAKIDSAVTIARHADVAVVVLGGNTQTAGENKSRTSLDLPGY 596

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q+ LV +V  T K PV++VL G  P+ +++   D  I  I++ GYPG  G  A+A+++FG
Sbjct: 597 QLRLVKAVKATGK-PVVVVLIGSQPMTINWI--DQHIDGIIYAGYPGTQGGTAVADVLFG 653

Query: 606 DFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           D+NPGG+L +T +P+S  ++P N     N   D       +         +Y FG GLSY
Sbjct: 654 DYNPGGKLTLT-FPKSVGQLPFNFPTKPNSETDEGELAKIKGL-------LYPFGFGLSY 705

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T ++Y      S+L IS +++  SD+               +   SC          VTN
Sbjct: 706 TTFAY------SDLKISPAIQ--SDQG--------------NVTVSC---------KVTN 734

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G V G  VV L+ R    +  T EK L GFDR+      +KE+ F + P + L + N+ 
Sbjct: 735 TGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDRLSLKPGETKEVMFTIVP-DDLKLYNRQ 793

Query: 783 GRRILPLGNHVLMVG 797
            + ++  G   +MVG
Sbjct: 794 MKYVVEPGEFKVMVG 808


>gi|222099590|ref|YP_002534158.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
 gi|2429092|gb|AAB70867.1| beta-xylosidase [Thermotoga neapolitana]
 gi|221571980|gb|ACM22792.1| Beta-mannanase [Thermotoga neapolitana DSM 4359]
          Length = 778

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 224/754 (29%), Positives = 341/754 (45%), Gaps = 147/754 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTAAIRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGEN K 
Sbjct: 140 DMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGENIKE 194

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                                        + A  KH   Y   + G N++  +    I E
Sbjct: 195 G----------------------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++  + F  PF + +++ +   +M SY++++GVP A  R  L    R +WGF+G + SD 
Sbjct: 223 REFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + EY    K   +SA   L+AG+D+    T   +H +  ++KG V E  ID A+ 
Sbjct: 283 FAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLIDEAVS 342

Query: 387 NLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            +  ++  LGLF     D  K K             H+ LAL+ AR+ I+LLKND   LP
Sbjct: 343 RVLKLKFMLGLFENPYVDVEKAK----------IESHRDLALEIARKSIILLKNDGT-LP 391

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGL 483
           L KN    +A+IGP    +  + G Y                 G P  P+     L + +
Sbjct: 392 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 449

Query: 484 EAYVSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
           E ++                  YA GC +V     +GF EA+ +AK++D  IVV G    
Sbjct: 450 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 508

Query: 526 --LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
             LD T   E RD  +L LPG Q  LV  +A+T K PV+LVL  G P   S      +++
Sbjct: 509 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPY--SLKNLVDRVN 564

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR 643
           +IL +  PGEAG +A+ ++I+G  NP G+LP++ +P S  ++P+    ++    R +   
Sbjct: 565 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHG 622

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
            Y   +   ++ FGHGLSYT + Y  L                              +  
Sbjct: 623 DYVDESTKPLFPFGHGLSYTRFEYSNLR-----------------------------IEP 653

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
            EV S   +   +++ V N GD+DG  VV L+      S   P K+L GF RV   AK  
Sbjct: 654 KEVPSAGEV--VIKVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEK 711

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           K + F +   + L+  ++  + ++  G   +MVG
Sbjct: 712 KTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 744


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 159/448 (35%), Positives = 237/448 (52%), Gaps = 51/448 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS + RA  L+S LTL EK+  + D + AIPRLGIP ++WW+E+LHGI  NG  
Sbjct: 24  PYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGIPQFQWWNEALHGIGRNG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                     T FP  +  AAS++ +L   + +AV+ EAR      +          L+F
Sbjct: 82  --------FATVFPITMAMAASWDDALLHQVFTAVSDEARVKAQQAKCTGDIKRYQSLSF 133

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP + +   +  V+  QG  +  +D G+   +R+    
Sbjct: 134 WTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGVGYNGEDLGVS-KYRK---- 188

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+ FN   + E+D  +T+ P F++ +
Sbjct: 189 --------------LLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKALV 231

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF-EYQNYTK 341
           ++GK + +MC+Y +++G   C +    Q+  R+EWGF G ITSDC A+      + N +K
Sbjct: 232 QEGKVAEVMCAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNVSK 291

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
              +++A  + AG D+ CG+   +H   A+ +G V+E DIDR+L  L   +  LG  + D
Sbjct: 292 DGAEASAKAVLAGTDVECGS-EYKHLPEAVRRGDVKEADIDRSLRRLLIARFELGDMDSD 350

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL----PLNKNAVS--SLAII 455
                + K+    V +  HK LAL  A + IVLL+N  K L    PLN  A S   + ++
Sbjct: 351 DLNA-WTKIPETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVVM 409

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGL 483
           GP  N+   M G Y G P    + L G+
Sbjct: 410 GPNANDSVMMWGNYAGYPTHTVTALDGI 437



 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 126/324 (38%), Gaps = 65/324 (20%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV +T Y +   D+    +    E +        +I V G+    E E+           
Sbjct: 589 YVQETGYGALNFDIKKRVNPTAEELLAQIGNTQTIIFVGGISPNLEGEEMRVNEPGFKGG 648

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR S+ LP  Q  L+  + +  K+ + +  +G     ++ A       +IL   Y GE G
Sbjct: 649 DRTSIELPQAQRDLLAVLHKAGKKVIFVNCSGSA---MALAPELETCDAILQWWYGGEQG 705

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             ALA  +FG   P G+LP+T+Y  +       D  M+         R+YR+Y G  ++ 
Sbjct: 706 GAALATTLFGMVAPSGKLPVTFYKSTDELPDFLDYTMK--------NRTYRYYEGEPLFP 757

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GL YT ++                    DK I +                       
Sbjct: 758 FGFGLGYTTFNI-------------------DKPIYKNN--------------------K 778

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           VQ+ V N G   G+  V ++ R     +G P+K L  + +V   A  +K IS  + P + 
Sbjct: 779 VQVRVKNLGTTAGTETVQVYIRHLADKEG-PKKSLRAYQQVTLNAAEAKTISIEL-PRKS 836

Query: 776 LS--IANKHGRRILPLGNHVLMVG 797
                   +  R++P G + +MVG
Sbjct: 837 FEGWDVKTNTMRVVP-GKYEVMVG 859


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 226/744 (30%), Positives = 339/744 (45%), Gaps = 135/744 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E  HG  + G            T FP  L   +++N  L+ +I  AVA 
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNTELFRSISRAVAA 151

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E RA     Q G   ++P +++ RDPRWGR +ET GEDP +V+ +AV  V+  QGE   S
Sbjct: 152 ETRA-----QGGSATYSPVLDVVRDPRWGRTEETFGEDPHLVTEFAVAAVQGLQGERLDS 206

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y   + G   R      +  +
Sbjct: 207 --------------------------HTSLLATLKHFAGYGASEGG---RNGAPVHMGLR 237

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  +    PFR  +E G  S +M +YN+++GVP    G L Q   R  WGF G++ +DC 
Sbjct: 238 ELHEVDLLPFRKAVEAGALS-VMTAYNEIDGVPCTSSGYLLQDVLREAWGFDGFVITDCG 296

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
           A+  +    N   +  ++AA  LKAG+D+     M R H   A+++G + E+D++RA   
Sbjct: 297 AIHMLACGHNTAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQGLITEEDLNRAAGR 356

Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           +  ++ RLGLF+    DP   +        +   EH  LA  AA +GIVLLKN+   LPL
Sbjct: 357 VLELKFRLGLFDRPYVDPAWAE------QVIGCKEHIALAYQAAAEGIVLLKNEGNLLPL 410

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGCHDV 499
           + ++  ++A+IGP  +      G YT  P  P  ++  L+        S+  YA GC  +
Sbjct: 411 DSSS-GTIAVIGPNAHAPYHQLGDYTS-PQPPGQIVTVLDGIRRRLGDSRVLYAPGCR-I 467

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET--------------ED 534
             +S  GF  A+  A++AD +++V G           +DL                  E 
Sbjct: 468 QGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESDMECGEG 527

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR +L L G Q+ L+  + +  K PVI+V   G P+   +   D  I SI+   YPG+ 
Sbjct: 528 IDRSTLTLMGVQLELLQELHKLGK-PVIVVYINGRPITEPWI--DEHIPSIVEAWYPGQE 584

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  A+A+++FGD NP GRLP++  P+   ++P N  N     +R+  G+ Y        Y
Sbjct: 585 GGSAIADMLFGDINPSGRLPLS-IPKEVGQLP-NSYN-----ARRTRGKRYLETDLAPRY 637

Query: 655 GFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
            FG GLSYT + Y  L+  P+ + I                          E T      
Sbjct: 638 PFGFGLSYTEFRYGRLTVEPAVVPIGG------------------------EAT------ 667

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
             V+I VTNAG  DG+ VV L+      S   PEK L GF +V   A  ++E++F +   
Sbjct: 668 --VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIG-S 724

Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
           EQL +     + ++  G   + VG
Sbjct: 725 EQLELIGLDLKPVVEPGEFRIQVG 748


>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 954

 Score =  268 bits (686), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 218/767 (28%), Positives = 354/767 (46%), Gaps = 120/767 (15%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
           S  + + +L +  R +SL+S++T ++K++ + +      IP L +P      E++HG + 
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
                      S  T FPQ L   A++N+ L  ++  AV  E      L    +  W+P 
Sbjct: 226 G----------SGATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAWSPV 270

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           +++ +D RWGR +ET GEDP++VS     ++K +Q +                       
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG---------------------- 308

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
                     L    KH   +     G   R S +  ++E++  +    PFR  I     
Sbjct: 309 ----------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 355

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDS 346
             +M +Y+   GVP     +L     R EWGF G+I SDC A+  +   ++YT K   ++
Sbjct: 356 QSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEA 415

Query: 347 AAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           A   L AG+  NCG T   +    A   G++  +++D     +  +  R  LF   P K 
Sbjct: 416 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNKP 475

Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + K+ P    +  HK++A  AAR+ IV+L+N    LPL K+ + ++A++GP  +++  
Sbjct: 476 LDWNKIYPG-WNSDSHKEMARQAARESIVMLENKDNILPLAKD-MRTIAVVGPGADDLQP 533

Query: 465 MGGGYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
             G YT   +P   KS+L G++  V   +K  Y  GC D   ++     +AV+ A ++D 
Sbjct: 534 --GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDV 590

Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V++V G   T E+         E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P
Sbjct: 591 VVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 649

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
            ++S A    +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   
Sbjct: 650 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 705

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           N +  S R+Y      FY    +Y FG+GLSYT++ Y  L           ++   + N+
Sbjct: 706 NFKT-SGRRYEYSDMEFYP---LYYFGYGLSYTSFEYSGL----------KIQEKDNGNV 751

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                                    +Q +V N G   G  VV L+      S  T   +L
Sbjct: 752 ------------------------AIQATVKNVGQRAGDEVVQLYITDMYASVKTRITEL 787

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             F RVH     SK +SF + P E LS+ N    R++  G   ++VG
Sbjct: 788 KDFTRVHLQPDESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833


>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 954

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 218/767 (28%), Positives = 354/767 (46%), Gaps = 120/767 (15%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIAS 108
           S  + + +L +  R +SL+S++T ++K++ + +      IP L +P      E++HG + 
Sbjct: 167 SLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSY 225

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPN 168
                      S  T FPQ L   A++N+ L  ++  AV  E      L    +  W+P 
Sbjct: 226 G----------SGATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAWSPV 270

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
           +++ +D RWGR +ET GEDP++VS     ++K +Q +                       
Sbjct: 271 LDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG---------------------- 308

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
                     L    KH   +     G   R S +  ++E++  +    PFR  I     
Sbjct: 309 ----------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDC 355

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDS 346
             +M +Y+   GVP     +L     R EWGF G+I SDC A+  +   ++YT K   ++
Sbjct: 356 QSVMMAYSDYLGVPVAKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEA 415

Query: 347 AAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           A   L AG+  NCG T   +    A   G++  +++D     +  +  R  LF   P K 
Sbjct: 416 ANQALAAGIATNCGDTYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNKP 475

Query: 406 -KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             + K+ P    +  HK++A  AAR+ IV+L+N    LPL K+ + ++A++GP  +++  
Sbjct: 476 LDWNKIYPG-WNSDSHKEMARQAARESIVMLENKDNILPLAKD-MRTIAVVGPGADDLQP 533

Query: 465 MGGGYTG--IPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADF 519
             G YT   +P   KS+L G++  V   +K  Y  GC D   ++     +AV+ A ++D 
Sbjct: 534 --GDYTPKLLPGQLKSVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTNIPKAVKAASQSDV 590

Query: 520 VIVVAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V++V G   T E+         E+ D  +L+LPG+Q  L+ +V  T K PVIL+L  G P
Sbjct: 591 VVLVLGDCSTSESTTDVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQAGRP 649

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDM 630
            ++S A    +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   
Sbjct: 650 YNLSKASELCKAILVNWL--PGQEGGPATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-Y 705

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           N +  S R+Y      FY    +Y FG+GLSYT++ Y  L           ++   + N+
Sbjct: 706 NFKT-SGRRYEYSDMEFYP---LYYFGYGLSYTSFEYSGL----------KIQEKDNGNV 751

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                                    +Q +V N G   G  VV L+      S  T   +L
Sbjct: 752 ------------------------AIQATVKNVGQRAGDEVVQLYITDMYASVKTRITEL 787

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             F RVH     SK +SF + P E LS+ N    R++  G   ++VG
Sbjct: 788 KDFTRVHLQPGESKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILVG 833


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  268 bits (685), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 217/736 (29%), Positives = 343/736 (46%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG    V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKKLIARIGE---V 182

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  ET GEDP +V     + + S Q  N   
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKYN--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G   +   +  +  +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPIGGRDGKTRTDPHVAPR 268

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I        T+ED  A  + AG++I          +   + A+D GK+ ++ +D
Sbjct: 327 SEAVEFISNKHKVADTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVDDGKISQETLD 386

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  ++  LGLF+ +P +G  GK     V + EH+ ++L+AARQ +VLLKN+   L
Sbjct: 387 KRVAEILRIKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNETHLL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
           PL+K ++ S+A+IGP  +  +Q+   Y       K++ +G++  +  ++  Y  GC    
Sbjct: 445 PLSK-SIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAEVIYKKGCDIID 503

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  ++      E +R AK+A+ V++V G +     EDR R SL LPG+Q
Sbjct: 504 PHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V  T K PVILV+  G    +++A A   + +IL   +PGE   +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVILVMLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S           + T VYG    FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGALYPFGHGLS 668

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT ++Y      S+L IS S             G + D               HV   + 
Sbjct: 669 YTTFTY------SDLHISPS-----------HQGVQGD--------------IHVSCKIK 697

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+ R    S  T  K L GF+R+   A   + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP-QDLGLWDK 756

Query: 782 HGRRILPLGNHVLMVG 797
           +    +  G+  +M+G
Sbjct: 757 NMNFRVEPGSFKVMLG 772


>gi|6006601|emb|CAB56857.1| beta-mannanase [Thermotoga neapolitana]
          Length = 821

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 224/754 (29%), Positives = 341/754 (45%), Gaps = 147/754 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 135 RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTAAIRE 182

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGEN K 
Sbjct: 183 DMRKLG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGENIKE 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                                        + A  KH   Y   + G N++  +    I E
Sbjct: 238 G----------------------------VVATVKHFAGYSASEGGKNWAPTN----IPE 265

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++  + F  PF + +++ +   +M SY++++GVP A  R  L    R +WGF+G + SD 
Sbjct: 266 REFREVFLFPFEAAVKEARVLSVMNSYSEIDGVPCAANRRLLTDILRKDWGFEGIVVSDY 325

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + EY    K   +SA   L+AG+D+    T   +H +  ++KG V E  ID A+ 
Sbjct: 326 FAVNMLGEYHRIAKDKSESARLALEAGIDVELPKTDCYQHLKDLVEKGIVPESLIDEAVS 385

Query: 387 NLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            +  ++  LGLF     D  K K             H+ LAL+ AR+ I+LLKND   LP
Sbjct: 386 RVLKLKFMLGLFENPYVDVEKAK----------IESHRDLALEIARKSIILLKNDGT-LP 434

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGL 483
           L KN    +A+IGP    +  + G Y                 G P  P+     L + +
Sbjct: 435 LQKN--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSI 492

Query: 484 EAYVSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
           E ++                  YA GC +V     +GF EA+ +AK++D  IVV G    
Sbjct: 493 EEHMKSIPSVLDAFKEEGIDFEYAKGC-EVTGEDRSGFKEAIEVAKRSDVAIVVVGDRSG 551

Query: 526 --LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
             LD T   E RD  +L LPG Q  LV  +A+T K PV+LVL  G P   S      +++
Sbjct: 552 LTLDCTT-GESRDMANLKLPGVQEELVLEIAKTGK-PVVLVLITGRPY--SLKNLVDRVN 607

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR 643
           +IL +  PGEAG +A+ ++I+G  NP G+LP++ +P S  ++P+    ++    R +   
Sbjct: 608 AILQVWLPGEAGGRAIVDVIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHG 665

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
            Y   +   ++ FGHGLSYT + Y  L                              +  
Sbjct: 666 DYVDESTKPLFPFGHGLSYTRFEYSNLR-----------------------------IEP 696

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
            EV S   +   +++ V N GD+DG  VV L+      S   P K+L GF RV   AK  
Sbjct: 697 KEVPSAGEVV--IKVDVENVGDMDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEK 754

Query: 764 KEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           K + F +   + L+  ++  + ++  G   +MVG
Sbjct: 755 KTVVFRLH-TDVLAYYDRDMKLVVEPGEFRVMVG 787


>gi|297543748|ref|YP_003676050.1| glycoside hydrolase family 3 domain-containing protein
           [Thermoanaerobacter mathranii subsp. mathranii str. A3]
 gi|296841523|gb|ADH60039.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           mathranii subsp. mathranii str. A3]
          Length = 787

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 226/751 (30%), Positives = 344/751 (45%), Gaps = 135/751 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     ES  G  + G            T FPQ +  A+++N  L   + S +  
Sbjct: 94  RLGIPAL-IHEESCSGYMAKG-----------ATIFPQTIGVASTWNPKLVEKMASVIRE 141

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + +A+           AP +++ RDPRWGR +ET GEDP +V    V +++  Q EN K 
Sbjct: 142 QMKAV-----GARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTENLK- 195

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
            +G I  G                          KH + Y   + G N++     A I  
Sbjct: 196 -EGVIATG--------------------------KHFVGYGNSEGGMNWA----PAHIPM 224

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++  + F  PF + +++ K   IM  Y++++G+P      L     R  WGF G + SD 
Sbjct: 225 RELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDY 284

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRAL 385
            A+  ++EY       +++A   L+AG+D+   +  C     +  I++G +    ++ A+
Sbjct: 285 FAINQLYEYHRLASNKKEAAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAV 344

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             +   +  LGLF  +P   +  K   +   T E ++LA   A++ IVLLKN+   LPL 
Sbjct: 345 RRILKAKFLLGLFE-NPYVDE--KRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLK 401

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR--------------GLEA---YV- 487
           K+ + S+A+IGP  +NI  M G Y   PC  +SLL               GLEA   YV 
Sbjct: 402 KD-LQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVP 459

Query: 488 ---------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LD 527
                          +K  YA GC DV  +  AGF++AV IAK+AD  IVV G      D
Sbjct: 460 IVSVLQGIKEKVSPKTKVIYAKGC-DVISDDTAGFNKAVEIAKQADVAIVVVGDRAGLTD 518

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS-FAEADSQISSIL 586
                E RDR  L LPG Q  LV ++  T   PV++VL  G P+ +S  AE   +I +I+
Sbjct: 519 GCTSGESRDRADLNLPGVQEQLVKAIYETGT-PVVVVLINGRPMSISRLAE---KIPAII 574

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
               PGE G +A+A++IFGD+NPGG+LP++  P S  ++P+   +  +     + G  Y 
Sbjct: 575 EAWLPGEEGGRAIADVIFGDYNPGGKLPIS-IPCSVGQLPVYYYHKPSGGRTNWKG-DYV 632

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
             +   +Y FG+GLSYT + Y      S L IS   K  + + I++              
Sbjct: 633 ESSTKPLYPFGYGLSYTEFLY------SNLNISNP-KVSTQEGIIE-------------- 671

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
                    + + V N G V G  VV L+     +S   P K+L GF R+       K +
Sbjct: 672 ---------ISVDVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVGEQKTV 722

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            F +   EQL   N+    ++  G   +M+G
Sbjct: 723 IFQLS-SEQLGFYNEEMEYVVEPGRVEVMIG 752


>gi|289577460|ref|YP_003476087.1| glycoside hydrolase [Thermoanaerobacter italicus Ab9]
 gi|289527173|gb|ADD01525.1| glycoside hydrolase family 3 domain protein [Thermoanaerobacter
           italicus Ab9]
          Length = 787

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 224/750 (29%), Positives = 342/750 (45%), Gaps = 133/750 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     ES  G  + G            T FPQ +  A+++N  L   + S +  
Sbjct: 94  RLGIPAL-IHEESCSGYMAKG-----------ATIFPQTIGVASTWNPKLVEKMASVIRE 141

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + +A+           AP +++ RDPRWGR +ET GEDP +V    V +++  Q EN K 
Sbjct: 142 QMKAV-----GARQALAPLLDVTRDPRWGRTEETFGEDPYLVMHMGVSYIRGLQTENLK- 195

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
            +G I  G                          KH + Y   + G N++     A I  
Sbjct: 196 -EGVIATG--------------------------KHFVGYGNSEGGMNWA----PAHIPM 224

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++  + F  PF + +++ K   IM  Y++++G+P      L     R  WGF G + SD 
Sbjct: 225 RELYEIFLYPFEAAVKEAKLGSIMPGYHELDGIPCHKSKQLLTDILRKNWGFDGIVVSDY 284

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRAL 385
            A+  ++EY       +++A   L+AG+D+   +  C     +  I++G +    ++ A+
Sbjct: 285 FAINQLYEYHRLASNKKEAAKLALEAGVDVELPSTDCYGLPIKELIEQGDIDIDFVNDAV 344

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             +   +  LGLF  +P   +  K   +   T E ++LA   A++ IVLLKN+   LPL 
Sbjct: 345 RRILKAKFLLGLFE-NPYVDE--KRVVEIFDTQEQRQLAYKIAQESIVLLKNESNLLPLK 401

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLR--------------GLEA---YV- 487
           K+ + S+A+IGP  +NI  M G Y   PC  +SLL               GLEA   YV 
Sbjct: 402 KD-LQSIAVIGPNADNIRNMIGDY-AYPCHIESLLEMREKDNVFNTPLPEGLEAKDIYVP 459

Query: 488 ---------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LD 527
                          +K  YA GC DV  +  AGF++AV +AK+AD  IVV G      D
Sbjct: 460 IVSVLQGIKEKVSPKTKVIYAKGC-DVISDDTAGFNKAVEVAKQADVAIVVVGDRAGLTD 518

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
                E RDR  L LPG Q  L+ +V  T   PVI+VL  G P+ +S+     +I +I+ 
Sbjct: 519 GCTSGESRDRADLNLPGVQEELIKAVYETGT-PVIVVLINGRPMSISWIA--EKIPAIIE 575

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
              PGE G +A+A++IFGD+NPGG+LP++  P S  ++P+   +  +     + G  Y  
Sbjct: 576 AWLPGEEGGRAIADVIFGDYNPGGKLPIS-IPRSVGQLPVYYYHKPSGGRTNWKG-DYVE 633

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   +Y FG+GLSYT + Y      S L+IS   K  +   I++               
Sbjct: 634 SSTKPLYPFGYGLSYTEFLY------SNLSISHP-KVATQGGIIE--------------- 671

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
                   +   V N G V G  VV L+     +S   P K+L GF R+       K + 
Sbjct: 672 --------ISADVKNIGKVKGDEVVQLYIHREFLSVTRPVKELKGFKRITLDVGEQKTVI 723

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           F +   EQL   N+    ++  G   +M+G
Sbjct: 724 FQLS-SEQLGFYNEEMEYVVEPGRVEVMIG 752


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 176/490 (35%), Positives = 253/490 (51%), Gaps = 75/490 (15%)

Query: 11  RERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLIS 70
           + RILF V+ L+ V  V A  N K V                 + + +     R   L+S
Sbjct: 3   KRRILFFVVSLVYVFTVQA-QNGKDV-----------------YLDMNAPRHERILDLLS 44

Query: 71  LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLV 130
            LT++EKI  L   +  IPRL I  Y   +E+LHG+   G   NF       T FPQ + 
Sbjct: 45  KLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGVVRPG---NF-------TVFPQAIG 94

Query: 131 SAASFNRSLWSNIGSAVAVEARAMYN--------LGQAG--LTFWAPNINIFRDPRWGRG 180
            AA +N  L + I + ++ EARA +N        LGQ    LTFW+P +N+ RDPRWGR 
Sbjct: 95  LAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMARDPRWGRT 154

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
            ET GEDP +     V FVK  QG+              + R LK             + 
Sbjct: 155 PETYGEDPFLSGKLGVSFVKGLQGD--------------DPRYLK-------------IV 187

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           +  KH  A + E     +R+  N +I+E+D  + + P F  CI +GKA+ IM +YN +N 
Sbjct: 188 STPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAIND 243

Query: 301 VPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
           VP  L   L +K  R++WGF GY+ SDC   + +  +  Y KT E +AA  ++AG+D+ C
Sbjct: 244 VPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVKTLEAAAALSIQAGLDLEC 303

Query: 360 G-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS 418
           G    +    +A  +  V E +ID A  ++   ++RLGLF+ DP    Y K+ P  V   
Sbjct: 304 GDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFD-DPALNPYNKISPSIVGCE 362

Query: 419 EHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS-PK 477
           +H KLAL+AARQ IVLLKN+KKFLPL+   + S+A++G  +N  +   G Y+G P + P 
Sbjct: 363 KHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTPVNQPV 420

Query: 478 SLLRGLEAYV 487
           S+L G++  V
Sbjct: 421 SILEGIKKKV 430



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/307 (31%), Positives = 149/307 (48%), Gaps = 57/307 (18%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA  I +K D  + V G++ + E E +DR S+ LP  Q   +    + +   V+ VL 
Sbjct: 592 YGEAGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLV 650

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGEAG  A+AE++FGD+NPGG+LP+T+Y  S  ++P
Sbjct: 651 AGSSLAINWM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELP 707

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D ++R        GR+Y+F+ G  +Y FGHGLSYT +SYK LS              
Sbjct: 708 AFDDYDIRK-------GRTYQFFEGDPLYAFGHGLSYTTFSYKKLS-------------- 746

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                            ID      S+ F    ++ N G  +G  V  L+ +     QG+
Sbjct: 747 -----------------IDAAGDVVSVSF----TLKNTGKYEGDEVAQLYVKY----QGS 781

Query: 746 ------PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGEL 799
                 P KQL GF+R+H     SK+I+  V   E      + G    P G+++ MVG  
Sbjct: 782 DSQVKLPLKQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVGTA 841

Query: 800 RHSLTIE 806
             ++ ++
Sbjct: 842 SDAIQLQ 848


>gi|148269983|ref|YP_001244443.1| glycoside hydrolase family 3 protein [Thermotoga petrophila RKU-1]
 gi|147735527|gb|ABQ46867.1| glycoside hydrolase, family 3 domain protein [Thermotoga petrophila
           RKU-1]
          Length = 778

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 223/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGE+ K 
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                          KG            + A  KH   Y   + G N++  +    I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ ++ F  PF + +++     +M SY++++GVP A  R  L    R +WGFKG + SD 
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + +Y    +   ++A   L+AG+D+    T   ++ +  ++KG + E  ID A+ 
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            +  ++  LGLF     + +  K+         HK +ALD AR+ I+LLKND   LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHKDIALDIARKSIILLKNDG-ILPLQK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
           N    +A+IGP    +  + G Y                 G P  P+     L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
           +                  YA GC +V     +GF EA+ IAKK+D  IVV G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T   E RD  +L LPG Q  LV  VA+T K PV+LVL  G P   S      ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
            +  PGEAG +A+ +II+G  NP G+LP++ +P S  ++P+    ++    R +    Y 
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
             +   ++ FGHGLSYT + Y  L   P E+  +  +                       
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     +++ V N GD DG  VV L+      S   P K+L GF RV   AK  K 
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F +   + L+  ++  + ++  G   +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
 gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
          Length = 955

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 221/764 (28%), Positives = 355/764 (46%), Gaps = 120/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + + SL +  R +SL++++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 171 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 227

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++NR L   +   +  E   + N  QA    W+P +++
Sbjct: 228 --------SGATIFPQALAMGATWNRKLTEEVAMVIGDET-VVANTKQA----WSPVLDV 274

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q                            
Sbjct: 275 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------------------- 307

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
            RG    L    KH   +     G   R S +  ++E++  +    PFR  +       +
Sbjct: 308 -RG----LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVVRNYDCQSL 359

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAAG 349
           M +Y+   G+P     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A  
Sbjct: 360 MMAYSDYMGIPVAGSTELLQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQ 419

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++   ++D     + +   R  LF  +P K   +
Sbjct: 420 ALAAGIATNCGDTYNDKEVIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDW 479

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  H+++A  AAR+ IV+L+N    LPL+K  + ++A++GP  +++    G
Sbjct: 480 NKIYPG-WNSDRHREMARQAARESIVMLENKDNLLPLSK-TLKTIAVLGPGADDLQP--G 535

Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT    P   KS+L G++A V   +K  Y  GC D          +AV+ A ++D V++
Sbjct: 536 DYTPKLQPGQLKSVLSGIKAAVGKQTKVLYEQGC-DFTTPDATNIPKAVKAASQSDVVVM 594

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PV+L+L  G P D+
Sbjct: 595 VLGDCSTSEATNNVRKTCGENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDL 653

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
              +A     +IL    PG+ G  A A+++FGD+NPGGRLPMT +P    ++P+   N +
Sbjct: 654 --LKASEMCKAILVNWLPGQEGGPATADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNFK 709

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      FY    +Y FG+GLSYT++ Y      S+L I       S+ N++  
Sbjct: 710 T-SGRRYEYVDMEFYP---LYRFGYGLSYTSFEY------SDLKIQEK----SNGNVM-- 753

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 VQ +V N G   G  V  L+      S  T   +L  F
Sbjct: 754 ----------------------VQATVKNVGGCAGDEVAQLYITDMYASVKTRVMELKDF 791

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            R+H     SK +SF + P + +S+ N    R++  G   +MVG
Sbjct: 792 TRIHLQPGESKNVSFELTPYD-ISLLNDRMDRVVEKGEFKVMVG 834


>gi|373460605|ref|ZP_09552356.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
 gi|371955223|gb|EHO73027.1| hypothetical protein HMPREF9944_00620 [Prevotella maculosa OT 289]
          Length = 858

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 166/462 (35%), Positives = 245/462 (53%), Gaps = 61/462 (13%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           +YP+ N +LS  TRA+ L+S LTL+EK   + D + AIPRLGI  + WWSE+LHG A+ G
Sbjct: 30  AYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG 89

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AG 161
                     +VT FP+ +  AASFN +L   + SA + E RA Y     N G+      
Sbjct: 90  ----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHS 139

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
           L+ W PN+NIFRDPRWGRGQET GEDP + +      V+  QG E+ K         +R+
Sbjct: 140 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSK---------YRK 190

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
                             L AC KH   Y +     ++R++ N   ++ +D  +T+ P F
Sbjct: 191 ------------------LWACAKH---YAVHSGPEYTRHTANLNNVSPRDLWETYLPAF 229

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +E+ K   +MC+Y  ++  P C    L Q+  R+EWGF+  + SDC AV+ I  +QN
Sbjct: 230 KTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDI--WQN 287

Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + KT  D+     K   AG D+ CG     +    A+ +G + EK++D+ +L L   +  
Sbjct: 288 H-KTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFD 346

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG  + DP    + K+    + +  H++L+LD ARQ IVLL+N +  LPL KN    +A+
Sbjct: 347 LGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAV 404

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           IGP  +N+  M G Y G P    ++L G+ A      Y  GC
Sbjct: 405 IGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGC 446



 Score = 83.6 bits (205), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 68/271 (25%), Positives = 118/271 (43%), Gaps = 59/271 (21%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           + E +R  K  + V+ V G+    E E+           DR  + LP  Q   + ++   
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++     +   +I+   Y G+ G  A+A+++FGD+NP G+LP+T
Sbjct: 658 GKQLVYVNCSGSA---IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  S       + +M+        GR+YR+++   ++ FGHGLSYT ++          
Sbjct: 715 FYKNSNQLPDYENYSMK--------GRTYRYFSDP-LFAFGHGLSYTTFNMG-------- 757

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
             +A +   +D  +                         V+I V N G  DG+  V+L+ 
Sbjct: 758 --TAEIIKKADSIV-------------------------VRIPVENVGSKDGTETVLLYI 790

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +  +   G P K L GF RV  V  G K ++
Sbjct: 791 KNHQDPNG-PIKSLRGFSRVF-VKAGHKAVA 819


>gi|299141953|ref|ZP_07035087.1| beta-glucosidase [Prevotella oris C735]
 gi|298576415|gb|EFI48287.1| beta-glucosidase [Prevotella oris C735]
          Length = 858

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 166/462 (35%), Positives = 245/462 (53%), Gaps = 61/462 (13%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           +YP+ N +LS  TRA+ L+S LTL+EK   + D + AIPRLGI  + WWSE+LHG A+ G
Sbjct: 30  AYPYQNPNLSALTRAQDLLSRLTLEEKALLMLDESPAIPRLGIKKFFWWSEALHGAANMG 89

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AG 161
                     +VT FP+ +  AASFN +L   + SA + E RA Y     N G+      
Sbjct: 90  ----------NVTVFPEPIAMAASFNDALLYKVFSAASDEMRAQYHHRIRNGGEDEKFHS 139

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
           L+ W PN+NIFRDPRWGRGQET GEDP + +      V+  QG E+ K         +R+
Sbjct: 140 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTAVMGTAVVRGLQGPEDSK---------YRK 190

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
                             L AC KH   Y +     ++R++ N   ++ +D  +T+ P F
Sbjct: 191 ------------------LWACAKH---YAVHSGPEYTRHTANLNNVSPRDLWETYLPAF 229

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +E+ K   +MC+Y  ++  P C    L Q+  R+EWGF+  + SDC AV+ I  +QN
Sbjct: 230 KTLVEEAKVREVMCAYQALDDEPCCGNSRLLQQILRDEWGFQYLVVSDCGAVSDI--WQN 287

Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
           + KT  D+     K   AG D+ CG     +    A+ +G + EK++D+ +L L   +  
Sbjct: 288 H-KTSSDAVHATAKAALAGTDVECGFNYTYKCIPEAVQRGLISEKEVDKHVLRLLEGRFD 346

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG  + DP    + K+    + +  H++L+LD ARQ IVLL+N +  LPL KN    +A+
Sbjct: 347 LGEMD-DPALVPWSKIPYSVMDSKAHRQLSLDMARQSIVLLQNKQNMLPLKKNN-ERIAV 404

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           IGP  +N+  M G Y G P    ++L G+ A      Y  GC
Sbjct: 405 IGPNADNVPMMWGNYNGTPNRTVTILDGIRAKHKNVKYIKGC 446



 Score = 83.2 bits (204), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 73/305 (23%), Positives = 132/305 (43%), Gaps = 60/305 (19%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           + E +R  K  + V+ V G+    E E+           DR  + LP  Q   + ++   
Sbjct: 598 YQEVIRALKGIEKVVFVGGISPALEGEEMPVDIPGFKGGDRTDIELPRVQRDFIKALHAA 657

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++     +   +I+   Y G+ G  A+A+++FGD+NP G+LP+T
Sbjct: 658 GKQLVYVNCSGSA---IALEPETTACDAIVQAWYAGQEGGTAVADVLFGDYNPSGKLPVT 714

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  S       + +M+        GR+YR+++   ++ FGHGLSYT ++          
Sbjct: 715 FYKNSNQLPDYENYSMK--------GRTYRYFSDP-LFAFGHGLSYTTFNMG-------- 757

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
             +A +   +D  +                         V+I V N G  DG+  V+L+ 
Sbjct: 758 --TAEIIKKADSIV-------------------------VRIPVENVGSKDGTETVLLYI 790

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI-LPLGNHVLM 795
           +  +   G P K L GF RV  V  G + ++  V   +     +++   +    GN+ L+
Sbjct: 791 KNHQDPNG-PIKSLRGFSRVF-VKAGHQAVAELVLTRKSFEFFDENTNTVHFKEGNYDLL 848

Query: 796 VGELR 800
            G+ R
Sbjct: 849 YGDKR 853


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 160/471 (33%), Positives = 227/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ F+   T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 139/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A  ++ +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSAQPVSIQLEP 827


>gi|371777646|ref|ZP_09483968.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 865

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 156/449 (34%), Positives = 241/449 (53%), Gaps = 55/449 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF N  LS   RAK LIS LT+QEK + L D + AIPRLGI  + WWSE+LHG A+N   
Sbjct: 23  PFQNPDLSSEERAKDLISRLTVQEKARLLCDQSEAIPRLGIKKFNWWSEALHGYANN--- 79

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
                   SVT FPQ +  AASFN  L   I +A++ EARA Y+  Q           L+
Sbjct: 80  -------DSVTVFPQPIGMAASFNEELVFEIFNAISDEARAKYHQAQRRGEENRRFLSLS 132

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
            W PN+NIFRDPRWGRGQET GEDP + S   V+ VK  QG E+ K         +R+  
Sbjct: 133 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAK---------YRK-- 181

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
                           L AC KH   +   +W   SR+  N   ++ ++  +T+ P F++
Sbjct: 182 ----------------LLACAKHYTVHSGPEW---SRHELNINDVSPREFYETYMPAFKA 222

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++     +MC+Y++++  P C    + Q+  R+EWG++  + +DC A++  +     +
Sbjct: 223 LVQKADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHMVVADCGAISDFYTTHGIS 282

Query: 341 KTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
            T   +AA  L AG D+ C       +    A++K  + EKDIDR+L+ +   +  LG  
Sbjct: 283 STPVHAAATGLLAGTDLECIWDNYHYKMLPEALEKDLITEKDIDRSLMRVLKGRFDLGEM 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + D     + ++ P  +   +H++LA   A+Q IVLL+N  K LPL+K++++ +A++GP 
Sbjct: 343 D-DNSLVPWAQIPPSVLNCEKHRQLAYKMAQQSIVLLQNKNKVLPLDKSSINKIAVVGPN 401

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV 487
            ++   + G Y G P    ++L G+++ V
Sbjct: 402 ADDEVVLWGNYNGTPIRTITVLDGIKSKV 430



 Score = 89.0 bits (219), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 80/306 (26%), Positives = 129/306 (42%), Gaps = 61/306 (19%)

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTS 552
            D  F   ++  +  + VI V GL    E E+           DR ++ LP  Q + + +
Sbjct: 589 EDIDFTALIKKLEGIETVIFVGGLSGFLEGEEMPVSYPGFKGGDRTNIELPSVQRNCLKA 648

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
           +    K  + +  +G     ++         +I+   Y GE+G +A+A+++FGD+NP G+
Sbjct: 649 LKEAGKTVIFVNCSGSA---IALEPETESCDAIIQAWYGGESGGQAIADVLFGDYNPSGK 705

Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           LP+T+Y  S       D +M         GR+YR YT   ++ FG GLSYTN+       
Sbjct: 706 LPVTFYRNSDNLGDFEDYSME--------GRTYR-YTNNHLFPFGFGLSYTNFEIG---- 752

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
                     KA   K+ ++           DE  S       ++I V N G  DG+ +V
Sbjct: 753 ----------KARLSKSTIKA----------DETIS-------IKIPVKNTGKRDGTEIV 785

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC-------EQLSIANKHGRR 785
            ++ R      G P K L GF R+   A  +++ +  + P         Q  +A   G  
Sbjct: 786 QVYVRKVNDIDG-PLKTLKGFQRIAVPAGKTRQANISLPPSTFEFYDWSQRKMAVTPGEY 844

Query: 786 ILPLGN 791
           IL  GN
Sbjct: 845 ILYYGN 850


>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 1049

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 216/747 (28%), Positives = 340/747 (45%), Gaps = 116/747 (15%)

Query: 66   KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
            K L+S +T++EKI QLS         G P  E+ S+SL      G  +N +G  +     
Sbjct: 342  KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400

Query: 121  ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
                                    T FP  L  + S++ +         A+E+ A     
Sbjct: 401  EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455

Query: 159  QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
             AGL + +AP ++I RD RWGR  E  GED  + S  A   V  FQ   W+++       
Sbjct: 456  -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508

Query: 218  FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                                 + AC KH +AY L + G   R      ++E+   DT+ P
Sbjct: 509  ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544

Query: 278  PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
            PF++CI+ G     M ++N +NG+PA     L +   R +W F G++ SD +AV  +   
Sbjct: 545  PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602

Query: 337  QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q   +  +D+      +G+D++    +  ++ +  I+ GK+  +D+D ++  +  ++  L
Sbjct: 603  QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662

Query: 396  GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
            GLF  DP K    +     +   E    ALD A +  VLLKND   LPL KN V S+A++
Sbjct: 663  GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720

Query: 456  GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
            GPL +N +++ G +   G      ++L+G++  +    +K  YA GC D      +GF E
Sbjct: 721  GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779

Query: 510  AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            AV++A K+D VI V G       E R R  L LPG Q  L+  +  T K PV++VL  G 
Sbjct: 780  AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838

Query: 570  PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            PL + +   D  +S+IL   + G +   A+A+I+FGD+NP GRL ++ +P    +VP+  
Sbjct: 839  PLSIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895

Query: 630  MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
               ++      P  S   +       +Y FG+GLSYT +SY   SAP             
Sbjct: 896  NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQ------------ 940

Query: 687  DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                     ++ +Y   + ++        V ++VTN GD DG   V L+      S   P
Sbjct: 941  --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984

Query: 747  EKQLIGFDRVHTVAKGSKEISFGVDPC 773
             K+L  F ++   A  SK + F + P 
Sbjct: 985  VKELKAFKKIFLKAGESKTVQFDISPL 1011


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 160/471 (33%), Positives = 227/471 (48%), Gaps = 61/471 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N  L +  R   L+S LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  G 
Sbjct: 26  YPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLT 163
                      T FPQ +  AA+F+ +      + V+ EARA Y+  Q         GLT
Sbjct: 85  ---------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRYKGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP +     V   +  QG                   
Sbjct: 136 FWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQG------------------- 176

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D  +     AC KH   +   +W   +R+ F+   T +D  +T+ P F + +
Sbjct: 177 --------DDPNYYKTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++G    +MC+YN+  G P C    L     RN WG+   I SDC A+   +     T  
Sbjct: 226 KEGDVQEVMCAYNRFEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPR 285

Query: 343 H------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
           H      E ++A  +  G D+ CG    R    A+  GK+ EKD+D +L  L   +  LG
Sbjct: 286 HETHPDAESASADAVLNGTDLECGGS-YRALNKALADGKISEKDLDVSLRRLLKGRFELG 344

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           +F+ D R   Y K+    V + EH   ALD AR+ IVLLKN    LPL+KN +  +A++G
Sbjct: 345 MFDPDERV-PYSKIPYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDKN-IKKIAVVG 402

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           P   + + +   Y G P    +++ G+    +K   A   +++ CN  A F
Sbjct: 403 PNAADSTMLWANYNGFPTKTVTIVEGIR---NKVPNAEVIYELGCNHTADF 450



 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 138/298 (46%), Gaps = 55/298 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           Y+ +T  A     V   +   F       K AD ++ V G+    E E+           
Sbjct: 574 YMQRTGSADLNFTVGVRTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKG 633

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ +P  Q  +V ++  T K PV+ V+  G  L +++   +  +++IL   Y G+ G
Sbjct: 634 DRTNIEIPAVQKEMVKALVATGK-PVVYVVCTGSALALNWE--NDHVNAILNAWYGGQEG 690

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             A+A+++FGD+NP GRLP+T+Y +S  ++P   D +M+        GR+YR+ T T +Y
Sbjct: 691 GTAVADVLFGDYNPAGRLPITFY-KSVDQLPDFQDYSMK--------GRTYRYMTQTPLY 741

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT + YK          +A L                     D++ S  S+  
Sbjct: 742 PFGYGLSYTTFDYK----------NAKLSK-------------------DKIASNESVTL 772

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
              I+  N G +DG  V  ++ + P    G P K +  F RV+  A   + +S  ++P
Sbjct: 773 SFDIA--NTGKMDGDEVAQIYIKNPNDPAG-PLKAMKAFKRVNVKAGSEQPVSIQLEP 827


>gi|409198206|ref|ZP_11226869.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 775

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 230/812 (28%), Positives = 369/812 (45%), Gaps = 129/812 (15%)

Query: 23  VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS 82
           +++ V+ F  +    ++     K P + SY   +    I  +  S+++L+TL+EKI Q++
Sbjct: 6   ILIVVLGFMMTLGACSEVGIAGKTPDWKSY---SGDPVIERKVDSVLNLMTLEEKIGQMN 62

Query: 83  DNASAIPRLG----IPAYEWWSESLHGIASNGPGVN--------------------FNGT 118
             +      G      + E+  + + G   N  G +                    F   
Sbjct: 63  QYSGNFATTGEVADTKSGEYLKKGMIGSTFNVFGADHLRMLQEQNLKHSRLKIPMLFAAD 122

Query: 119 V--SSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDP 175
           V     T+FP  L  A S++  L        A EA A      +G+ + +AP I+I RDP
Sbjct: 123 VIHGLETTFPIPLAEACSWDLELMEKSARIAAEEATA------SGVAWNFAPMIDIGRDP 176

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR  E  GED  + +  A   V  FQG                        + +D   
Sbjct: 177 RWGRVMEGAGEDVYLATQVARARVIGFQG----------------------IEDYTDLSQ 214

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
              + A  KH + Y     G   R   +  ++E++  +TF PPF++ +++G AS  M ++
Sbjct: 215 SNTMMATSKHFVGYGAALAG---RDYQSVDMSERELHETFLPPFKATVDEGVAS-FMTAF 270

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N +NGVP      LF++  R+ WGF G + +D  A+  +  +  + K  + +A   + AG
Sbjct: 271 NDLNGVPCTGNQYLFKEILRDRWGFGGMVVTDYTAIMEMVAH-GFAKDLKHAAELAIDAG 329

Query: 355 MDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD 413
           +D++      + H +  +++G V E+ ID A+  +  ++  LGLF+ DP +    +   +
Sbjct: 330 IDMDMISEAFVTHLKELVEEGDVSEEQIDVAVSRILEMKFLLGLFD-DPFRYFDAERQQE 388

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--G 471
            V   EH K A +AA++ IVLLKN+   LPL+KN    +A+IGP V     + G +   G
Sbjct: 389 VVMNPEHLKTAREAAQRSIVLLKNEGNVLPLDKNTSKRVALIGPFVKERESLNGEWAIKG 448

Query: 472 IPCSPKSLLRGLEAYVSKTH----YASG--------------CHDVPCNSDAGFHEAVRI 513
                 +LL GLE     +     YA G                +VP     GF EAV +
Sbjct: 449 DRNKSVTLLEGLEEKYDGSRVEFTYAQGTTLPLIDRSTQKVSVTEVPDRR--GFAEAVNV 506

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A+ +D ++V  G +     E   R  + LPG Q  L+  + +T K P++LVL  G PLD+
Sbjct: 507 ARNSDVIMVAMGENYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDL 565

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN 631
           S+ E +  + +I+   YPG     A+A+I+ GD+NP  +L MT +P +  ++P+  N  N
Sbjct: 566 SWEEEN--VDAIVEAWYPGMMSGHAVADILSGDYNPSAKLVMT-FPRNVGQIPIFYNMKN 622

Query: 632 MRADSSRQYPG---RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
                  ++P     SY     T ++ FG+GLSYT + Y            A+ K  SDK
Sbjct: 623 TGRPFDAEHPADYRSSYIDSPNTPLFPFGYGLSYTTFEY------------ANAKISSDK 670

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
               Q+GS L                   + VTN GD+DG  VV L+ R    S   P K
Sbjct: 671 F---QSGSSLT----------------ASVEVTNTGDLDGEEVVQLYLRDRVGSVVRPVK 711

Query: 749 QLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           +L GF+++H  A  +K + F +D  E L + N
Sbjct: 712 ELKGFEKIHLKAGETKTVEFSIDE-EMLKMYN 742


>gi|281412136|ref|YP_003346215.1| glycoside hydrolase family 3 domain protein [Thermotoga
           naphthophila RKU-10]
 gi|281373239|gb|ADA66801.1| glycoside hydrolase family 3 domain protein [Thermotoga
           naphthophila RKU-10]
          Length = 778

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 222/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGE+ K 
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                          KG            + A  KH   Y   + G N++  +    I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ ++ F  PF + +++     +M SY++++GVP A  R  L    R +WGFKG + SD 
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFKGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + +Y    +   ++A   L+AG+D+    T   ++ +  ++KG + E  ID A+ 
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            +  ++  LGLF     + +  K+         HK +ALD AR+ I+LLKND   LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHKDIALDIARKSIILLKNDG-ILPLQK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
           N    +A+IGP    +  + G Y                 G P  P+     L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
           +                  YA GC +V     +GF EA+ IAKK+D  IVV G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T   E RD  +L LPG Q  LV  VA+T K PV+LVL  G P   S      ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
            +  PGEAG +++ +II+G  NP G+LP++ +P S  ++P+    ++    R +    Y 
Sbjct: 568 QVWLPGEAGGRSIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
             +   ++ FGHGLSYT + Y  L   P E+  +  +                       
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     +++ V N GD DG  VV L+      S   P K+L GF RV   AK  K 
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F +   + L+  ++  + ++  G   +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 216/736 (29%), Positives = 340/736 (46%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG    V
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIARIGE---V 182

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  ET GEDP +V     + + S Q  N   
Sbjct: 183 EAKEAVALGYTNI--YSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G   +   +  +  +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPVGGRDGKTRTDPHVAPR 268

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I        T+ED  A  + AG++I          +   + A+  GK+ ++ +D
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVADGKISQETLD 386

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  V+  LGLF+ +P +G  GK     V + EH+ ++L+AARQ +VLLKN+   L
Sbjct: 387 KRVAEILRVKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
           PL+K ++ S+A+IGP  +  +Q+   Y       K++ +G++  +  T   Y  GC    
Sbjct: 445 PLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIID 503

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  ++      EA+  AK+A+ V++V G +     EDR R SL LPG+Q
Sbjct: 504 PHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V  T K PV+LVL  G    +++A A   + +IL   +PGE   +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVVLVLLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S           + T VYG    FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGVLYPFGHGLS 668

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT +SY       +L IS  L+ G   +I                        ++   + 
Sbjct: 669 YTTFSY------GDLKIS-PLRQGVQGDI------------------------NISCKIK 697

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+ R    S  T  K L GF+R+   A   + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP-QDLGLWDK 756

Query: 782 HGRRILPLGNHVLMVG 797
           +    +  G   +M+G
Sbjct: 757 NMNFRVEPGKFKVMIG 772


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 217/746 (29%), Positives = 339/746 (45%), Gaps = 136/746 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E  HG  + G            T FP  L   +++N  L+ ++  AVA 
Sbjct: 104 RLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTWNPELFRSMCRAVAA 151

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   ++P +++ RDPRWGR +ET GEDP +V+ +AV  V+  QG+   +
Sbjct: 152 ETRS-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFAVAAVQGLQGDRLDA 206

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           +D                           L A  KH   Y   + G   R      +  +
Sbjct: 207 EDS--------------------------LLATLKHFAGYGASEGG---RNGAPVHMGLR 237

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCD 328
           +  +    PFR  +E G A  +M +YN+++GVP    R  L    R  WGF G++ +DC 
Sbjct: 238 ELHEIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCG 296

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLN 387
           A+  +    N   + E++AA  L AG+D+     M R + + A+++G + E D++ A+  
Sbjct: 297 AIDMLKSGHNTAASGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGR 356

Query: 388 LFSVQLRLGLFN---GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           + +++ RLGLF+    DP + +        +   EH +LA   A +GIVLLKN+   LPL
Sbjct: 357 VLAMKFRLGLFDRPYTDPERAE------KVIGCEEHIELARRVAAEGIVLLKNEGNVLPL 410

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYV-----SKTHYASGC 496
           N      +A+IGP  N      G YT  P  P    ++L G+  ++     ++  YA GC
Sbjct: 411 NPK-TGKIAVIGPNANAPYNQLGDYTS-PQPPGQIITVLEGIRRHIGEDADTRVLYAPGC 468

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET------------- 532
             +  +S  G   A+  A +AD +++  G           +DL                 
Sbjct: 469 R-IQGDSREGLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMEC 527

Query: 533 -EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E  DR +L L G Q+ L+  + +  K PV++V   G P+   +   D  I +IL   YP
Sbjct: 528 GEGIDRSTLHLMGVQLELLQEIHKLGK-PVVVVYINGRPITEPW--IDEHIPAILEAWYP 584

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
           G+ G  A+A+I+FGD NP GRL +T  P+   ++P+N       ++++  G+ Y      
Sbjct: 585 GQEGGSAIADILFGDVNPSGRLTLT-IPKEVGQLPIN------YNAKRTRGKRYLETDLE 637

Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
             Y FG+GLSYT++ Y  LS               +  ++   GS               
Sbjct: 638 PRYPFGYGLSYTDFHYGNLSV--------------EPAVIPADGSAA------------- 670

Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
               V+I VTN G  DG+ VV L+      S   PEK L  F +V   A  S+E++F V 
Sbjct: 671 ----VRIVVTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVG 726

Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG 797
           P EQL +     + ++  G   + VG
Sbjct: 727 P-EQLELIGPDMKAVVEPGEFRIRVG 751


>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 737

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 210/747 (28%), Positives = 340/747 (45%), Gaps = 118/747 (15%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           SYPF N  L + TR   L+S +TL+EK+  LS + S +PRLGI       E  HG+A  G
Sbjct: 39  SYPFQNADLDMETRVDDLLSRMTLEEKVSALSTDPS-VPRLGIKGAPH-IEGYHGVAMGG 96

Query: 111 PG--VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTFW 165
           P             T FPQ     A++N  L    G   ++EAR ++    + + GL   
Sbjct: 97  PANWAPKGDERVPTTQFPQAYGMGATWNPELIRKAGEIESIEARYIFQNPEISKGGLVVR 156

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREKR 222
           APN ++ RDPRWGR +E  GEDP +V   +  F K  QG++   W++             
Sbjct: 157 APNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGLQGDDEKYWRT------------- 203

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSC 282
                            ++  KH +A   E   + S  +F+  +  +     +   FR  
Sbjct: 204 -----------------ASLLKHFLANSNENTRDSSSSNFDTQLFYE----YYGATFRRA 242

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           I +G ++  M +YN VNGVPA +     + +   WG  G I +D      +         
Sbjct: 243 ILEGGSNAYMTAYNAVNGVPAHIHPMHKEISMARWGVNGIICTDGGGYTLLVRAHKAYDD 302

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           +  +A GV+KAG++             A+  G + E+D+D  L  ++ V ++LG    DP
Sbjct: 303 YYRAAEGVIKAGLNQFLDNYR-EGVWGALAHGYLAEEDLDEVLKGVYRVMIKLGQL--DP 359

Query: 403 R-KGKYGKLG----PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           + K  Y  +G    P    + EH++ AL  AR+ +VLLKN+K+ LPL  + +  +A+IG 
Sbjct: 360 QDKVPYASIGRDGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVAVIGH 419

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKA 517
           L + I  +   Y+G+P    + L G++  +       G   V    D  ++ AV  A +A
Sbjct: 420 LADTI--LLDWYSGMPPFMSTPLDGIKEKM-------GADKVLFAPDNDYNAAVEAASQA 470

Query: 518 DFVIVVAG-------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           D  IVV G              D     E  DR +L L  + ++     A  +    ILV
Sbjct: 471 DVAIVVLGNHPYCDSERWGDCPDPGMGREAVDRKTLRLTDEWLAQRVFEANPN---TILV 527

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L    P  +++++ +  + +I+ I + G++   ALA+++FGD+NPGG+L  TW P+S  +
Sbjct: 528 LQSSFPYGINWSQEN--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTW-PKSEEQ 584

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P M + ++R        G +Y ++ G  +Y FG GLSYT++ +       ++ I+ S  
Sbjct: 585 LPDMMEYDIRK-------GHTYMYFNGEPLYPFGFGLSYTSFEW------VDMEITGSSV 631

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
             +++ ++                        V + + N G V G  V+ L+A  P+ S 
Sbjct: 632 KSNEEEVI------------------------VTVKLKNVGQVKGDEVIQLYASFPETSS 667

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGV 770
             P+K L GF RV      SK +   V
Sbjct: 668 RRPDKALKGFKRVTLEPGESKNVQIPV 694


>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 746

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
           K L+S +T++EKI QLS         G P  E+ S+SL      G  +N +G  +     
Sbjct: 39  KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 97

Query: 121 ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
                                   T FP  L  + S++ +         A+E+ A     
Sbjct: 98  EKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 152

Query: 159 QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
            AGL + +AP ++I RD RWGR  E  GED  + S  A   V  FQ   W+++       
Sbjct: 153 -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 205

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                                + AC KH +AY L + G   R      ++E+   DT+ P
Sbjct: 206 ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 241

Query: 278 PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
           PF++CI+ G     M ++N +NG+PA     L +   R +W F G++ SD +AV  +   
Sbjct: 242 PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 299

Query: 337 QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           Q   +  +D+      +G+D++    +  ++ +  I+ GK+  +D+D ++  +  ++  L
Sbjct: 300 QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 359

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           GLF  DP K    +     +   E    ALD A +  VLLKND   LPL KN V S+A++
Sbjct: 360 GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 417

Query: 456 GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
           GPL +N +++ G +   G      ++L+G++  +    +K  YA GC D      +GF E
Sbjct: 418 GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 476

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
           AV++A K+D VI V G       E R R  L LPG Q  L+  +  T K PV++VL  G 
Sbjct: 477 AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 535

Query: 570 PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
           PL + +   D  +S+IL   + G +   A+A+I+FGD+NP GRL ++ +P    +VP+  
Sbjct: 536 PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 592

Query: 630 MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
              ++      P  S   +       +Y FG+GLSYT +SY   S P             
Sbjct: 593 NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 637

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                    ++ +Y   + ++        V ++VTN GD DG   V L+      S   P
Sbjct: 638 --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 681

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPC 773
            K+L  F ++   A  SK + F + P 
Sbjct: 682 VKELKAFKKIFLKAGESKTVQFDISPL 708


>gi|15642851|ref|NP_227892.1| xylosidase [Thermotoga maritima MSB8]
 gi|418046013|ref|ZP_12684107.1| Beta-glucosidase [Thermotoga maritima MSB8]
 gi|4980564|gb|AAD35170.1|AE001694_6 xylosidase [Thermotoga maritima MSB8]
 gi|351675566|gb|EHA58726.1| Beta-glucosidase [Thermotoga maritima MSB8]
          Length = 778

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 222/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +AV  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAVRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGE+ K 
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                          KG            + A  KH   Y   + G N++  +    I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ ++ F  PF + +++     +M SY++++GVP A  R  L    R +WGF+G + SD 
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + +Y    +   ++A   L+AG+D+    T   ++ +  ++KG + E  ID A+ 
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVT 342

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            +  ++  LGLF     + +  K+         H+ +AL+ AR+ I+LLKND   LPL K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHRDIALEIARKSIILLKNDG-ILPLQK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
           N    +A+IGP    +  + G Y                 G P  P+     L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
           +                  YA GC +V     +GF EA+ IAKK+D  IVV G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T   E RD  +L LPG Q  LV  VA+T K PV+LVL  G P   S      ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
            +  PGEAG +A+ +II+G  NP G+LP++ +P S  ++P+    ++    R +    Y 
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
             +   ++ FGHGLSYT + Y  L   P E+  +  +                       
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     +++ V N GD DG  VV L+      S   P K+L GF RV   AK  K 
Sbjct: 664 ----------IKVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F +   + L+  N+  + ++  G   +MVG
Sbjct: 714 VVFRLH-MDVLAYYNRDMKLVVEPGEFKVMVG 744


>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
          Length = 285

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 123/285 (43%), Positives = 191/285 (67%), Gaps = 9/285 (3%)

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLD + E E RDRV LLLPG Q  LV+ VAR ++ PVILVL  GGP+DVSFA+ D +IS+
Sbjct: 2   GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF-TKVPMNDMNMRADSSRQYPGR 643
           ILW+GYPG+AG  A+A++IFG  NPGGRLPMTWYP+++  KVPM +M+MR + +  YPGR
Sbjct: 62  ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
           +YRFY G  V+ FGHGLSYT +++    AP ++++  A+L+A ++  +      R+ +  
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKAVRVSH-- 179

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
                +C ++     + V N G +DG++ +++F++ P   + +  KQL+ F + +  A  
Sbjct: 180 ----ANCDAMEVGFHVDVKNEGSMDGTNTLLVFSK-PPPGKWSATKQLVSFHKTYVPAGS 234

Query: 763 SKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
            + +  GV  C+ LS+ ++ G R +P+G H L +G+L+HS++++T
Sbjct: 235 KQRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGDLKHSISVQT 279


>gi|255545654|ref|XP_002513887.1| hypothetical protein RCOM_1034150 [Ricinus communis]
 gi|223546973|gb|EEF48470.1| hypothetical protein RCOM_1034150 [Ricinus communis]
          Length = 246

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 139/270 (51%), Positives = 173/270 (64%), Gaps = 33/270 (12%)

Query: 8   IQLRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPC--KPPHFDSYPFCNTSLSISTRA 65
           ++LR+  L  +  L   + +  F++S P   +P F C    P   SY FC TSL IS RA
Sbjct: 1   MKLRKISLLTLFHLTSFL-IFIFADSAP---QPPFSCDYSNPSTKSYLFCQTSLPISQRA 56

Query: 66  KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIA-----SNGPGVNFNGTVS 120
           K L+S LTL EKI QL D A +IPRLG+PAY WWSE+LHG+A     S   G++FNGT+ 
Sbjct: 57  KDLVSRLTLDEKISQLVDTAPSIPRLGVPAYRWWSEALHGVAFLPDVSERQGIHFNGTIR 116

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-GLTFWAPNINIFRDPRWGR 179
           S TSFPQV+++AASF+  LW  IG     EARA+YN GQA G+TFWAPNINIFRDPRWGR
Sbjct: 117 SATSFPQVIITAASFDTKLWYRIGQVTGKEARAIYNAGQAIGMTFWAPNINIFRDPRWGR 176

Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
           GQETPGEDPMV   YAV FV+  QG+   S +GG+                   G++L  
Sbjct: 177 GQETPGEDPMVTGKYAVSFVRGVQGD---SFEGGV------------------LGEKLQA 215

Query: 240 SACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
           SACCKH  AYDLEKW   +R+ FNA ++ +
Sbjct: 216 SACCKHFTAYDLEKWNGTNRFIFNAQVSAR 245


>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
 gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
          Length = 1049

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)

Query: 66   KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
            K L+S +T++EKI QLS         G P  E+ S+SL      G  +N +G  +     
Sbjct: 342  KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400

Query: 121  ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
                                    T FP  L  + S++ +         A+E+ A     
Sbjct: 401  EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455

Query: 159  QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
             AGL + +AP ++I RD RWGR  E  GED  + S  A   V  FQ   W+++       
Sbjct: 456  -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508

Query: 218  FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                                 + AC KH +AY L + G   R      ++E+   DT+ P
Sbjct: 509  ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544

Query: 278  PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
            PF++CI+ G     M ++N +NG+PA     L +   R +W F G++ SD +AV  +   
Sbjct: 545  PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602

Query: 337  QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q   +  +D+      +G+D++    +  ++ +  I+ GK+  +D+D ++  +  ++  L
Sbjct: 603  QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662

Query: 396  GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
            GLF  DP K    +     +   E    ALD A +  VLLKND   LPL KN V S+A++
Sbjct: 663  GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720

Query: 456  GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
            GPL +N +++ G +   G      ++L+G++  +    +K  YA GC D      +GF E
Sbjct: 721  GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779

Query: 510  AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            AV++A K+D VI V G       E R R  L LPG Q  L+  +  T K PV++VL  G 
Sbjct: 780  AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838

Query: 570  PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            PL + +   D  +S+IL   + G +   A+A+I+FGD+NP GRL ++ +P    +VP+  
Sbjct: 839  PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895

Query: 630  MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
               ++      P  S   +       +Y FG+GLSYT +SY   S P             
Sbjct: 896  NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 940

Query: 687  DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                     ++ +Y   + ++        V ++VTN GD DG   V L+      S   P
Sbjct: 941  --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984

Query: 747  EKQLIGFDRVHTVAKGSKEISFGVDPC 773
             K+L  F ++   A  SK + F + P 
Sbjct: 985  VKELKAFKKIFLKAGESKTVQFDISPL 1011


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 215/736 (29%), Positives = 338/736 (45%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG   A 
Sbjct: 138 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIARIGEVEAK 185

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A+          ++P ++I +DPRWGR  ET GEDP +V     + + S Q  N   
Sbjct: 186 EAVAL-----EYTNIYSPILDIAQDPRWGRCVETYGEDPYLVGELGKQMITSLQKHN--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G   +   +  +  +
Sbjct: 238 -----------------------------LVATPKHFAVYSIPVGGRDGKTRTDPHVAPR 268

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 269 EMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 326

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I        T+ED  A  + AG++I          +   + A+  GK+ ++ +D
Sbjct: 327 SEAVEFISSKHKVANTYEDGIAQAVNAGLNIRTHFTPPADFILPLRKAVADGKISQETLD 386

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  V+  LGLF+ +P +G  GK     V + EH+ ++L+AARQ +VLLKN+   L
Sbjct: 387 KRVAEILRVKFWLGLFD-NPYRGN-GKQAEQIVHSKEHQAVSLEAARQSLVLLKNEMNLL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
           PL+K ++ S+A+IGP  +  +Q+   Y       K++ +G++  +  T   Y  GC    
Sbjct: 445 PLSK-SLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTEVIYRKGCDIID 503

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  ++      EA+  AK+A+ V++V G +     EDR R SL LPG+Q
Sbjct: 504 PHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVREDRSRTSLNLPGRQ 563

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V  T K PV+LVL  G    +++A A   + +IL   +PGE   +A+AE +FGD
Sbjct: 564 EELLKAVCATGK-PVVLVLLDGRASSINYAAA--HVPAILHAWFPGEFCGQAVAEALFGD 620

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S           + T VYG    FGHGLS
Sbjct: 621 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDES-----------SSTSVYGVLYPFGHGLS 668

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT +SY       +L IS  L+ G   +I                        ++   + 
Sbjct: 669 YTTFSY------GDLKIS-PLRQGVQGDI------------------------NISCKIK 697

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+ R    S  T  K L GF+R+   A   + + F + P + L + +K
Sbjct: 698 NTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRLRP-QDLGLWDK 756

Query: 782 HGRRILPLGNHVLMVG 797
           +    +  G   +M+G
Sbjct: 757 NMNFRVEPGKFKVMIG 772


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 157/473 (33%), Positives = 227/473 (47%), Gaps = 61/473 (12%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + YPF N  L I  R   L+  LT +EK+ Q+ +   AI RLGIP Y+WW+E+LHG+A  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
           G            T FPQ +  AA+F+        + V+ EARA Y+  Q         G
Sbjct: 84  G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW PNINIFRDPRWGRG ET GEDP +     V  VK  QG++ K             
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                              AC KH   +   +W   +R+ F+  +T +D   T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++G    +MC+YN+  G P C    L     RN WG++  I SDC A+   +E    T
Sbjct: 224 LVKEGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERT 283

Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             H      E ++A  +  G D+ CG    R    A+  GK+ E D+D +L  L   +  
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YRALVKALKDGKISENDLDVSLRRLLKGRFE 342

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+ D R   Y ++  + V + EH   AL+ A + +VLLKN    LPL+K  +  +A+
Sbjct: 343 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           +GP   + + +   Y G P    ++L G+   V  T      +++ CN  A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/269 (31%), Positives = 134/269 (49%), Gaps = 55/269 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K AD ++ V G+    E E+           DR ++ LP  Q  +V ++  T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATGK-PVVYV 661

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L +++ EA+  I +IL   Y G+    A+A+I+FGD+NP GRLP+T+Y +S  +
Sbjct: 662 LCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P   D +M+        GR+YR+ T T +Y FG+GLSYTN++Y+          +A L 
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           +G               +  D+  + T         + N G +DG  +  ++ + P   +
Sbjct: 761 SGK--------------IAKDQSVTLT-------FDIANTGKMDGDEIAQIYIKNPNDPE 799

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           G P K L  F RVH  A  S+E++  + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELAP 827


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 177/551 (32%), Positives = 264/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L++  RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+A   A  + +G D+ CG    +    A+ KG + E+ I+ ++  L   +  LG  
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N+   LPLN+     +A+IGP 
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF+      ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
            E +   + AD VI   G+    E E            DR  + LP  Q  ++  + +  
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K+ V +  +G     ++         +IL   YPG+AG  A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + 
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           T++ S     +K IL                          I V+N G  DG  VV ++ 
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
             P   +G P+K L GF RV ++AKG     +I    D  E    A      I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842

Query: 793 VLMVGELRHSLTIET 807
            ++ G   +   ++T
Sbjct: 843 KILYGNSSNEKDLQT 857


>gi|410097652|ref|ZP_11292633.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409223742|gb|EKN16677.1| hypothetical protein HMPREF1076_01811 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 780

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 228/756 (30%), Positives = 363/756 (48%), Gaps = 144/756 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E  HG  + G            T FP  L  A+++N  L   +G A+A+
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTSLSQASTWNAELMHRMGEAIAL 172

Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR+   N+G      + P ++I R+PRW R +ET GEDP++ +   V F+K  QG++  
Sbjct: 173 EARSQGANIG------YGPVLDIAREPRWSRMEETFGEDPVLTTHLGVAFMKGMQGKS-- 224

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
                                   + D   L +  KH  AY + + G N +R    A + 
Sbjct: 225 ------------------------QNDGKHLYSTLKHFAAYGIPEAGHNGAR----ANVG 256

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSD 326
            +     + PPF+  +E+G A+ IM SYN ++GVP      L     R++WGFKG++ SD
Sbjct: 257 MRQLFSDYLPPFKKAVEEGVAT-IMTSYNTIDGVPCTSNKYLLTDVLRDQWGFKGFVYSD 315

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRAL 385
             ++  I   +   K ++++A   LKAG+D++ G     ++ Q A+++G +   D++RA+
Sbjct: 316 LTSIEGIVGAR-VAKDNKEAAVLALKAGLDMDLGGNAYGKNLQKALEEGAITMDDLNRAV 374

Query: 386 LNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
            N+  ++ R+GLF      P + K        V +  HK+LA + AR+GIVLLKN+   L
Sbjct: 375 ANVLRLKFRMGLFENPYVSPEQAK------QVVRSKAHKELAREVAREGIVLLKNEG-VL 427

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS---KTHYASGC- 496
           PL KN + ++A+IGP  + +    G YT      +  ++L G+   VS   K +Y  GC 
Sbjct: 428 PLKKN-IGNIAVIGPNADMMYNQLGDYTAPQEREEIVTVLDGIRKAVSPSTKVNYVKGCA 486

Query: 497 -HDVPCNSDAGFHEAVRI----------AKKADF---VIVVAGLDLTQET---------- 532
             D+  ++     EA R           +   DF    I     D++ +           
Sbjct: 487 IRDITTSNITAAVEAARAADAVVLVVGGSSARDFKTKYIGTGAADVSNDGNQLLSDMDCG 546

Query: 533 EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
           E  DR +L L G Q  L+ +VA T K P++++   G  L+++ A   +Q     W  YPG
Sbjct: 547 EGYDRSTLRLLGDQEKLLKAVAATGK-PLVVIYIQGRTLNMNLASEKAQALLTAW--YPG 603

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
           E G  A+A+++FGD+NP GRLP++  P S  ++P+           Q   R+Y    GT 
Sbjct: 604 EQGGTAIADVLFGDYNPAGRLPVS-VPRSEGQLPL--------FYSQGKQRAYVEEEGTP 654

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
           +Y FG+GLSYT + Y  L           ++ G+ K++LQ               SCT  
Sbjct: 655 LYAFGYGLSYTKFDYSQL----------EMQKGNGKDVLQ-------------TVSCT-- 689

Query: 713 RFHVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
                  VTN GD DG  VV L+   +V  VSQ +P   L  F+R+      SK+++F +
Sbjct: 690 -------VTNTGDCDGEEVVQLYICDKVASVSQ-SP-ILLKAFERISLKKGESKKVTFTL 740

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
              E+LS+ N   ++++  G+  +MVG     + +E
Sbjct: 741 GE-EELSLYNMEMKQVVEPGDFKVMVGAASDDIRLE 775


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 184/610 (30%), Positives = 281/610 (46%), Gaps = 76/610 (12%)

Query: 71  LLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLV 130
           LLT +E           + R+G+P Y+W   ++HG+ S+    + +GTV   TSFP  + 
Sbjct: 55  LLTAREGGGGSPGPPGNVSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVN 113

Query: 131 SAASFNRSLWSNIGSAVAVEARAMYNLG-----------QAGLTFWAPNINIFRDPRWGR 179
              ++N S +  +G  + VE RA++  G             GL  W+PNINI R P WGR
Sbjct: 114 YGFTWNYSAYLELGRIIGVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGR 173

Query: 180 GQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELML 239
            QE PGEDP +   +   +    QG+                             D+  L
Sbjct: 174 NQEVPGEDPFMNGQFGKAYTLGLQGD-----------------------------DDTYL 204

Query: 240 SACC--KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
            A    KH  AY LE     +R++FNA+++     DT+ P FR  + +GKA  +MCSYN 
Sbjct: 205 QAIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDTYWPAFRVAVTEGKAKGVMCSYNA 264

Query: 298 VNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM-D 356
           VNG+P C    L    R+ W F GY++SD  AV  I +   YT +   +A   ++ G  D
Sbjct: 265 VNGIPTCAHPLLRTVLRDLWKFDGYVSSDTGAVEDISDNHKYTPSWATAACAAIRDGQTD 324

Query: 357 INCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK-YGKLGPDDV 415
           I+ G   ++     + +G  + +D+D AL N   ++  LGLF  DP + + Y  +    V
Sbjct: 325 IDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLRFELGLF--DPVENQSYWHVPLAAV 382

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC- 474
            T+  +   +    + +VLL+N    LPL  N  + +A+IGP       M G Y G  C 
Sbjct: 383 NTNASRATNMLHTLESMVLLQNKNNVLPLASN--TKVALIGPHAKAQEDMVGNYLGQLCP 440

Query: 475 --------SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
                   SP   L  +    + T YA G +   C S +   EAV +A  AD  +++ G+
Sbjct: 441 DNNFDCVVSPHDALVSILGTDAVT-YAPGTNVTTC-SQSHIDEAVSVATAADVAVLMLGI 498

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D + E E  DR S+ LP  Q  L +++    K  VI++L GG    ++      Q  +I+
Sbjct: 499 DESIEAESNDRKSIDLPECQHQLASAIFAVGKPTVIVLLNGGM---LAIENEKQQADAII 555

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
             GYPG  G  A+A+ + G     G        +    + M+DM M +      PGR+YR
Sbjct: 556 EAGYPGFYGGTAIAQTLTGQNEHLG--------DYINWINMSDMEMTSG-----PGRTYR 602

Query: 647 FYTGTQVYGF 656
           +Y    ++ F
Sbjct: 603 YYKNETLWAF 612


>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
 gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
          Length = 1049

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 215/747 (28%), Positives = 339/747 (45%), Gaps = 116/747 (15%)

Query: 66   KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
            K L+S +T++EKI QLS         G P  E+ S+SL      G  +N +G  +     
Sbjct: 342  KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400

Query: 121  ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
                                    T FP  L  + S++ +         A+E+ A     
Sbjct: 401  EKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455

Query: 159  QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
             AGL + +AP ++I RD RWGR  E  GED  + S  A   V  FQ   W+++       
Sbjct: 456  -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508

Query: 218  FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                                 + AC KH +AY L + G   R      ++E+   DT+ P
Sbjct: 509  ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544

Query: 278  PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
            PF++CI+ G     M ++N +NG+PA     L +   R +W F G++ SD +AV  +   
Sbjct: 545  PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602

Query: 337  QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q   +  +D+      +G+D++    +  ++ +  I+ GK+  +D+D ++  +  ++  L
Sbjct: 603  QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662

Query: 396  GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
            GLF  DP K    +     +   E    ALD A +  VLLKND   LPL KN V S+A++
Sbjct: 663  GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720

Query: 456  GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
            GPL +N +++ G +   G      ++L+G++  +    +K  YA GC D      +GF E
Sbjct: 721  GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779

Query: 510  AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            AV++A K+D VI V G       E R R  L LPG Q  L+  +  T K PV++VL  G 
Sbjct: 780  AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838

Query: 570  PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            PL + +   D  +S+IL   + G +   A+A+I+FGD+NP GRL ++ +P    +VP+  
Sbjct: 839  PLSIEWV--DKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPVYY 895

Query: 630  MNMRADSSRQYPGRSYRFYT---GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
               ++      P  S   +       +Y FG+GLSYT +SY   S P             
Sbjct: 896  NYKKSGRPGDMPHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SVPQ------------ 940

Query: 687  DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                     ++ +Y   + ++        V ++VTN GD DG   V L+      S   P
Sbjct: 941  --------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVASVVRP 984

Query: 747  EKQLIGFDRVHTVAKGSKEISFGVDPC 773
             K+L  F ++   A  SK + F + P 
Sbjct: 985  VKELKAFKKIFLKAGESKTVQFDISPL 1011


>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1039

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 216/762 (28%), Positives = 340/762 (44%), Gaps = 141/762 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   IG     
Sbjct: 245 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLLRQIGLITGR 292

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E VK  Q  +  
Sbjct: 293 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQHNH-- 344

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 345 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 375

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP---------ACLRGDLFQKARNEWGF 319
           ++ E     PF+  I +     +M SYN  +G P           LRGD+        GF
Sbjct: 376 REVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDM--------GF 427

Query: 320 KGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGK 375
           +GY+ SD DAV  ++      K  +++    ++AG++I C        +   +  + +G+
Sbjct: 428 RGYVVSDSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNIRCTFRSPDSYVLPLRELVKEGE 487

Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLL 435
           + E+ I+  + ++  V+  +GLF+   +    G    ++V  + ++++AL A+R+ IVLL
Sbjct: 488 LSEEIINDRVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKASNEEIALQASRESIVLL 545

Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHY 492
           KNDK  LPLN + +  +A+ GP  +  S     Y  +     S+L+G++  +   ++  Y
Sbjct: 546 KNDKNVLPLNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLY 605

Query: 493 ASGCHDVPC--------------NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRV 538
             GC  V                N      +AV   K+AD  +VV G       E++ R 
Sbjct: 606 TKGCELVDANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRS 665

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           SL LPG+Q+ L+ +V  T K PV+LVL  G PL +++  AD  + +IL   YPG  G KA
Sbjct: 666 SLALPGRQLDLLKAVVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGSKGGKA 722

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------ 652
           +A+++FGD+NPGG+L +T +P++  ++P    N     S Q  G       G        
Sbjct: 723 VADVLFGDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGLNGNMSRVNGA 778

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
           +Y FG GLSYT + Y      S+L IS ++   + K                        
Sbjct: 779 LYPFGFGLSYTTFEY------SDLKISPAIITPNQKT----------------------- 809

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
             +V   VTN G   G  VV L+ R    S  T EK L GF+RVH     +KEI+F +D 
Sbjct: 810 --YVTCKVTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPID- 866

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            + L + N     ++  G   LM+G       L  +LT+  Y
Sbjct: 867 RKALELLNADMHWVVEPGEFTLMIGASSTDIRLNGTLTVSGY 908


>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
 gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
 gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus ATCC 8483]
 gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
          Length = 1049

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 220/751 (29%), Positives = 342/751 (45%), Gaps = 124/751 (16%)

Query: 66   KSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS----- 120
            K L+S +T++EKI QLS         G P  E+ S+SL      G  +N +G  +     
Sbjct: 342  KDLLSRMTVEEKIGQLSQYVGRTLLTG-PESEYLSDSLIARGLVGSVLNISGAKTLRDLQ 400

Query: 121  ----------------------SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG 158
                                    T FP  L  + S++ +         A+E+ A     
Sbjct: 401  EKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAAKIAAIESSA----- 455

Query: 159  QAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
             AGL + +AP ++I RD RWGR  E  GED  + S  A   V  FQ   W+++       
Sbjct: 456  -AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQWNLWENNS------ 508

Query: 218  FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQP 277
                                 + AC KH +AY L + G   R      ++E+   DT+ P
Sbjct: 509  ---------------------VLACAKHWVAYGLPQAG---RDYAPVDMSERTLFDTYLP 544

Query: 278  PFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEY 336
            PF++CI+ G     M ++N +NG+PA     L +   R +W F G++ SD +AV  +   
Sbjct: 545  PFKACIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLVA- 602

Query: 337  QNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
            Q   +  +D+      +G+D++    +  ++ +  I+ GK+  +D+D ++  +  ++  L
Sbjct: 603  QGVAEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYAL 662

Query: 396  GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
            GLF  DP K    +     +   E    ALD A +  VLLKND   LPL KN V S+A++
Sbjct: 663  GLFV-DPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPLAKN-VRSIAVV 720

Query: 456  GPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV----SKTHYASGCHDVPCNSDAGFHE 509
            GPL +N +++ G +   G      ++L+G++  +    +K  YA GC D      +GF E
Sbjct: 721  GPLADNQTELLGSWRARGEDRHVTTVLQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKE 779

Query: 510  AVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGG 569
            AV++A K+D VI V G       E R R  L LPG Q  L+  +  T K PV++VL  G 
Sbjct: 780  AVKLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGR 838

Query: 570  PLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMND 629
            PL + +   D  +S+IL   + G +   A+A+I+FGD+NP GRL ++ +P    +VP+  
Sbjct: 839  PLSIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTIS-FPRVEGQVPIY- 894

Query: 630  MNMRADSSRQYPGRSYRFYTGTQV-------YGFGHGLSYTNYSYKFLSAPSELTISASL 682
             N +  S R  PG      T   +       Y FG+GLSYT +SY   SAP         
Sbjct: 895  YNYKK-SGR--PGDMLHSSTTRHIDVPNAPLYPFGYGLSYTTFSY---SAPQ-------- 940

Query: 683  KAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVS 742
                         ++ +Y   + ++        V ++VTN GD DG   V L+      S
Sbjct: 941  ------------STQKEYTRQETIS--------VSVTVTNTGDRDGEETVQLYVNDKVAS 980

Query: 743  QGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               P K+L  F ++   A  SK + F + P 
Sbjct: 981  VVRPVKELKAFKKIFLKAGESKTVQFDISPL 1011


>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 954

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 222/781 (28%), Positives = 356/781 (45%), Gaps = 130/781 (16%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNG 110
           P+ + SL +  R +SL++ +T  +K++ + +      IP L +P      E++HG +   
Sbjct: 169 PYMDASLPVDERVESLLAAMTPADKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG- 226

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
                    S  T FPQ L   A++NR L   +  A+  E   + N  QA    W+P ++
Sbjct: 227 ---------SGATIFPQALAMGATWNRQLTEEVAMAIGDET-VIANTKQA----WSPVLD 272

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + +D RWGR +ET GEDP++VS     ++K +Q +                         
Sbjct: 273 VAQDARWGRCEETFGEDPVLVSQMGGAWIKGYQSKG------------------------ 308

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   L    KH   +     G   R S +  ++E++  +    PFR  I       
Sbjct: 309 --------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYDCQS 357

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT-KTHEDSAA 348
           +M +Y+   G+P     +L Q+  R EWGF G+I SDC A+  +   ++YT K   ++A 
Sbjct: 358 LMMAYSDYMGIPIAKSTELLQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAAN 417

Query: 349 GVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-K 406
             L AG+  NCG T   +    A   G++  +++D     + +   R  LF  +P K   
Sbjct: 418 QALAAGIATNCGDTYNNKEVIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNPCKPLD 477

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
           + K+ P    +  HK +A  AA + IV+L+N    LPL+K  + ++A++GP  +++    
Sbjct: 478 WNKIYPG-WNSDSHKAMAHRAACESIVMLENKDNLLPLSKE-LRTIAVLGPGADDLQP-- 533

Query: 467 GGYTGI--PCSPKSLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVI 521
           G YT    P   KS+L G++A VSK     Y  GC D          +AV+ A +AD V+
Sbjct: 534 GDYTPKLQPGQLKSVLTGIKAAVSKQTKVLYEKGC-DFTETGMTDIPKAVKTASQADVVV 592

Query: 522 VVAG-LDLTQETED--------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLD 572
           +V G   +++ T+D         D  +L+LPG+Q  L+ +V  T K PVIL+L  G P D
Sbjct: 593 MVLGDCSISEATKDVRKTCGENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYD 651

Query: 573 VSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNM 632
           +  A    +   + W+  PG+ G  A A+++FGD+NPGGRLPMT +P    ++P+   N 
Sbjct: 652 LLKASEMCKAILVNWL--PGQEGGPATADVLFGDYNPGGRLPMT-FPRHVGQLPLY-YNF 707

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
           +  S R+Y      +Y    +Y FG+GLSYT++ Y  L                      
Sbjct: 708 KT-SGRRYEYVDMEYYP---LYRFGYGLSYTSFEYSGL---------------------- 741

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                       +V    +    V+ +V N G   G  V  L+      S  T   +L  
Sbjct: 742 ------------KVQEKPNGNVTVEATVKNVGGRAGDEVAQLYVTDMYASVKTRVMELKD 789

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----------ELRHS 802
           F R+H     SK +SF + P + LS+ N H  R++  G   + VG          E++HS
Sbjct: 790 FARIHLNPGESKTVSFELTPYD-LSLLNDHMDRVVEKGEFKICVGGMSPDYKANNEIKHS 848

Query: 803 L 803
           +
Sbjct: 849 V 849


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 165/461 (35%), Positives = 237/461 (51%), Gaps = 63/461 (13%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           +P+ NTSL+   RA  L+  LTL+EK+  + + + AIPRLGI AY+WW+E+LHG+     
Sbjct: 25  FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGV----- 79

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AGLT 163
                G     T FPQ +  AASF+  L   + +AV+ EARA Y       NL +  GLT
Sbjct: 80  -----GRAGIATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQGLT 134

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
           FW PNINIFRDPRWGRGQET GEDP + S   V  V+  QG +N K D            
Sbjct: 135 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPDNMKYDK----------- 183

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
                           L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F++
Sbjct: 184 ----------------LHACAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKA 224

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++     +MC+YN+  G P C    L  Q  R+EW +KG I SDC A++  +   ++ 
Sbjct: 225 LVQEADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDH- 283

Query: 341 KTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           +TH D   ++AG + +G D+ CG    +    A+ KG + EK ID ++  L + +  LG 
Sbjct: 284 ETHPDKETASAGAVLSGTDLECGNN-YKSLPEAVQKGLIDEKQIDISVKRLLTARFELGE 342

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            +       +  +    V +  HK LAL+ AR+ IVLL+N    LPL ++    +A+IGP
Sbjct: 343 MD---EHVCWDSIPYSVVDSKAHKDLALEIARKSIVLLQNRNNILPLKEDM--KIALIGP 397

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             N+     G Y G P    +L   L+  +      Y  GC
Sbjct: 398 NANDSVMQWGNYNGFPSHTSTLYEALKERIPANQLIYDFGC 438



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 99/325 (30%), Positives = 156/325 (48%), Gaps = 59/325 (18%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           + SK   A    D+  N+      ++   K AD ++   G+  + E E+           
Sbjct: 570 FYSKDRAAQLNFDMGRNAPVNLQASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGG 629

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEA 594
           DR ++ LP  Q  L++ + +  K P+I V   G  + +   E +S+I  +IL   YPG+A
Sbjct: 630 DRTTIELPAIQRRLISELKKLGK-PIIFVNYSGSAVGL---EPESKICDAILQAWYPGQA 685

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G  A+A+++FGD+NP G+LP+T+Y  +       D +M+        GR+YR+ T + +Y
Sbjct: 686 GGTAVADVLFGDYNPSGKLPVTFYKHTDQLPDFQDYSMK--------GRTYRYMTESPLY 737

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FGHGLSYTN++Y     P+ L+              QQT S+       EVT       
Sbjct: 738 SFGHGLSYTNFTY----GPATLS--------------QQTISQG-----KEVT------- 767

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSK-EISFGVD-P 772
            + I V N G+ DG  VV ++       +G P   L  F RVH +AKG +  +SF +D  
Sbjct: 768 -LTIPVQNTGNYDGEEVVQVYLSCSGDKEG-PSHTLRAFKRVH-IAKGQRANVSFTLDSE 824

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
             Q    N +  R++  GN+ L+ G
Sbjct: 825 TFQWFDTNTNTMRMVE-GNYELLYG 848


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 169/519 (32%), Positives = 254/519 (48%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D G           
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQG----PEDAGYD--------- 181

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 182 -------------KLHACAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++ G D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN N    +A++GP  
Sbjct: 344 EQP---AWAEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTNL--KIAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDRKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 207/753 (27%), Positives = 343/753 (45%), Gaps = 125/753 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR+L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRALIHKVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q     
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQQH--- 251

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +   + 
Sbjct: 252 ------------------------------VAATGKHFAAYSNNKGAREGMARVDPQTSP 281

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
            + E+    PFR  I++     +M SYN  +G+P  ++G  +    + R+E GF+GY+ S
Sbjct: 282 HEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIP--IQGSYYWLTTRLRDEMGFRGYVVS 339

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ +
Sbjct: 340 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLDEETV 399

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V   E++ +AL A+R+ +VLLKN+   
Sbjct: 400 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKEENEAVALQASRESVVLLKNENST 457

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPLN N V  +A+ GP  +        Y  +     ++L+G++  V   ++  Y  GC  
Sbjct: 458 LPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKGCDL 517

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   D  A  ++AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 518 VDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLDLPG 577

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+L+L  G PL V++A  D  + +IL   YPG  G  ALA+I+F
Sbjct: 578 RQLQLLQAVQATGK-PVVLILINGRPLSVNWA--DKYVPAILEAWYPGSKGGVALADILF 634

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           GD+NPGG+L +T +P++  ++P N       + D  +              +Y FG+GLS
Sbjct: 635 GDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNAGPDGNMSRINGALYPFGYGLS 693

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S L I+  +   ++K                           V++ VT
Sbjct: 694 YTTFEY------SNLEITPKVITPNEKAT-------------------------VRLKVT 722

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G   G  VV L+ R    S  T EK L GF+R+H     +KE++F +D  + L + + 
Sbjct: 723 NTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTFILD-RKHLELLDA 781

Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
             +R++  G+  +M G       L   LT+E Y
Sbjct: 782 DMKRVVEPGDFAIMAGASSEDIRLNGLLTVEDY 814


>gi|403744211|ref|ZP_10953568.1| glycoside hydrolase family 3 domain-containing protein
           [Alicyclobacillus hesperidum URH17-3-68]
 gi|403122228|gb|EJY56463.1| glycoside hydrolase family 3 domain-containing protein
           [Alicyclobacillus hesperidum URH17-3-68]
          Length = 789

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 218/749 (29%), Positives = 340/749 (45%), Gaps = 134/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RL IPA     ES  G  + G            T FPQ +  A+++++ +   IG  +  
Sbjct: 92  RLAIPAL-VHEESCSGYMAKG-----------ATCFPQSIGIASTWDQDIARKIGEVIRT 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + RA+    Q  L   AP +++ RDPRWGR +ET GEDP +V+   + +V   QG++ + 
Sbjct: 140 QMRAVG--AQQAL---APLLDVTRDPRWGRVEETFGEDPYLVAQMGIGYVGGLQGDDLR- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
            DG I  G                          KH + Y   + G N++     A I E
Sbjct: 194 -DGVIATG--------------------------KHFVGYGASEGGMNWA----PAHIPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDC 327
           ++  + +  PF + + + K   IM  Y++++GVP     DL  +  RN WGF+G + SD 
Sbjct: 223 RELREVYLYPFEAVVREAKLQSIMPGYHELDGVPCHHNRDLLVETLRNRWGFEGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML--RHTQSAIDKGKVQEKDIDRAL 385
            AV  +FEY    +   ++A   ++AG+D+   +  +  +    A+++G+++ + +D  +
Sbjct: 283 FAVNQLFEYHQVARDKVEAAVFAVEAGVDVELPSRDVYGQPLVEAVNQGRLRIEQVDALV 342

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             + + + RLGLF    R        P+     E ++LA +AA + IVLLKN+   LPL 
Sbjct: 343 RRVLTAKFRLGLFE---RPFVDEGRAPNLFDNHEQRQLAREAAAKSIVLLKNEGNLLPLE 399

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV------------------ 487
                 +A+IGP  ++I  M G Y   PC  +SLL   E  V                  
Sbjct: 400 NRG--KIAVIGPNADSIRNMVGDYA-YPCHIESLLEQSEDNVFHSPMPKGMKSVDDFIEM 456

Query: 488 --------------SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDL 528
                         ++  YA GC DV  +  +G  EA  +A++AD  IVV G      D 
Sbjct: 457 KTIVQAIRDKVGDGAEVLYAKGC-DVLGDDTSGIAEAEHVARQADVAIVVVGDRAGLTDG 515

Query: 529 TQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
               E RDR +L L G Q  LV  V  T   P ++VL GG PL +++      + +IL  
Sbjct: 516 CTTGESRDRATLTLLGAQQELVERVVATGT-PTVVVLVGGRPLSITWIA--EHVPAILEA 572

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
             PGE GA A+A+++FGD NP G+LP+T  P S  +VP+     +    R +    Y   
Sbjct: 573 WLPGEEGAPAIADVVFGDMNPSGKLPIT-IPRSVGQVPIY-YGHKPSGGRSHWKGVYVDE 630

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
           +   +Y FGHGLSYT ++Y+      EL +S S           + G     +H     S
Sbjct: 631 SNKPLYAFGHGLSYTTFAYR------ELALSKS-----------EIG-----IHDTVDVS 668

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           C          + N GD  G  VV L+          P ++L GF RVH  AK +  ++F
Sbjct: 669 CV---------IENTGDRVGEEVVQLYVYDRAADVTRPVQELRGFARVHLEAKEAALVTF 719

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            +    QL   N++ + ++  G   + +G
Sbjct: 720 RLS-AHQLGFYNRNLQYVVEPGAIEVQIG 747


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 EQP---AWAEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 118/269 (43%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT ++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++AG ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 119/271 (43%), Gaps = 53/271 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT ++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
              +G P   L  F RVH  A  ++ ++  +
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAISL 818


>gi|424661946|ref|ZP_18098983.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
 gi|404578257|gb|EKA82992.1| hypothetical protein HMPREF1205_02332 [Bacteroides fragilis HMW
           616]
          Length = 814

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 228/744 (30%), Positives = 343/744 (46%), Gaps = 136/744 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 149 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 196

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQG+  + 
Sbjct: 197 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 251

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                      K V+                A  KH  +Y    W         A + E+
Sbjct: 252 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 282

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 283 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWLFKGFVVSDLY 341

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           A+  + E+      +E +A   + AG+D + GT +      +A+ KG V  + +D+A+  
Sbjct: 342 AIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRR 400

Query: 388 LFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
           + S++  +GLF+     D R  +        V + EH  LA + ARQ IVLLKN+ K LP
Sbjct: 401 ILSLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDKLLP 453

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASGCHD 498
           L K+ + +LA+IGP  +N   M G YT      S  ++L G+   VSK     YA GC  
Sbjct: 454 LKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA- 511

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------- 535
           V  +S  GF +A+  A+ AD V++V G     D + E E+                    
Sbjct: 512 VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGY 571

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGE 593
           DR +L L G+Q+ L+  V +  K P++LVL  G PL +     EAD    +IL   YPG 
Sbjct: 572 DRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWYPGM 626

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
            G  A+A+++FGD+NP GRL ++  P S  ++P+     R  +  +Y   +     GT  
Sbjct: 627 QGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----GTPR 680

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT +SY                          TG ++     +E   C   R
Sbjct: 681 YPFGYGLSYTMFSY--------------------------TGMKVRVS--EESNHC---R 709

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
             V ++V N G VDG  VV L+ R    S  TP++QL  F RV   A  ++EI+F +D  
Sbjct: 710 VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLDK- 768

Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
           + L++  + G   +  G   +M G
Sbjct: 769 KSLALYMRDGEWAVEPGRFTVMAG 792


>gi|403253118|ref|ZP_10919422.1| xylosidase [Thermotoga sp. EMP]
 gi|402811565|gb|EJX26050.1| xylosidase [Thermotoga sp. EMP]
          Length = 778

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 220/752 (29%), Positives = 343/752 (45%), Gaps = 143/752 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGE+ K 
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                          KG            + A  KH   Y   + G N++  +    I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ ++ F  PF + +++     +M SY++++GVP A  R  L    R +WGF+G + SD 
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + +Y    +   ++A   L+AG+D+    T   ++ +  ++KG + E  ID A+ 
Sbjct: 283 FAVKVLEDYHRIARNKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            +  ++  LGLF     + +  K+         H+ +AL+ AR+ I+LLKND   LPL K
Sbjct: 343 RVLMLKFMLGLFENPYVEVEKAKI-------ENHRDIALEIARKSIILLKNDG-ILPLQK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
           N    +A+IGP    +  + G Y                 G P  P+     L + +E +
Sbjct: 395 N--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
           +                  YA GC +V     +GF EA+ IAKK+D  IVV G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T   E RD  +L LPG Q  LV  VA+T K PV+LVL  G P   S      ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
            +  PGEAG +A+ +II+G  NP G+LP++ +P S  ++P+    ++    R +    Y 
Sbjct: 568 QVWLPGEAGGRAIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
             +   ++ FGHGLSYT + Y  L   P E+  +  +                       
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     +++ V N GD DG  VV L+      S   P K+L GF RV   AK  K 
Sbjct: 664 ----------IKVDVENIGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F +   + L+  ++  + ++  G   +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFKVMVG 744


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 170/519 (32%), Positives = 256/519 (49%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++AG ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 99.4 bits (246), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 70/270 (25%), Positives = 119/270 (44%), Gaps = 53/270 (19%)

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
           V+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+ 
Sbjct: 592 VKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKKV 651

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y +
Sbjct: 652 VFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD 708

Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
                   D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S 
Sbjct: 709 VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLSK 754

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           +  A  +  +L                          I V+N G  DG  VV ++ R P 
Sbjct: 755 NTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
             +G P   L  F RVH  A  ++ ++  +
Sbjct: 790 DKEG-PRYTLRAFKRVHIPAGKTESVAISL 818


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 160/459 (34%), Positives = 235/459 (51%), Gaps = 59/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA  L+  LTL+EK   + + + AIPRLGI AY+WW+E+LHG+      
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
               G     T FPQ +   ASFN  L  ++ +AV+ EARA               GLT 
Sbjct: 79  ----GRAGLATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP +     +  V+  QG                    
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              GE+ D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F++ +
Sbjct: 177 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
           ++     +MC+YN+  G P C    L  Q  R+EWG+K  + SDC A++  +    ++  
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETD 285

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
                 SA  VL +G D+ CG       + A+ +G + EK ID +L  L   +  LG  +
Sbjct: 286 PDKQHASAKAVL-SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P +  + ++    V + EH++LAL  AR+ +VLL+N++  LPLNKN    +A++GP  
Sbjct: 344 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 400

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           N+     G Y G P    +LL G+  Y+ ++   Y  GC
Sbjct: 401 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 439



 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 126/294 (42%), Gaps = 60/294 (20%)

Query: 486 YVSKTHYASGCHDVPCNSDAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR---- 535
           Y  K  +A    D   + D G        + V   K+AD +I   G+    E E+     
Sbjct: 563 YRIKIDFAFRNRDAALDFDMGREVPVDLKQTVDKVKEADVIIFAGGISPAVEGEEMHVNI 622

Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
                 DR ++ LP  Q  L+  + +  K+ V +  +G     ++         +IL   
Sbjct: 623 PGFKGGDRETIELPSIQSRLLAELKKAGKKIVFVNFSGSA---IALTPESKTCDAILQAW 679

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+A ++FGD+NP GRLP+T+Y  +       D +M+        GR+YR+ T
Sbjct: 680 YPGQAGGTAIANVLFGDYNPAGRLPVTFYKSTKQLPDFEDYSMK--------GRTYRYMT 731

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
              ++ FGHGLSYT + Y           +ASL     K+  Q T               
Sbjct: 732 ENPLFPFGHGLSYTTFQYG----------NASLNTSEIKDGEQVT--------------- 766

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
                 + I V+N G  DG  VV ++ R P   +G P   L  F RV  +AKG+
Sbjct: 767 ------LTIPVSNTGKYDGEEVVQVYLRHPGDKEG-PSHALRAFKRV-AIAKGA 812


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 177/551 (32%), Positives = 263/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L+   RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+A   A  + +G D+ CG    +    A+ KG + E+ I+ ++  L   +  LG  
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N+   LPLN+     +A+IGP 
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF+      ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
            E +   + AD VI   G+    E E            DR  + LP  Q  ++  + +  
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K+ V +  +G     ++         +IL   YPG+AG  A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + 
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           T++ S     +K IL                          I V+N G  DG  VV ++ 
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
             P   +G P+K L GF RV ++AKG     +I    D  E    A      I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842

Query: 793 VLMVGELRHSLTIET 807
            ++ G   +   ++T
Sbjct: 843 KILYGNSSNEKDLQT 857


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 160/459 (34%), Positives = 234/459 (50%), Gaps = 59/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA  L+  LTL+EK   + + + AIPRLGI AY+WW+E+LHG+      
Sbjct: 25  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
               G     T FPQ +   ASFN  L  ++ +AV+ EARA               GLT 
Sbjct: 79  ----GRAGLATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGLKRYQGLTM 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP +     +  V+  QG                    
Sbjct: 135 WTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              GE+ D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  +
Sbjct: 177 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
           ++     +MC+YN+  G P C    L  Q  R+EWG+K  + SDC A++  +    ++  
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETD 285

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
                 SA  VL +G D+ CG       + A+ +G + EK ID +L  L   +  LG  +
Sbjct: 286 PDKQHASAKAVL-SGTDVECGDSYASLPE-AVKEGLIDEKQIDISLKRLMKARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P +  + ++    V + EH++LAL  AR+ +VLL+N++  LPLNKN    +A++GP  
Sbjct: 344 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 400

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           N+     G Y G P    +LL G+  Y+ ++   Y  GC
Sbjct: 401 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 439



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 126/294 (42%), Gaps = 60/294 (20%)

Query: 486 YVSKTHYASGCHDVPCNSDAG------FHEAVRIAKKADFVIVVAGLDLTQETEDR---- 535
           Y  K  +A    D   + D G        + V   K+AD +I   G+    E E+     
Sbjct: 563 YRIKIDFAFRNRDAALDFDMGREVPVDLKQTVDKVKEADVIIFAGGISPAVEGEEMHVNI 622

Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
                 DR ++ LP  Q  L+  + +  K+ V +  +G     ++         +IL   
Sbjct: 623 PGFKGGDRETIELPSIQSRLLAELKKAGKKIVFVNFSGSA---IALTPESKTCDAILQAW 679

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYT 649
           YPG+AG  A+A ++FGD+NP GRLP+T+Y  +       D +M+        GR+YR+ T
Sbjct: 680 YPGQAGGTAIANVLFGDYNPAGRLPVTFYKSTKQLPDFEDYSMK--------GRTYRYMT 731

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
              ++ FGHGLSYT + Y           +ASL     K+  Q T               
Sbjct: 732 ENPLFPFGHGLSYTTFQYG----------NASLNTSEIKDGEQVT--------------- 766

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
                 + I V+N G  DG  VV ++ R P   +G P   L  F RV  +AKG+
Sbjct: 767 ------LTIPVSNTGKYDGEEVVQVYLRHPGDKEG-PSHALRAFKRV-AIAKGA 812


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 156/473 (32%), Positives = 227/473 (47%), Gaps = 61/473 (12%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + YPF N  L I  R   L+  LT +EK+ Q+ +   AI RLGIP Y+WW+E+LHG+A  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVARA 83

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
           G            T FPQ +  AA+F+        + V+ EARA Y+  Q         G
Sbjct: 84  G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRYKG 133

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW PNINIFRDPRWGRG ET GEDP +     V  VK  QG++ K             
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                              AC KH   +   +W   +R+ F+  +T +D   T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++G    +MC+YN+  G P C    L     RN WG++  I SDC A+   +E    T
Sbjct: 224 LVKEGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERT 283

Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             H      E ++A  +  G D+ CG    R    A+  GK+ E D+D +L  L   +  
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YRALVKALKDGKISENDLDVSLRRLLKGRFE 342

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+ D  +  Y ++  + V + EH   AL+ A + +VLLKN    LPL+K  +  +A+
Sbjct: 343 LGMFDPD-EQVPYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           +GP   + + +   Y G P    ++L G+   V  T      +++ CN  A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 134/269 (49%), Gaps = 55/269 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K AD ++ V G+    E E+           DR ++ LP  Q  +V ++  T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATGK-PVVYV 661

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L +++ EA+  I +IL   Y G+    A+A+I+FGD+NP GRLP+T+Y +S  +
Sbjct: 662 LCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P   D +M+        GR+YR+ T T +Y FG+GLSYTN++Y+          +A L 
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           +G               +  D+  + T         + N G +DG  V  ++ + P   +
Sbjct: 761 SGK--------------IAKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 799

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           G P K L  F RVH  A  S+E++  + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELAP 827


>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 682

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 206/647 (31%), Positives = 301/647 (46%), Gaps = 86/647 (13%)

Query: 40  PDF-PCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASA---------IP 89
           P+F  C+     + P+C+ SLSI  R + L+S LTL EK+  +  + +          + 
Sbjct: 57  PNFWGCQNDVAKALPYCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVS 116

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           R+G+P Y W  E+   + S     N        T F   L  AASFNRS W   GS    
Sbjct: 117 RIGLPDYYWLVETNTAVGSACIAEN-----KCATEFSGPLSIAASFNRSSWFLKGSVFGT 171

Query: 150 EARAMYNLG----------QAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFV 199
           E RA+ N+             GLT + PNIN  RDPR+GR  E PGEDP +   YA   V
Sbjct: 172 EQRALMNVHGERFHTHSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMV 231

Query: 200 KSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSR 259
           +  Q                          E D      + A  KH  AY  E+      
Sbjct: 232 QGMQ--------------------------ERDANGYPKVLAYLKHFTAYSREEGRGNDD 265

Query: 260 YSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWG 318
           Y+    I+  D  DT+ P +   + QG A+ +MCSYN VNG+PAC    L  K  R  W 
Sbjct: 266 YN----ISMYDLFDTYLPQYEMGMVQGGATGVMCSYNAVNGIPACANDYLLNKILRQRWN 321

Query: 319 -FKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSAIDKGKV 376
               ++T+DC AV  +            +AA  L  G DI  G+ +  H   +AI  G  
Sbjct: 322 RSDAHVTTDCGAVNNLRGKPIQAADEAQAAAMALMNGADIEMGSTLFVHNLTTAITLGYA 381

Query: 377 QEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
            E+ +++A+   +      G F+ DP   ++  LG DD+ + +H+++ L+AA QG+VLLK
Sbjct: 382 TEEAVNQAIRRSYRPHFIAGRFD-DPTLSEWFSLGLDDIQSKKHQEIQLEAALQGLVLLK 440

Query: 437 NDKKFLPLNKNAVSSLAIIGPL-----------VNNISQMGGGYTGIPCSPKSL--LRGL 483
           ++   LP+   A + LA++GPL            ++ S  GGG+  IP   +S+  + G 
Sbjct: 441 HEDSILPI--AAGTKLAVLGPLGMTRSGLMSDYESDQSCFGGGHDCIPTLAESIGFINGK 498

Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
           E  V+    A+G  DV   + +     +++A   D +++  G   TQE E  DR    LP
Sbjct: 499 EFTVA----AAGV-DVDSRNTSDVERILQLAADRDLIVLCLGNTKTQEQEGFDRKDTALP 553

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           GQQ +L  +V  T ++PV+LVL  GG   ++        S+I+    P   G  ALA  +
Sbjct: 554 GQQYALFEAVL-TLRKPVVLVLVNGG--QIALDGMTGYPSAIIEAFNPNGIGGTALAASL 610

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           FG  N  G+LP T YP S     M   +M+  S    PGR+YR++TG
Sbjct: 611 FGQENRWGKLPYTIYPYSV----MQSFDMKDHSMSAPPGRTYRYFTG 653


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 156/473 (32%), Positives = 228/473 (48%), Gaps = 61/473 (12%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + YPF N  L I  R   L+  LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  
Sbjct: 24  EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 83

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
           G            T FPQ +  AA+F+        + V+ EARA Y+  Q         G
Sbjct: 84  G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 133

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW PNINIFRDPRWGRG ET GEDP +     V  VK  QG++ K             
Sbjct: 134 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDDPKY------------ 181

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                              AC KH   +   +W   +R+ F+  +T +D   T+ P F +
Sbjct: 182 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 223

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++G    +MC+YN+  G P C    L     RN WG++  I SDC A+   ++    T
Sbjct: 224 LVKKGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERT 283

Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             H      E ++A  +  G D+ CG    +    A+ +GK+ E D+D +L  L   +  
Sbjct: 284 PRHETHPDAESASADAVLNGTDLECGNS-YKALIKALKEGKISENDLDVSLRRLLKGRFE 342

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+ D R   Y ++  + V + EH   AL+ A + +VLLKN    LPL+K  +  +A+
Sbjct: 343 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 400

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           +GP   + + +   Y G P    ++L G+   V  T      +++ CN  A F
Sbjct: 401 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 450



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 133/269 (49%), Gaps = 55/269 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K AD ++ V G+    E E+           DR ++ +P  Q  +V ++  T K PV+ V
Sbjct: 603 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATGK-PVVYV 661

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L +++   D+ I +IL   Y G+    A+A+I+FGD+NP GRLP+T+Y +S  +
Sbjct: 662 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 718

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P   D +M+        GR+YR+ T T +Y FG+GLSYTN++Y+          +A L 
Sbjct: 719 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 760

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           +G               +  D+  + T         + N G +DG  V  ++ + P   +
Sbjct: 761 SGK--------------ITKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 799

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           G P K L  F RVH  A  S+E++  + P
Sbjct: 800 G-PIKALKAFLRVHVKAGDSQEVNIELTP 827


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 162/483 (33%), Positives = 245/483 (50%), Gaps = 61/483 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    + +T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-ET 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++AG ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFV 520
            F+
Sbjct: 459 GFL 461



 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  261 bits (668), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 170/519 (32%), Positives = 255/519 (49%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 123/270 (45%), Gaps = 55/270 (20%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q +L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFY- 706

Query: 620 ESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTI 678
           ++  ++P   D +M+        GR+YR+     ++ FGHGLSYT ++Y       E  +
Sbjct: 707 KNVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKL 752

Query: 679 SASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARV 738
           S +  A  +  +L                          I V+N G  DG  VV ++ R 
Sbjct: 753 SKNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRR 787

Query: 739 PKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           P   +G P   L  F RVH  A  ++ ++ 
Sbjct: 788 PGDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  261 bits (668), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 160/459 (34%), Positives = 233/459 (50%), Gaps = 59/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA  L+  LTL+EK   + + + AIPRLGI AY+WW+E+LHG+      
Sbjct: 41  PYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALHGV------ 94

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
               G     T FPQ +   ASFN  L  ++ +A++ EARA               GLT 
Sbjct: 95  ----GRAGLATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGLKRYQGLTM 150

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINIFRDPRWGRGQET GEDP + S   +  V+  QG                    
Sbjct: 151 WTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPE------------------ 192

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              GE+ D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  +
Sbjct: 193 ---GEKYDK-----LHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLV 241

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE---YQNY 339
           ++     +MC+YN+  G P C    L     R+EWG+K  + SDC A++  +    ++  
Sbjct: 242 QKAHVKEVMCAYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETD 301

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
                 SA  VL +G DI CG         A+ +G + EK ID +L  L   +  LG  +
Sbjct: 302 PDKQHASAKAVL-SGTDIECGDS-YGSLPEAVKEGLIDEKQIDISLKRLMKARFELGEMD 359

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P +  + ++    V + EH++LAL  AR+ +VLL+N++  LPLNKN    +A++GP  
Sbjct: 360 -EPSQVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQSLLPLNKNL--KVAVVGPNA 416

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           N+     G Y G P    +LL G+  Y+ ++   Y  GC
Sbjct: 417 NDSVMQWGNYNGFPSHTITLLEGIREYLPESQIIYEPGC 455



 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 121/268 (45%), Gaps = 56/268 (20%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
             + V   K+AD +I   G+    E E+           DR ++ LP  Q  L+  + + 
Sbjct: 606 LKQTVNKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLLAELKKA 665

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++         +IL   YPG+AG  A+A ++FGD+NP GRLP+T
Sbjct: 666 GKKIVFVNFSGSA---IALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 722

Query: 617 WYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
           +Y +S +++P   D +M+         R+YR+ T   ++ FGHGLSYT + Y        
Sbjct: 723 FY-KSTSQLPGFEDYSMKE--------RTYRYMTEAPLFPFGHGLSYTTFRYG------- 766

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
               ASL     K+  Q                       + I V+N G+ DG  VV ++
Sbjct: 767 ---DASLNTQEVKDGEQTI---------------------LTIPVSNVGEYDGEEVVQVY 802

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGS 763
            R P   +G P   L  F R + +AKG+
Sbjct: 803 LRRPGDKEG-PSHALRAFKRAN-IAKGA 828


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  261 bits (668), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 170/519 (32%), Positives = 255/519 (49%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 155/439 (35%), Positives = 230/439 (52%), Gaps = 53/439 (12%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           N S  IS R ++LIS +TL+EK+ QLS+   +IPRL +P+Y +W+E LHG+A  G     
Sbjct: 51  NLSQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAG----- 105

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   + SA++ EAR  Y     GLT+W+P IN+ RDP
Sbjct: 106 -----EVTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDP 160

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D  D
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPD 193

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E      R+S ++ I  +   + + P + +C+++  A  +M +Y
Sbjct: 194 YLKTVATIKHFVANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAY 249

Query: 296 NQVNGVP----ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL 351
           N  NGV       L GD+ +K   EWGF G++ SDC A+  +        + E++AA  +
Sbjct: 250 NAFNGVAPSGSTWLLGDVLRK---EWGFDGFVVSDCGAIGVMNWQHRVVNSLEEAAALGI 306

Query: 352 KAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKL 410
            +G D+ CG        +A+  G V E+ ID+AL  + + + +LG F  DP +   Y   
Sbjct: 307 NSGCDLECGGTYREKLVAAVKMGLVSEQAIDKALTRVLTARFKLGEF--DPIELVPYNHY 364

Query: 411 GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT 470
               +   +  KLA +AA + IVLLKND  FLP++K  + S+AI+GP  +N     GGY+
Sbjct: 365 DKKLLAGEKFGKLAYEAAVKSIVLLKNDNDFLPVDKKKIRSVAIVGPFADN--NYLGGYS 422

Query: 471 GIPCSPKSLLRGLEAYVSK 489
           G P    SLL+G++  V K
Sbjct: 423 GKPVHNVSLLQGVKDLVGK 441



 Score =  122 bits (307), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 104/325 (32%), Positives = 154/325 (47%), Gaps = 55/325 (16%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKK----ADFVIVVAGLDLTQETEDRDRVSLL 541
           Y++KT  A+ C  V   SD G  + +   K+    AD V+V  G D     E+RD  S+ 
Sbjct: 578 YINKTG-AAACMLV---SDFGNSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIY 633

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LP  Q  L+  + + + R   L+L  G PL   +A  +  + +IL   YPG+ G KALA 
Sbjct: 634 LPMTQELLLKEIYKVNPR-TALILHTGNPLTSKWAAEN--VPAILQAWYPGQEGGKALAG 690

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           I+FG  NP G+LPMT Y ES  ++P + D ++       + GR+Y++ +   +YGFGHGL
Sbjct: 691 ILFGSENPSGKLPMTIY-ESEEQLPDILDYDI-------WKGRTYQYLSSKPLYGFGHGL 742

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SY+N+ Y  L +                          D V  D    C+       I +
Sbjct: 743 SYSNFEYTHLQSD-------------------------DVVRPDGTLQCS-------IEI 770

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGT-PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
            N  DV G  VV ++         T P K+L+ F RV      SK ++F + P  QLSI 
Sbjct: 771 KNISDVAGEEVVQVYISRENTPVYTFPLKKLVAFARVDLKPGESKTVTFTIAP-RQLSIW 829

Query: 780 NKHGRRILPLGNHVLMVGELRHSLT 804
            +   ++LP G + L VG  +  L+
Sbjct: 830 QEGIWKMLP-GKYSLFVGSGQEGLS 853


>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
 gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
          Length = 770

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 209/708 (29%), Positives = 333/708 (47%), Gaps = 111/708 (15%)

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNIN 170
            ++ NG     T +P  +  AA+F+ +L   +    A+E RA      +G+ + + PN+ 
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVE 189

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RD RWGR  ET GEDP +V       V+ FQ +++  +D  I                
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVI---------------- 233

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMITEQDTEDTFQPPFRSCIEQGK 287
                     AC KHL+       G+      N   A ++E+  ++ F PPF+ C+E G 
Sbjct: 234 ----------ACAKHLVG------GSQPANGINGAPAELSERTLQEVFFPPFKDCLEAG- 276

Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
              +M ++N++NG+P      L  +  RN+W F G++ SD   +  + +Y N  +T +D+
Sbjct: 277 VFTVMTAHNELNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDA 336

Query: 347 AAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DP 402
               + AGM ++  G          + +G + EK ID A+  +  V+ RLGLF     D 
Sbjct: 337 YRISVDAGMGMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENPFIDL 396

Query: 403 RKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           +K        D++  +E H++ AL+ AR+ IVLLKN+   LPL+ +    + + G   NN
Sbjct: 397 KK-------KDEIVFNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN 449

Query: 462 ISQMGGGYTGIPCSP-KSLLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKAD 518
            S +G      P     ++L+GL+A   +T+Y       +V   SD    EAV+ A+ +D
Sbjct: 450 QSILGDWAMEQPEEHVTTVLKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSD 509

Query: 519 FVIVVAGLD-----LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
             I+V G +       ++T  E+ DR  L LPG+Q  LV +VA T   P +++L  G PL
Sbjct: 510 LAILVVGENSMRYHWNEKTCGENSDRYELSLPGRQQELVKAVAATG-VPTVVILVNGRPL 568

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
              +   D  +  I+    PG AG +ALAEI++G  NP G+LP+T        +P +   
Sbjct: 569 TTEW--IDENMPCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT--------IPRSTGQ 618

Query: 632 MRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
           ++   + ++    + + TG    +Y FG+GLSYT Y Y+ L   SE TI+       DK+
Sbjct: 619 IQCMYNHKFTNHWFPYATGNSLPLYEFGYGLSYTTYKYENLKL-SEATITP------DKS 671

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
           +                         V + VTN G +DG   V L+ R    S   P K+
Sbjct: 672 V------------------------KVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKE 707

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           L  F R+   A  +KE+SF + P E LS  + +    +  G   +MVG
Sbjct: 708 LKDFARIPLKAGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  261 bits (666), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 161/483 (33%), Positives = 244/483 (50%), Gaps = 61/483 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE---YQNY 339
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    ++ Y
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETY 285

Query: 340 TKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
               E ++AG ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 286 PD-KEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFV 520
            F+
Sbjct: 459 GFL 461



 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKKVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 770

 Score =  261 bits (666), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 209/708 (29%), Positives = 333/708 (47%), Gaps = 111/708 (15%)

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNIN 170
            ++ NG     T +P  +  AA+F+ +L   +    A+E RA      +G+ + + PN+ 
Sbjct: 136 AIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHWTFTPNVE 189

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RD RWGR  ET GEDP +V       V+ FQ +++  +D  I                
Sbjct: 190 VARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGNDKVI---------------- 233

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN---AMITEQDTEDTFQPPFRSCIEQGK 287
                     AC KHL+       G+      N   A ++E+  ++ F PPF+ C+E G 
Sbjct: 234 ----------ACAKHLVG------GSQPANGINGAPAELSERTLQEVFFPPFKDCLEAG- 276

Query: 288 ASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
              +M ++N++NG+P      L  +  RN+W F G++ SD   +  + +Y N  +T +D+
Sbjct: 277 VFTVMTAHNELNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHNVAETLKDA 336

Query: 347 AAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DP 402
               + AGM ++  G          + +G + EK ID A+  +  V+ RLGLF     D 
Sbjct: 337 YQISVDAGMGMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENPFIDL 396

Query: 403 RKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           +K        D++  +E H++ AL+ AR+ IVLLKN+   LPL+ +    + + G   NN
Sbjct: 397 KK-------KDEIVFNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNANN 449

Query: 462 ISQMGGGYTGIPCSP-KSLLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKAD 518
            S +G      P     ++L+GL+A   +T+Y       +V   SD    EAV+ A+ +D
Sbjct: 450 QSILGDWAMEQPEEHVTTVLKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSD 509

Query: 519 FVIVVAGLD-----LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
             I+V G +       ++T  E+ DR  L LPG+Q  LV +VA T   P +++L  G PL
Sbjct: 510 LAILVVGENSMRYHWNEKTCGENSDRYELSLPGRQQELVEAVAATG-VPTVVILVNGRPL 568

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMN 631
              +   D  +  I+    PG AG +ALAEI++G  NP G+LP+T        +P +   
Sbjct: 569 TTEW--IDENMPCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT--------IPRSTGQ 618

Query: 632 MRADSSRQYPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
           ++   + ++    + + TG    +Y FG+GLSYT Y Y+ L   SE TI+       DK+
Sbjct: 619 IQCMYNHKFTNHWFPYATGNSLPLYEFGYGLSYTTYKYENLKL-SEATITP------DKS 671

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
           +                         V + VTN G +DG   V L+ R    S   P K+
Sbjct: 672 V------------------------KVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKE 707

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           L  F R+   A  +KE+SF + P E LS  + +    +  G   +MVG
Sbjct: 708 LKDFARIPLKAGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 155/473 (32%), Positives = 228/473 (48%), Gaps = 61/473 (12%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + YPF N  L I  R   L+  LT +EKI Q+ +   AI RLGIP Y+WW+E+LHG+A  
Sbjct: 22  EDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVARA 81

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------G 161
           G            T FPQ +  AA+F+        + V+ EARA Y+  Q         G
Sbjct: 82  G----------KATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRYKG 131

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTFW PNINIFRDPRWGRG ET GEDP +     +  VK  QG++ K             
Sbjct: 132 LTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDDPKY------------ 179

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                              AC KH   +   +W   +R+ F+  +T +D   T+ P F +
Sbjct: 180 ---------------FKTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEA 221

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++G    +MC+YN+  G P C    L     RN WG++  I SDC A+   ++    T
Sbjct: 222 LVKKGNVQEVMCAYNRYQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERT 281

Query: 341 KTH------EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             H      E ++A  +  G D+ CG    +    A+ +GK+ E D+D +L  L   +  
Sbjct: 282 PRHETHPDAESASADAVLNGTDLECGNS-YKALIKALKEGKISENDLDVSLRRLLKGRFE 340

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG+F+ D R   Y ++  + V + EH   AL+ A + +VLLKN    LPL+K  +  +A+
Sbjct: 341 LGMFDPDERV-PYAQIPYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPLSK-TIRKIAV 398

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           +GP   + + +   Y G P    ++L G+   V  T      +++ CN  A F
Sbjct: 399 VGPNAADSTMLWANYNGFPTHTVTILEGIRNKVPDTEV---IYELGCNHAADF 448



 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 133/269 (49%), Gaps = 55/269 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K AD ++ V G+    E E+           DR ++ +P  Q  +V ++  T K PV+ V
Sbjct: 601 KDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATGK-PVVYV 659

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L +++   D+ I +IL   Y G+    A+A+I+FGD+NP GRLP+T+Y +S  +
Sbjct: 660 LCTGSALALNWE--DANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTFY-KSIDQ 716

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P   D +M+        GR+YR+ T T +Y FG+GLSYTN++Y+          +A L 
Sbjct: 717 LPDFEDYSMK--------GRTYRYMTETPLYPFGYGLSYTNFAYR----------NAKLS 758

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           +G               +  D+  + T         + N G +DG  V  ++ + P   +
Sbjct: 759 SGK--------------ITKDQSVTLT-------FDIANTGKMDGDEVAQIYIKNPNDPE 797

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           G P K L  F RVH  A  S+E++  + P
Sbjct: 798 G-PIKALKAFLRVHVKAGDSQEVNIELTP 825


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 169/520 (32%), Positives = 254/520 (48%), Gaps = 71/520 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG E+ K D             
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDTKYDK------------ 182

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  
Sbjct: 183 ---------------LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDL 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  
Sbjct: 225 VQKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-G 283

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D   ++A  ++ G D+ CG+        A+  G + EK+ID +L  L + +  LG  
Sbjct: 284 THPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEM 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +  P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP 
Sbjct: 343 DEQP---AWSEIPASVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPN 397

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKK 516
            N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K
Sbjct: 398 ANDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGK 457

Query: 517 ADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
             F        L +   +RDR   ++   Q+S     A T
Sbjct: 458 PGF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q +L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT ++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 164/437 (37%), Positives = 235/437 (53%), Gaps = 57/437 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+S LT++EKI  L   +  IPRL I  Y   +E+LHG+   G   NF       T
Sbjct: 38  RILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGVVRPG---NF-------T 87

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAG--LTFWAPNINIFR 173
            FPQ +  AA +N  L + I +A++ EARA +N        LGQ    LTFW+P +N+ R
Sbjct: 88  VFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQFSDLLTFWSPTVNMAR 147

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  ET GEDP +     V FVK  QG+              + R LK        
Sbjct: 148 DPRWGRTPETYGEDPFLSGKLGVSFVKGLQGD--------------DPRYLK-------- 185

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                + +  KH  A + E     +R+  N +I+E+D  + + P F  CI +GKA+ IM 
Sbjct: 186 -----IVSTPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMT 236

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN +N VP  L   L +K  R++WGF GY+ SDC A   +  +  Y KT E +A   ++
Sbjct: 237 AYNAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVKTLEAAATLSIQ 296

Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           AG+D+ CG    +    +A  +  V E +ID A  ++   ++RLGLF+ DP    Y K+ 
Sbjct: 297 AGLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFD-DPNLNPYNKIS 355

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P  V   +H +LAL+AARQ IVLLKN+KKFLPL+   + S+A++G  +N  +   G Y+G
Sbjct: 356 PSVVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDYSG 413

Query: 472 IPCS-PKSLLRGLEAYV 487
            P + P S+L G++  V
Sbjct: 414 TPVNQPVSILEGIKKRV 430



 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 93/299 (31%), Positives = 142/299 (47%), Gaps = 57/299 (19%)

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           +K D  I V G++ + E E +DR S+ LP  Q   +    + +   V+ VL  G  L ++
Sbjct: 600 RKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVV-VLVAGSSLAIN 658

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMR 633
           +   D  I +I+   YPGEAG  A+AE++FGD+NPGG+LP+T+Y  S  ++P  +D ++R
Sbjct: 659 WM--DEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYY-RSLDELPAFDDYDIR 715

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                   GR+Y+F+ G  +Y FGHGLSYT +SYK L                       
Sbjct: 716 K-------GRTYQFFEGNPLYAFGHGLSYTTFSYKKL----------------------- 745

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT------PE 747
                   +ID       + F    ++ N G  DG  V  L+ +     QG       P 
Sbjct: 746 --------NIDSTGDAVKVSF----ALKNTGKYDGDEVAQLYVKY----QGNDSLVKLPL 789

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           KQL GF+RVH     SK ++  V   E      + G    P G+++ MVG    ++ ++
Sbjct: 790 KQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVGTASDAIQLQ 848


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 206/751 (27%), Positives = 343/751 (45%), Gaps = 123/751 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 150 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRKLIHQVGLITGR 197

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 198 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 249

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH +AY   K         +  ++ 
Sbjct: 250 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E     PF+  I++     +M SYN  +G+P  ++G  +   ++ R E GF+GY+ S
Sbjct: 281 REVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIP--IQGSYYWLTKRLRGEMGFRGYVVS 338

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   +  K  +++    ++AG+++ C        +   +  + +G + E  I
Sbjct: 339 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEDII 398

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V  +E++ +AL A+R+ ++LLKN+   
Sbjct: 399 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADKEVEKAENEAVALQASRESLILLKNENNV 456

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
           LPL+ N + ++A+ GP  N        Y  +     ++L G+       ++  YA GC  
Sbjct: 457 LPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAEVLYAKGCDL 516

Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
           V  N                A  ++AV  A+KAD  +VV G       E++ R SL LPG
Sbjct: 517 VDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENKSRSSLDLPG 576

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+++F
Sbjct: 577 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILETWYPGSKGGTAVADVLF 633

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           GD+NPGG+L +T +P+S  ++P N     + + D  +              +Y FG+GLS
Sbjct: 634 GDYNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNPGPDGNMSRVNGSLYPFGYGLS 692

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S + IS  +   + K                           V+  VT
Sbjct: 693 YTTFEY------SNIEISPKMMTANQKAT-------------------------VRCKVT 721

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G   G  VV L+ R    S  T EK L GF+RVH     +KE++F +D  + L + +K
Sbjct: 722 NTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFILD-RKHLELLDK 780

Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIE 806
           H   ++  G+  +MVG       L   LT+E
Sbjct: 781 HMEWVVEPGDFSIMVGASSEDIRLNGMLTVE 811


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 227/744 (30%), Positives = 341/744 (45%), Gaps = 136/744 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A 
Sbjct: 137 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAT 184

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQG+  + 
Sbjct: 185 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 239

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                      K V+                A  KH  +Y    W         A + E+
Sbjct: 240 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 270

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCD 328
           + E+   PPFR  +  G A  +M SYN+++G P    R  L    ++ W FKG++ SD  
Sbjct: 271 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLY 329

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           A+  + E+      +E +A   + AG+D + GT +      +A+ KG V  + +D+A+  
Sbjct: 330 AIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKAVRR 388

Query: 388 LFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
           +  ++  +GLF+     D R  +        V + EH  LA + ARQ IVLLKN+ K LP
Sbjct: 389 ILFLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDKLLP 441

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASGCHD 498
           L K+ + +LA+IGP  +N   M G YT      S  ++L G+   VSK     YA GC  
Sbjct: 442 LKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVLYAKGCA- 499

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR------------------- 535
           V  +S  GF +A+  A+ AD V++V G     D + E E+                    
Sbjct: 500 VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESGEGY 559

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGE 593
           DR +L L G+Q+ L+  V +  K P++LVL  G PL +     EAD    +IL   YPG 
Sbjct: 560 DRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWYPGM 614

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
            G  A+A+++FGD+NP GRL ++  P S  ++P+     R  +  +Y   +     GT  
Sbjct: 615 QGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----GTPR 668

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT +SY                          TG ++     +E   C   R
Sbjct: 669 YPFGYGLSYTTFSY--------------------------TGMKVRVS--EESNHC---R 697

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
             V ++V N G VDG  VV L+ R    S  TP++QL  F RV   A  ++EI+F +D  
Sbjct: 698 VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGETREITFTLDK- 756

Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
           + L++  + G   +  G   +M G
Sbjct: 757 KSLALYMRDGEWAVEPGRFTVMAG 780


>gi|218132025|ref|ZP_03460829.1| hypothetical protein BACEGG_03650 [Bacteroides eggerthii DSM 20697]
 gi|217985785|gb|EEC52125.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 888

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 166/468 (35%), Positives = 232/468 (49%), Gaps = 63/468 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C       YPF N  L +  R  +L+SLLTLQEK+  + + + AI RLGIPAY WW E+ 
Sbjct: 20  CSEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEAC 79

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL----GQ 159
           HG+ + G           VT FPQ +  AA+F+ S      + V+ EARA YN     G 
Sbjct: 80  HGLIAGG-----------VTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGD 128

Query: 160 AG--------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
            G        LTFWAPNINIFRDPRWGRGQET GEDP ++S   +  V   QG       
Sbjct: 129 IGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQG------- 181

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                               D        AC KH   Y +       R+ FNA+++ +D 
Sbjct: 182 --------------------DDEHYYKTHACAKH---YGVHSGPEPLRHEFNAVVSMRDL 218

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
            +T+ P F + + +G    +MC+Y+   G P C    L     RN WGF G + SDCDA+
Sbjct: 219 WETYLPAFETLVVKGNVREVMCAYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAI 278

Query: 331 ATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
              +    ++ +      SA  VL  G D+ CG         A++KG ++E+D+D +L  
Sbjct: 279 NDFYVKGRHETHPDAAAASADAVL-TGTDLECGRS-YNALIEAVEKGIIKEQDLDVSLRR 336

Query: 388 LFSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           + + + RLGL   DP K   Y  +    +   EH+  AL  A +  VLLKN+   LPL+K
Sbjct: 337 ILTERFRLGLL--DPAKYVPYSTIPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           N + S+AI+GP +N+   M G Y+G P    ++L+GL+  +  T   S
Sbjct: 395 N-IKSIAIVGPNINDSIMMRGNYSGSPTHCITILQGLKNKLPNTRIIS 441



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 137/295 (46%), Gaps = 47/295 (15%)

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           A F E     K  + +I V G+  TQE E  +R  + LP  Q   + ++  T K PVI V
Sbjct: 619 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 677

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
              G    ++ A+ D    ++L   YPG+ G  A+A+++FGD+NP G+LP+T+Y  +   
Sbjct: 678 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKSTEQL 735

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
               D +M          R+YR++ G   Y FG+GLSYT++ +        L  S+S+KA
Sbjct: 736 PEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG-----EALLSSSSIKA 782

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
           G                           +  + I +TN G +DG+ VV ++ +    +  
Sbjct: 783 GE--------------------------KVEITIPLTNVGKMDGAEVVQVYVK-SLTNPD 815

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG 797
            P K L G+ R    A  S+++   ++P E  +  N    G ++ P G + ++ G
Sbjct: 816 APIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVFP-GKYQILYG 868


>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 862

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 159/463 (34%), Positives = 235/463 (50%), Gaps = 55/463 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +  LS   RAK L S LTL+EK   + D + AIPRLGI  + WWSE+LHG A+NG  
Sbjct: 24  PYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALHGYANNG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
                    VT FP+ +  AASFN ++   + +A + EAR  YN   A          L+
Sbjct: 82  --------DVTVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKEDTRFHSLS 133

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP + S   VE VK  QG                   
Sbjct: 134 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGP------------------ 175

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSC 282
                 ES +  +L   AC KH   +   +   ++R++ N A I+ +D  +T+ P F++ 
Sbjct: 176 ------ESTKYRKLY--ACAKHFAVHSGPE---YTRHTANLADISPRDLWETYLPAFKAT 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           ++Q     +MC+Y +++  P C    L Q+  R+EWGF+  + SDC A+A  +   + + 
Sbjct: 225 VQQAGVREVMCAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSS 284

Query: 342 THEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
               +AA    AG D+ CG          A+ +G V E ++D+ ++ L   +  LG+ + 
Sbjct: 285 DAVHAAAKGTLAGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD- 343

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           DP+   + K+ P  V +  H++LAL+ ARQ + LL+N    LPL K     +A++GP   
Sbjct: 344 DPKLVSWTKISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPLAKG--EKIAVVGPNAA 401

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSK-THYASGCHDVPCN 502
           +   + G Y G P    ++L G+ A   K   Y  GC  V  N
Sbjct: 402 DGPMLWGNYNGTPSRTTTILEGIRAKAGKDIPYLQGCDLVNKN 444



 Score = 86.3 bits (212), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 128/285 (44%), Gaps = 64/285 (22%)

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQ 546
           H+ P N    +  A+R  +    V+ V G+    E E+           DR S+ LP  Q
Sbjct: 584 HESPIN----YATAIRQLRGVRTVVFVGGISSKLEGEEMPVHVEGFKGGDRTSIELPAVQ 639

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
              + ++    K  V +  +G     ++         +IL   Y GE G +A+A++++GD
Sbjct: 640 RDFLKALKAAGKTVVFVNCSGSA---IALTPEVESCDAILQAWYAGEEGGRAVADVLYGD 696

Query: 607 FNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNY 665
           +NPGG+LP+T+Y  S T++P  +D +M+        GR+YR+++   ++ FG+GLSYT +
Sbjct: 697 YNPGGKLPVTFY-RSTTQLPAFDDYSMK--------GRTYRYFSDA-LFPFGYGLSYTRF 746

Query: 666 SY--KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
           +     LSAP       ++KA                            +  + + V+N 
Sbjct: 747 AIGKGSLSAP-------AMKADG--------------------------KVTLTVPVSNV 773

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           G   G  VV ++ R    + G P K L  F RV   A  S++++ 
Sbjct: 774 GKRTGDEVVQVYVRDVNDADG-PLKSLKAFRRVSLKAGESRKVTI 817


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 162/462 (35%), Positives = 239/462 (51%), Gaps = 61/462 (13%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF +  L    R   L+  LT++EK+  + D +  I RLGI  Y WW+E+LHG+A  G 
Sbjct: 24  YPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEALHGVARAG- 82

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--LGQA------GLT 163
                      T FPQ +  AASF+R    N+ +AV+ EARA +N  L Q       GLT
Sbjct: 83  ---------LATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQGSYGRYEGLT 133

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKR 222
            W P INIFRDPRWGRG ET GEDP + +   V+ VK  QG  N K D            
Sbjct: 134 MWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPSNGKYDK----------- 182

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
                           L AC KH   +   +W   +R+SF+A  I ++D  +T+ P F +
Sbjct: 183 ----------------LHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEA 223

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++ K   +MC+YN+  G P C    L Q+  R +WGF+G + +DC A+A  F+ +N  
Sbjct: 224 LVKEAKVQEVMCAYNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFK-ENAH 282

Query: 341 KTHED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           KTH D   ++A  + +G D++CG+     T+ A+ KG ++EKDID ++  L   + RLG 
Sbjct: 283 KTHPDAASASAAAVYSGTDLDCGSSYKALTE-AVKKGLIEEKDIDVSVRRLLMARFRLGE 341

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + D     + K+  + V +  H ++ALD AR+ I LL+N    LPL    +  +A++GP
Sbjct: 342 MD-DQSLVPWSKISYNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGL-KIAVMGP 399

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
              +     G Y G P +  ++L G++A +    K  Y   C
Sbjct: 400 NAQDSVMQWGNYNGTPANTITILEGIKAALGPKDKLIYEQAC 441



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 153/331 (46%), Gaps = 63/331 (19%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKK------ADFVIVVAGLDLTQETEDR---- 535
           Y  K ++A    D   N D G+ E   I K       AD V+ V G+  + E E+     
Sbjct: 566 YDIKFYFAHNNGDAQLNFDIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKL 625

Query: 536 ------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
                 DR  + LP  Q   V ++    KR VI +   G P+ ++   A+S+     W  
Sbjct: 626 PGFRGGDRTDIQLPTIQRQFVKALKEAGKR-VIFINCSGSPIGLADEMANSEAIVQAW-- 682

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFY 648
           YPG+AG +A+A+++FG +NP GRLP+T+Y ++ T++P   + +M         GR+YR+ 
Sbjct: 683 YPGQAGGQAVADVLFGKYNPSGRLPITFYRDT-TQLPDFENYDM--------AGRTYRYM 733

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
               ++ FG+GLSYT + Y                      IL Q            +T+
Sbjct: 734 QDKPLFPFGYGLSYTQFQY-------------------GNPILNQ----------QVITN 764

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             +++  V   VTN G   G  VV ++ R    + G P K L  F R+   A  ++++ F
Sbjct: 765 GQTIQLTV--PVTNTGKRSGDEVVQVYLRKKGDATG-PVKTLRDFRRLSFNAGQTQQVVF 821

Query: 769 GVDPCEQLSIANKHGRRI-LPLGNHVLMVGE 798
            + P +QL   N+  + + +  G++ L+VG+
Sbjct: 822 KITP-KQLEWWNEQSKAMQVQSGDYELLVGK 851


>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
          Length = 769

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 218/742 (29%), Positives = 334/742 (45%), Gaps = 137/742 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++ +L   +G+ +A 
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPTLIEEVGNVIAK 161

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q     + P +++ RDPRW R +ET GEDP++        +          
Sbjct: 162 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMIL--------- 207

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D   E    A  KH +AY + + G    Y   A +  +
Sbjct: 208 ---GLGSG--------------DLSCEYATIATLKHFLAYAVPEGGQNGNY---ASVGTR 247

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D  + F PPFR  I+ G  S +M SYN ++GVP      L  Q  RNEW F+G++ SD  
Sbjct: 248 DLHENFLPPFREAIDAGALS-VMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLY 306

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  + E      T E++A   + AG DI+ G     +   A+  GK+ E  ID A+  +
Sbjct: 307 SIEGVHESHFVAPTIEEAAMQAVSAGADIDLGGDAFMNLTHAVQFGKISEAVIDTAVCRV 366

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     +P      K     V + +H KLA   A+  IVLLKN+   LPLN
Sbjct: 367 LRMKFEIGLFEHPYVNP------KTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLN 420

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGC--HD 498
           K  +  +A++GP  +N   M G YT  P   +++   L+  +     SK  Y  GC   D
Sbjct: 421 KK-IKKVAVVGPNADNRYNMLGDYTA-PQEDENIKTVLDGVISKLSPSKVEYVRGCAIRD 478

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
              N  A   EAV  A +++ +I V G     D     QET                E  
Sbjct: 479 TTVNEIA---EAVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 535

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L L G+Q  L+ ++  T K P+I+V   G PLD  +  A     ++L   YPG+ G
Sbjct: 536 DRATLTLLGKQQDLLIALKATGK-PLIVVYIEGRPLDKVW--ASEYADALLTASYPGQEG 592

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y  ++      + +Y 
Sbjct: 593 GYAIADVLFGDYNPAGRLPVS-IPRSVGQIPVY-YNKKAPRNHDYVEQA-----ASPLYT 645

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT + Y                  SD  ++++             + C    F 
Sbjct: 646 FGYGLSYTTFEY------------------SDLQVIRK-------------SPC---HFE 671

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V   V N G  DG  V  L+ R    S   P +QL  F+R        KEI F +   + 
Sbjct: 672 VSFKVKNTGSYDGEEVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KD 730

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LSI +++ +R++  G+  +M+G
Sbjct: 731 LSIIDRNMKRVVETGDFRIMIG 752


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 216/762 (28%), Positives = 353/762 (46%), Gaps = 135/762 (17%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           E +  +   A    RLGIP      E  HG  + G      GTV     FP  L   +++
Sbjct: 89  EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG------GTV-----FPVPLSIGSTW 136

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N  L+ ++  AVA+E R+     Q G   ++P +++ RDPRWGR +E  GEDP ++S YA
Sbjct: 137 NVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
           V  V+  QGE+  S                             ++A  KH + Y   + G
Sbjct: 192 VASVEGLQGESLDSPSS--------------------------VAATLKHFVGYGSSEGG 225

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R +    +  ++  +    PF+  +E G AS IM +YN+++GVP  +  +L     R
Sbjct: 226 ---RNAGPVHMGTRELMEVDMLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILR 281

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
            EWGF G + +DC A+  +    +  +   D+A   ++AG+D+     M  +H Q A++ 
Sbjct: 282 KEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDMEMSGEMFGKHLQKAVES 341

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
            K++   +D A+  + +++ +LGLF     DP+  +      + + + +H  LA   A +
Sbjct: 342 NKLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAE------NVIGSEQHVGLARQLAAE 395

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYV- 487
           GIVLLKN+ K LPL+K     +A+IGP  +      G YT    P +  ++L G+ A + 
Sbjct: 396 GIVLLKNEAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLG 454

Query: 488 ---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDL----T 529
               +  YA GC  +  +S  GF  A+  A++AD V++V G           +DL    +
Sbjct: 455 EEAQRVLYAPGCR-IKDDSREGFEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGAS 513

Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           + T+D           DR++L L G Q+ LV  + +  KR +I+V   G P+   +   D
Sbjct: 514 KVTDDALSDMDCGEGIDRMTLQLSGVQLELVQEIHKLGKR-MIVVYINGRPIAEPW--ID 570

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
               +IL   YPG+ G  A+A+I+FGD NP G+L M+  P+   ++P+     R+     
Sbjct: 571 EHADAILEAWYPGQEGGHAVADILFGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR---- 625

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
             G+ Y        Y FG+GLSYT +SY  +    E+        G+D   +        
Sbjct: 626 --GKRYLEEDSQPRYPFGYGLSYTEFSYSDIQMTPEVI-------GTDGTAV-------- 668

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
                           V ++VTN+GD +GS VV L+          P ++L GF ++   
Sbjct: 669 ----------------VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQ 712

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
               +++ F + P EQL    +  R+++  G   +M+G  RH
Sbjct: 713 PGERRKVEFTIGP-EQLQYIGQDYRQVVEPGLFRVMLG--RH 751


>gi|423279982|ref|ZP_17258895.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
 gi|404584318|gb|EKA88983.1| hypothetical protein HMPREF1203_03112 [Bacteroides fragilis HMW
           610]
          Length = 812

 Score =  260 bits (664), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 228/747 (30%), Positives = 343/747 (45%), Gaps = 142/747 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E  HG  + G            T FP  +  A+++N  L   +G  +A+
Sbjct: 147 RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRQMGRVIAI 194

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA A     Q     + P +++ RDPRW R +ET GEDP +        V+ FQG+  + 
Sbjct: 195 EASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGAALVRGFQGDTLRG 249

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                      K V+                A  KH  +Y    W         A + E+
Sbjct: 250 ----------RKSVI----------------ATLKHFASYG---WTEGGHNGGTAHLGER 280

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITS 325
           + E+   PPFR  +  G A  +M SYN+++G P      L  D+ +   + W FKG++ S
Sbjct: 281 ELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILE---DRWLFKGFVVS 336

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRA 384
           D  A+  + E+      +E +A   + AG+D + GT +      +A+ KG V  + +D+A
Sbjct: 337 DLYAIGGLREHGVAGSDYE-AAVKAVNAGVDSDLGTNVYAEQLVAAVRKGDVAMETVDKA 395

Query: 385 LLNLFSVQLRLGLFNG----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           +  + S++  +GLF+     D R  +        V + EH  LA + ARQ IVLLKN+ K
Sbjct: 396 VRRILSLKFHMGLFDAPFVDDKRPAQL-------VASPEHIGLAREVARQSIVLLKNEDK 448

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC--SPKSLLRGLEAYVSK---THYASG 495
            LPL K+ + +LA+IGP  +N   M G YT      S  ++L G+   VSK     YA G
Sbjct: 449 LLPLKKD-IRTLAVIGPNADNGYNMLGDYTAPQADGSVVTVLEGIRQKVSKDTRVFYAKG 507

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR---------------- 535
           C  V  +S  GF +A+  A+ AD V++V G     D + E E+                 
Sbjct: 508 CA-VRDSSRTGFADAIESARSADVVVMVVGGSSARDFSSEYEETGAAKVSANRVSDMESG 566

Query: 536 ---DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGY 590
              DR +L L G+Q+ L+  V +  K P++LVL  G PL +     EAD    +IL   Y
Sbjct: 567 EGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIKGRPLLMEGVIQEAD----AILDAWY 621

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           PG  G  A+A+++FGD+NP GRL ++  P S  ++P+     R  +  +Y   +     G
Sbjct: 622 PGMQGGNAVADVLFGDYNPAGRLTLS-VPRSVGQLPVYYNTKRKGNRSRYIEEA-----G 675

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
           T  Y FG+GLSYT +SY                          TG ++     +E   C 
Sbjct: 676 TPRYPFGYGLSYTTFSY--------------------------TGMKVRVS--EESNHC- 706

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
             R  V ++V N G VDG  VV L+ R    S  TP++QL  F RV   A  + EI+F +
Sbjct: 707 --RVDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFRRVRLKAGETWEITFTL 764

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
           D  + L++  + G   +  G   +M G
Sbjct: 765 DK-KSLALYMRDGEWAVEPGRFTVMAG 790


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  260 bits (664), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 175/551 (31%), Positives = 263/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L++  RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+A   A  + +G D+ CG    +    A+ K  + E+ I+ ++  +   +  LG  
Sbjct: 286 THPDAAHASADAVLSGTDLECGG-NFKSITDAVKKDLISEEKINTSVKRVLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N+   LPLN+     +A+IGP 
Sbjct: 345 NS---THPWSNIPFSVIDCPKHKELALKMAHESLVLLQNNNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    + +
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGE 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF+      ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFNATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 141/315 (44%), Gaps = 63/315 (20%)

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
            E +   + AD VI   G+    E E            DR  + LP  Q  ++  + +  
Sbjct: 591 REILNKLQSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNG 650

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K+ V +  +G     ++         +IL   YPG+AG  A+A+++FGD+NP GRLP+T+
Sbjct: 651 KKTVFVNFSGSA---MAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITF 707

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + 
Sbjct: 708 Y-KSMQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKA 752

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           T++ S     +K IL                          I V+N G  DG  VV ++ 
Sbjct: 753 TLNQSKLTKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYI 787

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPL-GNH 792
             P   +G P+K L GF RV ++AKG     +I    D  E    A      I PL G +
Sbjct: 788 CRPDDKEG-PQKTLRGFQRV-SIAKGKTQNVQIELPYDSFEWFDAATN---TIRPLNGTY 842

Query: 793 VLMVGELRHSLTIET 807
            ++ G   +   ++T
Sbjct: 843 KILYGNSSNEKDLQT 857


>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 721

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 228/782 (29%), Positives = 334/782 (42%), Gaps = 144/782 (18%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N +LS   R   L+S +TLQEKIQ L D+   +PRLGIP      E LHG A  GP
Sbjct: 24  YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDD-PGVPRLGIPG-ALTEEGLHGAAIGGP 81

Query: 112 GVNFNG---TVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEAR-AMYNLGQAGLTFWAP 167
             ++ G    V   T FPQ      +++ +L     +  A E R A+      GL   AP
Sbjct: 82  A-HWEGRGRAVVPTTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRAP 140

Query: 168 NINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREKRVL 224
           N N+ RDPRWGR +E+ GEDP +V   AV ++K  QG N   W++               
Sbjct: 141 NANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGNNPRYWET--------------- 185

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                          +A  KH  AY  E   + S  +F   +  +     +  PFR  IE
Sbjct: 186 ---------------AALMKHFDAYSNEANRDGSSSNFGKRLFYE----YYSVPFRMGIE 226

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           QG +   M SYN  NG+P      L      +WGF G I +D  A++ +  + +Y KT  
Sbjct: 227 QGHSDAFMTSYNAWNGIPMTANPVLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMP 286

Query: 345 DSAAGVLKAGMDINCGTCMLRHTQ---SAIDKGKVQEKDIDRALLNLFSVQLRLGLF--- 398
           ++AAG + AG++      + R+ Q    A+ +  + E+ ID+ L  ++ V LRLGL    
Sbjct: 287 EAAAGAVHAGIN----QFLDRYQQPVEEALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPS 342

Query: 399 -----------NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
                      N +P KG       D      H  L      + IVLLKN    LPL+  
Sbjct: 343 SMSPYSMIGLTNDNPAKG-------DPWDWPSHIALDRKVTDESIVLLKNQNHALPLDAK 395

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
            + S+A+IGP  N ++     Y+G P    + + G+   V           V  N  +  
Sbjct: 396 KLHSIAVIGPWANIVAL--DWYSGTPPFGVTPVEGIRQRVGPDV------KVTFNDGSNL 447

Query: 508 HEAVRIAKKADFVIVVAGLDLT------------QETEDRDRVSLLLPGQQMSLVTSVAR 555
             A  +AK++D  IV+ G   T            +  E  DR +L LP + ++     A 
Sbjct: 448 QAAAALAKQSDEAIVIIGNHPTCDAGWGKCALPSEGKEAFDRTALNLPDESIAKAVYAAN 507

Query: 556 TSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPM 615
                 ++VL    P    + +A   I +IL + +  E    ALA+++FGD++P GRL  
Sbjct: 508 PH---TVVVLQTSFPYTTDWTQA--HIPAILEMAHNSEEQGTALADVLFGDYDPAGRLAQ 562

Query: 616 TWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSE 675
           TW        PM D N+R        GR+Y +     +Y FG GLSYT + Y  L   S 
Sbjct: 563 TWVASIGQLPPMMDYNIRD-------GRTYMYLKSKPLYPFGFGLSYTTFKYSNLRLSSH 615

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
                +L AG                           +  V + VTN G  +G  VV ++
Sbjct: 616 -----TLPAGG--------------------------QLTVSVDVTNTGKYNGDEVVQMY 644

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA----NKHGRRILPLGN 791
            +        P + L GFDRV      ++ ++    P +  ++A    N H  R+ P  +
Sbjct: 645 VKHLDSKVSRPLEALKGFDRVSIPVGQTRTVTL---PLKASALAYWDKNTHAFRVEP--D 699

Query: 792 HV 793
           HV
Sbjct: 700 HV 701


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 162/483 (33%), Positives = 243/483 (50%), Gaps = 61/483 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASADAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
             P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 EQP---AWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFV 520
            F+
Sbjct: 459 GFL 461



 Score = 92.8 bits (229), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 120/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q +L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRNLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQCDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|317474349|ref|ZP_07933623.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909030|gb|EFV30710.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 877

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 165/468 (35%), Positives = 232/468 (49%), Gaps = 63/468 (13%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C       YPF N  L +  R  +L+SLLTLQEK+  + + + AI RLGIPAY WW E+ 
Sbjct: 9   CSEAEKYQYPFQNPKLKVEERVDNLMSLLTLQEKVSMMINRSLAIERLGIPAYNWWGEAC 68

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL----GQ 159
           HG+ + G           VT FPQ +  AA+F+ S      + V+ EARA YN     G 
Sbjct: 69  HGLIAGG-----------VTVFPQSIALAATFDDSSQLTTYTMVSDEARARYNTLPLDGD 117

Query: 160 AG--------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDD 211
            G        LTFWAPNINIFRDPRWGRGQET GEDP ++S   +  V   QG       
Sbjct: 118 IGPYVSAIPNLTFWAPNINIFRDPRWGRGQETYGEDPFLMSRMGLNVVLGMQG------- 170

Query: 212 GGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDT 271
                               D        AC KH   Y +       R+ FNA+++ +D 
Sbjct: 171 --------------------DDEHYYKTHACAKH---YGVHSGPEPLRHEFNAVVSMRDL 207

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
            +T+ P F + + +G    +MC+Y+   G P C    L     RN WGF G + SDCDA+
Sbjct: 208 WETYLPAFETLVVKGNVREVMCAYSAYEGEPCCASNRLLVDILRNRWGFDGMVVSDCDAI 267

Query: 331 ATIF---EYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLN 387
              +    ++ +      SA  VL  G D+ CG         A++KG ++E+D+D +L  
Sbjct: 268 NDFYVKGRHETHPDAAAASADAVL-TGTDLECGRS-YNALIEAVEKGIIKEQDLDVSLRR 325

Query: 388 LFSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           + + + RLGL   DP K   Y  +    +   EH+  AL  A +  VLLKN+   LPL+K
Sbjct: 326 ILTERFRLGLL--DPAKYVPYSTIPGSVIDCQEHRDHALKMAHESQVLLKNEGNILPLDK 383

Query: 447 NAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS 494
           N + S+A++GP +N+   M G Y+G P    ++L+GL+  +  T   S
Sbjct: 384 N-IKSIAVVGPNINDSIMMRGNYSGSPTHCITILQGLKNKLPNTRIIS 430



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 137/295 (46%), Gaps = 47/295 (15%)

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           A F E     K  + +I V G+  TQE E  +R  + LP  Q   + ++  T K PVI V
Sbjct: 608 AVFEELKEKIKDVEAIIYVGGITPTQEGEGHERAKIELPDVQKRFLKAMHETGK-PVIYV 666

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
              G    ++ A+ D    ++L   YPG+ G  A+A+++FGD+NP G+LP+T+Y  +   
Sbjct: 667 NCSGSA--IALADIDYAYDALLQAWYPGQEGGTAVADVLFGDYNPSGKLPVTFYKSTEQL 724

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
               D +M          R+YR++ G   Y FG+GLSYT++ +        L  S+S+KA
Sbjct: 725 PEFTDYSME--------NRTYRYFKGEPQYAFGYGLSYTDFEFG-----EALLSSSSIKA 771

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
           G                           +  + I +TN G +DG+ VV ++ +    +  
Sbjct: 772 GE--------------------------KVEITIPLTNVGKMDGAEVVQVYVK-SLTNPD 804

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK--HGRRILPLGNHVLMVG 797
            P K L G+ R    A  S+++   ++P E  +  N    G ++ P G + ++ G
Sbjct: 805 APIKSLKGYVRQEIKAGKSEKVRITLEP-ESFAYYNADVDGLKVFP-GKYQILYG 857


>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
 gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
          Length = 948

 Score =  259 bits (663), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 206/749 (27%), Positives = 342/749 (45%), Gaps = 125/749 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + GI S              T+FP  L    ++NR L   +G     
Sbjct: 161 RLGIPV-DFTNEGIRGIES-----------YKATNFPTQLGLGTTWNRQLIRQVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   ++  +  Q     
Sbjct: 209 EARLL------GYTNVYAPILDVGRDQRWGRYEEIYGESPFLVAELGIQMTRGLQ----- 257

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      +  +++  KH  AY   K G       +  +  
Sbjct: 258 --------------------------TDFQVASTAKHFAAYSNNKGGREGMSRVDPQMPP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ E+    P+   +++      M SYN  +G+P  ++G    L +  R+ +GF+GYI S
Sbjct: 292 REVENIHLYPWERVVQEAGLLGAMSSYNDYDGIP--IQGSYHWLTEVLRHRFGFRGYIVS 349

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DA+  +F   +     +++    + AG+++ C        +   +  I +G++    I
Sbjct: 350 DSDALEYLFSKHHTAADMKEAVYQAVMAGLNVRCTFRSPDSFVLPLRELIREGRIPMSVI 409

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           DR + ++  V+   G+F+ +P +    K    +V +  ++ +AL A+RQ IVLLKN  + 
Sbjct: 410 DRLVGDILRVKFITGIFD-NPYQMNL-KAADQEVNSERNQAVALQASRQSIVLLKNQDRL 467

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHD 498
           LPL+++ +  + + GP  ++ S     Y  +     ++L G+   V    +  YA GC  
Sbjct: 468 LPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGIRDKVENNIEVSYAKGCDV 527

Query: 499 V----PCNSDAGF----------HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
           V    P +   G+            AV +AK++D  IVV G +     E++ R SL LPG
Sbjct: 528 VDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCGENKSRSSLDLPG 587

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+LVL  G PL V++A  D  I +I+   YPG  G  A+A+++F
Sbjct: 588 RQLDLLKAVQATGK-PVVLVLINGRPLSVNWA--DRFIPAIVEAWYPGSQGGTAVADVLF 644

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT------QVYGFGH 658
           GD+NPGG+L +T +P+S  ++P    N  +  + Q  G +     G        +Y FGH
Sbjct: 645 GDYNPGGKLTVT-FPKSVGQIP---FNFPSKPASQVDGGNKLGLQGNASRINGALYSFGH 700

Query: 659 GLSYTNYSYKFLSAPSE-LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
           GLSYT + Y  L    E +T++ S+    D                              
Sbjct: 701 GLSYTTFKYSNLRLSKETMTLNDSINISCD------------------------------ 730

Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
             V+N GD +G  VV L+ R    S  T EK L GFDR+H     +K ++F + P E L 
Sbjct: 731 --VSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLTFTIKP-EHLK 787

Query: 778 IANKHGRRILPLGNHVLMVGELRHSLTIE 806
           + NK   +++  G   +M+G     + +E
Sbjct: 788 LVNKDFEKVVEPGEFKIMIGASSEDIRLE 816


>gi|393773722|ref|ZP_10362116.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
 gi|392720897|gb|EIZ78368.1| glycosyl hydrolase family 3 N terminal domain protein
           [Novosphingobium sp. Rr 2-17]
          Length = 846

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 153/425 (36%), Positives = 221/425 (52%), Gaps = 54/425 (12%)

Query: 69  ISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQV 128
           I+   L EK+ QL   A A   +G+PAY+WW+E LHG+A NG            T FPQ 
Sbjct: 17  IAKADLAEKVGQLQSTAPAAKAMGLPAYDWWNEGLHGLARNG----------EATVFPQA 66

Query: 129 LVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFWAPNINIFRDPRWGRG 180
           +  AA+F+  L   +G+ ++ EARA +N            GLT W+PNINIFRDPRWGRG
Sbjct: 67  IGLAATFDPVLMEKVGTVISTEARAKFNTRPVNADRRIYEGLTIWSPNINIFRDPRWGRG 126

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
           QET GEDP +    AV F+K  QG +               RV+                
Sbjct: 127 QETYGEDPFLSGTLAVGFIKGLQGTD-----------PAHPRVI---------------- 159

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           A  KH   +   + G   R SF+  ++ QD E T+ P FR  + +G+   +MC+YN  +G
Sbjct: 160 ATPKHFAVHSGPEAG---RDSFDVDVSPQDREATYLPAFRKAVTEGRPLSVMCAYNSTHG 216

Query: 301 VPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC 359
            P C    L  Q+ R +WGFKG++ SDCDA+A ++ + +Y     +++A  +KAG D +C
Sbjct: 217 TPVCADDALINQRLRKDWGFKGFVVSDCDAIANVWMFHHYAADAAEASAASIKAGTDFDC 276

Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
           G       Q A+++G V E  IDRAL      +  LG+  G      +G++ P  V T  
Sbjct: 277 GNTYAALPQ-AVERGLVDEATIDRALARSLQARHSLGIAFG--AANPWGRIKPSQVHTLA 333

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL 479
            + LAL+AAR+ IVLL+ND   LPL     + LA++G   +++S +   Y G    P + 
Sbjct: 334 SRALALEAARKSIVLLQNDNARLPLKPG--TKLAVVGANADDLSVIEASYHGTAADPITP 391

Query: 480 LRGLE 484
           L+G+ 
Sbjct: 392 LQGIR 396



 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 141/311 (45%), Gaps = 63/311 (20%)

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETE----------DRDRVSLLLPGQQMSLVTS 552
           +D    EA+   + AD ++ V GL    E E            DR  + LP  Q  L+ +
Sbjct: 558 ADVQRAEAIAATRNADAIVAVLGLSPDLEGEALQVQVPGFVGGDRSDIALPRPQAQLLEA 617

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
           +  T K PVI+VLT G  + +    AD+ +++  W  YPGE G  ALAE + G  NP GR
Sbjct: 618 LHATGK-PVIVVLTSGSAIAIDPKLADAVLAA--W--YPGEEGGTALAETLGGINNPSGR 672

Query: 613 LPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA 672
           LP+T+Y  +       D +M+         RSYR++TGT ++GFGHGLSYTN+ Y    A
Sbjct: 673 LPVTFYRSTSDLPAYVDYSMKE--------RSYRYFTGTPLWGFGHGLSYTNFGY---DA 721

Query: 673 PSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
           P  +                           D V+        V ++V N G   G  VV
Sbjct: 722 PKAVATGIG----------------------DPVS--------VTVTVHNTGVRAGEDVV 751

Query: 733 MLFARVPKV------SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRI 786
             +   P +      +Q   ++QL  F R+      ++ I+  +DP E +S+ ++ G R 
Sbjct: 752 QAYVVPPTIEPEPIMTQAVLQRQLAAFQRIALAPGQARAITLNLDPRE-ISVVDRRGIRR 810

Query: 787 LPLGNHVLMVG 797
           L  G + + VG
Sbjct: 811 LVPGEYRVWVG 821


>gi|319787180|ref|YP_004146655.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465692|gb|ADV27424.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 903

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 159/435 (36%), Positives = 220/435 (50%), Gaps = 64/435 (14%)

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L++ LT +EK+ QL + A AIPRLG+P Y WW+ESLHG     P           T+F +
Sbjct: 54  LVAKLTTEEKLDQLLNTAPAIPRLGVPHYNWWTESLHGAMGTLP----------TTNFAE 103

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDPRWG 178
            +  AA+F+  L   +G  +A E R ++ L +          GL  W+PNINIFRDPRWG
Sbjct: 104 PIGLAATFDDGLVKRVGQVIATEVRGLHALARQTGRTGRMGTGLNTWSPNINIFRDPRWG 163

Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
           RGQET GEDP + +   V +VK  QGE+ +  D                           
Sbjct: 164 RGQETYGEDPFLAARMGVAYVKGVQGEDPRWYD--------------------------- 196

Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
           + A  KH   Y +      +R+S N  ++  D EDT+ P FR+ I +G A  +MC+YN+V
Sbjct: 197 IIATPKH---YAVHSGPESTRHSANVYVSRHDLEDTYLPAFRAAIVEGGAGSVMCAYNRV 253

Query: 299 NGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
           +G PAC   DL  K   R  W FKGY+ SDCDAV  I  + +Y      + A  +++G+D
Sbjct: 254 DGQPACAN-DLLLKDYLRGAWDFKGYVVSDCDAVTDINRHHHYAPDAASAVAAAMRSGVD 312

Query: 357 INCGTCML-------RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
             C    L       R  Q A+++G +   D+DRAL+ LF+ + R G   G  R      
Sbjct: 313 NECNGATLVDMAGLARPYQEALERGLISMADVDRALVRLFAARYRNGDLPGL-RPLSTET 371

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
             P DV   EH  LAL+ A + +VLLKN    LPL   A   LA++GPL +    + G Y
Sbjct: 372 ASPADVGKPEHAALALEVAEKSLVLLKN-SGVLPLRPQA--KLAVVGPLADATRVLRGNY 428

Query: 470 TGIPCSPK-SLLRGL 483
           +     P  S+L GL
Sbjct: 429 SSPQSGPPVSVLEGL 443



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 91/294 (30%), Positives = 138/294 (46%), Gaps = 54/294 (18%)

Query: 514 AKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
           A ++D V+ V GL    E E+           D+ SL L   Q +L+ + AR + +P+++
Sbjct: 629 AAQSDVVVAVVGLTSDLEAEEAPVEVPGFKGGDKTSLDLLADQQALLEA-ARATGKPLVV 687

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           V+  G P+++++A      ++I+   YPG++G  A+  ++ G  NP GRLP+T+Y     
Sbjct: 688 VVMNGSPVNLAWAR--QHAAAIIEAWYPGQSGGLAIGNVLAGHANPAGRLPLTFYRSVDD 745

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
             P +D +MR        GR+YR++ G  VY FGHGLSYT++ Y     P +L  +    
Sbjct: 746 LPPFDDYDMR--------GRTYRYFEGKPVYPFGHGLSYTSFGY----GPLKLEPA---- 789

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           AG     L+ T               T LR        N G   G  V  L+   P V  
Sbjct: 790 AGGAHEGLKVT---------------TELR--------NTGKRAGDEVAQLYLDFPDVD- 825

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           G P   L GF RVH      + + F +DP   LS     G+R +  GN+ + VG
Sbjct: 826 GLPRIALRGFQRVHLAPGERRVLEFQLDP-RDLSAVTPEGQRQVFAGNYRVFVG 878


>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
 gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
          Length = 946

 Score =  259 bits (662), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 209/735 (28%), Positives = 339/735 (46%), Gaps = 115/735 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGVITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q     
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ----- 249

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      +  ++A  KH IAY   K G       +  ++ 
Sbjct: 250 --------------------------QDYQVAATGKHFIAYSNNKGGREGMSRVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     PF+  I +     +M SYN  +G P       L  + R E GF+GY+ SD 
Sbjct: 284 REVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++   N  K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 344 DAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    ++V  +E++++AL A+R+ IVLLKND+  LP
Sbjct: 404 RVRDILRVKFLVGLFDHPYQTDLKG--ADEEVEKAENEEVALQASRESIVLLKNDQDVLP 461

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVP 500
           L+ + +  +A+ GP  +  S   G Y  +     S+L+G++       +  Y+ GC  V 
Sbjct: 462 LDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVLKGIQEKTDGKVEVLYSKGCELVD 521

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                     AV  AK+AD  +VV G       E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL +++  AD  + +IL   YPG  G KA+A+++FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGAKGGKAVADVLFGD 638

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
           +NPGG+L +T +P++  ++P N     + + D  +  PG           +Y FGHGLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGMDGNMSRANGALYAFGHGLSY 696

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T++ Y      S+L I+ ++   + K                          +V   VTN
Sbjct: 697 TSFEY------SDLKITPAVITPNQKT-------------------------YVTCKVTN 725

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T EK L GF+R+H     +KE+ F +D  + L + N  
Sbjct: 726 TGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIHLKPGETKEVFFPID-RKALELLNAD 784

Query: 783 GRRILPLGNHVLMVG 797
              ++  G+  LMVG
Sbjct: 785 MHWVVEPGDFTLMVG 799


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  259 bits (662), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 177/551 (32%), Positives = 259/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L+   RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+A   A  +  G D+ CG    +    A+ KG + E+ I+ ++  L   +  LG  
Sbjct: 286 THPDAAHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N    LPLN+     +A+IGP 
Sbjct: 345 N---PTHPWSNIPYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    +  
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF       ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/314 (29%), Positives = 142/314 (45%), Gaps = 63/314 (20%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           E +   K AD VI   G+    E E            DR  + LP  Q  ++  + +  K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           + V +  +G     ++         +IL   YPG+AG  A+A+++FG++NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFY 708

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
            +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           ++ S  A  +K IL                          I V+N G  DG  VV ++  
Sbjct: 754 LNQSKLAKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE---ISFGVDPCEQLSIANKHGRRILPL-GNHV 793
            P   +G P+K L GF RV+ +AKG  +   I    D  E    A      I PL G + 
Sbjct: 789 RPD-DKGGPQKTLRGFQRVN-IAKGKTQNVNIELPYDSFEWFDTATN---TIRPLSGTYK 843

Query: 794 LMVGELRHSLTIET 807
           ++ G   +   ++T
Sbjct: 844 ILYGNSSNENDLQT 857


>gi|170288668|ref|YP_001738906.1| glycoside hydrolase family 3 protein [Thermotoga sp. RQ2]
 gi|170176171|gb|ACB09223.1| glycoside hydrolase family 3 domain protein [Thermotoga sp. RQ2]
          Length = 778

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 218/752 (28%), Positives = 343/752 (45%), Gaps = 143/752 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G    G            T+FPQ +  A++++  L   + +A+  
Sbjct: 92  RLGIPAM-IHEECLTGYMGLGG-----------TNFPQAIAMASTWDPDLIEKMTTAIRE 139

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + R +      GL   AP +++ RDPRWGR +ET GE P +V+   V +VK  QGE+ K 
Sbjct: 140 DMRKIG--AHQGL---APVLDVARDPRWGRTEETFGESPYLVARMGVSYVKGLQGEDIK- 193

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                          KG            + A  KH   Y   + G N++  +    I E
Sbjct: 194 ---------------KG------------VVATVKHFAGYSASEGGKNWAPTN----IPE 222

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ ++ F  PF + +++     +M SY++++GVP A  R  L    R +WGF+G + SD 
Sbjct: 223 REFKEVFLFPFEAAVKEANVLSVMNSYSEIDGVPCAANRKLLTDILRKDWGFEGIVVSDY 282

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALL 386
            AV  + +Y    +   ++A   L+AG+D+    T   ++ +  ++KG + E  ID A+ 
Sbjct: 283 FAVKVLEDYHRIARDKSEAARLALEAGIDVELPKTECYQYLKDLVEKGIISEALIDEAVA 342

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
            +  ++  LGLF     + +  K+         H+ +AL+ AR+ I+LLKND   LPL+K
Sbjct: 343 RVLRLKFMLGLFENPYVEVEKAKI-------ESHRDIALEIARKSIILLKNDG-ILPLSK 394

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT----------------GIPCSPKS----LLRGLEAY 486
                +A+IGP    +  + G Y                 G P  P+     L + +E +
Sbjct: 395 E--KKVALIGPNAGEVRNLLGDYMYLAHIRALLDNIDDVFGNPQIPRENYERLKKSIEEH 452

Query: 487 VSKT--------------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG------L 526
           +                  YA GC +V     +GF EA+ IAKK+D  IVV G      L
Sbjct: 453 MKSIPSVLDAFKEEGIEFEYAKGC-EVTGEDRSGFEEAIEIAKKSDVAIVVVGDKSGLTL 511

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
           D T   E RD  +L LPG Q  LV  VA+T K PV+LVL  G P   S      ++++IL
Sbjct: 512 DCTT-GESRDMANLKLPGVQEELVLEVAKTGK-PVVLVLITGRPY--SLKNVVDKVNAIL 567

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYR 646
            +  PGEAG +++ +II+G  NP G+LP++ +P S  ++P+    ++    R +    Y 
Sbjct: 568 QVWLPGEAGGRSIVDIIYGKVNPSGKLPIS-FPRSAGQIPVFHY-VKPSGGRSHWHGDYV 625

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDE 705
             +   ++ FGHGLSYT + Y  L   P E+  +  +                       
Sbjct: 626 DESTKPLFPFGHGLSYTKFEYSNLRIEPKEVPPAGEVV---------------------- 663

Query: 706 VTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
                     +++ V N GD DG  VV L+      S   P K+L GF RV   AK  K 
Sbjct: 664 ----------IKVDVENTGDRDGDEVVQLYIGREFASVTRPVKELKGFKRVSLKAKEKKT 713

Query: 766 ISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           + F +   + L+  ++  + ++  G   +MVG
Sbjct: 714 VVFRLH-MDVLAYYDRDMKLVVEPGEFRVMVG 744


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 177/551 (32%), Positives = 259/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L+   RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D+A   A  +  G D+ CG    +    A+ KG + E+ I+ ++  L   +  LG  
Sbjct: 286 THPDAAHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N    LPLN+     +A+IGP 
Sbjct: 345 N---PTHPWSNIPYSVINCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    +  
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF       ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 142/314 (45%), Gaps = 63/314 (20%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           E +   K AD VI   G+    E E            DR  + LP  Q  ++  + +  K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           + V +  +G     ++         +IL   YPG+AG  A+A+++FGD+NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
            +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           ++ S  A  +K IL                          I V+N G  DG  VV ++  
Sbjct: 754 LNQSKLAKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE---ISFGVDPCEQLSIANKHGRRILPL-GNHV 793
            P   +G P+K L GF RV+ +AKG  +   I    D  E    A      I PL G + 
Sbjct: 789 RPD-DKGGPQKTLRGFQRVN-IAKGKTQNVNIELPYDSFEWFDTATN---TIRPLSGTYK 843

Query: 794 LMVGELRHSLTIET 807
           ++ G   +   ++T
Sbjct: 844 ILYGNSSNENDLQT 857


>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 796

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 216/754 (28%), Positives = 340/754 (45%), Gaps = 131/754 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP      E++HG  + G            T FP  +  A+++N  L   +   +A 
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAK 173

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E RA     Q   T + P I+I R+PRW R +ET GEDP +++      V  FQG +   
Sbjct: 174 EIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQGSH--- 225

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                               ESD      ++A  KH  AY + + G+         I ++
Sbjct: 226 --------------------ESDLKSNEHVAATLKHFAAYGVSEGGH---NGAAVHIGQR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNE-WGFKGYITSDCD 328
           D    +  P +  ++ G  S +M +Y+ ++GVP+    +L      E WGFKG++ SD  
Sbjct: 263 DLFQNYMYPVKEAVDNGVMS-VMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLA 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLN 387
           ++  +    +   T ED+AA  + AG+D++ G          A++ GKV E+ ID A+  
Sbjct: 322 SIEGLLGDHHIVDTEEDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRR 381

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
           + +V+ +LGLF  +P   +  K     V  SEH +LA + ARQ I +LKN+   LPLNK 
Sbjct: 382 ILTVKFKLGLFE-NPYANE--KQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNKE 438

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKT-----HYASGCHDVPCN 502
            + ++A+IG   +      G YT  P S ++++  LE    K       Y  G   V   
Sbjct: 439 -LQNIAVIGSNADMQYNQLGDYTA-PQSEENIITVLEGIQHKMPNANIEYVKGTA-VRDT 495

Query: 503 SDAGFHEAVRIAKKADFVIVVAG--------------------------LDLTQETEDRD 536
           +      AV  AK A+  IVV G                          L   +  E  D
Sbjct: 496 TQTNIPAAVEAAKNAEVAIVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYD 555

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R +L L G+Q+ L+ +V  T   P +LVL  G PL +++   +  +    W  YPG+ G 
Sbjct: 556 RSTLNLMGKQLELLQAVVATGT-PTVLVLIKGRPLLLNWPAENVPVILDAW--YPGQEGG 612

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ---V 653
            A+A++IFGDFNP GRLP++  P+S  ++P+         +  +P R  R Y  T    +
Sbjct: 613 SAIADVIFGDFNPAGRLPVS-VPKSLGQIPVY-------YNYWFPNR--RDYVETDAKPL 662

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSY+ + Y      S+L ++ S   G  +N                       +
Sbjct: 663 YPFGYGLSYSEFKY------SDLKVATS---GKGRNT----------------------K 691

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
             + + ++N   VDG  V+ L+ R    +  +P KQL  F+RV   A  +K + F + P 
Sbjct: 692 IEISLKISNTSKVDGDEVIQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFELLPK 751

Query: 774 EQLSIANKHGRRILPLGNHVLMVGELRHSLTIET 807
           E LS+ +   ++ +  G   LM+G     + +ET
Sbjct: 752 E-LSLFDTEMKQKVQAGEFKLMIGASSEDIRLET 784


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     +M SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V   E++ +AL A+R+ IVLLKN  + 
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASRESIVLLKNAGEL 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPL+ N+   +A+ GP  N        Y  +     ++L G++      ++  Y  GC  
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 519

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   D  A   +AV  A++AD  IVV G       E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ ++  T K PV+L+L  G PL +++A  D  + +IL   YPG  G  ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P++  ++P    N     S Q  G      TG        +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L I+  +                    I    S T     V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF R+H     ++E+SF +D  + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            +   + ++  G+ VLM G       L  +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816


>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 804

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 206/711 (28%), Positives = 313/711 (44%), Gaps = 118/711 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  E+ +E +HG+             S  T  P  +   +++NR+L    G     
Sbjct: 145 RLGIPV-EFTNEGIHGLNH-----------SRATPLPAPIAIGSTWNRALVHRAGEIAGH 192

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EAR    LG   +  +AP +++ RDPRWGR  E  GEDP +++   VE V+  Q +    
Sbjct: 193 EARV---LGYKNV--YAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQSQG--- 244

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        +++  KH  AY + K G       +  I  +
Sbjct: 245 -----------------------------VASTLKHYAAYSVPKGGRDGNCRTDPHIAPR 275

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
           +    +  PFR  I +     +M SYN  +GVP    R  L    R+E+GF GY+ SD +
Sbjct: 276 ELHQMYLYPFRRVIRESGPMGVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSE 335

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRA 384
           AV  +       +T+ED+   VL+AG+++          +   +  + +G++  + +D+ 
Sbjct: 336 AVEYVHTKHAVAETYEDAVRQVLEAGLNVRTNFSPPARFILPVRKLVREGRLSMEVVDQR 395

Query: 385 LLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  +  V+ RLGLF+    DPR+    + G D     +H+   LD  RQ +VLLKN+ K 
Sbjct: 396 VREVLRVKFRLGLFDNPYNDPREA-VAEAGAD-----KHRDFVLDIQRQSLVLLKNEDKT 449

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
           LPL+K   + + + GPL +  + M   Y        ++L G+  Y+   ++  YA GC  
Sbjct: 450 LPLDKKKTARVLVAGPLADEDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDV 509

Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
           V                   AG +EAV+ A   D ++ V G D  +  E   R SL LPG
Sbjct: 510 VDAGFPDSELTATPLTAAERAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPG 569

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q  L+ ++  T   PV+LVL  G PL V++A  +  + +IL   +P   G  A+AE +F
Sbjct: 570 RQQQLLEALHATGV-PVVLVLINGQPLTVNWAAQN--VPAILEGWFPSVEGGTAIAETLF 626

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           GD+NPGG+L +T +P S  ++ +N            R+ P           +Y FG+GLS
Sbjct: 627 GDYNPGGKLTIT-FPRSTGQIELNFPYKKGSHGAQPRKGPNGGGVTRVLGSIYPFGYGLS 685

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT ++YK L    E                                S T   F V   VT
Sbjct: 686 YTTFAYKNLRIAPE-------------------------------PSRTQGSFRVSCEVT 714

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           N GD  G  VV L+      S  T E  L GF+RV      +K +SF V P
Sbjct: 715 NTGDRRGDEVVQLYISDKFSSVVTYESVLRGFERVTLEPGETKTVSFEVTP 765


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 161/460 (35%), Positives = 238/460 (51%), Gaps = 62/460 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + +T+L I TR   L+S + L EKI Q+ + + AI  LGI  Y+WW+E+LHG+A  G   
Sbjct: 47  YMDTTLDIDTRVDDLVSRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAG--- 103

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTFW 165
                    T FPQ +  AA ++R    +I  AV+ EARA ++            GLTFW
Sbjct: 104 -------KATVFPQAIGMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFW 156

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
           +PNINIFRDPRWGRGQET GEDP +    A+ ++   QGEN K                 
Sbjct: 157 SPNINIFRDPRWGRGQETYGEDPYLTGELALPYISGLQGENPKY---------------- 200

Query: 226 GFGEESDRGDELMLSACCKHLIAYD-LEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                      L  +A  KH   +   EK    SR+S N + + +D  +T+ P F   + 
Sbjct: 201 -----------LKTAAMAKHFAVHSGPEK----SRHSDNYIASPKDLNETYLPAFEKAVV 245

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIF--EYQNYTK 341
           +G    +MC+YN+VN  PAC    L ++  R +WGFKG++ SDC A+A  +  E  +   
Sbjct: 246 EGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWGFKGHVVSDCGAIADFYAPEAHHVVM 305

Query: 342 THEDSAAGVLKAGMDINCGTCMLR---HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
               +AA  +++G D+NCGT  L    +   A+ +  + + +ID+++  L   + +LG+F
Sbjct: 306 APAAAAAWAVRSGTDLNCGTDRLSTFANLHFALQREMITQDEIDQSVKRLMKTRFKLGMF 365

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + D +   Y K+  D V +  H  L   AA +  VLLKN    LPL K+  S +AIIGP 
Sbjct: 366 DPDDQV-PYSKIPMDVVGSQAHLALTQKAAEKSFVLLKN-SGILPLKKS--SKVAIIGPN 421

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             N + + G Y G P  P + L G++ Y+ + +  YA G 
Sbjct: 422 ATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFYAPGS 461



 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/265 (29%), Positives = 128/265 (48%), Gaps = 50/265 (18%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q  L+ ++ + +K P++LV   G  + +++A  ++ + +IL   YPGEA 
Sbjct: 650 DRTDIRLPEPQRKLLATLKKLNK-PIVLVNFSGSAIALNWA--NNNVDAILQGFYPGEAT 706

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVY 654
             ALA I++G+ +P GRLP+T+Y  S   +P   D  M          R+Y++Y G  +Y
Sbjct: 707 GTALARILWGEVSPSGRLPITFY-RSLDDLPGFKDYAMT--------NRTYKYYQGDVLY 757

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT ++Y  LSAP      A++ +G    I  Q                     
Sbjct: 758 PFGYGLSYTQFAYSELSAP------ATMASGEPLAITAQ--------------------- 790

Query: 715 HVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
                V+N+G V    VV ++   +VP +S   P+++L  F R++     S+ + F +  
Sbjct: 791 -----VSNSGKVASDEVVQVYVSMKVPGLS--LPQRELKEFKRIYLEPGASQTVEFSI-A 842

Query: 773 CEQLSIANKHGRRILPLGNHVLMVG 797
            + LS  +  G R    G   L VG
Sbjct: 843 GKDLSYVDDQGVRHPYHGPLTLSVG 867


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  258 bits (660), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 167/519 (32%), Positives = 252/519 (48%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   R + L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    +           GLTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  Q                     
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQ--------------------- 173

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G E  R D+  L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 174 ---GPEDARYDK--LHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG++G + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++AG ++AG D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAGAVRAGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/268 (26%), Positives = 118/268 (44%), Gaps = 53/268 (19%)

Query: 511 VRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRP 560
           V+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+ 
Sbjct: 592 VKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKKV 651

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y +
Sbjct: 652 VFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD 708

Query: 621 SFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
                   D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S 
Sbjct: 709 VNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLSK 754

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           +  A  +  +L                          I V+N G  DG  VV ++ R P 
Sbjct: 755 NTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRPG 789

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             +G P   L  F RVH  A  ++ ++ 
Sbjct: 790 DKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 215/762 (28%), Positives = 352/762 (46%), Gaps = 135/762 (17%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           E +  +   A    RLGIP      E  HG  + G      GTV     FP  L   +++
Sbjct: 89  EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG------GTV-----FPVPLSIGSTW 136

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N  L+ ++  AVA+E R+     Q G   ++P +++ RDPRWGR +E  GEDP ++S YA
Sbjct: 137 NLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
           V  V+  QGE+  S                             ++A  KH + Y   + G
Sbjct: 192 VASVEGLQGESLDSPSS--------------------------VAATLKHFVGYGSSEGG 225

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R +    +  ++  +    PF+  +E G AS IM +YN+++GVP  +  +L     R
Sbjct: 226 ---RNAGPVHMGTRELMEVDMLPFKKAVEAGAAS-IMPAYNEIDGVPCTVNTELLDGILR 281

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
            EWGF G + +DC A+  +    +  +   D+A   ++AG+D+     M  +H Q A++ 
Sbjct: 282 KEWGFDGMVITDCGAIDMLASGHDTAEDGMDAAVQAIRAGIDLEMSGEMFGKHLQKAVES 341

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
            K++   +D A+  + +++ +LGLF     DP+  +      + + + +H  LA   A +
Sbjct: 342 NKLEVSVLDEAVRRVLTLKFKLGLFENPYVDPQTAE------NVIGSGQHIGLARQLAAE 395

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYV- 487
           GIVLLKN+ K LPL+K     +A+IGP  +      G YT    P +  ++L G+ A + 
Sbjct: 396 GIVLLKNEAKALPLSKEG-GVIAVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLG 454

Query: 488 ---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDL----T 529
               +  YA GC  +  +S  GF  A+  A++AD V++V G           +DL    +
Sbjct: 455 EEAQRVLYAPGCR-IKDDSREGFEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGAS 513

Query: 530 QETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           + T+D           DR++L L G Q+ L   + +  KR +I+V   G P+   +   D
Sbjct: 514 KVTDDALSDMDCGEGIDRMTLQLSGVQLDLAQEIHKLGKR-MIVVYINGRPIAEPW--ID 570

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
               +IL   YPG+ G  A+A+I+FGD NP G+L M+  P+   ++P+     R+     
Sbjct: 571 EHADAILEAWYPGQEGGHAIADILFGDVNPSGKLTMS-IPKHVGQLPVYYNGKRSR---- 625

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
             G+ Y        Y FG+GLSYT +SY  +    E+        G+D   +        
Sbjct: 626 --GKRYLEEDSQPRYPFGYGLSYTEFSYSDIQMTPEVI-------GTDGTAV-------- 668

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
                           V ++VTN+GD +GS VV L+          P ++L GF ++   
Sbjct: 669 ----------------VSVNVTNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQ 712

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRH 801
               +++ F + P EQL    +  R+++  G   +M+G  RH
Sbjct: 713 PGERRKVEFTIGP-EQLQYIGQDYRQVVEPGLFRVMLG--RH 751


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 176/551 (31%), Positives = 259/551 (47%), Gaps = 79/551 (14%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T L+   RA  L+  LTL+EK+  + +N+ AIPRLGI  YEWW+E+LHG+A  G 
Sbjct: 26  YPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVARAG- 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----AGLT 163
                      T FPQ +  AASFN  L   +  AV+ EARA        GQ     GLT
Sbjct: 85  ---------LATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRYQGLT 135

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D            
Sbjct: 136 MWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQG----PEDA----------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 E D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F+  
Sbjct: 181 ------EYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKEL 226

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  RN+WGFKG + +DC A+   F+ + + +
Sbjct: 227 VQKAGVKEVMCAYNRFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH-E 285

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D   ++A  +  G D+ CG    +    A+ KG + E+ I+ ++  L   +  LG  
Sbjct: 286 THPDAVHASADAVLNGTDLECGG-NFKSITDAVKKGLISEEKINTSVKRLLKARFELGEM 344

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           N       +  +    +   +HK+LAL  A + +VLL+N    LPLN+     +A+IGP 
Sbjct: 345 N---PTHPWSNIPYSVIDCPKHKELALKMAHESLVLLQNKNNILPLNRQM--KVAVIGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA--------------YVSKTHYASGCHDVPCNSD 504
            N+     G Y G P    +LL G+ A              Y + T   S  +    +  
Sbjct: 400 ANDSVMQWGNYNGFPSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLFNQCSIDGK 459

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP---- 560
           AGF       ++    I       T      +  ++  PG  +   T++ R++ RP    
Sbjct: 460 AGFSATYWNNREYKGKIAATDRLTTPFHFSAEGSTVFAPGVGLKNFTAIYRSTFRPTDSG 519

Query: 561 --VILVLTGGG 569
                V+T GG
Sbjct: 520 AATFRVMTNGG 530



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 140/311 (45%), Gaps = 57/311 (18%)

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSK 558
           E +   K AD VI   G+    E E            DR  + LP  Q  ++  + +  K
Sbjct: 592 EVLNKLKNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVLALLKKNGK 651

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
           + V +  +G     ++         +IL   YPG+AG  A+A+++FGD+NP GRLP+T+Y
Sbjct: 652 KTVFVNFSGSA---MAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFY 708

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
            +S  ++P   D +M+        GR+YRF T T +Y FG+GLSYT +SY       + T
Sbjct: 709 -KSIQQLPDYEDYSMK--------GRTYRFMTETPLYPFGYGLSYTRFSY------GKAT 753

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           ++ S     +K IL                          I V+N G  DG  VV ++  
Sbjct: 754 LNQSKLNKGEKAIL-------------------------TIPVSNVGQRDGEEVVQVYIC 788

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMV 796
            P   +G P+K L GF RV+     ++ +S  + P +     +     I PL G + ++ 
Sbjct: 789 RPDDKEG-PQKTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILY 846

Query: 797 GELRHSLTIET 807
           G   +   ++T
Sbjct: 847 GNSSNENDLQT 857


>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
           thermohalophila DSM 12881]
          Length = 749

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 210/748 (28%), Positives = 336/748 (44%), Gaps = 128/748 (17%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           +SYPF N  L    R   L+S +TL EK+  LS + S +PRLG+       E  HG+A  
Sbjct: 50  ESYPFQNPELDSEARIDDLLSRMTLDEKVSALSTDPS-VPRLGVKGAPH-IEGYHGVAMG 107

Query: 110 GPG--VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTF 164
           GP             T+FPQ     A++N  L    G   ++EAR ++    + + GL  
Sbjct: 108 GPANWAPKGDEAVPTTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKGGLVV 167

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN---WKSDDGGIGFGFREK 221
            APN ++ RDPRWGR +E  GEDP +V   A  F K  QG++   W++            
Sbjct: 168 RAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGDDDQYWRT------------ 215

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRS 281
                             ++  KH +A   E     S   F+  +  +     +   FR 
Sbjct: 216 ------------------ASLLKHFLANSNENGRESSSSDFDMQLYHE----YYGASFRR 253

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
              +G ++  M +YN +NGVPA +     +     WG  G   +D      +     Y  
Sbjct: 254 AFIEGGSNAYMAAYNAINGVPAHVHDMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYYD 313

Query: 342 THEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
               +A GV+KAG++             A+  G + E DID  L  ++ V ++LG    D
Sbjct: 314 DLYLAAEGVIKAGLNQFLDNYR-EGVYGALAHGYITEADIDEVLRGVYRVMIKLGQL--D 370

Query: 402 PRK-------GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           P++       G+ GK  P    T +HK  AL  AR+ IVLLKN+ K LPLN + ++ +A+
Sbjct: 371 PQEKVPYSAIGRDGKPAP--WTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAV 428

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAV 511
           IG L + +  +   Y+G+P    + L G+   +   SK  YA          D  ++ AV
Sbjct: 429 IGYLADTV--LLDWYSGLPPYRITPLEGIREKLGNDSKVLYA---------PDNDYNAAV 477

Query: 512 RIAKKADFVIVVAG-------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSK 558
             A +AD  IV+ G              D     E  DR +L L  + +  +   A  + 
Sbjct: 478 EAASEADVAIVILGNYPTCNSEIWADCPDPGMGREAIDRKTLRLTDEYLVKLVMEANPN- 536

Query: 559 RPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWY 618
              I VL    P  +++++ +  + +IL + + G+    ALA+++FGD+NPGG+L  TW 
Sbjct: 537 --TIFVLQSSFPYAINWSQQN--VPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTW- 591

Query: 619 PESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           P+S  ++P M + ++R        G +Y ++    +Y FGHGLSYT ++++      +++
Sbjct: 592 PKSEDQLPDMMEYDIRK-------GHTYMYFEDKPLYPFGHGLSYTTFAWE------DIS 638

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
           I+  + +  D+ ++                        + + + N GDV G  VV L+A 
Sbjct: 639 INKPVVSADDEEVI------------------------ITVKLKNTGDVKGDEVVQLYAS 674

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKE 765
            P+ +   P K L GF RV T+  G K+
Sbjct: 675 FPESTVRRPAKALKGFKRV-TLEPGEKK 701


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 167/519 (32%), Positives = 253/519 (48%), Gaps = 69/519 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  YEWW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-GQAG-------LTF 164
               G     T FPQ +   ASFN SL   + +A + EAR    + G++G       LTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG     +D G           
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQG----PEDAGYD--------- 181

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+SF+A  I  +D  +T+ P F+  +
Sbjct: 182 -------------KLHACAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  T
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-GT 284

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D   ++A  ++ G D+ CG+        A+  G + EK+ID +L  L + +  LG  +
Sbjct: 285 HPDKEHASAAAVRTGTDLECGS-EYASLADAVKAGLIDEKEIDISLKRLLTARFELGEMD 343

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
               +  + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP  
Sbjct: 344 ---EQSAWSEIPTSVLNSKEHQALALRMARESLVLLQNKNNILPLNTHL--KVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAKKA 517
           N+     G Y GIP    +LL  + A + +    Y  GC  V   +     +   I  K 
Sbjct: 399 NDSVMQWGNYNGIPAHTVTLLEAVRAKLPEGQIIYEPGCDRVDGKTLQSLFDECSINGKP 458

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
            F        L +   +RDR   ++   Q+S     A T
Sbjct: 459 GF--------LAEYWNNRDREGEVVATDQISTPFHFATT 489



 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 119/269 (44%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV+    AD ++   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVKRVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ + ++G++NPGGRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           +        D +M+        GR+YR+     ++ FGHGLSYT+++Y       E  +S
Sbjct: 708 DVNQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTDFTY------GEAKLS 753

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            +  A  +  +L                          I V+N G  DG  VV ++ R P
Sbjct: 754 KNTIAKGENVVL-------------------------TIPVSNVGQRDGEEVVQVYLRRP 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  ++ ++ 
Sbjct: 789 GDKEG-PRYTLRAFKRVHIPAGKTESVAI 816


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 210/754 (27%), Positives = 341/754 (45%), Gaps = 129/754 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   IG     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQIGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH +AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E     PF+  I++     +M SYN  +GVP  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   +  K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V  +E++ LAL A+R+ +VLLKN+   
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
           LPL+ N V  +A+ GP  +        Y  +     ++L G+       ++  Y  GC  
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519

Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
           V  N                A   +AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLELPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P+S  ++P    N     S Q  G       G        +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S++ IS  +   + K                           V+ 
Sbjct: 693 GLSYTTFEY------SDIEISPKVITPNQKAT-------------------------VRC 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF+R+H     +KE+ F +D  +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
            +KH   ++  G+  +M+G       L   LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMIGASSEDIRLSGKLTVE 814


>gi|294675359|ref|YP_003575975.1| 1,4-beta-xylosidase [Prevotella ruminicola 23]
 gi|225016052|gb|ACN78955.1| xylosidase/arabinofuranosidase [Prevotella ruminicola]
 gi|294472720|gb|ADE82109.1| putative 1,4-beta-xylosidase [Prevotella ruminicola 23]
          Length = 861

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 163/462 (35%), Positives = 234/462 (50%), Gaps = 60/462 (12%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + P+ N +LS   RA  L S LTL+EK   + D + AIPRLGI  + WWSE+LHG A+ G
Sbjct: 21  TLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAANMG 80

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN----------LGQA 160
                     +VT+FP+ +  AASFN  L   +    + E RA YN          +   
Sbjct: 81  ----------NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDMKMR 130

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
            L+ W PN+NIFRDPRWGRGQET GEDP + S   V+ VK  QG               +
Sbjct: 131 SLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVQVVKGLQGP-------------ED 177

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
            R  K             L AC KH   Y +     ++R++ N   ++ +D  +T+ P F
Sbjct: 178 ARYRK-------------LWACAKH---YAVHSGPEYTRHTANLTDVSARDFWETYMPAF 221

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ ++  K   +MC+Y +++  P C    L Q+  R+EWGF+  + SDC AV+  +E   
Sbjct: 222 KTLVKDAKVREVMCAYQRLDDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENH- 280

Query: 339 YTKTHEDSAAGVLK---AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLR 394
             K+  D+  G  K   AG D+ CG     +    A+ KG + EK++D+ ++ L   +  
Sbjct: 281 --KSSSDAVHGTSKAVLAGTDVECGFNYAYKSLPEAVRKGLLSEKEVDKHVIRLLEGRFD 338

Query: 395 LGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           LG  + DP   ++ K+    + T     +ALD ARQ IVLL+N    LPL KNA   +AI
Sbjct: 339 LGEMD-DPSLVEWSKIPYSAMSTKASANVALDMARQTIVLLQNKNNILPLKKNA-EKIAI 396

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           IGP  +N   M G Y G P    ++L G++A   K  Y  GC
Sbjct: 397 IGPNAHNEPMMWGNYNGTPNHTVTILDGVKAKQKKLVYIPGC 438



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 121/282 (42%), Gaps = 56/282 (19%)

Query: 489 KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRV 538
           KT  AS   DV    +  + E +   K  + VI   G+  + E E+           DR 
Sbjct: 574 KTWGASMKIDVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRT 633

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           S+ LP  Q   + ++    K+ + +  +G     ++         +I+   YPG+ G  A
Sbjct: 634 SIELPKVQREFLKALKAAGKQVIYVNCSGSA---IALQPETESCDAIVQAWYPGQEGGTA 690

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FGD+NPGG+L +T+Y         ND  +         GR+YR++    ++ FG+
Sbjct: 691 VADVLFGDYNPGGKLSVTFYK--------NDQQLPDYEDYSMKGRTYRYFDDA-LFPFGY 741

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT +              A ++A +D  +                       ++VQI
Sbjct: 742 GLSYTTFEVG----------EAKVEAATDGAL-----------------------YNVQI 768

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
            VTN G  +GS  + L+ R  +   G P K L GF+R+   A
Sbjct: 769 PVTNTGTKNGSETIQLYIRNLQDPDG-PLKSLRGFERLDIKA 809


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 201 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 248

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 249 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 300

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  ++ 
Sbjct: 301 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 331

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     +M SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 332 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 389

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 390 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 449

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+  P +         +V   E++ +AL A+R+ IVLLKN  + 
Sbjct: 450 NDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGEL 507

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPL+ N+   +A+ GP  N        Y  +     ++L G++      ++  Y  GC  
Sbjct: 508 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 567

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   D  A   +AV  A++AD  IVV G       E++ R SL LPG
Sbjct: 568 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 627

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ ++  T K PV+L+L  G PL +++A  D  + +IL   YPG  G  ALA+I+F
Sbjct: 628 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 684

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P++  ++P    N     S Q  G      TG        +Y FG+
Sbjct: 685 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 740

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L I+  +                    I    S T     V++
Sbjct: 741 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 769

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF R+H     ++E+SF +D  + L +
Sbjct: 770 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 828

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            +   + ++  G+ VLM G       L  +LT+E Y
Sbjct: 829 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 864


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 212/756 (28%), Positives = 343/756 (45%), Gaps = 129/756 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     +M SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+  P +         +V   E++ +AL A+R+ IVLLKN  + 
Sbjct: 402 NDRVRDILRVKFLIGLFDA-PYQTDLAD-ADREVEKEENEAIALQASRESIVLLKNAGEL 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPL+ N+   +A+ GP  N        Y  +     ++L G++      ++  Y  GC  
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAEVLYTKGCDL 519

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   D  A   +AV  A++AD  IVV G       E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENKSRTSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ ++  T K PV+L+L  G PL +++A  D  + +IL   YPG  G  ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P++  ++P    N     S Q  G      TG        +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L I+  +                    I    S T     V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF R+H     ++E+SF +D  + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            +   + ++  G+ VLM G       L  +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816


>gi|383123909|ref|ZP_09944579.1| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
 gi|382983834|gb|EES66944.2| hypothetical protein BSIG_4072 [Bacteroides sp. 1_1_6]
          Length = 815

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 216/742 (29%), Positives = 331/742 (44%), Gaps = 137/742 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G+ +A 
Sbjct: 160 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPVLIEEVGNVIAK 207

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q     + P +++ RDPRW R +ET GEDP++        V          
Sbjct: 208 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMV---------- 252

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               IG G              D   E    A  KH +AY + + G    Y   A +  +
Sbjct: 253 ----IGLG------------SGDLSREYATIATLKHFLAYAVPEGGQNGNY---ASVGTR 293

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D  + F PPF+  I+ G  S +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 294 DLHENFLPPFQEAIDAGALS-VMTSYNSIDGIPCTANYYLLTQLLRNEWRFRGFVVSDLY 352

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  + E      T E++A  V+ AG+DI+ G     +   A+  GK+ E  ID A+  +
Sbjct: 353 SIEGVHESHFVAPTIEEAAMQVVSAGVDIDLGGNAFMNLTHAVQSGKISEAVIDTAVCRV 412

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     +P+           V + EH +LA   A+  IVLLKN    LPLN
Sbjct: 413 LRMKFEMGLFEHPYVNPKSAT------KVVRSEEHIRLAHKVAQSSIVLLKNKNSILPLN 466

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV-----SKTHYASGC--HD 498
           K  +  +A++GP  +N   M G YT  P   +++   L+  +     SK  Y  GC   D
Sbjct: 467 KK-IKKVAVVGPNADNRYNMLGDYTA-PQEDENIKTVLDGVISKLSPSKVEYVRGCAIRD 524

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDR 535
              N  A   E V  A +++ +I V G     D     QET                E  
Sbjct: 525 TTVNEIA---EVVEAASRSEVIIAVVGGSSARDFKTSYQETGAAIADEKSISDMECGEGF 581

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L L G+Q  L+ ++  T K P+I+V   G PLD  +  A     ++L   YPG+ G
Sbjct: 582 DRATLTLLGKQQDLLNALKATGK-PLIVVYIEGRPLDKVW--ASEYADALLTASYPGQEG 638

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y  ++      + +Y 
Sbjct: 639 GYAIADVLFGDYNPAGRLPVS-VPRSVGQIPVY-YNKKAPCNHDYVEQA-----ASPLYT 691

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT + Y                  SD  ++++             + C    F 
Sbjct: 692 FGYGLSYTTFEY------------------SDLQVIRK-------------SPC---YFE 717

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V   V N G  DG  V  L+ R    S   P +QL  F+R        KEI F +   + 
Sbjct: 718 VSFKVKNTGSYDGEEVAQLYLRDEYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KD 776

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LSI +++  R++  G+  +M+G
Sbjct: 777 LSIIDRNMARVVETGDFRIMIG 798


>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
 gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
          Length = 783

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 215/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G A+  
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RDPRW R +ET GEDP++             GE  K+
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 221

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D        A  KH +AY + + G     SF  +   +
Sbjct: 222 MVEGLGGG--------------DLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPFR  I+ G A  +M SYN ++GVP      L  +  RNEW F+G + SD  
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLY 323

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I +      T E++A   L AG+D++ G     +  +A++ G++ +  +D ++  +
Sbjct: 324 SIEGIHQSHFVAPTMEEAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 383

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP K K       +V + E   LA   A+  I LLKN+   LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
           KN    +A+IGP  +N   M G YT        K++L G+ A +  S+  Y  GC  +  
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGC-SIRD 494

Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
                  +AV  A++++ +I V G     D     +ET                E  DR 
Sbjct: 495 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 554

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           +L L G+Q  L+ ++  T K P+I+V   G PLD ++A  ++   ++L   YPG+ G  A
Sbjct: 555 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DAVLTAYYPGQEGGIA 611

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FGDFNP GRLP +  P S  ++P+   N +A  S  Y   S      + +Y FG+
Sbjct: 612 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 664

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT++ Y      S+L +SA +                               F +  
Sbjct: 665 GLSYTSFDY------SDLHLSALMPRS----------------------------FEISF 690

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G  DG  V  L+ R    S   P KQL  F R +      +E+ F +   E  S+
Sbjct: 691 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 749

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +++ ++I+  G   +M+G   + + ++T
Sbjct: 750 VDRNLKKIVEPGTFQIMIGAASNDIRLQT 778


>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
 gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
          Length = 769

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 215/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G A+  
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RDPRW R +ET GEDP++             GE  K+
Sbjct: 165 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 207

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D        A  KH +AY + + G     SF  +   +
Sbjct: 208 MVEGLGGG--------------DLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 250

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPFR  I+ G A  +M SYN ++GVP      L  +  RNEW F+G + SD  
Sbjct: 251 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLY 309

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I +      T E++A   L AG+D++ G     +  +A++ G++ +  +D ++  +
Sbjct: 310 SIEGIHQSHFVAPTMEEAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 369

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP K K       +V + E   LA   A+  I LLKN+   LPLN
Sbjct: 370 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 423

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
           KN    +A+IGP  +N   M G YT        K++L G+ A +  S+  Y  GC  +  
Sbjct: 424 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGC-SIRD 480

Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
                  +AV  A++++ +I V G     D     +ET                E  DR 
Sbjct: 481 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 540

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           +L L G+Q  L+ ++  T K P+I+V   G PLD ++A  ++   ++L   YPG+ G  A
Sbjct: 541 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DAVLTAYYPGQEGGIA 597

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FGDFNP GRLP +  P S  ++P+   N +A  S  Y   S      + +Y FG+
Sbjct: 598 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 650

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT++ Y      S+L +SA +                               F +  
Sbjct: 651 GLSYTSFDY------SDLHLSALMPRS----------------------------FEISF 676

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G  DG  V  L+ R    S   P KQL  F R +      +E+ F +   E  S+
Sbjct: 677 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 735

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +++ ++I+  G   +M+G   + + ++T
Sbjct: 736 VDRNLKKIVEPGTFQIMIGAASNDIRLQT 764


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 158/460 (34%), Positives = 232/460 (50%), Gaps = 63/460 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ +TSL+   RA+ L+  LTL+EK+  + + + AIPRLGI  Y+WW+E+LHG+      
Sbjct: 25  PYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEYDWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
               G     T FPQ +   ASFN SL   +  AV+ EAR    +           GLTF
Sbjct: 79  ----GRAGLATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSRIFSENGVLKRYQGLTF 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRV 223
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG EN K D             
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPENGKYDK------------ 182

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          L AC KH   +   +W   +R+SF+A  IT +D  +T+ P F+  
Sbjct: 183 ---------------LHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDL 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  R+EWG+KG + SDC A++  +    +  
Sbjct: 225 VQKADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTH-G 283

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH D   ++AG + +G D+ CG         A+  G + EK ID +L  L + +  LG  
Sbjct: 284 THPDKEHASAGAVLSGTDLECGG-EYGSLADAVKAGLIDEKQIDVSLKRLLTARFELGEM 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +  P    + ++    + + EH+ LAL  AR+ +VLL+N    LPLN +    +A++GP 
Sbjct: 343 DEQP---AWAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPLNTDL--KVAVMGPN 397

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            N+     G Y GIP    +LL  + + +   +  Y  GC
Sbjct: 398 ANDSVMQWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGC 437



 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 77/269 (28%), Positives = 122/269 (45%), Gaps = 53/269 (19%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           AV   K AD V+   G+  + E E+           DR  + LP  Q  L+ ++ +  K+
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQRDLLKALKKAGKK 650

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
            V +  +G     +      +   +IL   YPG+AG  A+ +++FGD+NP GRLP+T+Y 
Sbjct: 651 VVFINYSGSA---IGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
           ++       D +M+        GR+YR+     ++ FGHGLSYT ++Y            
Sbjct: 708 DAGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYG----------- 748

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
              +A   KN +   G+         VT        + I V+NAG  DG  VV ++ R  
Sbjct: 749 ---EADLSKNTIGDGGT---------VT--------LTIPVSNAGQRDGDEVVQVYLRCM 788

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
              +G P   L  F RVH  A  +K+++ 
Sbjct: 789 ADKEG-PHYTLRAFKRVHIPAGETKQVTI 816


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 149/436 (34%), Positives = 230/436 (52%), Gaps = 47/436 (10%)

Query: 56  NTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNF 115
           + S  IS R +++I  +TL+EK+ QLS+ + +IPRL +P+Y +W+E LHG+A  G     
Sbjct: 51  DLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAG----- 105

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                 VT FPQ +  A++++  L   I SA++ EAR  Y     GLT+WAP IN+ RDP
Sbjct: 106 -----EVTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGKGLTYWAPTINMARDP 160

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V FVK  QG                           D  +
Sbjct: 161 RWGRNEETYGEDPYLTSRLGVAFVKGLQG---------------------------DHPN 193

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L   A  KH +A + E      R+S ++ I  +   + + P + +C+++     IM +Y
Sbjct: 194 YLKTVATVKHFVANNQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAY 249

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NG+P      L +   R EWGF G++ SDC A+  +        + E++AA  + +G
Sbjct: 250 NAFNGIPPSGSTWLLEDVLRKEWGFDGFVVSDCGAIGVMNWQHRIVNSLEEAAALGINSG 309

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK-GKYGKLGPD 413
            D+ CG     +  +A+ +G V E  IDRAL  + +++ +LG F  DP +   Y      
Sbjct: 310 CDLECGGTYRENLVAAVQRGLVSEYAIDRALTRVLTMRFKLGEF--DPIELVPYNHYDKK 367

Query: 414 DVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP 473
            +   + ++LA +AA + I+LLKN+  FLP++K  V S+AI+GP  +N     GGY+G P
Sbjct: 368 LLAGEQFRRLAYEAAVKSIILLKNEDNFLPIDKKDVRSIAIVGPFADN--NYLGGYSGKP 425

Query: 474 CSPKSLLRGLEAYVSK 489
               SLL+G++  V +
Sbjct: 426 VHNISLLQGVKKMVGE 441



 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/288 (33%), Positives = 140/288 (48%), Gaps = 47/288 (16%)

Query: 512 RIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPL 571
           +I  +AD V+V  G D     E+RD  S+ LP  Q  L+  + + + R + L+L  G PL
Sbjct: 604 KIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKVNPR-IALILQTGNPL 662

Query: 572 DVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDM 630
              +A     + SIL   YPG+ G  ALA I+FG  NP G+LPMT Y ES  ++P + D 
Sbjct: 663 TSQWAA--EHVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIY-ESEQQLPNILDY 719

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           ++       + GR+Y++ +   +YGFGHGLSY+N+ Y            A L+       
Sbjct: 720 DI-------WKGRTYQYLSSKPLYGFGHGLSYSNFEY------------ADLQCN----- 755

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT-PEKQ 749
                   D VH+D    C+       I V N  DV G  V+ ++    K    T P K+
Sbjct: 756 --------DVVHVDGTLQCS-------IKVKNISDVVGEEVIQVYVSREKTPVYTFPLKK 800

Query: 750 LIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           LI F RV+     SK ++F + P  QLS+      ++L  G + L VG
Sbjct: 801 LIAFARVNLKPNESKTVTFTITP-RQLSVWQDGEWKMLS-GKYSLFVG 846


>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 792

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 210/735 (28%), Positives = 339/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 143 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 190

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 191 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQNEG-- 242

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 243 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 272

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 273 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 332

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AID+GKV    +D+
Sbjct: 333 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLDQ 392

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + +VLLKN+
Sbjct: 393 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLLKNE 445

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN    +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 446 NQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 504

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +L L
Sbjct: 505 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 564

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  I +I+   +PGE    A+A++
Sbjct: 565 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 621

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 622 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGALYPFGYGLSY 674

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 675 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 703

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + +SF + P + L + +K+
Sbjct: 704 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 762

Query: 783 GRRILPLGNHVLMVG 797
            R  +  G+  +MVG
Sbjct: 763 NRFTVEPGSFSVMVG 777


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 225/758 (29%), Positives = 342/758 (45%), Gaps = 135/758 (17%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           E +  +   A    RLGIP   +  E  HG  + G            T FP  L   +++
Sbjct: 90  EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N  L+ +I  AVA E RA     Q G   ++P +++ RDPRWGR +ET GEDP +V+ +A
Sbjct: 138 NTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
           V  V+  QGE   S                             L A  KH   Y   + G
Sbjct: 193 VAAVQGLQGERLDS--------------------------HTSLLATLKHFAGYGASEGG 226

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R      +  ++  +    PFR  +E G  S IM +YN+++GVP      L Q   R
Sbjct: 227 ---RNGAPVHMGLRELHEVDLLPFRKAVESGALS-IMTAYNEIDGVPCTSSRYLLQNVLR 282

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLR-HTQSAIDK 373
             WGF G++ +DC A+  +    N   +  ++A   LKAG+D+     M R H Q A+++
Sbjct: 283 EAWGFDGFVITDCGAIHMLACGHNTAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQ 342

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           G + E D++RA   +  ++ RLGLF+    DP   +        +   EH  LA  AA +
Sbjct: 343 GLITEDDLNRAAGRVLELKFRLGLFDRPYVDPAWAE------QVIGCKEHIALAYQAAAE 396

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--- 487
           GIVLLKN+   LPL+ ++  ++A+IGP  +      G YT  P  P  ++  L+      
Sbjct: 397 GIVLLKNEGNLLPLDSSS-GTIAVIGPNAHTPYHQLGDYTS-PQPPGQIVTVLDGIRRRL 454

Query: 488 --SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET-- 532
             S+  YA GC  +  +S  GF  A+  A++AD +++V G           +DL      
Sbjct: 455 GDSRVLYAPGCR-IQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASV 513

Query: 533 ------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
                       E  DR +L L G Q+ L+  + +  K PVI+V   G P+   +   D 
Sbjct: 514 VTGDAKSDMECGEGIDRSTLTLMGVQLELLQELQKLGK-PVIVVYINGRPITEPWI--DE 570

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQY 640
            I +I+   YPG+ G  A+A+++FGD NP GRLP++  P+   ++P++       ++R+ 
Sbjct: 571 FIPAIIEAWYPGQEGGGAIADMLFGDINPSGRLPLS-IPKEVGQLPIS------YNARRT 623

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSA-PSELTISASLKAGSDKNILQQTGSRLD 699
            G+ Y        Y FG GLSYT + Y  L+  P+ + I                     
Sbjct: 624 RGKRYLETDLAPRYPFGFGLSYTEFRYGRLTVEPAVVPIGG------------------- 664

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
                E T        V+I VTNAG  DG+ VV L+      S   PEK L GF +V   
Sbjct: 665 -----EAT--------VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLK 711

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           A  ++E++F +   EQL +     + ++  G   + VG
Sbjct: 712 AGETQEVTFTIG-SEQLELIGLDLKPVVEPGEFRIQVG 748


>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
          Length = 946

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 208/735 (28%), Positives = 339/735 (46%), Gaps = 115/735 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRRLIHQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 253 -----------------------------QIAATGKHFIAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E T   PF+  I +     +M SYN  +G P       L  + R E GF+GY+ SD 
Sbjct: 284 REVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    ++V  + ++++AL A+R+ IVLLKNDK  LP
Sbjct: 404 RVRDILRVKFLVGLFDHPYQIDLKG--ADEEVEKAANEEIALQASRESIVLLKNDKNILP 461

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ + +  +A+ GP  +  S     Y  +     S+L+G++  +   ++  Y  GC  V 
Sbjct: 462 LDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKAEVLYTKGCDLVD 521

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                    +AV   K+AD  +VV G       E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +VA T K PV+LVL  G PL +++  AD  + +I+   YPG  G KA+A+++FG+
Sbjct: 582 LDLLKAVAATGK-PVVLVLINGRPLSINW--ADKFVPAIVEAWYPGSKGGKAVADVLFGE 638

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
           +NPGG+L +T +P++  ++P N     + + D  +  PG           +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGMEGNMSRANGALYPFGYGLSY 696

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS ++   +     QQT                     V   VTN
Sbjct: 697 TTFEY------SDLKISPAIITPN-----QQT--------------------FVTCKVTN 725

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T EK L GF+RVH     +KE++F +D  + L + N  
Sbjct: 726 TGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVTFPID-RKALELLNAD 784

Query: 783 GRRILPLGNHVLMVG 797
              ++  G+  LMVG
Sbjct: 785 MHWVVEPGDFTLMVG 799


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 152/456 (33%), Positives = 234/456 (51%), Gaps = 53/456 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RAK L++ LTL+EK+  + D + A+PRLGI  + WWSE+LHG A+ GP 
Sbjct: 23  PYQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYANQGP- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                    VT FP+ +  AASF+     ++  AV+ EARA  N  +          L+ 
Sbjct: 82  ---------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRFHDLSV 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   V  VK  QG             +R+    
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK---- 180

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   SR+  N   +T +D  +T+ P F+S +
Sbjct: 181 --------------LLACAKHYAVHSGPEW---SRHEMNVTDVTPRDLWETYLPAFKSLV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           +      +MC+Y +++  P C    L  Q  R +WGFK  + SDC A+   +   + +  
Sbjct: 224 QDADVREVMCAYQRLDDEPCCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSD 283

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
              ++A  + +G D+ C          A+ +G ++EKDI+ +++ L + +  LG  + D 
Sbjct: 284 ATHASAKAVLSGTDVECVGYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKD- 342

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
               + K+    V + +H+KLALD AR+ + LL+N+   LPL+K ++  LA+IGP  N+ 
Sbjct: 343 ELVPWTKIPLSVVNSEDHQKLALDMARETMTLLQNNNNILPLSK-SIGKLAVIGPNANDS 401

Query: 463 SQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             + G Y G P    ++L G++  +   H  Y +GC
Sbjct: 402 QMLSGNYNGTPLRTINILEGIKTKLGADHVIYDAGC 437



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 134/312 (42%), Gaps = 59/312 (18%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           ++P   D G  E V   K AD V+ V G+    E E+           DR  + LP  Q 
Sbjct: 584 EIPLTYD-GIIEKV---KDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQR 639

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           + + ++ +  K+ V +  +G     ++         +IL   Y GE+G +A+A+++FGD+
Sbjct: 640 NCIEALRKAGKKIVFVNCSGSA---IAMVPETQNCDAILQAWYAGESGGQAVADVLFGDY 696

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NP G LP+T+Y         +D +M+        GR+YR+     ++ FG GLSYT ++ 
Sbjct: 697 NPSGHLPVTFYRNVQQLPDFSDYSMK--------GRTYRYLKSAPLFPFGFGLSYTTFNI 748

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
                 ++LT +   K  +                             +++ V NAG  D
Sbjct: 749 ----GEAKLTKNNITKGEA---------------------------IQLRVPVANAGKTD 777

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRI 786
           G+ ++ ++ R      G   K L GF R+   A  ++ ++  + P   +         R+
Sbjct: 778 GTELLQVYIRKVDDPDGA-SKTLRGFKRIPVSAGKTEMVTLDLPPKTFEFFDPTDAVVRV 836

Query: 787 LPLGNHVLMVGE 798
            P GN+ L+ GE
Sbjct: 837 SP-GNYQLLYGE 847


>gi|167646366|ref|YP_001684029.1| beta-glucosidase [Caulobacter sp. K31]
 gi|167348796|gb|ABZ71531.1| Beta-glucosidase [Caulobacter sp. K31]
          Length = 898

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 164/442 (37%), Positives = 229/442 (51%), Gaps = 70/442 (15%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           A  L+  +TL EK++QL + A AIPRLG+PAY WW+ESLHG     P           T+
Sbjct: 46  AAELVGKMTLDEKLEQLLNTAPAIPRLGVPAYNWWTESLHGALGALP----------TTN 95

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIFRDP 175
           FP+ +  AA+F+  L  ++  A+  E + ++ L +A         GL  W+PNINIFRDP
Sbjct: 96  FPEPIGLAATFDAPLVHDVAGAIGAEMQDLHALARATGRMGRIGTGLNTWSPNINIFRDP 155

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + +   V FV+  QG                           D  D
Sbjct: 156 RWGRGQETYGEDPFLTARMGVAFVEGIQG---------------------------DDPD 188

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
              + A  KH   + +      +R+  N  ++ +D EDT+ P FR+ + +G+A  IMC+Y
Sbjct: 189 HPRIIATPKH---FAVHSGPESTRHGANVFVSRRDLEDTYLPAFRAAVVEGRAGSIMCAY 245

Query: 296 NQVNGVPACLRGDLFQKA--RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           N+++G PAC   DL  K   R  W F GY+ SDCDAV  I ++  Y      + A  L+A
Sbjct: 246 NRIDGQPACA-SDLLLKEHLRGAWKFDGYVVSDCDAVKDISDHHKYAPEAASAVAAALRA 304

Query: 354 GMDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKG 405
           G+D  C    L  T       + A+D+G +    ID AL+ LFS + R    NGD P KG
Sbjct: 305 GVDNECNGATLTDTDGLAGRYREALDRGLISTAQIDTALVRLFSARFR----NGDLPAKG 360

Query: 406 KY-GKL-GPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
              G+L GP  V T EH+ LAL A+ + +VLLKND   LPL       +A+IGPL +   
Sbjct: 361 GSDGRLAGPSVVTTHEHEALALAASEKSLVLLKND-GVLPLKPGL--RIAVIGPLGDATR 417

Query: 464 QMGGGY-TGIPCSPKSLLRGLE 484
            + G Y + +   P S++ GL 
Sbjct: 418 VLRGNYSSALSAPPISVVDGLR 439



 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 126/290 (43%), Gaps = 54/290 (18%)

Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           D ++ V GL    E E+           D+ +L LP  Q +++   A+   +P+I+V   
Sbjct: 628 DVLVAVVGLTSDMEAEEAPIEIPGFKGGDKTTLDLPADQRAMLEQ-AKALGKPLIVVAMN 686

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL+  FA A    S++L   YPG++G  A+A ++ G  NP GRLP+T+Y       P 
Sbjct: 687 GSPLN--FAWAKDNASALLEAWYPGQSGGLAIANVLTGKTNPAGRLPLTFYRSVDDLPPF 744

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
           +D  M         GR+YR++ GT VY FG+GLSYT + Y  L                 
Sbjct: 745 DDYAMA--------GRTYRYFEGTPVYPFGYGLSYTRFDYGPLK---------------- 780

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                          I+  T        V  ++ N G   G  V  L+   PK + G P 
Sbjct: 781 ---------------IEPATKGAGQGLRVTTTIKNVGTRPGEEVAQLYLDFPK-TPGAPR 824

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             L GF R+      +++I+F + P   LS  +  G   +  G + + VG
Sbjct: 825 LALRGFQRIALKPGETRDITFALSP-RDLSSVDLDGEHRVSAGLYRVSVG 873


>gi|218130693|ref|ZP_03459497.1| hypothetical protein BACEGG_02282 [Bacteroides eggerthii DSM 20697]
 gi|217987037|gb|EEC53368.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 954

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 213/764 (27%), Positives = 353/764 (46%), Gaps = 120/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +  L ++ R +SL+S++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDPRLPVNERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++N++L   +  A+  E      L    +  W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKNLTEKVAMAIGEET-----LAAGTMQAWSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q +                          
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREVHLVPFRHVIRNYSCQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
           M +Y+   GVP     +L +   R EWGF G++ SDC A+  +   ++YT  ++  AA  
Sbjct: 359 MMAYSDFLGVPVAKSKELLRNILREEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++   ++D     +  +  R  LF  + ++   +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLDMANLDNVCRTMLRMMFRNELFEKEHKEPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  +AR+ IV+L+N    LPL+K+ V ++A++GP  +N+    G
Sbjct: 479 NKIYPG-WNSDSHKEIARQSARESIVMLENKDDVLPLSKH-VRTIAVLGPGADNLQP--G 534

Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT    P   KS+L G++  V   +K  Y  GC     N +     AV++A ++D V++
Sbjct: 535 DYTPKLRPGQLKSVLTGIKQAVGNQTKILYEQGCEFTGSNGE-NIPNAVKVAAQSDVVVL 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PV+LVL  G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGRQQELLEAVCATGK-PVVLVLQIGRPYNL 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           +      +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 653 TKESELCKAIIVNWL--PGQEGGLATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG+GLSYT++ Y      S+L +        + NI+  
Sbjct: 709 T-SGRRYEYSDLEYYP---LYYFGYGLSYTSFEY------SDLKVEER----ENGNII-- 752

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                   ++V N G   G  VV L+      S  T   +L  F
Sbjct: 753 ----------------------AHVNVKNIGHRAGDEVVQLYVTDMYASVKTRITELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RVH     SK ISF + P E LS+ N +  R++  G   ++VG
Sbjct: 791 TRVHLRPGESKSISFELTPYE-LSLLNDNMDRVVEKGTFKILVG 833


>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 819

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 208/720 (28%), Positives = 316/720 (43%), Gaps = 124/720 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E++HG+             +  T  P  +   +++N  L    GS    
Sbjct: 151 RLGIPV-DFSNETIHGLNH-----------TKATPLPAPIGIGSTWNAPLVYKAGSIAGK 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  +AP +++ RDPRWGR  E  GEDP +V+    + VK  Q     
Sbjct: 199 EAKAL------GYTNIYAPILDLARDPRWGRVLECYGEDPFLVATLGTQMVKGIQ----- 247

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      E  ++A  KH   Y + K G       +  +  
Sbjct: 248 ---------------------------EQGVAATLKHFAVYSVPKGGRDGSVRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDC 327
           ++       PF+  I+      +M SYN  +GVP       L Q  R E+GF GY+ SD 
Sbjct: 281 REMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFGFDGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++   +  +T+E++   VL+AG+++          +   +  + +G++  K ID 
Sbjct: 341 DAVEYVYNKHHVAETYEEAVRMVLEAGLNVRTTFAAPDIFILPARKLVKEGRLSMKVIDE 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
            + ++  V+ RLGLF+     DP      K     V   ++K   LD  RQ +VLLKN+ 
Sbjct: 401 RVADVLRVKFRLGLFDQPFVADP------KAADKIVGADKNKDFVLDIQRQSLVLLKNEN 454

Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC 496
             LPL+KN +S + I GPL    + M   Y        ++  G++ Y+       YA GC
Sbjct: 455 NLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGNKVAVDYALGC 514

Query: 497 H--------------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                           +          AV  AK +D VI V G D     E + R  L L
Sbjct: 515 KVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTGESKSRSGLDL 574

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           PG+Q  L+ ++  T K PV+LVL  G PL +++A  D  I +IL   +PG+ G  A+AE 
Sbjct: 575 PGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPGQMGGTAIAET 631

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGH 658
           +FGD+NPGG+LP+T +P++  ++ +N     A  S+Q P      Y  T+V    Y FG 
Sbjct: 632 LFGDYNPGGKLPVT-FPKTLGQIELNFPFKPASQSKQ-PEAGPNGYGKTRVNGALYPFGF 689

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S L +S            ++ G + D                V  
Sbjct: 690 GLSYTTFEY------SNLKVSP-----------ERQGPKGD--------------IQVSF 718

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            +TN G   G  +V L+ +    S  + E  L GF+RV      +K I F + P E L I
Sbjct: 719 DITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNIQFTLHP-EDLEI 777


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 210/754 (27%), Positives = 340/754 (45%), Gaps = 129/754 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHSH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH +AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E     PF+  I++     +M SYN  +GVP  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   +  K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V  +E++ LAL A+R+ +VLLKN+   
Sbjct: 402 NDRVRDILRVKFLVGLFDTPYQTDLAG--ADKEVEKAENESLALQASRESLVLLKNENNV 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHD 498
           LPL+ N V  +A+ GP  +        Y  +     ++L G+       ++  Y  GC  
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKAEVLYTKGCDL 519

Query: 499 VPCN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
           V  N                A   +AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+LVL  G PL +++A  D  +  IL   YPG  G  A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSKGGTAVADVLF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P+S  ++P    N     S Q  G       G        +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S++ IS  +   + K                           V+ 
Sbjct: 693 GLSYTTFEY------SDIEISPKVITPNQKAT-------------------------VRC 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF+R+H     +KE+ F +D  +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
            +KH   ++  G+  +MVG       L   LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMVGASSEDIRLSGKLTVE 814


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 211/754 (27%), Positives = 341/754 (45%), Gaps = 129/754 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + GI S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGIES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARIL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH +AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFVAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E     PF+  I++     +M SYN  +GVP  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVP--IQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   +  K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHSTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V  +E++ LAL A+R+ +VLLKN+   
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKAENESLALQASRESLVLLKNENNV 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPL+ N V  +A+ GP  +        Y  +     ++L G+       ++  Y  GC  
Sbjct: 460 LPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAEVLYTKGCDL 519

Query: 498 -----------DVPC--NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   N  A   +AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRSSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+++F
Sbjct: 580 RQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGTAVADVLF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG++ +T +P+S  ++P    N     S Q  G       G        +Y FG+
Sbjct: 637 GDYNPGGKMTVT-FPKSVGQIP---FNFPCKPSSQIDGGKNPGLDGNMSRVNGALYSFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y  +   S   I+ + KA                               V+ 
Sbjct: 693 GLSYTTFEYSGIEI-SPKVITPNQKA------------------------------TVRC 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF+R+H     +KE+ F +D  +QL +
Sbjct: 722 KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVFTLD-RKQLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIE 806
            +KH   ++  G+  +MVG       L   LT+E
Sbjct: 781 LDKHMEWVVEPGDFSIMVGASSEDIRLSGKLTVE 814


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    ++                      
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               I VTNAG+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 98/326 (30%), Positives = 150/326 (46%), Gaps = 68/326 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGE 593
            DR  + LP  Q  L+ ++    K+ VI V   G P+ +   E ++Q   +IL   YPG+
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSPIAM---EPETQYCQAILQAWYPGQ 685

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQ 652
           +G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDP 736

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
           ++ FG+GLSYT ++Y        + +  ++K G    ++                     
Sbjct: 737 LFPFGYGLSYTTFNY------GNIKLEQTIKVGETAKMV--------------------- 769

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
                I VTN G+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P
Sbjct: 770 -----IPVTNTGNRDGEEVVQVYLKKQEDTEG-PAKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 773 CEQLSIANKHGRRILPL-GNHVLMVG 797
            +QL   +     +  + GN  +MVG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    ++                      
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               I VTNAG+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    ++                      
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               I VTNAG+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|293371677|ref|ZP_06618088.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633374|gb|EFF51944.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 783

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 214/749 (28%), Positives = 339/749 (45%), Gaps = 132/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G A+  
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RDPRW R +ET GEDP++                   
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------------ 215

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
             G IG     K +++G G   D        A  KH +AY + + G     SF  +   +
Sbjct: 216 --GEIG-----KAIVEGLGG-GDLSRPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPFR  I+ G A  +M SYN ++GVP      L  +  RNEW F G + SD  
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFSGIVVSDLY 323

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I +      T E++A   L AG+D++ G     +  +A++ G++ +  +D ++  +
Sbjct: 324 SIEGIHQSHFVAPTMEEAAVLALSAGVDVDLGGDAYMNLMNAVNTGRIGKTALDASVARV 383

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP K K       +V + E   LA   A+  I LLKN+   LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGI--PCSPKSLLRGLEAYV--SKTHYASGCHDVPC 501
           KN    +A+IGP  +N   M G YT      + K++L G+   +  S+  Y  GC  +  
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEANIKTVLDGIRTKLSSSQVEYVKGC-SIRD 494

Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
                  +AV  A++++ +I V G     D     +ET                E  DR 
Sbjct: 495 TVTTDIEQAVAAAQRSEIIIAVVGGSSARDFKTSYKETGAAIANEKTISDMECGEGFDRA 554

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           +L L G+Q  L+ ++ +T+ +P+++V   G PLD ++A  ++   ++L   YPG+ G  A
Sbjct: 555 TLSLLGKQQELLKAL-KTTGKPLVVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIA 611

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FGDFNP GRLP +  P S  ++P+   N +A  S  Y   S      + +Y FG+
Sbjct: 612 IADVLFGDFNPAGRLPFS-VPRSVGQIPLY-YNKKAPQSHDYVEMS-----ASPLYPFGY 664

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT++ Y      S+L +SA                             T   F V  
Sbjct: 665 GLSYTSFDY------SDLHLSA----------------------------LTPRSFEVSF 690

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G  DG  V  L+ R    S   P KQL  F R +      +E+ F +   E  S+
Sbjct: 691 KVRNTGKYDGEEVAQLYLRDEYASVVQPLKQLKHFARFYLKRGEEREVKF-ILSEEDFSL 749

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +++ + I+  G   +M+G     + ++T
Sbjct: 750 VDRNLKSIVEPGTFQIMIGAASDDIRLQT 778


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 164/462 (35%), Positives = 234/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/325 (29%), Positives = 146/325 (44%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    ++                      
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLEQTIKVGETAKMV---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               I VTN G+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----IPVTNTGNRDGEEVVQVYLKKQEDTEG-PTKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|62321294|dbj|BAD94522.1| beta-xylosidase - like protein [Arabidopsis thaliana]
          Length = 287

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 130/291 (44%), Positives = 183/291 (62%), Gaps = 9/291 (3%)

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D V++V G D + E E  DRV L LPG+Q  LVT VA  ++ PV+LV+  GG  D++FA+
Sbjct: 1   DSVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVIMSGGGFDITFAK 60

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADS 636
            D +I+SI+W+GYPGEAG  A+ ++IFG  NP G LPMTWYP+S+  KVPM++MNMR D 
Sbjct: 61  NDKKITSIMWVGYPGEAGGLAIPDVIFGRHNPSGNLPMTWYPQSYVEKVPMSNMNMRPDK 120

Query: 637 SRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGS 696
           S+ YPGRSYRFYTG  VY F   L+YT + ++ + AP  +++S         +  Q    
Sbjct: 121 SKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENHPCRSSECQS--- 177

Query: 697 RLDYV--HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
            LD +  H +      S  F V ++V N GD  GSH V LF   P+V  G+P KQL+GF+
Sbjct: 178 -LDAIGPHCENAVEGGS-DFEVHLNVKNTGDRAGSHTVFLFTTSPQV-HGSPIKQLLGFE 234

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
           ++         + F V+ C+ LS+ ++ G+R + LG+H+L VG L+HSL I
Sbjct: 235 KIRLGKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGSLKHSLNI 285


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 147/436 (33%), Positives = 230/436 (52%), Gaps = 49/436 (11%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           ++PF + SLS   RAK+L+SLLTL+EKI Q+     AIPRL I  Y +W+E++HG+A +G
Sbjct: 39  THPFQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSG 98

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
                       TSFP     +++++  L  +   A + EAR   N    GL +W P IN
Sbjct: 99  LA----------TSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTIN 148

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR +E  GEDP +    AVE++K  QG++ K                      
Sbjct: 149 MSRDPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDDPKY--------------------- 187

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                     A  KH  A + EK     R+S ++ +  ++  + + P F   +++G    
Sbjct: 188 ------YKTIATAKHFAANNYEK----GRHSTSSDMDARNLREYYLPAFEMAVKEGNVRS 237

Query: 291 IMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFE--YQNYTKTHEDSA 347
           +M +YN +NG+P     +L     R EWGF G++TSDC AV  +++    ++  T  +++
Sbjct: 238 VMSAYNALNGIPCGANHELLIDILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEAS 297

Query: 348 AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           A  +  G D+NCG     + + AI+KG +QE D+D AL+ +F  +  +G F+ +     +
Sbjct: 298 AVSIVNGEDLNCGNTFQDYCKEAIEKGYMQEADLDTALVRVFEARFSVGEFD-NASNVPW 356

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
             +  D +   EH++LA  AA++ IVLLKND   LPL+K    S+A+IGP  N I+   G
Sbjct: 357 RSISDDVLDCEEHRQLAYKAAQEAIVLLKNDNNILPLDK--TKSVAVIGPFGNTITL--G 412

Query: 468 GYTGIPCSPKSLLRGL 483
           GY+G P +  +   G+
Sbjct: 413 GYSGSPTALTTPFGGI 428



 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/288 (30%), Positives = 136/288 (47%), Gaps = 42/288 (14%)

Query: 495 GCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVA 554
           GC  V   ++     A  IA KAD VI  AG DLT   E  DR +L LPG Q  L+ +V 
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650

Query: 555 RTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLP 614
             +   ++L+ T      V+   A   + +I+   Y G+A  KA+A++++GD+NP G+L 
Sbjct: 651 SANPNVILLLQTCSS---VTINWAKEHVPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707

Query: 615 MTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPS 674
            TWY  + + +P   +N     ++     +Y ++  T +Y FG+G+SYT + Y+      
Sbjct: 708 STWY-NALSDLPNGMLNYDIRDAK----YTYMYHDKTPLYPFGYGMSYTTFEYQ------ 756

Query: 675 ELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVML 734
           +L IS S  A  ++ I                         V   +TN G   G+ +V L
Sbjct: 757 KLNISKSRLAAGEELI-------------------------VSADITNTGKYAGAEIVQL 791

Query: 735 FARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +A V   S   P KQL+GF RV      +K ++  +   EQLS  N+ 
Sbjct: 792 YAHV-NSSIERPLKQLVGFARVELEPGETKTVTMPLKH-EQLSYFNEQ 837


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 161/459 (35%), Positives = 227/459 (49%), Gaps = 62/459 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  LS   RA  L++ LTL EK   + +N+ AIPRLGI AYEWW+E+LHG+  +G  
Sbjct: 23  PYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKAYEWWNEALHGVGRSG-- 80

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
                     T FPQ +  AASFN  L  +  +AV+ EARA  N            GLT+
Sbjct: 81  --------VATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKSNKFSEQGGLKRYQGLTY 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   V  VK  QG                    
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQG-------------------- 172

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                  D  +   L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F++ +
Sbjct: 173 ------PDNAEYDKLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+    P C    L  Q  RN+W F G + SDC A++  ++  N   T
Sbjct: 224 QKADVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYK-PNAHAT 282

Query: 343 HED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
             D   +AA  +  G D+ CG+   R+   A+  G ++EK ID +L  L   +  LG  N
Sbjct: 283 QPDATHAAANAVLNGTDLECGSD-FRNLPEAVKAGLIEEKRIDVSLKRLLKARFELGEMN 341

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D    +   +    V + +H+ LAL  A + IVLL+N+   LPL+K     +A++GP  
Sbjct: 342 SD----QVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPLSKKL--KIAVMGPNA 395

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
           N+     G Y G P    +LL  +      ++  Y  GC
Sbjct: 396 NDSVMQWGNYNGFPAHTVTLLEAMRKSFPGAQLIYEPGC 434



 Score = 98.2 bits (243), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 90/336 (26%), Positives = 143/336 (42%), Gaps = 67/336 (19%)

Query: 450 SSLAIIGPLVN-NISQMGGGYTGIPCSPKSL-LRGLEAYVSKTHYASGCHDVPCNSDAG- 506
           +SL I G +VN N+S         P +   L     ++Y  +  ++ G  D   N D G 
Sbjct: 526 TSLTINGEIVNRNVS------ANTPTNVYQLKAEAGKSYEIEIAFSQGNTDANLNFDFGT 579

Query: 507 -----FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVT 551
                   ++   K AD V+   G+  + E E+           DR  + LP  Q  L+ 
Sbjct: 580 LVPLDLSASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQ 639

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
           ++    K+ V +  +G     +          +IL   YPG+AG  A+A ++ G++NP G
Sbjct: 640 ALKDAGKKVVFVNFSGSA---MGLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSG 696

Query: 612 RLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
           RLP+T+Y          D +M+        GR+YR+ T   ++ FG+GLSYT    KF+ 
Sbjct: 697 RLPVTFYKNVAQLPDFEDYSMK--------GRTYRYMTEKPLFSFGYGLSYT----KFVL 744

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
             ++L  S S+KA                               + + VTNAG V G+ V
Sbjct: 745 GTAKLNKS-SIKANET--------------------------LKITVPVTNAGKVAGTEV 777

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           + ++ R  K   G P K L GF +V+     + +IS
Sbjct: 778 LQVYVRKVKDVDG-PAKTLRGFKKVNIEPGKTSQIS 812


>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
          Length = 786

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 222/743 (29%), Positives = 349/743 (46%), Gaps = 125/743 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAKVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG   E        L +  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVSTPKHFAVYSIPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
           D +AV  ++   N      D AA V+ AG+++     +    +R  + AI +GKV E+ I
Sbjct: 325 DSEAVEFLYSKHNVAANAVDGAAQVINAGLNVRTNFTLPENFIRPLRQAISEGKVSEQTI 384

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS-EHKKLALDAARQGIVLLKNDKK 440
           D  + ++  V+  +GLF+ +P KG   K  P+ V  S EH+ +++ AA + IVLLKN+  
Sbjct: 385 DSRVADVLRVKFMMGLFD-NPYKGDAKK--PEKVVHSKEHQAVSMRAALESIVLLKNENN 441

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC-- 496
            LPL+K +   +A+IGP    +  +   Y       K++ +G++ Y+  +   YA G   
Sbjct: 442 ILPLSK-STKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDADVRYAKGADI 500

Query: 497 ----------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                     +DVP + D  A   EAV +AK++D  I+V G +     E+  R +L L G
Sbjct: 501 IDKYFPESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCG 560

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q  L+ +V  T K PV+L+L  G    +++AE    I  I+   +PGE    A+A+++F
Sbjct: 561 RQEKLLQAVYATGK-PVVLLLVDGRAATINWAE--HYIPGIVHAWFPGEFMGDAVAKVLF 617

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           GD+NPGG+L +T +P S  ++P        +DS      + +   TGT +Y FG+GLSYT
Sbjct: 618 GDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYT 669

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            ++Y      S+L I   +       I  Q   +L         SC          V N 
Sbjct: 670 TFAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNT 698

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G V G  VV L+      S  T  K L GF+RVH      K ++F + P ++L + NK  
Sbjct: 699 GKVAGDEVVQLYLHDEMSSVTTYVKVLRGFERVHLEPGEEKTVNFVLTP-QELGLWNKDN 757

Query: 784 RRILPLGNHVLMVGELRHSLTIE 806
             ++  G   +MVG     + ++
Sbjct: 758 HFVVEPGTFAVMVGSSSQDIRLQ 780


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 226/835 (27%), Positives = 363/835 (43%), Gaps = 150/835 (17%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDN--------ASAIPRLGIPAYEWWSESL 103
           +PF N SL  + R   L++ LT +E IQQ+S+         A  I RL I AY+W +   
Sbjct: 27  HPFRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQWRTNPG 86

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
            G                +T FPQ +   A+F+      +  A  +E RA +N  +A   
Sbjct: 87  DG---------------RITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKT 131

Query: 161 -----GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
                G+  +AP +N+ R P WGR QET GEDP ++   A  FV+   G  WK+ +    
Sbjct: 132 YRDGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGG--WKNAEPQ-- 187

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTF 275
               E+ +       S + D L++ A CKH   +   +    SR SF A +T+ D   T+
Sbjct: 188 -SLDEQNL-------SSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTY 239

Query: 276 QPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVA-TI 333
            P FR+C+E G  S +MC+Y+ +NG P C+   L  +  R +W FKG++ +DC A+   I
Sbjct: 240 LPAFRACLEAGAVS-VMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVI 298

Query: 334 FEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAID----KGKVQEKDIDRALLNLF 389
           +++Q +   +E + A V +AG+++           S +      G +    +      LF
Sbjct: 299 WKHQIFNHYNETAMAAV-RAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLF 357

Query: 390 SVQLRLGLFNG---DPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKNDKKFLPLN 445
             +L  G FN    DP    Y  L P++   +E H+++AL    + IVLL+N  +FLPL 
Sbjct: 358 LTRLMQGEFNPVEMDP----YRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLK 413

Query: 446 KNAVSS------LAIIGPLVNNISQMGGGYTGIPCSPK---SLLRGLEAYVSKTHYASGC 496
            N   S      +AI+GP   +++++ G Y   P  P+    L +GL     + H +  C
Sbjct: 414 NNMSDSGGPLRHIAIVGPFATSVTELYGHYRTAP-EPEIEVPLSKGLSQLSRRMHASDIC 472

Query: 497 HD-VPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSV 553
            D   C+S  D   H  +      D +++  G     E E+ DR ++ LPG+Q  L+   
Sbjct: 473 TDGGRCSSLNDDALHSTLGY-DDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEET 531

Query: 554 AR---------TSKR--PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            +          SKR  P+IL++   GP+++S A  +  + +I W G+PG     A+  +
Sbjct: 532 LKLSSGLGNSGLSKRTVPIILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHL 591

Query: 603 IFGD---------------------------------FNPGGRLPMTWYPESFTKVPMND 629
           + G                                  + P  RLP TWY        +  
Sbjct: 592 LLGSSGELFGPSKPISVGFHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITV 651

Query: 630 MNMRADSSRQYPGRSYRFYTGTQV---YGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
             M   + R  P + +      ++   Y FG+GLSY N++   LS  S    S  +   S
Sbjct: 652 YEMTNQTYRYLPTQCHMSSEDCKIPVLYPFGYGLSY-NFN---LSGASGFVYSDLIAPSS 707

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV----- 741
                              V+S   + F+V  +V N G +    VV ++ +         
Sbjct: 708 ------------------AVSSNQRIVFYV--TVQNEGPIACEEVVQVYTKWLNRTENDN 747

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMV 796
           S+  P  QL GF+RV       K++ F + P E L++ +     ++P G  VL +
Sbjct: 748 SRNGPLIQLAGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIP-GRGVLQI 801


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  256 bits (653), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 155/459 (33%), Positives = 237/459 (51%), Gaps = 58/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS S RA  L+  +TL+EKI Q+ + + AI RLGIPAY+WW+E+LHG+A  G  
Sbjct: 24  PYRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
                     T FPQ +  AA+F+          V+ EARA Y+         G  GLTF
Sbjct: 82  --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTF 133

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINI+RDPRWGRG ET GEDP + S   +  VK  QG          G G  +K   
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGN---------GAGKYDK--- 181

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                           AC KH   +   +W   +R+SF++  I+++D  +T+ P F++ +
Sbjct: 182 --------------AHACAKHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLV 224

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +GK   +MC+YN+  G P C    L  +  R +WG+   + SDC A+   F Y N+ +T
Sbjct: 225 TEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 283

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H   E ++A  + +G D+ CG         A+ KG + E  I+ ++  L   + +LG+F+
Sbjct: 284 HPSAEAASADAVVSGTDLECGGS-YSSLNEAVKKGLITEDKINESVFRLLRARFQLGMFD 342

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     + ++    V + EH   AL+ AR+ +VLL N    LPL+K ++  +A++GP  
Sbjct: 343 DDTLVS-WSEIPYSVVESKEHVDKALEMARKSMVLLTNKNNSLPLSK-SIRKVAVLGPNA 400

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGC 496
           N+   +   Y G P    ++L G+ + + +   +Y  GC
Sbjct: 401 NDSVMLWANYNGFPTKSVTILEGIRSKLPEGAVYYEKGC 439



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 146/312 (46%), Gaps = 57/312 (18%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           D+    +  +++    A +AD +I V GL    E E+           DR ++ LP  Q 
Sbjct: 582 DIGTKKEIDYNKVAAKAAEADAIIFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQE 641

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
            ++ ++ +T K PVI V+  G  L + + EA++ + ++L   YPG+ G  A+A+++FGD+
Sbjct: 642 EMLKALKKTGK-PVIFVVCSGSTLALPW-EAEN-LDAMLEAWYPGQQGGTAVADVLFGDY 698

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NP GRLP+T+Y          D NM          R+YR++ G  ++ FG+GLSYT + Y
Sbjct: 699 NPAGRLPLTFYASDSDLPDFEDYNMS--------NRTYRYFKGKPLFPFGYGLSYTTFDY 750

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
                          KA  DK  + +TG        D +T        + I + N G +D
Sbjct: 751 G--------------KAKVDKKSI-KTG--------DSMT--------LTIPLKNTGKMD 779

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-- 785
           G  VV ++ R P   +G P K L  F RV   A  ++ I   + P       N    R  
Sbjct: 780 GDEVVQVYLRNPADKEG-PIKMLRAFRRVSLKAGQAENIQIEL-PASTFECFNPATNRME 837

Query: 786 ILPLGNHVLMVG 797
           ILP GN+ L+ G
Sbjct: 838 ILP-GNYELLYG 848


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 209/756 (27%), Positives = 342/756 (45%), Gaps = 129/756 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YRATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     +M SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGMLGVMSSYNDYDGIP--VQGSYYWLTTRLRGEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G     +V   E++ +AL A+ + +VLLKN  + 
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADREVEKEENEAIALQASHESVVLLKNADEL 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPL+ N+   +A+ GP  N        Y  +     ++L G++      ++  Y  GC  
Sbjct: 460 LPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAEVLYTKGCDL 519

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      D P   D  A   +AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENKSRTSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
           +Q+ L+ ++  T K PV+L+L  G PL +++A  D  + +IL   YPG  G  ALA+I+F
Sbjct: 580 RQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKGGTALADILF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGH 658
           GD+NPGG+L +T +P++  ++P    N     S Q  G      TG        +Y FG+
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIP---FNFPCKPSSQIDGGKNPGPTGNMSRINGALYPFGY 692

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L I+  +                    I    S T     V++
Sbjct: 693 GLSYTTFEY------SDLDITPRV--------------------ITPNESAT-----VRL 721

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN G   G  VV L+ R    S  T EK L GF R+H     ++E+SF +D  + L +
Sbjct: 722 KVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFTID-RKHLEL 780

Query: 779 ANKHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            +   + ++  G+ VLM G       L  +LT+E Y
Sbjct: 781 LDADMKWVVEPGDFVLMAGASSEDIRLNGTLTVEDY 816


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  255 bits (652), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 160/461 (34%), Positives = 233/461 (50%), Gaps = 62/461 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N++LS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCTDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN++ G P C    L  Q  R EWG++G + SDC A+   +  + + K
Sbjct: 224 VKEGKVKEVMCAYNRLEGDPCCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGH-K 282

Query: 342 TH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH   E ++A  + +G D+ CG+      +SA  KG + EKDID ++  L   +  LG  
Sbjct: 283 THPNAESASAAAVLSGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEM 341

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP 
Sbjct: 342 D-DPSKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
            N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 400 ANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  +  ++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNIAQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    ++                      
Sbjct: 738 FPFGYGLSYTTFNY------DNIKLDQTIKVGETAKMV---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               I VTNAG+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----IPVTNAGNRDGEEVVQVYLKKQEDAEG-PAKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|325105782|ref|YP_004275436.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324974630|gb|ADY53614.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 748

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 229/786 (29%), Positives = 369/786 (46%), Gaps = 135/786 (17%)

Query: 61  ISTRAKSLISLLTLQEKIQQL----SDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
           I  R   L+S +TLQEKI QL    SD  S  P   I A    S  L  I +   G   N
Sbjct: 33  IEQRVNDLMSKMTLQEKIGQLNQFTSDRTSTGP---ISAN---SNKLKDIEAGKVGSMLN 86

Query: 117 GTVSSVT----------------------------SFPQVLVSAASFNRSLWSNIGSAVA 148
              S  T                            +FP  L  AAS++            
Sbjct: 87  VRGSKDTRMVQEAAMRSRLKIPLLFGLDVIHGYRVTFPIPLAEAASWDLDAIEFASRVAG 146

Query: 149 VEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENW 207
            EA A      AG+ + +AP ++I RD RWGR  E  GED  + S  A   V+ FQG   
Sbjct: 147 REAAA------AGIHWTFAPMVDISRDARWGRVMEGAGEDTYLGSMIAKARVRGFQG--- 197

Query: 208 KSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-I 266
                            KG GE     D +M  AC KH  AY     G      +N + +
Sbjct: 198 -----------------KGLGEL----DAVM--ACVKHFAAYGAAVAGR----DYNTVDM 230

Query: 267 TEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGY 322
           +E+   + + PPF++ ++ G A+  M S+N +NG+PA     L+ D+    + +WGFKG+
Sbjct: 231 SERHLWEVYLPPFKAALDAG-AATFMNSFNDLNGIPATANTYLQRDIL---KGKWGFKGF 286

Query: 323 ITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDI 381
           + SD  +V  +  +  + K +E++A   +K G D++  +   +   +  +   +V E  I
Sbjct: 287 VVSDWGSVREMVPH-GFAKDNEEAAKYAIKGGSDMDMESYAYVNFLEKLVKNKEVSEDLI 345

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           + A+  + + +  +GLF+ DP +    K    ++ ++ +KK+AL+ A++ IVLLKN  + 
Sbjct: 346 NDAVRRILTKKFEMGLFD-DPYRFSDEKREKKELNSAANKKVALEVAQKSIVLLKNQNQV 404

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKT---HYASGC 496
           LPL+K+ V S+A++GPL  +   M G ++ +  +    SL  GLE    K    +YA GC
Sbjct: 405 LPLSKD-VKSIALVGPLAKSAEDMKGFWSVVWDNDHLVSLHEGLETKQGKKTQINYAKGC 463

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
           + +  +  +GF +AV +A+K+D V+V  G       E + R ++ LPG Q  L+  + ++
Sbjct: 464 N-INDDDKSGFADAVEVARKSDVVVVAVGEAFDMSGEAKSRANIHLPGVQEELIKELKKS 522

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K PV++++  G PL  ++   ++   +I++  + G     A+A ++FGD+NP G+LPMT
Sbjct: 523 GK-PVVVLIMAGRPLIFNWTADNAD--AIMYTWWLGSEAGHAMANVLFGDYNPSGKLPMT 579

Query: 617 WYPESFTKVPM----NDMNMRADSSRQYPGRS-YRFYTGTQVYGFGHGLSYTNYSYKFLS 671
            +P S  ++P+    N+      +      RS Y     +  Y FG+GLSYTN++Y    
Sbjct: 580 -FPRSEGQLPIYYNYNNTGRPVINENNIHYRSAYIDSPNSPKYAFGYGLSYTNFAY---- 634

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
             S LT+S+          L Q GS                   V+ ++ N G   G  V
Sbjct: 635 --SNLTLSSK--------SLNQDGS-----------------ITVKFTLKNDGKYAGEEV 667

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
           V L+ R    S   P K+L  F ++      SKE++F ++  E+LS  NK+   +   G+
Sbjct: 668 VQLYLRDKVASLVRPVKELKDFTKILLKPGESKEVTFVINK-EKLSFYNKNLDWVAEPGD 726

Query: 792 HVLMVG 797
             LM+G
Sbjct: 727 FDLMIG 732


>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
           [Tannerella forsythia ATCC 43037]
 gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
           forsythia ATCC 43037]
          Length = 775

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 219/760 (28%), Positives = 335/760 (44%), Gaps = 143/760 (18%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           E +  L   A    RLGIP + +  E +HG  + G            T FP  +  A+++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           NR+L   +G+A+A E R+     Q     + P +++ R+PRW R +ET GEDP++     
Sbjct: 155 NRTLIEKMGAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILG 209

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             FV+  QG+                          D  D     +  KHL AY +   G
Sbjct: 210 SAFVRGLQGK--------------------------DFADGRHTYSTLKHLAAYGIPVGG 243

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
           +  R    A I  ++       PF   ++ G A  +M SYN V+GVP      + +K  R
Sbjct: 244 HNGR---QAQIGARELIAEHLLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILR 299

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDK 373
            EW F G++ SD  ++  I          + +AA  L AG++++ G     R+ + A   
Sbjct: 300 GEWDFNGFVVSDLGSIEGIATTHRVAPDIKHAAAMALNAGVEMDLGGVAYTRNMEQAHTD 359

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
             +   +ID A+  +  ++  +GLF      P +        + + + EH +LA   A +
Sbjct: 360 SLISMSEIDDAVSRILRLKFEMGLFESPYVQPSRTT------EIIRSKEHNRLARKVAEE 413

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS 488
            IVLLKN+   LPL+KN + S+A+IGP  +N+    G YT         ++L G+   VS
Sbjct: 414 SIVLLKNNANLLPLSKN-IGSIAVIGPNADNLYNQLGDYTAPQPEEHIVTILEGIRNAVS 472

Query: 489 KT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------------------- 525
            T    Y  GC  V   + +   EAVR A  ++ V++V G                    
Sbjct: 473 PTTVIRYVKGCA-VRDTTQSNIDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVS 531

Query: 526 ------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
                 +   +  E  DR SL L G Q  L+ S+A T K P+I+V   G PL+++   AD
Sbjct: 532 SRENELIPDMESGEGYDRKSLTLLGHQEKLIESIAATGK-PLIMVYIQGRPLNMNL--AD 588

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
            + S++L   YPGE G  A+A +IFGD NP GRLP++  P S  ++P+            
Sbjct: 589 KKASALLTAWYPGEEGGNAVANVIFGDVNPSGRLPIS-VPRSTGQLPV----------YY 637

Query: 640 YPGRSYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
             G+S  +  GT   +Y FG+GLSYT + Y        LTIS   + G +  +       
Sbjct: 638 SLGKSNDYVEGTSTPLYAFGYGLSYTAFEY------GNLTIS---REGGNITV------- 681

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                     SCT         VTN G+ DG  VV L+ R    S   P   L  F ++ 
Sbjct: 682 ----------SCT---------VTNTGNTDGDEVVQLYLRDHVASVSVPPVLLKDFAKIS 722

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
                S  ++F + P EQL+  N   +R++  G   +M+G
Sbjct: 723 LKKGESARVNFVLTP-EQLAFFNTDLKRVVEPGEFTVMIG 761


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 162/447 (36%), Positives = 228/447 (51%), Gaps = 62/447 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  L+   RA  L+  LT++EKI  + +N+  IPRLGI  YEWW+E+LHG+A  G  
Sbjct: 15  PYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEALHGVARAG-- 72

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEA----RAMYNLGQ----AGLTF 164
                     T FPQ +  AASFN SL   + +AV+ EA    RA  + GQ     GLT 
Sbjct: 73  --------LATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQYKRYQGLTM 124

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   V  VK  Q                     
Sbjct: 125 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQ--------------------- 163

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G +S R D+  L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F++ +
Sbjct: 164 ---GPDSARYDK--LHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLV 215

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EWGF G + SDC A++  +  + +  T
Sbjct: 216 QEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHN-T 274

Query: 343 HEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG-LF 398
           H D+A   A  + +G D+ CG+   R    A+  G + E+ ID ++  L   +  LG + 
Sbjct: 275 HPDAAHASADAVLSGTDLECGSN-YRKLTDAVKAGIISEEQIDISVKRLLKARFELGEME 333

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
              P    Y       V   EH+ LAL  A + + LL+N +  LPL+K+A   +A+IGP 
Sbjct: 334 ESHPWALPYSI-----VDCPEHRHLALQIAHETMTLLQNKENILPLDKHA--KVAVIGPN 386

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEA 485
            N+     G Y G P    +LL  L +
Sbjct: 387 ANDSVMQWGNYNGTPSHTSTLLSALRS 413



 Score = 97.1 bits (240), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 75/278 (26%), Positives = 126/278 (45%), Gaps = 60/278 (21%)

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQ 545
             D+P N +A   +     K  + VI   G+    E E+           DR  + LP  
Sbjct: 570 VEDIPLNMNATLEKL----KDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAV 625

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q +++ ++ +  K+ + +  +G     ++         +IL   YPG+ G  A+A+++FG
Sbjct: 626 QRNVLAALKKAGKKVIFVNFSGSA---MALTPETENCDAILQAWYPGQEGGTAVADVLFG 682

Query: 606 DFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
           D+NP GRLP+T+Y ++  ++P   D +M+        GR+YR+     ++ FG+GLSYT 
Sbjct: 683 DYNPAGRLPVTFY-KNMEQLPDFEDYSMQ--------GRTYRYMKEAPLFPFGYGLSYTT 733

Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
           ++Y               KA +DK  +  TG ++                 + I V+N G
Sbjct: 734 FTYG--------------KARADKKRI-STGEKMT----------------LTIPVSNIG 762

Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
             DG  VV ++ R     +G P K L  F RV  + KG
Sbjct: 763 SRDGEEVVQVYLRREDDPEG-PTKTLRAFKRVE-ITKG 798


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 159/459 (34%), Positives = 233/459 (50%), Gaps = 58/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF N+ L +  RA+ L+  LTLQEK+  + D +S IPRLGI  Y WW+E+LHG+      
Sbjct: 23  PFNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGV------ 76

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
               G     T FPQ +  AA+F+          V+ EARA Y+  +         GLTF
Sbjct: 77  ----GRAGLATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLTF 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   +  V+  QG +    D             
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPSESKYD------------- 179

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+SF+   I+ +D  +T+ P F++ +
Sbjct: 180 -------------KLHACAKHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           +QG    +MC+YN+  G P C     L+   R EWGF G + SDC A++  F  + + +T
Sbjct: 224 QQGGVKEVMCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISD-FYLKGHHET 282

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H   E + A  +KAG D++CG       Q A+++G + EK ID +L  L   +  LGL +
Sbjct: 283 HPTKEAAVAAAVKAGTDLDCGVDYYA-LQKAVEEGIITEKQIDVSLFRLLKARFELGLMD 341

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +     +  +    V + +H++ AL+ AR+ + LLKND   LPL+K+    +A+IGP  
Sbjct: 342 -EEHLVSWSDIPYTVVDSEKHREKALEMARKSMTLLKNDHGTLPLSKHC-GKIAVIGPNA 399

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           N+   M G Y G P    ++L G+   +      Y  GC
Sbjct: 400 NDSVMMWGNYNGFPSHTVTILEGITHKLGAEQIIYDKGC 438



 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 79/269 (29%), Positives = 129/269 (47%), Gaps = 54/269 (20%)

Query: 510 AVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKR 559
           A R+   A+ ++ V G+    E E+           DR  + LP  Q  L+  + +T K 
Sbjct: 594 AARVGD-AEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK- 651

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
           PVIL+L  G  + +S AE D    +I+   Y G+AG  A+A+++FGD+NP GRLP+T+Y 
Sbjct: 652 PVILILCSGSAIGLS-AEVDLA-DAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYK 709

Query: 620 ESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
            +       D +M+        GR+YR++ G  ++ FG+GLSYT++              
Sbjct: 710 ATEQLPDFEDYSMQ--------GRTYRYFEGEALFPFGYGLSYTSF-------------- 747

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
                       +   +RL    I E  S +     ++++V N G +DG  V+ ++ R  
Sbjct: 748 ------------EIGKARLSKKRIRENESVS-----LKLTVENTGKLDGDEVIQIYIRKL 790

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           +  +G P K L  F R H  A   K+++F
Sbjct: 791 QDKEG-PLKTLRAFKRFHLRAGEKKDVTF 818


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  255 bits (651), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 216/759 (28%), Positives = 345/759 (45%), Gaps = 133/759 (17%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           E +  +   A    RLGIP      E  HG  + G            T FP  L   +++
Sbjct: 106 EAVNAIQRYAIENSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 153

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
           N  L+  +  AVA E RA     Q G   ++P +++ RDPRWGR +E  GED  ++S  A
Sbjct: 154 NVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDAYLISEMA 208

Query: 196 VEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
           V  V+  QGE+   +D                           ++A  KH + Y   + G
Sbjct: 209 VASVEGLQGESLDGEDS--------------------------VAATLKHFVGYGSSEGG 242

Query: 256 NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-R 314
              R +    +  ++  +    PFR  +E G AS IM +YN+++GVP     +L     R
Sbjct: 243 ---RNAGPVHMGRRELLEVDLLPFRKAVEAGAAS-IMPAYNEIDGVPCTTNEELLDGVLR 298

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDK 373
            EWGF G + +DC A+  +    +  +   D+A   ++AG+D+     M  +H   A+  
Sbjct: 299 GEWGFDGMVITDCGAIDMLASGHDVAEDGRDAAIQAIRAGIDMEMSGVMFGKHLVEAVRS 358

Query: 374 GKVQEKDIDRALLNLFSVQLRLGLFN---GDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           G+++E+ +DRA+  + +++ RLGLF     DP + +        + ++EH +LA   A +
Sbjct: 359 GQLEEEVLDRAVRRVLTLKFRLGLFERPYADPERAER------VIGSAEHVELARQLASE 412

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS 488
           G+VLLKN    LPL+ +A  ++A+IGP  +      G YT      K  ++L G+ + ++
Sbjct: 413 GVVLLKNKDGVLPLSADA-GTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLA 471

Query: 489 KTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----------LDLTQET- 532
           +T     YA GC  +  NS  GF  A+  A+KAD V++V G           +DL     
Sbjct: 472 ETPERVLYAPGCR-INGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGAS 530

Query: 533 -------------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
                        E  DR++L L G Q+ L+  + +  K P+++V   G P+   +   D
Sbjct: 531 KVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYINGRPIAEPWI--D 587

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
               +IL   YPG+ G  A+A+I+FGD NP GRL ++  P+   +VP+     R+   R 
Sbjct: 588 EHADAILEAWYPGQEGGHAIADILFGDVNPSGRLTIS-IPKHVGQVPVYYHGKRSRGKRY 646

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
             G S   Y       FG+GLSYT ++Y  L   S+ TI+   K GS K           
Sbjct: 647 LEGDSQPRYP------FGYGLSYTEFTYNNLKLESD-TIN---KDGSTK----------- 685

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
                           V + VTN G+  G+ V+ L+          P K+L GF ++   
Sbjct: 686 ----------------VTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFLQ 729

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
              ++ + F V P EQL    ++ + ++  G   + VG+
Sbjct: 730 PGETQTVEFTVGP-EQLQYIGQNYKPVVEPGEFRVHVGK 767


>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
          Length = 800

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 211/735 (28%), Positives = 342/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AID+GKV    +++
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLNQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN    +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 454 NQMLPLSKN-FKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P      + DS  +   R         +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFA-FPFKPDSDSKGKVR-----VDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TIFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + +SF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 161/437 (36%), Positives = 226/437 (51%), Gaps = 57/437 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+S LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   NF       T
Sbjct: 40  RIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG---NF-------T 89

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
            FPQ +  A+ +N  L   I +A++ EAR  +N    G          LTFW+P +N+ R
Sbjct: 90  VFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGKDQKGFFSDLLTFWSPTVNMAR 149

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  ET GEDP +     V FVK  QG               + R LK        
Sbjct: 150 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQG--------------NDPRYLK-------- 187

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                + +  KH  A + E     +R+  N  I+E++  + + P F SCI++GKA  IM 
Sbjct: 188 -----IVSTPKHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQSIMS 238

Query: 294 SYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN +N VP  L   L  Q  R EWGF GY+ SDC     +  +  Y KT E +A   +K
Sbjct: 239 AYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVKTPEAAATLSIK 298

Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           AG+D+ CG    +    +A  +  V + DID A   +   ++ LGLF+ DP K  Y  + 
Sbjct: 299 AGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFD-DPEKNPYNAIS 357

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P  V   +H++LAL+AARQ +VLLKN+K FLPLN   V S+A++G  +N  +   G Y+G
Sbjct: 358 PSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGNCEFGDYSG 415

Query: 472 IPC-SPKSLLRGLEAYV 487
            P  +P S+L G++  V
Sbjct: 416 TPVNAPVSVLEGIKNRV 432



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 140/292 (47%), Gaps = 47/292 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA R  ++ D  + V G++ + E E +DR ++ LP  Q   +    + +   V+ VL 
Sbjct: 594 YGEAGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPNTVV-VLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +IL   YPGE G  A+AE +FGD+NPGGRLP+T+Y  S  ++P
Sbjct: 653 AGSSLAINWI--DENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYY-RSLDELP 709

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D +++        GR+Y ++    +Y FG+GLSYT + YK L   SE++  A     
Sbjct: 710 AFDDYDIQK-------GRTYMYFENKPLYPFGYGLSYTRFDYKNLK--SEVSDDA----- 755

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                                        +++ +V N G   G  V  ++ R P+     
Sbjct: 756 ----------------------------VNLKFTVKNTGKYAGDEVAQVYVRFPESGIKV 787

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P KQL GF+RVH     S ++S  +   E      K G+   P GN++ MVG
Sbjct: 788 PLKQLKGFERVHIGKGKSAQVSVSIPKKELRLWDEKDGKFYTPSGNYIFMVG 839


>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
          Length = 955

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 204/734 (27%), Positives = 335/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRDLVHKVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E R +      G T  +AP +++ RD RWGR +E  GE P +V+   VE  K  Q     
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      +  ++A  KH IAY   K G       +  ++ 
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     P++  I++     +M SYN  +G P       L  + R E+GF+GY+ SD 
Sbjct: 292 REVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  +F         ++S    + AG++I C        +   +  I +G +    ID 
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGALPMSTIDD 411

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+  P +    K    +V ++E++++AL A+++ +VLLKN    LP
Sbjct: 412 RVRDILRVKFLVGLFD-QPYQIDL-KQADKEVNSAENQQVALQASKESLVLLKNQDAVLP 469

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ N +S +A+ GP  +  +     Y  +     ++L G++  V   ++  +  GC  V 
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVD 529

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                +  ++AV  AKK+D  +VV G       E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQ 589

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G P+ +++A  D  + +IL   YPG  G  A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P++  ++P N     N + D  R              +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++I  ++                    + +V   T     V+  VTN 
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L+GFDR+H     +KE++F ++P   L + N   
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807


>gi|317474222|ref|ZP_07933498.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909532|gb|EFV31210.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 954

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 212/764 (27%), Positives = 353/764 (46%), Gaps = 120/764 (15%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDN--ASAIPRLGIPAYEWWSESLHGIASNGP 111
           + +  L ++ R +SL+S++T ++K++ + +      IP L +P      E++HG +    
Sbjct: 170 YMDPRLPVNGRVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYG-- 226

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINI 171
                   S  T FPQ L   A++N++L   +  A+  E      L    +  W+P +++
Sbjct: 227 --------SGATIFPQALAMGATWNKNLTEKVAMAIGEET-----LAAGTMQAWSPVLDV 273

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            +D RWGR +ET GEDP++VS     ++K +Q +                          
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQSKG------------------------- 308

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCI 291
                  L    KH   +     G   R S +  ++E++  +    PFR  I       +
Sbjct: 309 -------LFTTPKHFGGHGAPLGG---RDSHDIGLSEREMREIHLVPFRHVIRNYACQSL 358

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG- 349
           M +Y+   GVP     +L +   R EWGF G++ SDC A+  +   ++YT  ++  AA  
Sbjct: 359 MMAYSDFLGVPVAKSKELLRNILREEWGFDGFVVSDCGAIGNLTSRKHYTAKNKIEAANQ 418

Query: 350 VLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG-KY 407
            L AG+  NCG T   +    A   G++   ++D     +  +  R  LF  + ++   +
Sbjct: 419 ALAAGIATNCGDTYNDKEVIQAAKDGRLDMANLDNVCRTMLRMMFRNELFEKEHKEPLDW 478

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
            K+ P    +  HK++A  +AR+ IV+L+N    LPL+K+ V ++A++GP  +N+    G
Sbjct: 479 NKIYPG-WNSDSHKEIARQSARESIVMLENKDDLLPLSKH-VRTIAVLGPGADNLQP--G 534

Query: 468 GYTGI--PCSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
            YT    P   KS+L G++  V   +K  Y  GC     N +     AV++A ++D V++
Sbjct: 535 DYTPKLRPGQLKSVLTGIKQAVGNQTKILYEQGCEFTGSNGE-NIPNAVKVAAQSDVVVL 593

Query: 523 VAGLDLTQET---------EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           V G   T E          E+ D  +L+LPG+Q  L+ +V  T K PV+L+L  G P ++
Sbjct: 594 VLGDCSTSEATTDVYKTSGENHDYATLILPGRQQELLEAVCATGK-PVVLLLQIGRPYNL 652

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           +      +   + W+  PG+ G  A A+++FGD+NP GRLPMT +P    ++P+   N +
Sbjct: 653 TKESELCKAIIVNWL--PGQEGGLATADVLFGDYNPAGRLPMT-FPRHVGQLPLY-YNFK 708

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
             S R+Y      +Y    +Y FG+GLSYT++ Y      S+L +        + NI+  
Sbjct: 709 T-SGRRYEYLDLEYYP---LYYFGYGLSYTSFEY------SDLKVEER----ENGNII-- 752

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                   ++V N G   G  VV L+      S  T   +L  F
Sbjct: 753 ----------------------AHVNVKNIGYRAGDEVVQLYVTDMYASVKTRITELKDF 790

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            RVH     SK ISF + P E LS+ N +  R++  G   ++VG
Sbjct: 791 TRVHLRPGESKSISFELTPYE-LSLLNDNMDRVVEKGTFKILVG 833


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 158/447 (35%), Positives = 228/447 (51%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   LIS LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 28  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 84

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 85  -------RFTVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVLT 137

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG                   
Sbjct: 138 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG------------------- 178

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 179 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 226

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC   A +     Y KT
Sbjct: 227 KEGKAASIMSAYNALNDVPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLKT 286

Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   LKAG+D+ CG  +      +A  +  V + DID A  ++ + +++LGLF+G 
Sbjct: 287 KEAAATLSLKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGV 346

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y K+ P  + + EH+++ALDAARQ IVLLKN K  LPLN + + S+A++G  +N 
Sbjct: 347 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INA 403

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
                G Y+G P   P S+L+G+   V
Sbjct: 404 GKCEFGDYSGAPVVEPVSILQGIRNRV 430



 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 82/302 (27%), Positives = 143/302 (47%), Gaps = 47/302 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + VI V G++ + E E +DR  + LP  Q   +  + + +   +I++L 
Sbjct: 592 YGEAGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSN-MIVILV 650

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  + +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y       P
Sbjct: 651 AGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 708

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSY+++ Y  L    E           
Sbjct: 709 FDDYDITK-------GRTYKYFKGEVLYPFGYGLSYSSFKYSDLRVKDE----------- 750

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                            DEV         V   + N G  +G  V  ++ R+P+     P
Sbjct: 751 ----------------ADEVA--------VSFRLKNTGKRNGDEVTQVYVRIPETGGIVP 786

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
            K+L GF RV   +  S+ +   ++  EQL   +   G+ ++P G   +MVG     + +
Sbjct: 787 VKELKGFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVGASSKDIRL 845

Query: 806 ET 807
           +T
Sbjct: 846 QT 847


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 153/459 (33%), Positives = 230/459 (50%), Gaps = 58/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           PF N  L +  R + L+  LTL EK+  + D +S++PRLGI  Y WW+E+LHG+      
Sbjct: 23  PFNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGV------ 76

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ--------AGLTF 164
               G     T FPQ +  AA+F+      +   V+ EARA Y+  +         GLTF
Sbjct: 77  ----GRAGLATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLTF 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   +  V+  QG +    D             
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPSESKYD------------- 179

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+ F+   I+ +D  +T+ P F++ +
Sbjct: 180 -------------KLHACAKHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           +QG    +MC+YN+  G P C     L+   R EWGF G + SDC A++  F  + + +T
Sbjct: 224 QQGGVKEVMCAYNRFEGEPCCGSNRLLYNILREEWGFDGLVVSDCGAISD-FYLKGHHET 282

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H   E + A  +KAG D++CG    +  + A++KG + EK ID +L  L   +  LGL +
Sbjct: 283 HSTKESAVAAAVKAGTDLDCGV-DYQSLEKAVEKGIITEKQIDVSLSRLLKARFELGLMD 341

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +     +  +    V + +H+  AL+ AR+ + LLKN    LPL+K+    + +IGP  
Sbjct: 342 -EEHLVSWSDIPYTVVDSEKHRAKALEVARKSMTLLKNKNGTLPLSKHC-GKIVVIGPNA 399

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
           N+   M G Y G P    ++L G+   +   +  Y  GC
Sbjct: 400 NDSIMMWGNYNGFPSHTVTILEGITHKLDAGQVIYDKGC 438



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 82/270 (30%), Positives = 126/270 (46%), Gaps = 53/270 (19%)

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
           +E       A+ ++ V G+    E E+           DR  + LP  Q  L+  + +T 
Sbjct: 591 NEIAATVSDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELYKTG 650

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           K P+IL+L  G  + +S AE D    +I+   YPG+AG  A+A+++FGD+NP GRLP+T+
Sbjct: 651 K-PIILILCSGSAIGLS-AEVDLA-DAIIQAWYPGQAGGTAVADVLFGDYNPAGRLPVTF 707

Query: 618 YPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           Y  +       D NM+        GR+YR++ G  ++ FG+GLSYT+    F    ++L 
Sbjct: 708 YKTTEQLPDFEDYNMQ--------GRTYRYFKGEALFPFGYGLSYTS----FEIGKAQL- 754

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
                   S K I       LD                  + + N G+ DG  V+ ++ R
Sbjct: 755 --------SKKRIHANESVNLD------------------LWIKNTGERDGEEVIQVYIR 788

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
             K  +G P K L  F RVH  +   K+IS
Sbjct: 789 KLKDKEG-PLKTLRAFKRVHVKSGEKKQIS 817


>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
 gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
          Length = 800

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 212/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIGEIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGYI SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI++GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN  S +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A   EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PVILV+  G    +++  A+  I +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVILVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            R  +  G+  +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 213/736 (28%), Positives = 335/736 (45%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG A   
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIAQIGEA--- 195

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EAR    LG   +  ++P ++I +DPRWGR  ET GEDP        + + S Q      
Sbjct: 196 EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKNK--- 250

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L +  KH   Y +   G   +   +  +  +
Sbjct: 251 -----------------------------LVSTPKHFAVYSIPVGGRDGKTRTDPHVAPR 281

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR    +  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 282 EMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 339

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I         +ED+ A  + AG++I          +   +SA+ KGK+ ++ ++
Sbjct: 340 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILPLRSAVKKGKISQETLN 399

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  V+  LGLF+ +P +G   + G   V + EH++LAL+AARQ +VLLKN+ + L
Sbjct: 400 QRVAEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTL 457

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
           PL+K ++ S+A+IGP  +   Q+   Y        ++  G++  + +    Y  GC    
Sbjct: 458 PLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIID 516

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    + P  +       EA+  AK A+  ++V G +     EDR R SL LPG+Q
Sbjct: 517 PHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQ 576

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+  + +  K PV+LV+  G    ++FA   + + +I+   +PGE G +A+AE +FGD
Sbjct: 577 EELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGD 633

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S +           T VYG    FGHGLS
Sbjct: 634 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDESSE-----------TSVYGALYPFGHGLS 681

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S+L IS S K G   NI                 SCT         + 
Sbjct: 682 YTTFQY------SDLVISPS-KQGVQGNI---------------SISCT---------IK 710

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G  +G  VV L+ R    S  T  + L GF+R+    + S  + F + P ++L I +K
Sbjct: 711 NIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGIWDK 769

Query: 782 HGRRILPLGNHVLMVG 797
                +  G   +M+G
Sbjct: 770 QMNFTVEPGMFKVMIG 785


>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 800

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 209/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI++GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN  S +A+IGP    + ++   Y     S K++ +G++ Y+  S+  Y  GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            R  +  G+  +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785


>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 955

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 204/734 (27%), Positives = 334/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRDLVHKVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E R +      G T  +AP +++ RD RWGR +E  GE P +V+   VE  K  Q     
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      +  ++A  KH IAY   K G       +  ++ 
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     P++  I++     +M SYN  +G P       L  + R E+GF+GY+ SD 
Sbjct: 292 REVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  +F         ++S    + AG++I C        +   +  I +G +    ID 
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGALPMSTIDD 411

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+  P +    K    +V ++E++++AL A+++ +VLLKN    LP
Sbjct: 412 RVRDILRVKFLVGLFD-QPYQIDL-KQADKEVNSAENQQVALQASKESLVLLKNQDAVLP 469

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ N +S +A+ GP  +  +     Y  +     ++L G++  V   ++  +  GC  V 
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGTEVLFTKGCDLVD 529

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                +   +AV  AKK+D  +VV G       E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGENKSRSSLELPGRQ 589

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G P+ +++A  D  + +IL   YPG  G  A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPISINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P++  ++P N     N + D  R              +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++I  ++                    + +V   T     V+  VTN 
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L+GFDR+H     +KE++F ++P   L + N   
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807


>gi|160890986|ref|ZP_02071989.1| hypothetical protein BACUNI_03433 [Bacteroides uniformis ATCC 8492]
 gi|156859207|gb|EDO52638.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 869

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 243/858 (28%), Positives = 367/858 (42%), Gaps = 178/858 (20%)

Query: 10  LRERILFRVLQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLI 69
           L +RIL + L   ++  +IA  +           C P + DS      S+S+  R   L+
Sbjct: 3   LEDRILKKWLLPFILSVLIAVDSMAQ--------CIPVYKDS------SMSVERRVTDLL 48

Query: 70  SLLTLQEKIQQLS-------------------DNASAIP--------------------- 89
             +TL+EKI QLS                   D A  I                      
Sbjct: 49  GRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDAAGDISYGCIEGFNLTGENVRKAFHAI 108

Query: 90  --------RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWS 141
                   RLGIP +   +ESLHG   +G            T FPQ +   ++FN  L  
Sbjct: 109 QKYMVEETRLGIPVFTV-TESLHGSVHDGS-----------TIFPQSVAVGSTFNLDLAY 156

Query: 142 NIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKS 201
            +  A+A E R+     Q  +   +P +++ RD RWGR +E+ GEDP +V    +  VK 
Sbjct: 157 QMTKAIATELRS-----QGVIQTLSPGLDVVRDLRWGRVEESFGEDPWLVGQMGIAQVKG 211

Query: 202 FQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYS 261
           +        DGGI        +LK FG        L L++                    
Sbjct: 212 YI-------DGGIS------PMLKPFGPGGAPLGGLNLASVES----------------- 241

Query: 262 FNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFK 320
                 E+D  +    P+   +   +   +M SYN  NG+P      L     RNEWGFK
Sbjct: 242 -----GERDIRNIHIKPYEMAVRNTEVKAVMTSYNSWNGIPNSASSYLLTNILRNEWGFK 296

Query: 321 GYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKD 380
           GY+ SD  AVA + ++Q+  K   ++A   L AG+D+   +      +  I++G+  EK 
Sbjct: 297 GYVYSDWGAVAMLKDFQHTAKDDSEAAIQALTAGVDLEASSNCYWALEQLIEQGRFDEKY 356

Query: 381 IDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKK 440
           +D A+  +  V+  LGLF  +P +G    +    + T E  +L+   A + IVLLKN+  
Sbjct: 357 VDLAVGRILRVKFELGLFE-NPYQG--ADMPGVAMRTKEAVELSRRVADESIVLLKNENT 413

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSL--LRGLEAYVS---KTHYASG 495
            LPLN N + SLA+IGP  N +    G YT    +   +  L GL+  V    K +YA+G
Sbjct: 414 LLPLNLNKIKSLAVIGPNANQVQ--FGDYTWSRSNKDGVTPLEGLKKRVGNKIKINYAAG 471

Query: 496 CHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--EDRDRVSLLLPGQQ 546
           C D+  ++ +GF EAV   K +D  +V  G        D +  T  E  D  SL L G Q
Sbjct: 472 C-DLITDNKSGFDEAVAAVKASDMAVVFVGSSSASLARDYSDATCGEGFDLSSLDLTGVQ 530

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             LV  +    K PVI+VL  G P  +S+ +    I +I+   Y GE    A+A+++ G+
Sbjct: 531 EELVEEIYAIGK-PVIVVLVTGKPFSISWIK--EHIPAIVVQWYGGEKAGDAIADMLLGN 587

Query: 607 FNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
            NP  +LP + +P+S       +  +P +    R       PGR Y F +   ++ FGHG
Sbjct: 588 INPSAKLPFS-FPQSVGHLPVFYNHLPTDKGFYRRPGRPNEPGRDYVFSSPAPLWSFGHG 646

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT + Y  L+A      SA L   SD  I                         V +S
Sbjct: 647 LSYTTFEY--LNA----HYSAELLHPSDTLI-------------------------VSVS 675

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           + N G V G  VV L+ R    S  TP KQL  F +   +  G  +      P ++L++ 
Sbjct: 676 LKNTGSVAGKEVVQLYVRDVVSSVVTPVKQLKAFSKPF-LQPGEMQTVVLKLPIQELALY 734

Query: 780 NKHGRRILPLGNHVLMVG 797
           +   ++++  G + + +G
Sbjct: 735 DLSMKKVVEEGEYEIQIG 752


>gi|373955060|ref|ZP_09615020.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891660|gb|EHQ27557.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 754

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 205/672 (30%), Positives = 320/672 (47%), Gaps = 105/672 (15%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW--APNINIFRDPRWGRG 180
           T+FP  L  AAS++         A+ + AR       A    W  AP ++I RDPRWGR 
Sbjct: 124 TTFPIPLAEAASWDMD-------AIRLSARIAATEAAASGIHWTFAPMVDISRDPRWGRV 176

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
            E  GEDP + S  A+  VK FQG+   S D                           + 
Sbjct: 177 MEGAGEDPYLGSQIAIARVKGFQGDGLGSLDA--------------------------VM 210

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
           AC KH  AY     G      +NA+ ++++   + + PPF++ ++ G A+  M S+N++N
Sbjct: 211 ACAKHFAAYGAAIAGR----DYNAVDMSDRMLWEVYLPPFKAALDAG-AATFMNSFNELN 265

Query: 300 GVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN 358
           G+PA     L +K  + +WGFKG++ SD  ++A +  +  + K   ++A   L AG D++
Sbjct: 266 GIPATGNSYLQRKVLKGKWGFKGFVVSDWGSIAEMVNH-GFVKDKNEAARVALNAGSDMD 324

Query: 359 C-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
             G   + +    +  GKV    +D A+  + + +  LGLF+ DP +    K   + V +
Sbjct: 325 MEGRSYIEYLPQLVRSGKVSITLVDDAVRRVLTKKFELGLFD-DPYRFSDEKREKEIVNS 383

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT---GIP- 473
             + ++A   A++ IVLLKN+ + LPL++NA  S+A+IGPL      M G +    GI  
Sbjct: 384 PRNHEVAKMMAQKSIVLLKNEGELLPLSRNA-KSIAVIGPLAKAKKDMNGFWALPWGIAD 442

Query: 474 -CSPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLT 529
              P SL  G++A V       YA GC +V   S AGF EA R+A+ AD V++  G    
Sbjct: 443 ESEPVSLFEGIQAKVGDKKNVWYAKGC-NVNDTSRAGFAEAARVAQNADVVVMAIGEAYN 501

Query: 530 QETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIG 589
              E + R ++ LPG Q  LV ++  T K  V+LV+ GG PL   F      + +IL+  
Sbjct: 502 MSGEAKSRANIHLPGVQEELVKAIQATGKPTVVLVM-GGRPL--IFNWTADHVPAILFTW 558

Query: 590 YPGEAGAKALAEIIFGDFNPGGRLPMT----------WYPESFTKVPMNDMNMRADSSRQ 639
           + G  G  A+A+++FGD+NP  +LP+T          +Y    T  P+ D+N  A  S  
Sbjct: 559 WLGVEGGNAMADVLFGDYNPSAKLPITFPRSEGQIPVYYASKNTGRPLVDVNDVAYKS-- 616

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
               +Y     T  + FG+GLSYT++ Y            + LK  S K           
Sbjct: 617 ----AYIDELNTPRFAFGYGLSYTHFKY------------SDLKLSSKK----------- 649

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
            ++  +  +C+        ++TN+G   G  VV L+ R    S   P K+L  F ++  +
Sbjct: 650 -INNTQTITCS-------FTITNSGKYTGEEVVQLYLRNRVASVTRPVKELKDFKKIKLM 701

Query: 760 AKGSKEISFGVD 771
              SK +SF +D
Sbjct: 702 PSESKTVSFVID 713


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 216/746 (28%), Positives = 331/746 (44%), Gaps = 131/746 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+                T FP      +++N+ L + IG    V
Sbjct: 127 RLGIPV-DFTNEGIRGVCHE-----------KATFFPAQCGQGSTWNKELIARIGE---V 171

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  E  GEDP +V     + ++S Q      
Sbjct: 172 EAKEAVALGYTNI--YSPILDIAQDPRWGRAVECYGEDPYLVGQLGKQMIQSLQKHK--- 226

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L A  KH   Y +   G       +  +  +
Sbjct: 227 -----------------------------LVATPKHFAVYSIPVGGRDGGTRTDPHVAPR 257

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR   ++  A  +M SYN  +G P  + G    L Q  R EWGFKGY+ SD
Sbjct: 258 EMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEP--ITGSYRFLTQILRQEWGFKGYVVSD 315

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDID 382
            DAV  I         +E++    + AG+++          ++  +SAI KGKV +  ID
Sbjct: 316 SDAVEFISSKHKVADNNEEAVVQSVNAGLNVRTNFSSPAGFIKPLRSAIAKGKVSQATID 375

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  V+  LGLF+ +P +G  GKL    V   EH+ +AL+AARQ IVLLKN    L
Sbjct: 376 QRVSEILYVKFWLGLFD-NPYRGD-GKLADKIVHCKEHQAVALEAARQSIVLLKNQDNLL 433

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCH--- 497
           PL K  + S+A+IGP  +   ++   Y       K++ +G++  +  +K  Y  GC    
Sbjct: 434 PLQKT-LKSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPGAKVVYKKGCEIVD 492

Query: 498 -----------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                      D+         EA+  AK A+ VI+V G       E+R R SL LPG+Q
Sbjct: 493 PHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREERSRTSLDLPGRQ 552

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ +V +  K P ILV+  G    +++A+    + +IL   +PGE   +A+AE IFGD
Sbjct: 553 EELLKAVCKLGK-PTILVMIDGRASSINYAK--KYVPAILHAWFPGEFCGQAVAETIFGD 609

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
            NPGG+L +T +P+S  ++P        +DS             GT V G    FGHGLS
Sbjct: 610 NNPGGKLAVT-FPKSVGQIPFAFPFKPGSDSG-----------CGTSVTGALFPFGHGLS 657

Query: 662 YTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           YT + Y  L  +P +  +   +K                        SCT         V
Sbjct: 658 YTTFEYNNLKISPEQQGVLGEVK-----------------------VSCT---------V 685

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
            N G   G  VV L+ R    S  T  K L GF+R+       K+++F + P + L+I +
Sbjct: 686 KNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLSP-QDLAIWD 744

Query: 781 KHGRRILPLGNHVLMVGELRHSLTIE 806
           K+ +  +  G   +M+G     + +E
Sbjct: 745 KNMKFQVEPGTFKVMIGASSKDIRLE 770


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 156/462 (33%), Positives = 241/462 (52%), Gaps = 58/462 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  LS S RA  L+  +TL+EK+ Q+ + + AI RLGIPAY+WW+E+LHG+A  G  
Sbjct: 23  PYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 80

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
                     T FPQ +  AA+F+          V+ EARA Y+         G  GLTF
Sbjct: 81  --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTF 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINI+RDPRWGRG ET GEDP + S   +  VK  QG       GG G         
Sbjct: 133 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG-------GGTG--------- 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                + D+       AC KH   +   +W   +R+SF+A  I+++D  +T+ P F++ +
Sbjct: 177 -----KYDKA-----HACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GK   +MC+YN+  G P C    L  +  R +WG+   + SDC A+   F Y N+ +T
Sbjct: 224 KEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 282

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H     ++A  + +G D+ CG         A+ KG + E+ I+ ++  L   + +LG+F+
Sbjct: 283 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEEKINESVFRLLRARFQLGMFD 341

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     + ++    V + EH   AL+ AR+ +VLL N    LPL+K ++  +A++GP  
Sbjct: 342 DDALVS-WSEIPYSVVESKEHVAKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNA 399

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGCHDV 499
           N+   +   Y G P    ++L G+++ + +   +Y  GC  V
Sbjct: 400 NDSVMLWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 144/318 (45%), Gaps = 59/318 (18%)

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
           AS   D+    +  + E    A +AD +I V GL  T E E+           DR ++ L
Sbjct: 576 ASLKFDIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDL 635

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           P  Q  ++ ++ +T K PVI VL  G  L + + EA++ + +IL   YPG+ G  A+A++
Sbjct: 636 PHVQAEMLKALKKTGK-PVIFVLCSGSTLALPW-EAEN-LDAILEAWYPGQQGGTAVADV 692

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NP GRLP+T+Y  S       D +M          R+YR++ G  ++ FGHGLSY
Sbjct: 693 LFGDYNPAGRLPLTFYASSDDLPDFEDYDMS--------NRTYRYFKGKALFPFGHGLSY 744

Query: 663 TNYSYKFLSAPSELTISASLKAGSDK-NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           T + Y               KA  DK N+    G  L                   I + 
Sbjct: 745 TIFDYG--------------KAKVDKQNVRAGEGMTL------------------TIPLK 772

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G +DG  V+ ++ R P   +G P K L  F RV   A  ++ I   + P       N 
Sbjct: 773 NTGKLDGDEVIQVYLRNPADKEG-PIKTLRAFRRVSLPAGQTENIRIEL-PASTFECFNP 830

Query: 782 HGRR--ILPLGNHVLMVG 797
              R  ILP G + L+ G
Sbjct: 831 STNRMEILP-GKYELLYG 847


>gi|270294308|ref|ZP_06200510.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
 gi|270275775|gb|EFA21635.1| periplasmic beta-glucosidase [Bacteroides sp. D20]
          Length = 862

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 236/824 (28%), Positives = 354/824 (42%), Gaps = 170/824 (20%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLS-------------------DN 84
           C P + DS      S+S+  R   L+  +TL+EKI QLS                   D 
Sbjct: 22  CIPVYKDS------SMSVERRVTDLLGRMTLREKIAQLSHLHGYQLYNGQEVDYQKLRDA 75

Query: 85  ASAIP-----------------------------RLGIPAYEWWSESLHGIASNGPGVNF 115
           A  I                              RLGIP +   +ESLHG   +G     
Sbjct: 76  AGDISYGCIEGFNLTGENVRKAFHAIQKYMVEETRLGIPVFTV-TESLHGSVHDGS---- 130

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                  T FPQ +   ++FN  L   +  A+A E R+     Q  +   +P +++ RD 
Sbjct: 131 -------TIFPQSVAVGSTFNLDLAYQMTKAIATELRS-----QGVIQTLSPGLDVVRDL 178

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +E+ GEDP +V    +  VK +        DGGI        +LK FG       
Sbjct: 179 RWGRVEESFGEDPWLVGQMGIAQVKGYI-------DGGI------SPMLKPFGPGGAPLG 225

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
            L L++                          E+D  +    P+   +   +   +M SY
Sbjct: 226 GLNLASVES----------------------GERDIRNIHIKPYEMAVRNTEVKAVMTSY 263

Query: 296 NQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N  NG+P      L     RNEWGFKGY+ SD  AVA + ++Q+  K   ++A   L AG
Sbjct: 264 NSWNGIPNSASSYLLTNILRNEWGFKGYVYSDWGAVAMLKDFQHTAKDDSEAAIQALTAG 323

Query: 355 MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           +D+   +      +  I++G+  EK +D A+  +  V+  LGLF  +P +G    +    
Sbjct: 324 VDLEASSNCYWALEQLIEQGRFDEKYVDLAVGRILRVKFELGLFE-NPYQG--ADMPGVA 380

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
           + T E  +L+   A + IVLLKN+   LPLN N + SLA+IGP  N +    G YT    
Sbjct: 381 MRTKEAVELSRRVADESIVLLKNENTLLPLNLNKIKSLAVIGPNANQVQ--FGDYTWSRS 438

Query: 475 SPKSL--LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG---- 525
           +   +  L GL+  V    K +YA+GC D+  ++ +GF EAV   K +D  +V  G    
Sbjct: 439 NKDGVTPLEGLKKRVGNKIKINYAAGC-DLITDNKSGFDEAVAAVKASDMAVVFVGSSSA 497

Query: 526 ---LDLTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
               D +  T  E  D  SL L G Q  LV  +    K PVI+VL  G P  +S+ +   
Sbjct: 498 SLARDYSDATCGEGFDLSSLDLTGVQEELVEEIYAIGK-PVIVVLVTGKPFSISWIK--E 554

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMR 633
            I +I+   Y GE    A+A+++ G+ NP  +LP + +P+S       +  +P +    R
Sbjct: 555 HIPAIVVQWYGGEKAGDAIADMLLGNINPSAKLPFS-FPQSVGHLPVFYNHLPTDKGFYR 613

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
                  PGR Y F +   ++ FGHGLSYT + Y  L+A      SA L   SD  I   
Sbjct: 614 RPGRPNEPGRDYVFSSPAPLWSFGHGLSYTTFEY--LNA----HYSAELLHPSDTLI--- 664

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                                 V +S+ N G V G  VV L+ R    S  TP KQL  F
Sbjct: 665 ----------------------VSVSLKNTGSVAGKEVVQLYVRDVVSSVVTPVKQLKAF 702

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            +   +  G  +      P ++L++ +   ++++  G + + +G
Sbjct: 703 SKPF-LQPGEMQTVVLKLPIQELALYDLSMKKVVEEGEYEIQIG 745


>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
 gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
          Length = 800

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 209/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI++GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEAVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN  S +A+IGP    + ++   Y     S K++ +G++ Y+  S+  Y  GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            R  +  G+  +MVG
Sbjct: 771 NRFTVEPGSFSVMVG 785


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 213/736 (28%), Positives = 335/736 (45%), Gaps = 129/736 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L + IG A   
Sbjct: 148 RLGIPV-DFTNEGIRGLCHD-----------RATYFPAQCGQGATWNKELIAQIGEA--- 192

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EAR    LG   +  ++P ++I +DPRWGR  ET GEDP        + + S Q      
Sbjct: 193 EAREASVLGYTNI--YSPILDIAQDPRWGRCVETYGEDPYHAGQMGKQMILSLQKNK--- 247

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L +  KH   Y +   G   +   +  +  +
Sbjct: 248 -----------------------------LVSTPKHFAVYSIPVGGRDGKTRTDPHVAPR 278

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITSD 326
           +    +  PFR    +  A  +M SYN  +G P  + G    L +  R EWGFKGY+ SD
Sbjct: 279 EMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRQEWGFKGYVVSD 336

Query: 327 CDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDID 382
            +AV  I         +ED+ A  + AG++I          +   +SA+ KGK+ ++ ++
Sbjct: 337 SEAVEFISTKHQVANGYEDAVAQAVNAGLNIRTHFTPPADFILPLRSAVKKGKISQETLN 396

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
           + +  +  V+  LGLF+ +P +G   + G   V + EH++LAL+AARQ +VLLKN+ + L
Sbjct: 397 QRVAEILRVKFWLGLFD-NPYRGDEKRAG-QIVHSPEHQQLALEAARQSLVLLKNEHQTL 454

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCH--- 497
           PL+K ++ S+A+IGP  +   Q+   Y        ++  G++  + +    Y  GC    
Sbjct: 455 PLSK-SIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVVYKKGCDIID 513

Query: 498 ---------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    + P  +       EA+  AK A+  ++V G +     EDR R SL LPG+Q
Sbjct: 514 PHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVREDRSRTSLDLPGRQ 573

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+  + +  K PV+LV+  G    ++FA   + + +I+   +PGE G +A+AE +FGD
Sbjct: 574 KELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGGQAIAEALFGD 630

Query: 607 FNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYG----FGHGLS 661
           +NPGGRL +T +P+S  ++P        +D S +           T VYG    FGHGLS
Sbjct: 631 YNPGGRLAVT-FPKSVGQIPFAFPFKPGSDESSE-----------TSVYGALYPFGHGLS 678

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S+L IS S K G   NI                 SCT         + 
Sbjct: 679 YTTFQY------SDLAISPS-KQGVQGNI---------------SISCT---------IK 707

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G  +G  VV L+ R    S  T  + L GF+R+    + S  + F + P ++L I +K
Sbjct: 708 NIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QELGIWDK 766

Query: 782 HGRRILPLGNHVLMVG 797
                +  G   +M+G
Sbjct: 767 QMNFTVEPGMFKVMIG 782


>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
 gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
 gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
 gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
 gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
          Length = 800

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 208/735 (28%), Positives = 338/735 (45%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AID+GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAIDEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + +VLLKN 
Sbjct: 401 RVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLLKNK 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN    +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 454 NQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + +SF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|313204104|ref|YP_004042761.1| beta-glucosidase [Paludibacter propionicigenes WB4]
 gi|312443420|gb|ADQ79776.1| Beta-glucosidase [Paludibacter propionicigenes WB4]
          Length = 871

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 158/459 (34%), Positives = 233/459 (50%), Gaps = 57/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RAK LIS LTL EK   + D + A+PRLGI  + WWSE+LHG A+NG  
Sbjct: 31  PYKNPNLSSEERAKDLISRLTLSEKATLMCDISDAVPRLGIKKFNWWSEALHGFANNG-- 88

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLT 163
                   +VT FP+ +  AASF+ +L   I  AV+ EARA YN              L+
Sbjct: 89  --------NVTVFPEPVGMAASFDDNLVYKIFDAVSDEARAKYNEATKKGLENARFLSLS 140

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP + S   V  VK  QG             +R+   
Sbjct: 141 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK--- 189

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          L AC KH   +   +W   SR+S N   +  +D  +T+ P F+S 
Sbjct: 190 ---------------LLACAKHFAVHSGPEW---SRHSLNINNLDPRDLWETYLPAFKSL 231

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+Y +++  P C    L Q+  R+EWG+K  + +DC AV+  +     + 
Sbjct: 232 VQKSDVRQVMCAYQRLDDEPCCGNTQLLQRILRDEWGYKYMVVADCGAVSDFYTSHKVSS 291

Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
               +A+  + AG D+ C     + +    A+ KG + E +I++ LLN+   +  LG  +
Sbjct: 292 DAVHAASKGVWAGTDVECQWDNHIYKQLPDAVAKGLITEAEINKHLLNVLIGRFDLGEMD 351

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     + K+    V   EH+KLALD A + + LL+N    LPL+K+    +A++GP  
Sbjct: 352 DDALV-PWSKIPMSVVNNEEHRKLALDMALKSMTLLQNKNNILPLSKS--KKIAVVGPNA 408

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
           N+   + G Y G P    ++L G+ + +S  K  Y  GC
Sbjct: 409 NDKPMLWGNYNGTPVRTITILDGITSKLSANKVLYEKGC 447



 Score = 79.3 bits (194), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 61/227 (26%), Positives = 100/227 (44%), Gaps = 48/227 (21%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q + + ++    K+ + +  +G     ++         +IL   Y GE+G
Sbjct: 638 DRTDIELPAVQRNCLKALKEAGKQVIFVNCSGSA---IALIPETESCDAILQAWYGGESG 694

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP G+LP+T+Y          D +M+        GR+YR Y    ++ 
Sbjct: 695 GQAVADVLFGDYNPSGKLPITFYKSVKQLSDFEDYSMK--------GRTYR-YMSDPLFP 745

Query: 656 FGHGLSYTNYSY--KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           FG GLSYT ++     LS P                               E+ S  S+ 
Sbjct: 746 FGFGLSYTTFAIGNAKLSNP-------------------------------EIKSNQSV- 773

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
             + I V+N G   G+ +V ++ R+   + G P K L GF RV   A
Sbjct: 774 -ELTIPVSNKGKRSGTEIVQVYVRMVNDTDG-PLKTLKGFQRVEVAA 818


>gi|296081550|emb|CBI20073.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 124/270 (45%), Positives = 168/270 (62%), Gaps = 38/270 (14%)

Query: 146 AVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGE 205
            V+ EARAMYN+G AGLTFW+PN+NIF+DPRWGRGQETPGEDP++ S YA  +V+  Q  
Sbjct: 78  VVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASGYVRGLQ-- 135

Query: 206 NWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM 265
             +SDDG                      D L ++ACCKH  AYDL+ W     + FNA+
Sbjct: 136 --QSDDG--------------------SPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAV 173

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITS 325
           +T QD +DTFQPPF+SC+  G  + +MCSYNQ           L  K   +     Y+ S
Sbjct: 174 VTNQDMDDTFQPPFKSCVIDGNVASVMCSYNQFY---------LIYKYHLQ-----YLVS 219

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRAL 385
           DCD+V   +  Q+YTKT E++ A  + AG+D+NCG+ + +HT++A+  G V E  +D+A+
Sbjct: 220 DCDSVDVFYNSQHYTKTPEEAVAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAV 279

Query: 386 LNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
            N F+  +RLG F+G+P K  YGKLGP DV
Sbjct: 280 SNNFATLMRLGFFDGNPSKAIYGKLGPKDV 309


>gi|110740481|dbj|BAF02134.1| xylosidase [Arabidopsis thaliana]
          Length = 284

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 127/287 (44%), Positives = 184/287 (64%), Gaps = 12/287 (4%)

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           GLD + E E RDR  LLLPG Q  LVT VA+ S+ PVILVL  GGP+DV+FA+ D ++++
Sbjct: 2   GLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLMSGGPIDVTFAKNDPRVAA 61

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPMNDMNMRADSSRQYPGR 643
           I+W GYPG+AG  A+A IIFG  NPGG+LPMTWYP+ +  KVPM  M MRA  S  YPGR
Sbjct: 62  IIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKVPMTVMAMRA--SGNYPGR 119

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAP-SELTISASLKAGSDKNILQQTGSRLDYVH 702
           +YRFY G  V+ FG GLSYT +++    +P ++L++S S    S   IL  +   +   H
Sbjct: 120 TYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLS-NLNSANTILNSSSHSIKVSH 178

Query: 703 IDEVTSCTSL-RFHVQISVTNAGDVDGSHVVMLFARVP--KVSQGTPEKQLIGFDRVHTV 759
               T+C S  +  + + V+N G+ DG+H V +FA  P   +      KQLI F++VH +
Sbjct: 179 ----TNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPINGIKGLGVNKQLIAFEKVHVM 234

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
           A   + +   VD C+ L + +++G+R +P+G H L +G+L+H++ ++
Sbjct: 235 AGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGDLKHTILVQ 281


>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
 gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
          Length = 772

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 181/622 (29%), Positives = 310/622 (49%), Gaps = 85/622 (13%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP +++ RDPRWGR  E  GED  + +A A   V+ FQG+                   
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQGD------------------- 216

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                  D      + A  KH  AY     G   R      +++++  DT+ PPF++ ++
Sbjct: 217 -------DLSQPHTILATAKHFAAY---GQGQAGRDYHTTDMSDRELRDTYLPPFKAAVD 266

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            G  +  M S+N++NGVPA     L     R+EW F+G++ +D  ++  + ++  + + +
Sbjct: 267 AG-VTSFMTSFNELNGVPASANKYLLTDILRDEWSFEGFVVTDYTSINEMVKH-GFARDN 324

Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           + +    +KAG+D++  G+    +  + + +GKV  + ID A   +  ++ RLGLF  DP
Sbjct: 325 DHAGELAVKAGVDMDMQGSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFE-DP 383

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
            +    +    ++    + + A D AR+ +VLLKN+ + LPL+K+ ++ +A+IGPL ++ 
Sbjct: 384 YRYSNEEREAQEIYKEYNLQAAQDVARKSMVLLKNENQQLPLSKSDLT-IAVIGPLADSK 442

Query: 463 SQMGGGYTGIPC---SPKSLLRGLEAYV---SKTHYASGC-HDVPCNSDAGFHEAVRIAK 515
             + G ++        P +LL G++A V   SK  YA G  ++     ++GF  A+ IAK
Sbjct: 443 EDLIGSWSAAGDRYEKPITLLTGIKAKVADPSKVLYAKGASYEFSHQDNSGFEAAIAIAK 502

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           KAD +++  G       E   R SL  PG Q++L+  + + +K P++LVL  G P+ + +
Sbjct: 503 KADVIVLAMGEKWDMTGEATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEW 561

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-- 633
           A  D  + +IL   YPG  G  A+A+++FGD+NP G+LP+T +P +  ++P+   NM+  
Sbjct: 562 A--DQNVDAILEAWYPGTMGGPAIADVLFGDYNPSGKLPVT-FPRNVGQIPLY-YNMKNT 617

Query: 634 -----ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDK 688
                 D++ Q     Y     T +Y FGHGLSYT + Y      S+++++ ++    +K
Sbjct: 618 GRPYSKDNAEQKYVSRYIDSLNTPLYHFGHGLSYTTFDY------SKISLNKAVITAKEK 671

Query: 689 NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEK 748
                                        I VTN+G+ DG  VV L+ R    S   P K
Sbjct: 672 -------------------------LTASIDVTNSGNYDGEEVVQLYIRDRIGSVTRPVK 706

Query: 749 QLIGFDRVHTVAKGSKEISFGV 770
           QL GF ++      +K +SF +
Sbjct: 707 QLKGFKKIFLHKGETKTVSFSI 728


>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
 gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 800

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 211/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIGEIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGYI SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI++GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN  S +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A   EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PVILV+  G    +++  A+  I +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVILVMVDGRAATINW--ANKYIPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 799

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 195/735 (26%), Positives = 337/735 (45%), Gaps = 124/735 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ +              T FP  L    ++NR+L   +G     
Sbjct: 147 RLGIPV-DFTNEGIRGVEA-----------YEATGFPTQLNMGMTWNRNLIRKMGRITGQ 194

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EARA+      G T  +AP +++ RD RWGR +E  GEDP +V+   VE     Q  N  
Sbjct: 195 EARAL------GYTNVYAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMTLGMQENN-- 246

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         +++  KH   Y   K         +  ++ 
Sbjct: 247 -----------------------------QIASTAKHFAVYSANKGAREGLARTDPQVSP 277

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ ED    PF+  I++     +M SYN  NG+P  + G    L Q+ R ++GF GY+ S
Sbjct: 278 REVEDIMLYPFKKVIQEAGIMGVMSSYNDYNGIP--ITGSEYWLTQRLRKDFGFGGYVVS 335

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DA+  ++   +     +++      AG+++         ++ + +  +++G++  + I
Sbjct: 336 DSDALEYLYNKHHVAANLKEAVFQAFMAGLNVRTTFRPPDSIIIYARQLVNEGRIPIETI 395

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+ +LGLF  D    K        V +  H+ +AL A+++ IVLLKN+ + 
Sbjct: 396 NSRVKDVLRVKFKLGLF--DQPYVKDAAASEKLVNSIAHQAVALQASKESIVLLKNNNQI 453

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGCHDV 499
           LPL++ ++  +A+IGP   +       Y  +     ++L G+   +   K  YA GC  V
Sbjct: 454 LPLSR-SLKKIAVIGPNAADNDYAHTHYGPLQSKSTNILEGIRNKIGADKVWYAKGCELV 512

Query: 500 ----------PCNSDAG----FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
                     P + DA       +AV  A KAD  IVV G +     E++ R +L LPG 
Sbjct: 513 DKNWPESEIFPEDPDATAIALIEDAVNTAMKADVAIVVLGGNTKTAGENKSRTTLELPGF 572

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q++L+ ++ +T K PV+ V+ G  P+ +++   D  I  I++ GYPG  G  A+A+++FG
Sbjct: 573 QLNLIKAIQKTGK-PVVAVMIGTQPMGINWI--DKYIDGIVYAGYPGVKGGIAVADVLFG 629

Query: 606 DFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           D+NPGG+L +T +P+S  ++P+N     N + D       +         +Y FG GLSY
Sbjct: 630 DYNPGGKLTLT-FPKSVGQLPLNFPSKPNAQTDEGELAKIKGL-------LYPFGFGLSY 681

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T ++Y      S L IS  ++   D NI                         + + +TN
Sbjct: 682 TTFAY------SNLKISP-IEQEKDGNI------------------------SISVDITN 710

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
              ++G  +V L+ R    +  T EK L GF+R+      +K + F + P + L + N+ 
Sbjct: 711 TAKLEGDEIVQLYIRDVLSTVTTYEKILRGFERISLKPNETKTLKFTLFP-DDLKLWNRE 769

Query: 783 GRRILPLGNHVLMVG 797
            + ++  G   +M+G
Sbjct: 770 MQHVIEPGTFKVMIG 784


>gi|423212159|ref|ZP_17198688.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392695047|gb|EIY88272.1| hypothetical protein HMPREF1074_00220 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 768

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 240/840 (28%), Positives = 376/840 (44%), Gaps = 150/840 (17%)

Query: 12  ERILFRVLQLIVVVNVIAFSNSKPVLNK--PDFPCKP-----------PHFD----SYPF 54
           +R L  +L +I V  ++A S+ +  +NK   +   K            P+ D    + P 
Sbjct: 11  KRFLVLILAMIPVCGMMAQSSQESKMNKFVDNLMSKMTIEEKIGQLNMPNIDGAVITGPV 70

Query: 55  CNTSLSISTRAKSLISLLTLQE--KIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
            NT+     R   + +LL LQ+  KI+++   A    R  IP   +  + +HG       
Sbjct: 71  KNTNTGERLRKNEVGALLNLQDVKKIREVQQLALEHSRFKIPLL-FGMDVIHGYK----- 124

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINI 171
                     T FP  L  AA++N  +        AVEA A       G+ + ++P +++
Sbjct: 125 ----------TIFPIPLAQAATWNMDMVEEAARISAVEASA------NGICWTFSPMVDV 168

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            RD RWGR  E  GEDP +    A  +V+ +QG                           
Sbjct: 169 TRDARWGRIAEGAGEDPWLGGEIAKAYVRGYQG--------------------------- 201

Query: 232 DRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASC 290
           D      + AC KH   Y     G      +N + ++ Q   + +  P+ + +E G  S 
Sbjct: 202 DLSANTNILACVKHFALYGAPDAGK----DYNTVDMSRQRMFNEYMLPYLAAVEAGVGSA 257

Query: 291 IMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
            M ++N+V G+PA   R  L    R +WGF G++ SD DAV  + E+       ++ +A 
Sbjct: 258 -MSAFNEVEGIPAAANRWLLTDLLRGQWGFDGFVVSDWDAVRELTEHG--IGNMQEVSAR 314

Query: 350 VLKAGMDINCGTCMLRHT-QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
            L AG+D++  +  L  T + + D+GKV E+ I+ A   +   + +LGLF  DP K    
Sbjct: 315 ALIAGLDMDMASEGLVSTLKKSFDEGKVTEEQINTACRRILVAKYKLGLFK-DPFKYCNE 373

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           +    D+ T EH++ A + A +  VLLKN ++ LPL K+    +A+IGPL +N   M G 
Sbjct: 374 ERAKKDLFTPEHRRAAREMAAESFVLLKNKERLLPLQKSG--KIAVIGPLADNRRNMLGT 431

Query: 469 YT--GIPCSPKSLLRGLEAYV---SKTHYASG---CHDVPCNSDA-----GF-------- 507
           +T      +P S+L G+   V   ++ +YA G    +D      A     GF        
Sbjct: 432 WTVSADLDTPVSVLEGIREAVGDKAEINYARGSNLTYDEELEKRASHWGKGFPRDGRTDS 491

Query: 508 ---HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
               EA+ IA ++D +I V G       E   RV L +P  Q  L+  +  T K PV+LV
Sbjct: 492 QLQREALDIANQSDIIIAVMGEAAEMSGESTSRVDLNIPDAQKDLLKKLVDTGK-PVVLV 550

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G PL + + E +  + +IL + +PG     A+A+++FGD NP G+L  T +P S  +
Sbjct: 551 LFAGRPLTLVWEEEN--VPAILNVWFPGTEAGNAVADVLFGDVNPSGKLTAT-FPRSVGQ 607

Query: 625 VPMN---DMNMRADSSRQYPGRSYRF----YTGTQVYGFGHGLSYTNYSYKFLSAPSELT 677
           VP++       RA  S++ P   YR      T   +Y FG+GLSYT + Y  LS      
Sbjct: 608 VPISYSYKHTGRA-PSKEKPSEKYRTGYIDETYEPLYPFGYGLSYTQFEYGELSL----- 661

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
                    DK ++  T          E  + +       I+VTN G VDG  +V L+ R
Sbjct: 662 ---------DKEVINNT----------EFLTAS-------ITVTNKGTVDGKEIVQLYLR 695

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               S   P K+L G+++++  A  SK + F + P E L   N +   +   G   +MVG
Sbjct: 696 DVVGSVTRPLKELKGYEKIYLKAGESKTVRFRITP-EMLKFYNYNIEYVYEPGEFDVMVG 754


>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 733

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 210/742 (28%), Positives = 351/742 (47%), Gaps = 108/742 (14%)

Query: 67  SLISLLT---LQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           SLI L T   L+ +IQ+ +   S   RLGIP   +  + +HG+                T
Sbjct: 79  SLIYLHTDPKLRNRIQRKAMEES---RLGIPIL-FGFDVIHGLR---------------T 119

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQET 183
            +P  L  A SFN  L +      A E+     L     TF +P I++ RDPRWGR  E 
Sbjct: 120 VYPISLAQACSFNPDLVTQACGMAAKESV----LSGIDWTF-SPMIDVARDPRWGRISEC 174

Query: 184 PGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACC 243
            GEDP + + + V  VK +QGE                             D   ++AC 
Sbjct: 175 YGEDPYLNTVFGVASVKGYQGEKL--------------------------SDPYSIAACL 208

Query: 244 KHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA 303
           KH + Y + + G   RY+    I+ Q   +T+ PP+ +C++ G A+ +M S+N ++GVPA
Sbjct: 209 KHYVGYGVSEGGRDYRYT---DISPQALWETYLPPYEACVKAG-AATLMSSFNDISGVPA 264

Query: 304 CLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GT 361
                +  +  +N+W   G++ SD +A+  +  YQ   K  +++A     AG++++    
Sbjct: 265 TSNHYILTEILKNKWRHDGFVVSDWNAIEQLI-YQGVAKNRKEAAYKAFHAGVEMDMRDN 323

Query: 362 CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHK 421
               + +  + + K++   ID A+  +  V+ RLGLF+ +P   +  +   +     E  
Sbjct: 324 VYYEYLEQLVAEKKIEISQIDDAVARILRVKFRLGLFD-EPYTKELTE--QERYLQKEDI 380

Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG--YTGIPCSPKSL 479
            LA   A + +VLLKN+K  LPL+ + V  +A+IGP+V + S + G   + G     +++
Sbjct: 381 ALAARLAEESMVLLKNEKNLLPLS-STVKRVALIGPMVKDRSDLLGAWAFKGQAEDVETI 439

Query: 480 LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
             G++       +  Y  GC  +  N ++GF  A++ A+ +D V+V  G       E+  
Sbjct: 440 YEGMQKEFGDKVRLDYEQGCA-LDGNDESGFSAALKTAEASDVVVVCLGESKQWSGENAS 498

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R ++ LP  Q  L+  + + +K P++LVL+ G PL++     + Q+ +I+ +  PG AG 
Sbjct: 499 RSTIALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLEPQVEAIIEMWQPGVAGG 555

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR-SYRFYTGTQVYG 655
             LA I+ G  NP G+L +T +P S  ++P+   NMR  S+R +     Y+      +Y 
Sbjct: 556 TPLAGILSGRVNPSGKLSVT-FPLSTGQIPVY-YNMR-QSARPFDAMGDYQDIPTEPLYP 612

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++Y      S+  +S SLK   ++ I                         
Sbjct: 613 FGYGLSYTTFTY------SDAKLS-SLKIKKNQKI------------------------T 641

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
            +++VTNAG V+G   V+ +   P  S   P K+L  F++       S+   F +DP   
Sbjct: 642 AEVTVTNAGKVEGKETVLWYVSDPFCSISRPMKELKFFEKQSLKVGESRVFRFEIDPMRD 701

Query: 776 LSIANKHGRRILPLGNHVLMVG 797
           LS  +  G+R L  G  ++ VG
Sbjct: 702 LSYTDATGKRFLEAGEFIVSVG 723


>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
 gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
          Length = 787

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 217/747 (29%), Positives = 334/747 (44%), Gaps = 138/747 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E  HG  + G            T FP  +  A+++N SL   +G  + +
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172

Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR    N+G      + P ++I R+PRW R +ET GEDP +       FV+  QG+++K
Sbjct: 173 EARLQGANIG------YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF---SRYSFNAM 265
                                     D   + +  KHL AY + + G+    +     A+
Sbjct: 227 --------------------------DGRHVYSTLKHLAAYGVPRGGHNGGPADMGLRAL 260

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
           +      D + P F+  +E GKA+ +M SYN ++GVP      L     R  WGF G++ 
Sbjct: 261 L------DEYLPGFQRAVEVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVY 314

Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRA 384
           SD  ++  I    +     ED+A   ++AG D++ G    R    A+  GKV+E  I+RA
Sbjct: 315 SDLASIDGIAG-AHVAANLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRA 373

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDD----VCTSEHKKLALDAARQGIVLLKNDKK 440
           + N+  ++ R+GLF       +   + P++    V   +H+ LA   AR+G VLLKN+  
Sbjct: 374 VSNVLRLKFRMGLF-------EQPYVSPEEAARLVNCEDHRMLARKIAREGTVLLKNN-G 425

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVS--KTHYASGC 496
            LPL K  V  +A+IGP  + +    G YT      K  +LL  L   +   +  Y  GC
Sbjct: 426 ILPLGK--VKRIAVIGPNADVMYNYLGDYTAPQERSKVVTLLDALRNRMPDVRIDYVKGC 483

Query: 497 HDVPCNSDAGFHEAVRIAKKADFVIVVAG--------------------------LDLTQ 530
             +   + +   EAV  A+KAD VI+  G                          L   +
Sbjct: 484 A-IRDTTQSNIKEAVEAARKADLVILAVGGSSARDFKTKYINTGAATVDSENSGILSDME 542

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
             E  DR +L L G Q  L+ ++A T K P++ V   G PL+++ A   S      W  Y
Sbjct: 543 CGEGFDRATLDLLGDQEKLIRAIAATEK-PLVTVYIAGRPLNMNLASEVSDALLTAW--Y 599

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG 650
           PGE G   + +++ G++NP GRLPM+  P    ++P++          Q   R Y    G
Sbjct: 600 PGEQGGNGIVDVLTGEYNPSGRLPMS-VPRHVGQIPVH--------YSQGTLRDYMDCPG 650

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             +Y FG+GLSYT ++Y      S L +SA+ KA S     Q  G       + +  +CT
Sbjct: 651 KPLYTFGYGLSYTTFAY------SNLKLSATAKAAS-----QPAGDN----EVMQTITCT 695

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
                    VTN GD DG  VV L+      S   P  +L GF ++      S+E++F +
Sbjct: 696 ---------VTNTGDRDGDEVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESREVTFQL 746

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVG 797
              + LSI +++       G   +M+G
Sbjct: 747 TR-QDLSIYDRNMNFTAEPGRFNVMIG 772


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 209/754 (27%), Positives = 333/754 (44%), Gaps = 125/754 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA ++ +E + G+ +              T+FP  L    ++NR L   IG     
Sbjct: 118 RLGIPA-DFTNEGIRGVEN-----------YIATNFPTQLALGHTWNRELIRQIGYITGR 165

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +   K  Q     
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQ----- 214

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      ++ +++  KH IAY   K         +  ++ 
Sbjct: 215 --------------------------TDMQVASTAKHFIAYSNNKGAREGFARVDPQMSW 248

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E+    PF   I++     +M SYN  +G P       L Q+ R   GF+GY+ SD 
Sbjct: 249 REVENIHAYPFTRVIQEAGILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDS 308

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  I +G +  + ID 
Sbjct: 309 DAVEYLYSKHKTAKDMKEAVRQSVEAGLNVRCTFRSPESYVLPLRELIQEGGLSMETIDN 368

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+   GLF+  P +     L   +V +  H+++AL A+R+G+VLLKN    LP
Sbjct: 369 RVRDILRVKFLTGLFD-TPYQTDLA-LADKEVNSEAHQQVALQASREGLVLLKNANNLLP 426

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+K+ +  +A+ GP  +  S     Y  +     ++L G++  V   +K  Y  GC  V 
Sbjct: 427 LDKSQIKRIAVCGPNADEASFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVD 486

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                    +AV   K++D  +VV G  +    E++ R SL LPG Q
Sbjct: 487 ANWPESEIISYPLTAEEKTEIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQ 546

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+ ++  T K PV+LVL  G PL +++  AD  + +IL   YPG  G  A+AE +FGD
Sbjct: 547 QQLLEAIVATGK-PVVLVLINGRPLSINW--ADKFVPAILEAWYPGSQGGTAIAEALFGD 603

Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ------VYGFGHGL 660
           +NPGG+L +T +P++  ++P    N  A  + Q  G       G Q      +Y FG+GL
Sbjct: 604 YNPGGKLTVT-FPKTVGQIP---FNFPAKPASQVDGGQTPGMKGNQSRINGPLYPFGYGL 659

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y  L   S +         +DK               + VT        V   +
Sbjct: 660 SYTTFEYSNLQLSSPVI--------TDK---------------EPVT--------VTCKI 688

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
            N G   G  VV L+ R    S  T EK L GF+RVH     +K++SF + P +   + N
Sbjct: 689 KNTGTRSGDEVVQLYTRDVISSVTTYEKNLRGFERVHLEPGETKKVSFQLLPRD-FQLLN 747

Query: 781 KHGRRILPLGNHVLMVG------ELRHSLTIETY 808
           K    ++  G   +M+G       L+  L I  Y
Sbjct: 748 KDNHWVVEPGMFQIMIGASSEDIRLKKGLEIRAY 781


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 161/447 (36%), Positives = 225/447 (50%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  I  R + L+S LT++EKI  L   +  I R+GI  Y   +E+LHGI   G   
Sbjct: 23  YKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGNEALHGIIRPG--- 79

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  A+ +N  L   I S ++ EARA +N  + G          LT
Sbjct: 80  -------KFTVFPQAIGLASMWNPELHHIIASVISDEARARWNELERGKKQKDQFSDLLT 132

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG++               R 
Sbjct: 133 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH--------------PRY 178

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK               +  KH  A + E     +R+  +A ITE D  + + P F  CI
Sbjct: 179 LKSV-------------STPKHFAANNEEH----NRFYCDAAITETDMREYYLPAFEKCI 221

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +GKA  IM +YN +NGVP      L  K  + +WGF GYI SDC A   +     Y KT
Sbjct: 222 REGKAESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKT 281

Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V   +ID A  ++   ++RLG+F+ D
Sbjct: 282 PEAAAMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFD-D 340

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P K  Y  L P+ V   +HK+LAL+AARQ IVLLKN K  LPLN   + S+A++G  +N 
Sbjct: 341 PEKNPYNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INA 398

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYV 487
            +   G Y+G P  +P S+L G+   V
Sbjct: 399 ANCEFGDYSGTPVNAPVSVLDGIRNRV 425



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 89/302 (29%), Positives = 151/302 (50%), Gaps = 49/302 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + +A ++ +++D VI V G++ + E E +DR S+ LP  Q   +    + +    I+VL 
Sbjct: 587 YGDASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKANPN-TIVVLV 645

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  + V +   D  I +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y  S   +P
Sbjct: 646 AGSSMAVGWM--DQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY-NSIEDLP 702

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             ND N++ +       R+Y ++ G  +Y FG+GLSYT + Y+ L               
Sbjct: 703 AFNDYNVKNN-------RTYMYFEGKPLYAFGYGLSYTKFDYRNL--------------- 740

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                           +I + +   +L F    SV N+G  +G  V  ++ + P +   T
Sbjct: 741 ----------------NIKQDSQNITLNF----SVKNSGKYNGDEVAQVYVQFPDLGIKT 780

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVGELRHSLT 804
           P KQL GF RVH     +++IS  + P E+L + +   ++   P G +  MVG+   ++ 
Sbjct: 781 PLKQLKGFKRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYNFMVGKSSDNIC 839

Query: 805 IE 806
           ++
Sbjct: 840 LQ 841


>gi|294674604|ref|YP_003575220.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474051|gb|ADE83440.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 869

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 155/474 (32%), Positives = 239/474 (50%), Gaps = 74/474 (15%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ +T LS   R ++L+SLLT +EK+  + + + +I RLGIP+Y WWSE+ HG+  +G 
Sbjct: 23  YPYLDTKLSNKERVENLLSLLTPEEKVGLMMNKSISIDRLGIPSYNWWSEACHGVRQDG- 81

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL-------------- 157
                      T +PQ +  AA+FN  L+ ++ S V+ EARA +N               
Sbjct: 82  ----------YTVYPQPIGMAAAFNAQLFYDVFSQVSDEARANWNRTDHNDPKLFNVPMG 131

Query: 158 -----GQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDG 212
                G   LTFW PN+NIFRDPRWGRGQET GEDP + +   V+ V   QG N K    
Sbjct: 132 VTYYPGNPELTFWCPNVNIFRDPRWGRGQETCGEDPYLNAVLGVQTVLGMQGNNDKY--- 188

Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTE 272
                                       AC KH   Y +       R+S N   T +D  
Sbjct: 189 ------------------------FKTHACAKH---YAVHSGPEPLRHSMNVEPTNRDLW 221

Query: 273 DTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVA 331
           +T+ P F++ +++G    +MC+Y +  G P C    L     RN+WG+   + +DCDA+ 
Sbjct: 222 ETYLPAFKALVKKGNVREVMCAYQRFEGKPCCTSDRLLIDILRNKWGYDAIVLTDCDAIN 281

Query: 332 TIFEYQNYTKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
             F    + +TH+D  +  + A   G D+ CG   +   +  + KG ++E D+D  L   
Sbjct: 282 NFFNRGQH-ETHKDGLSASVDAVLNGTDLECGKVFMSLVE-GLKKGLIKESDLDNHLRKT 339

Query: 389 FSVQLRLGLFNGDPRKG-KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKN 447
              +  LG+F  DP     + KLG D + + ++  +A+ AAR+ +VLL+N    LPL+K 
Sbjct: 340 LMGRFELGMF--DPADMLPWAKLGADVISSEKNDAMAVQAARESMVLLENKAGILPLSK- 396

Query: 448 AVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCH 497
           ++ +LA++GP  ++++ + G Y G P +    SLL G++A V  +K  Y   C 
Sbjct: 397 SIKTLAVLGPNADDVNMLNGNYGGTPTAAHQHSLLSGIKAAVPGAKIIYNKACE 450



 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 78/265 (29%), Positives = 131/265 (49%), Gaps = 43/265 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           F +     K+AD ++++ G+    E E  D+  + LP  Q  LV ++ +T K PVI V  
Sbjct: 606 FTDLANQVKEADAIVIIGGISAQMEGEGGDKQDIELPKVQQMLVKAMHKTGK-PVIFVNC 664

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G    ++F   + +  ++L   Y G+ GAKALAE++FGD+NPGG+LP+T+Y  +     
Sbjct: 665 SGSA--IAFGSVEGEYDALLQAWYAGQGGAKALAEVLFGDYNPGGKLPVTFYRSN----- 717

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            ND+    D S +   R+YR++TG   Y FG+GLSYT +      A  +  ISA      
Sbjct: 718 -NDLPDFLDYSMK--NRTYRYFTGVPQYAFGYGLSYTTF------ALGDAKISAKQ---- 764

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
               +++ G                 +  + + VTN G  +G+  V ++ +      G P
Sbjct: 765 ----MKKNG-----------------KVTLTVPVTNTGKREGTETVQVYVKRLD-DAGAP 802

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVD 771
            K L GF +++  A  +++ +  +D
Sbjct: 803 IKALKGFQKLNLKAGETQKATITLD 827


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 159/460 (34%), Positives = 231/460 (50%), Gaps = 60/460 (13%)

Query: 44  CKPPHFDSYP---FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWS 100
           C  P   SY    + N +  +  R   LIS LT++EKI  L   +  IPRLGI  Y   +
Sbjct: 16  CGAPCLFSYAQELYKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGN 75

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHG+   G            T FPQ +  AA++N  L   I + ++ EARA +N    
Sbjct: 76  EALHGVVRPG----------RFTVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQ 125

Query: 161 G----------LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSD 210
           G          LTFW+P +N+ RDPRWGR  ET GEDP +       FVK  QG      
Sbjct: 126 GRNQKEQFSDVLTFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQG------ 179

Query: 211 DGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQD 270
                                D    L + +  KH  A + E     +R+  N  I+E+ 
Sbjct: 180 ---------------------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQ 214

Query: 271 TEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDA 329
             + + P F  C+++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC  
Sbjct: 215 LREYYFPAFEMCVKEGKAASIMTAYNALNNVPCTLNSWLLQKVLRRDWGFQGYVVSDCGG 274

Query: 330 VATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNL 388
            + +     Y KT E +A   +KAG+D+ CG  +   +  +A  +    E DID A  ++
Sbjct: 275 PSLLVNAHKYVKTKEAAATLSIKAGLDLECGDDVYDEYLLNAYKQYMASEADIDSAAYHV 334

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            + +++LGLF+G  R   Y K+ P  + + EH+ +AL+AAR+ IVLLKN K  LPLN   
Sbjct: 335 LTARMKLGLFDGVER-NPYAKISPSVIGSKEHQTVALNAARECIVLLKNQKNMLPLNVKK 393

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIP-CSPKSLLRGLEAYV 487
           + S+A++G  +N      G Y+G P   P S+L+G++  V
Sbjct: 394 LKSIAVVG--INAGKCEFGDYSGAPVVEPVSILQGIKNRV 431



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 144/301 (47%), Gaps = 46/301 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +ILVL 
Sbjct: 593 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IILVLV 651

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   +  + +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y +S  ++P
Sbjct: 652 AGSSLAVNWE--NEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYY-KSLEQLP 708

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
             D     D ++   GR+Y+++    +Y FG+GLSYT + Y  L                
Sbjct: 709 AFD---DYDITK---GRTYQYFKKDVLYPFGYGLSYTTFKYSNLK--------------- 747

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                           +D+     ++ F    ++ N G   G  V  ++ R+P+++  T 
Sbjct: 748 ----------------VDDAGKTVNVSF----TLKNTGKRAGDEVAQVYVRLPEIAGSTQ 787

Query: 747 E-KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTI 805
             +QL GF RV   A  S+++   +D  +      K    ++P G+   MVG     + +
Sbjct: 788 AIRQLKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVGASSGDIRL 847

Query: 806 E 806
           E
Sbjct: 848 E 848


>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
 gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
          Length = 955

 Score =  252 bits (644), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 202/734 (27%), Positives = 333/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR+L   +G     
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E R +      G T  +AP +++ RD RWGR +E  GE P +V+   +E  K  Q ++  
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMAKGMQTDH-- 260

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K G       +  ++ 
Sbjct: 261 -----------------------------QVAATSKHYIAYSNNKGGREGMARVDPQMSP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     P++  I++     +M SYN  +G P       L  + R E+GF+GY+ SD 
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  +F         ++S    + AG++I C        +   +  I +G +    ID 
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+  P +    +   +  C +E++++AL A+++ +VLLKN    LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQQVALQASKESLVLLKNQDAVLP 469

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ N +S +A+ GP  +  +     Y  +     ++L G+   V   +   +  GC  V 
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTNVLFTKGCDLVD 529

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                +   +AV  AKK+D  +VV G       E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGENKSRSSLDLPGRQ 589

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+L+L  G PL +++A  D  + +IL   YPG  G  A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLILINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P++  ++P N     N + D  R              +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++I  ++                    + +V   T     V+  VTN 
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L+GFDR+H     +KE++F ++P   L + N   
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 152/449 (33%), Positives = 235/449 (52%), Gaps = 57/449 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   L+S LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 84  -------RFTVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVLT 136

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QGE+               R 
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGED--------------PRY 182

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH +A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 183 LK-------------IVSTPKHFVANNEEH----NRFICNPQISEKQLREYYFPAFEMCV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC   + +     Y KT
Sbjct: 226 KKGKAASIMTAYNALNDVPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVKT 285

Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +   +  +A  +  V + DID A  ++ + +++LG+F+  
Sbjct: 286 KETAATLSIKAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSK 345

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y ++ P  + + +H+++ALDAAR+ IVLLKN K  LPLN + + S+A++G  +N 
Sbjct: 346 ER-NPYARISPSVIGSKDHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INA 402

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSK 489
            +   G Y+G P   P S+L+G++  V +
Sbjct: 403 GTCEFGDYSGAPVIEPVSVLQGIKNRVGE 431



 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 143/302 (47%), Gaps = 47/302 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +   + + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 591 YGEAGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  I +I+   YPGE G  A+A+++FGD+NP GRLP+T+Y       P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKSLDELPP 707

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSY+++ Y      S+L +  S     
Sbjct: 708 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKY------SDLKVKDS----- 749

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                            D+VT        V   + N G   G  V  ++ R+P+     P
Sbjct: 750 ----------------TDKVT--------VSFRLKNTGRRKGDEVAQVYVRIPETGGIVP 785

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELRHSLTI 805
            K+L GF RV      S+ I   +D  EQL   +    + ILP G   +MVG     + +
Sbjct: 786 IKELKGFRRVPLEPGESRAIDIELDK-EQLRYWDTTKEQFILPAGTFDVMVGASSKDIRL 844

Query: 806 ET 807
           +T
Sbjct: 845 QT 846


>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 216/750 (28%), Positives = 336/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G  S +M SYN ++G+P     +L  Q  RNEW F+G++ SD  
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A+  A++++ VIVV G     D     +ET                E  D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S      + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 661

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776


>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 793

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 207/737 (28%), Positives = 333/737 (45%), Gaps = 129/737 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA ++ +E + G+ +              T FP  L    ++N+ L    G     
Sbjct: 142 RLGIPA-DFTNEGIRGVEA-----------YEATGFPTELNMGMTWNKELVHQEGIITGR 189

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EARA+      G T  +AP +++ RD RWGR +E+ GEDP +V++  +   K  Q     
Sbjct: 190 EARAL------GYTNVYAPIMDVARDQRWGRLEESYGEDPYLVASMGIALAKGIQ----- 238

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
             DG +                         ++  KH   Y   K     +   +  +  
Sbjct: 239 -QDGKV-------------------------ASTAKHFAVYSANKGAREGQARTDPQVAP 272

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDC 327
           ++ E+    PF+  I++     +M SYN  +G+P       L Q+ R E GF GY+ SD 
Sbjct: 273 REVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEMGFTGYVVSDS 332

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DA+  +    +     +++      AGM++         ++ + +  + +G++    I+ 
Sbjct: 333 DALEYLATKHHVAANLKEAVFQAFMAGMNVRTTFKAPDSIIIYLRQLVKEGRIPMDTINH 392

Query: 384 ALLNLFSVQLRLGLFN-------GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLK 436
            + ++  V+ RLGLF+        + RK          V +   +++AL A+R+ +VLLK
Sbjct: 393 RVADVLRVKFRLGLFDHPYVESAAETRK---------VVNSDASQQIALQASRESVVLLK 443

Query: 437 NDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYAS 494
           N+   LPL K ++  +A++GP   +       Y  +     ++L+G++A +   K  YA 
Sbjct: 444 NNNNILPLVK-SLDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVLYAK 502

Query: 495 GCHDV----------PCNSDAG----FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
           G   V          P   DAG       AV I K+A   IVV G +     E + R  L
Sbjct: 503 GVDLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSRTDL 562

Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
            LPG Q+ LV ++  T K PV++VL G  P+ +++   D  I  I++ GYPG  G  A+A
Sbjct: 563 DLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINWI--DKYIDGIVYAGYPGVKGGIAVA 619

Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           +++FGD+NPGG+L +TW P+S  ++P+N  +     S +  G   +      +Y FG GL
Sbjct: 620 DVLFGDYNPGGKLTLTW-PKSVGQIPLNFPSKPGAQSDE--GEHAKI--KGLLYPFGFGL 674

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT++ Y      + L IS    A                   D V         V + V
Sbjct: 675 SYTSFGY------TNLKISTGKTAA------------------DPVA--------VTVDV 702

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           TN G + G  VV  + R    S  T EK L GF+RVH  A  +K ISF + P E+L + N
Sbjct: 703 TNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREELKLYN 761

Query: 781 KHGRRILPLGNHVLMVG 797
           +  + +L  G   +M+G
Sbjct: 762 REMKFVLEPGEFSVMIG 778


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 220/746 (29%), Positives = 350/746 (46%), Gaps = 138/746 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E LHG A+ G            TSFPQ +   A+F+  L  ++ +  A 
Sbjct: 104 RLGIPVI-FHEECLHGHAAIGG-----------TSFPQPIGLGATFDPELVESLFAMTAA 151

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EARA        LT   P +++ R+PRWGR +ET GEDP +VS   +  V+ FQG+    
Sbjct: 152 EARARGT--HQALT---PVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDA--- 203

Query: 210 DDGGIGFGFREK-RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                   FR+K RV+                A  KH  A+   + G          ++ 
Sbjct: 204 -------TFRDKTRVI----------------ATLKHFAAHGQPESGT---NCAPVNVSM 237

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
           +   +TF  PF+  +++G A  +M SYN+++GVP+   R  L    R EWGFKG++ SD 
Sbjct: 238 RVLRETFLFPFKEALDKGCAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDY 297

Query: 328 DAVATIFEYQNYTKTH--------EDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQ 377
            A   I+E     ++H         ++ A  ++AG++I      C L H    + KG +Q
Sbjct: 298 YA---IYELSYRPESHGHFVAKDKREACALAVQAGVNIELPEPDCYL-HLVDLVHKGVLQ 353

Query: 378 EKDIDRALLNLFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVL 434
           E  +D  +  +   + ++GLF+    DP + +  ++   D     H++LA+ AAR+ I L
Sbjct: 354 ESQLDELVEPMLRWKFQMGLFDDPYVDPAEAE--RIAGCDA----HRELAMQAARETITL 407

Query: 435 LKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTH 491
           LKND   +PL+ +A+ ++A+IGP  N    + GGY+G+P    ++L G+   V   +K  
Sbjct: 408 LKNDGPVVPLDLSAIKTIAVIGPNANR--SLLGGYSGVPKHDVTVLDGIRERVGSRAKVV 465

Query: 492 YASGCHDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED--- 534
           YA GC                 P        EAV++AK+AD +++  G +     E    
Sbjct: 466 YAEGCKITIGGSWVQDEVTPSDPAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSP 525

Query: 535 ---RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
               DR SL L G+Q  LV ++  T K PVI  L  G P+ +++      + +I    Y 
Sbjct: 526 KHLGDRPSLDLVGRQEELVRAMVATGK-PVIAFLFNGRPISINYLA--QSVPAIFECWYL 582

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
           G+   +A+AE++FGD NPGG+LP+T  P S   +P         S+R    R Y F    
Sbjct: 583 GQETGRAVAEVLFGDTNPGGKLPIT-IPRSAGHLPA--FYNHKPSAR----RGYLFDEVG 635

Query: 652 QVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTS 711
            +Y FG+GLSYT ++++ L               + K + +++ +R              
Sbjct: 636 PLYAFGYGLSYTTFAFQNLRL-------------AKKKMHRESTAR-------------- 668

Query: 712 LRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
               V + VTN G  +G  VV L+ R    S   P K+L GF ++      ++ + F + 
Sbjct: 669 ----VLVDVTNTGAREGREVVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEFEIT 724

Query: 772 PCEQLSIANKHGRRILPLGNHVLMVG 797
           P + L+  N   + ++  G+  +MVG
Sbjct: 725 P-DLLAFYNVDMKFVVEPGDFEIMVG 749


>gi|242206820|ref|XP_002469265.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
 gi|220731725|gb|EED85567.1| hypothetical protein POSPLDRAFT_51213 [Postia placenta Mad-698-R]
          Length = 312

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 175/306 (57%), Gaps = 33/306 (10%)

Query: 55  CNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVN 114
           C+TS +   RA +LISL TL+EKI    + A  +PRLG+PAY+WW E+LHG+A + PGV 
Sbjct: 34  CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAES-PGVI 92

Query: 115 F--NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIF 172
           F  +G  S  TSFPQ ++  A+F+ +L +++ + V+ EARA  N  ++G+ FW PNIN F
Sbjct: 93  FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152

Query: 173 RDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESD 232
           +DPRWGRGQETPGEDP  + +Y    +   QG          G     KR++        
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG----------GLDPEYKRIV-------- 194

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIM 292
                   A CKH  AYDLE W    RY F+A+++ QD  + +   FR+C         M
Sbjct: 195 --------ATCKHFAAYDLENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFM 246

Query: 293 CSYNQVNGVPACLRGDLFQKA-RNEWGFKG---YITSDCDAVATIFEYQNYTKTHEDSAA 348
           CSYN VNGVP+C    L Q   R+ WG+     YITSDCDA+  I+E   YT T  ++ A
Sbjct: 247 CSYNAVNGVPSCANSYLLQDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTATRAETVA 306

Query: 349 GVLKAG 354
             L AG
Sbjct: 307 DALNAG 312


>gi|427384989|ref|ZP_18881494.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728250|gb|EKU91109.1| hypothetical protein HMPREF9447_02527 [Bacteroides oleiciplenus YIT
           12058]
          Length = 862

 Score =  252 bits (643), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 154/460 (33%), Positives = 236/460 (51%), Gaps = 55/460 (11%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + P+ N  LS + RAK L+S LTL+EK   + D++ AIPRLGI  + WWSE+LHG+A+ G
Sbjct: 20  TLPYQNPELSPAERAKDLVSRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
                     +VT FP+ +  AASFN  L   I +A + E RA +N              
Sbjct: 80  ----------NVTVFPEPVGMAASFNEKLVFEIFNATSDEMRAKHNERVRNGLEDTRFHS 129

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           L+ W PN+NIFRDPRWGRGQET GEDP + S   +  VK  Q                  
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQ------------------ 171

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFR 280
                 G E ++  +L+  AC KH   +   +W   SR+S N   I+ +D  +T+ P F+
Sbjct: 172 ------GPEDEKYRKLL--ACAKHYAVHSGPEW---SRHSANLNNISPRDLWETYLPAFK 220

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           + +++     +MC+Y +++  P C    L Q+  R+EWGFK  + SDC A+A  +     
Sbjct: 221 ALVQKADVREVMCAYQRLDDDPCCGSTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHKS 280

Query: 340 TKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           +     +A     AG D+ CG     +    A+ +G + E++ID+ +L L   +  LG  
Sbjct: 281 SSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEIDKHVLRLLEGRFELGEM 340

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + DP   K+ ++    V +  HK L+L+ +RQ + LL+N    LPL+K ++  +A+IGP 
Sbjct: 341 D-DPSLVKWSQIPMSVVNSKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGPN 398

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
            ++   + G Y G P    ++L G +  + K    Y  GC
Sbjct: 399 ADDKPMLWGNYNGTPNQTITILDGFKTKLKKNQIIYMKGC 438



 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 132/312 (42%), Gaps = 62/312 (19%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           ++P N D    E++   K  D V+ V G+    E E+           DR  + LP  Q 
Sbjct: 585 ELPINYD----ESISKLKNVDMVVFVGGISPQLEGEEMPLNLPGFKNGDRTDIELPAVQR 640

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           + + ++   SK+ V +  +G     ++         +IL   Y GE G +A+A+++FGD+
Sbjct: 641 NFLKALKEASKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGQAVADVLFGDY 697

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NP G+LP+T+Y  +       D +M+        GR+YR Y    ++ FG GLSYT+++ 
Sbjct: 698 NPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDFTV 748

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
                                   Q + ++L     +E  + T       + ++N G   
Sbjct: 749 ---------------------GTAQCSKTQL---RTEEALTLT-------VPISNTGKRS 777

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI--ANKHGRR 785
           G+ V+ ++ R      G P K L  + R    A  +++I   + P E       + +  R
Sbjct: 778 GTEVIQVYIRKTD-DTGGPLKSLKAYARAELAAGATQDIEIQL-PAESFECFDPSTNTMR 835

Query: 786 ILPLGNHVLMVG 797
           + P G + L  G
Sbjct: 836 VAP-GEYELFYG 846


>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
 gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
          Length = 764

 Score =  252 bits (643), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 215/742 (28%), Positives = 337/742 (45%), Gaps = 136/742 (18%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EKI+   D A    R+GIP     S+ +HG                 T+FP  L +AAS+
Sbjct: 90  EKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------TTFPIPLGTAASW 133

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           +  +        A EA A       G+ + ++P ++I RDPRWGR  E  GEDP + S  
Sbjct: 134 DMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGEDPYLGSQI 187

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
           A   V+ +QG+                          D   E  + A  KH   Y   + 
Sbjct: 188 AKAMVEGYQGD--------------------------DLAKENTMIATVKHFALYGASEA 221

Query: 255 G---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRG 307
           G   N +  S   M  E      + PP+++ I+ G A  +M S+N V+GVPA     L  
Sbjct: 222 GRDYNTTDMSRVKMFNE------YLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLT 274

Query: 308 DLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRH 366
           DL    R+ WGF+G++TSD  ++  +  +       +  +A  LKAG+D++  G   L+ 
Sbjct: 275 DLL---RDRWGFEGFVTSDYTSLNEMIAHG--MGDLQAVSALALKAGLDMDMVGEGYLKT 329

Query: 367 TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALD 426
            + ++D+GKV E +I  A   +   + +LGLF+ DP K         D+ + E++  +  
Sbjct: 330 LKKSLDEGKVTEAEITTAARRILEAKYKLGLFD-DPYKYLDESRPEKDILSEENRTFSRK 388

Query: 427 AARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLE 484
            A    VLLK D    PL KNA   +A+IGPL NN + M G +  TG P     +L+G++
Sbjct: 389 VAAHSFVLLKKDAGVFPLKKNA--KIALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVK 446

Query: 485 --AYVSKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADFVIVV 523
             A  +K  YA G +                       + +    EA+++AKK+D ++ V
Sbjct: 447 NVAPKAKVTYAQGANITDDAQLAENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAV 506

Query: 524 AGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQIS 583
            G       E   R +LL+P  Q  L+  +A+T K P+ LVL  G PL++S  E++  I 
Sbjct: 507 VGEATEMSGEAASRTNLLIPESQKKLIRELAKTGK-PMALVLMSGRPLNIS-EESEMNID 564

Query: 584 SILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPG- 642
            IL + +PG     A+A++IFGD+NP G++  +W P +  +VP+     R     +  G 
Sbjct: 565 -ILQVWHPGVEAGNAIADVIFGDYNPSGKITASW-PRNVGQVPVYYAMKRTGRPGEVEGF 622

Query: 643 ----RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
                 +     + +Y FG+GLSYT + Y            + +KA +D+          
Sbjct: 623 QKFKSEFLDTPNSPLYPFGYGLSYTEFEY------------SDVKASADE---------- 660

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
             + +D   + +++       +TN GD DG  VV L+      S   P KQLIGF+++  
Sbjct: 661 --LKMDGTLTLSAI-------ITNTGDYDGEEVVQLYIHDKVRSITPPMKQLIGFEKIML 711

Query: 759 VAKGSKEISFGVDPCEQLSIAN 780
               SK ++F +   E L   N
Sbjct: 712 KKGESKTVTFEIS-AEDLKFYN 732


>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
 gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
          Length = 801

 Score =  251 bits (642), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 205/734 (27%), Positives = 323/734 (44%), Gaps = 114/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+                TSFP  L   A++N++L   IG     
Sbjct: 142 RLGIPV-DFTNEGIRGLCH-----------EKATSFPSQLGVGATWNKNLVGKIGHITGK 189

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  ++P ++I RDPRWGR  E  GEDP +V     + VK  Q E   
Sbjct: 190 EARLL------GYTNVYSPILDIARDPRWGRVVECYGEDPYLVGELGYQMVKGIQQEK-- 241

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + +  KH   Y   K G       +A ITE
Sbjct: 242 ------------------------------VVSTPKHFAIYSAPKGGRDGDARTDAHITE 271

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++    +  PF+  I+   A  +M SYN  NGVP            R +WGFKGY+ SD 
Sbjct: 272 RELFSLYLHPFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWGFKGYVVSDS 331

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDR 383
            AV  I +  +  K  +D+    + AG+++     M    +   +  + +G +    ID 
Sbjct: 332 RAVEFIADKHHVAKDRKDAVRQAVLAGLNVRTDFTMPEDFILPVRELVKEGGLDMATIDD 391

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+   GLF  D   GK  K     V   E++++A  A+ + IVLLKN++  LP
Sbjct: 392 RVRDILRVKFWQGLF--DAPYGKQMKEADKTVGKPEYQEVAYQASLESIVLLKNEENILP 449

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC---- 496
           L+ +   S+ + GP    I+     Y        S+  G++    K     Y  GC    
Sbjct: 450 LDFSKYKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPKDVEIKYTKGCVFFD 509

Query: 497 ---------HDVPCNSD-AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    +  P  ++ +   +AV +AK     IVV G D     E R R SL LPG Q
Sbjct: 510 ENWPDSELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVGESRSRTSLDLPGNQ 569

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             LV  + +T   PVI+VL  G P+ +++   D  +  I+   + G+ G  A+A+++ G 
Sbjct: 570 QKLVEEIYKTGT-PVIVVLINGRPMTINW--VDKYVPGIVEGWFQGKFGGSAIADVLVGS 626

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+LP++ +P++  ++PMN       +AD   + P  S +   G  +Y FG+GLSYT
Sbjct: 627 YNPGGKLPVS-FPKTVGQLPMNFPSKPGAQADQPAKGPNGSGKTRVGGFLYPFGYGLSYT 685

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      + L I +++K G    +                         V + +TN+
Sbjct: 686 TFEY------TNLKIRSNIKNGLGDVV-------------------------VSVDITNS 714

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  +V L+      S    EKQL GF+R+   A  +K ++F + P E LS+ N+  
Sbjct: 715 GKRKGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVNFTLSP-EDLSLYNRQM 773

Query: 784 RRILPLGNHVLMVG 797
             +L  G+  +M+G
Sbjct: 774 EFVLEPGSFTIMIG 787


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  251 bits (642), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 199/681 (29%), Positives = 321/681 (47%), Gaps = 112/681 (16%)

Query: 101 ESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA 160
           E+LHG  + G            T FPQ +  A+++N  L   + +A+  E R+     + 
Sbjct: 138 EALHGCMAKGS-----------TIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
                +P INI RDPR GR +ET GEDP + S  AV ++K                G +E
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIK----------------GVQE 225

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFR 280
           + V+                A  KH +A  +   G+  R S+    +E+   + + P FR
Sbjct: 226 QGVI----------------ATPKHFVANFV---GDGGRDSYPIHFSERLLREIYFPAFR 266

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           + IE+  A  +M +YN ++G+P      L  +  R EWGFKGY+ SD  +V  +      
Sbjct: 267 ASIEEAGALSLMAAYNSLDGIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHKV 326

Query: 340 TKTHEDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
            ++  ++A   L+AG+D+    +         I + K+ +  +D A+  +  V+  +GLF
Sbjct: 327 AESKAEAAKLSLEAGLDMELPDSDCFEEIPGLIRESKLSQDTLDEAVRRVLRVKFWIGLF 386

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           +       Y +   D    SEH++LAL  AR+ IVLLKN+   LPLNK+ + S+A+IGP 
Sbjct: 387 DNPFVDPDYAERIND---CSEHRELALRVARESIVLLKNEG-ILPLNKD-IRSIAVIGP- 440

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAK 515
            N      GGY+G      + L G++  +    K ++A GC  +   S +GF EA++IA+
Sbjct: 441 -NAAVPRLGGYSGYGVKVVTPLEGIKNKLGDKVKVYFAEGC-GLNDTSKSGFDEAIKIAQ 498

Query: 516 KADFVIVVAGLDLTQ-ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           K+D  I+  G  + + E E RDR +L LPG Q  L+  +  T+  PVI+VL  G    ++
Sbjct: 499 KSDVAILFMGNSVPETEGEQRDRHNLNLPGVQEDLIKEICNTNT-PVIVVLINGSA--IT 555

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM---NDMN 631
                 ++ +++   YPGE G  A+A+++FGD+NPGG+LP++ +P+  +++P+   +  +
Sbjct: 556 MMNWIDKVQAVIEAWYPGEEGGNAIADVLFGDYNPGGKLPIS-FPKYSSQLPLYYNHKPS 614

Query: 632 MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAGSDKNI 690
            R D      G  Y F        FG+GLSYT++ Y  L   P E+          D  +
Sbjct: 615 GRVDDYVDLRGNQYLF-------PFGYGLSYTDFKYSNLRITPEEIP--------RDGEV 659

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
           +                        +   + N G   G  VV L+      S   P K+L
Sbjct: 660 V------------------------ITFDIENIGKYKGDEVVQLYLHDEFASVARPIKEL 695

Query: 751 IGFDRVHTVAKGSKEISFGVD 771
             F+RV       K +SF ++
Sbjct: 696 KRFERVTLDVGERKTVSFKLN 716


>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
 gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
          Length = 955

 Score =  251 bits (642), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 204/734 (27%), Positives = 331/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR+L   +G     
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E R +      G T  +AP +++ RD RWGR +E  GE P +V+   VE  K  Q     
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGVEMAKGMQ----- 257

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                      +  ++A  KH IAY   K G       +  ++ 
Sbjct: 258 --------------------------TDYQVAATSKHYIAYSNNKGGREGMARVDPQMSP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     P++  I++     +M SYN  +G P       L  + R E+GF+GY+ SD 
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  +F         ++S    + AG++I C        +   +  I +G +    ID 
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+  P +    +   +  C +E++ +AL A+++ +VLLKN    LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQLVALQASKESLVLLKNQDAVLP 469

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ N +S +A+ GP  +  +     Y  +     ++L G+   V   +   +  GC  V 
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVD 529

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                +   +AV  AKK+D  +VV G       E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQ 589

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P++  ++P N     N + D  R              +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++I  ++                    + +V   T     V+  VTN 
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L+GFDR+H     +KE++F ++P   L + N   
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 156/459 (33%), Positives = 235/459 (51%), Gaps = 58/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  L+ S RA  L+  +TL+EKI Q+ + + AI RLGIPAY WW+E+LHG+A  G  
Sbjct: 24  PYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYNWWNEALHGVARAG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNL--------GQAGLTF 164
                     T FPQ +  AA+F+        S V+ EARA Y+         G  GLTF
Sbjct: 82  --------KATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYHDFQRKGERDGYKGLTF 133

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINI+RDPRWGRG ET GEDP + S   +  VK  QG+         G G  +K   
Sbjct: 134 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQGD---------GTGKYDK--- 181

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                           AC KH   +   +W   +R+SF+A  I+++D  +T+ P F++ +
Sbjct: 182 --------------THACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLV 224

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +GK   +MC+YN+  G P C    L  +  R +WG+   + SDC A+   F Y N+ +T
Sbjct: 225 TEGKVKEVMCAYNRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 283

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H     ++A  + +G D+ CG         A+ KG + E  I+ ++  L   + +LG+F+
Sbjct: 284 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEDKINESVFRLLRARFQLGMFD 342

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     + ++    V + EH   AL+ AR+ +VLL N    LPL+K +V  +A++GP  
Sbjct: 343 -DNTLVSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNNILPLSK-SVRKVAVLGPNA 400

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGC 496
           N+   +   Y G P    ++L G+   + +   +Y  GC
Sbjct: 401 NDSVMLWANYNGFPTKSVTILEGIRNKLPEGAVYYEKGC 439



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 94/317 (29%), Positives = 148/317 (46%), Gaps = 57/317 (17%)

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
           AS   D+    +  + E    A +AD +I V GL  + E E+           DR ++ L
Sbjct: 577 ASLKFDIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDL 636

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           P  Q  ++ ++ +T K PV+ VL  G  L + + EA++ + +I+   YPG+ G  A+A++
Sbjct: 637 PQVQEEMLKALKKTGK-PVVFVLCSGSTLALPW-EAEN-LDAIIEAWYPGQQGGTAVADV 693

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NP GRLP+T+Y  S       D +M          R+YR++ G  ++ FGHGLSY
Sbjct: 694 LFGDYNPAGRLPLTFYASSSDLPDFEDYDMS--------NRTYRYFKGRPLFPFGHGLSY 745

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y               KA +DK IL + G  L                 + I + N
Sbjct: 746 TTFDYG--------------KAKADKKIL-RAGEGLT----------------LTIPLKN 774

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G + G  VV ++ R P   +G P K L  F R+   A  ++++ F + P       N  
Sbjct: 775 IGKLSGDEVVQVYLRNPGDKEG-PIKTLRAFRRISLEAGQAEDVLFEL-PVSTFEWFNPA 832

Query: 783 GRR--ILPLGNHVLMVG 797
             R  +LP G + L+ G
Sbjct: 833 TNRMEVLP-GKYELLYG 848


>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
           43184]
 gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
          Length = 955

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 203/734 (27%), Positives = 332/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR+L   +G     
Sbjct: 161 RLGIPT-DFTNEGIRGVES-----------YIATNFPTQLGLGHTWNRNLVHKVGYITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E R +      G T  +AP +++ RD RWGR +E  GE P +V+   +E  K  Q ++  
Sbjct: 209 EGRLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMAKGMQTDH-- 260

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K G       +  ++ 
Sbjct: 261 -----------------------------QVAATSKHYIAYSNNKGGREGMARVDPQMSP 291

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     P++  I++     +M SYN  +G P       L  + R E+GF+GY+ SD 
Sbjct: 292 REVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGEFGFRGYVVSDS 351

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  +F         ++S    + AG++I C        +   +  I +G +    ID 
Sbjct: 352 DAVEYLFSKHGTAADMKESVLQSVLAGLNIRCTFRSPDSYVLPLRELIAEGAIPMSTIDD 411

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+  P +    +   +  C +E++ +AL A+++ +VLLKN    LP
Sbjct: 412 RVRDILRVKFLVGLFD-HPYQIDLKETDKEVNC-AENQLVALQASKESLVLLKNQDAVLP 469

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ N +S +A+ GP  +  +     Y  +     ++L G+   V   +   +  GC  V 
Sbjct: 470 LDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTDVLFTKGCDLVD 529

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                +   +AV  AKK+D  +VV G       E++ R SL LPG+Q
Sbjct: 530 ANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENKSRSSLDLPGRQ 589

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+ +FGD
Sbjct: 590 LDLLQAVVATGK-PVVLVLINGRPLSINWA--DKYVPAILEAWYPGSQGGTAIADALFGD 646

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P++  ++P N     N + D  R              +Y FG+GLSYT
Sbjct: 647 YNPGGKLTVT-FPKTVGQIPFNFPTKPNAQVDGGRNKGLDGNMSRVNGPLYPFGYGLSYT 705

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++I  ++                    + +V   T     V+  VTN 
Sbjct: 706 TFEY------SDISIQPAI--------------------VTQVQPVT-----VRCKVTNT 734

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L+GFDR+H     +KE++F ++P   L + N   
Sbjct: 735 GKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTFTIEP-RDLQLLNSDN 793

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +MVG
Sbjct: 794 HWVVEPGDFKVMVG 807


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + ++SLS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++ K   +MC+YN++ G P C    L  Q  R +WG+ G + SDC A+   +  + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/325 (30%), Positives = 148/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA+AE++FGD+NP GRLP+T+Y  + T++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAVAEVLFGDYNPAGRLPVTFY-RNITQLPNFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    I+                      
Sbjct: 738 FPFGYGLSYTTFNY------GNIKLEQTIKVGETAKII---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               + VTN G+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  L GN  +MVG
Sbjct: 824 KQLEWWDTQTNTMRTLAGNFDIMVG 848


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 153/447 (34%), Positives = 232/447 (51%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   L+S LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 84  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 136

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG+              + R 
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 182

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 183 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L +K  R +WGF+GY+ SDC   + +     Y KT
Sbjct: 226 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 285

Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   ++AG+D+ CG  +   +  +A  +  V + DID A  ++ + +++LGLF+G 
Sbjct: 286 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 345

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y ++ P  + + EH+++ALDAAR+ IVLLKN    LPLN N V S+A++G  +N 
Sbjct: 346 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 402

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
                G Y+G P   P S+L+G++  V
Sbjct: 403 GKCEFGDYSGAPVVDPVSILQGIKDRV 429



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/303 (27%), Positives = 148/303 (48%), Gaps = 49/303 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 591 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  I +I+   YPGE G  A+A+++FGD+NP GRLP+T+Y +S  ++P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 706

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D ++         GR+Y+++ G  +Y FG+GLSY+++ Y  L           +K G
Sbjct: 707 AFDDYDITQ-------GRTYKYFKGDVLYPFGYGLSYSSFKYSDL----------KVKDG 749

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
           ++                   T   S R        N G   G  V  ++ R+P+     
Sbjct: 750 AN-------------------TVSVSFRLK------NTGKRKGDEVAQVYVRIPETGGVV 784

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
           P K+L GF R+   +  S+ +   +D  EQL   +   G+ I+P G   +M+G     + 
Sbjct: 785 PIKELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIGASSKDIR 843

Query: 805 IET 807
           ++T
Sbjct: 844 LQT 846


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + ++SLS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++ K   +MC+YN++ G P C    L  Q  R +WG+ G + SDC A+   +  + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/325 (29%), Positives = 147/325 (45%), Gaps = 66/325 (20%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQV 653
           G KA AE++FGD+NP GRLP+T+Y  + T++P   D NM         GR+YR++ G  +
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFY-RNITQLPDFEDYNMT--------GRTYRYFKGDPL 737

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           + FG+GLSYT ++Y        + +  ++K G    I+                      
Sbjct: 738 FPFGYGLSYTTFNY------GNIKLEQTIKVGETAKII---------------------- 769

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
               + VTN G+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P 
Sbjct: 770 ----VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP- 823

Query: 774 EQLSIANKHGRRILPL-GNHVLMVG 797
           +QL   +     +  + GN  +MVG
Sbjct: 824 KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 151/463 (32%), Positives = 238/463 (51%), Gaps = 55/463 (11%)

Query: 46  PPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHG 105
           P     +P+ NT LS   RA  L S LTL+EK + + + + AIPRLGIP +EWWSE+LHG
Sbjct: 17  PMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALHG 76

Query: 106 IASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ------ 159
           I  NG            T FP  +  A+S++ +L   +  AV+ E R      +      
Sbjct: 77  IGRNG----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTIK 126

Query: 160 --AGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFG 217
              GL+FW PNINIFRDPRWGRGQET GEDP + S   +  V+  QG    SD       
Sbjct: 127 RYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQG---PSDSK----- 178

Query: 218 FREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQ 276
           +R+                  L AC KH   +   +W   +R++FN   + E+D  +T+ 
Sbjct: 179 YRK------------------LLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYL 217

Query: 277 PPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE 335
           P F++ ++QG  + +MC+Y +++G P C      +   RNEW ++G + SDC AV   ++
Sbjct: 218 PAFKALVQQGDVAEVMCAYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWK 277

Query: 336 --YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
             +   +     ++A  + +G D+ CG+    +   A+  G ++E D+D ++  L   + 
Sbjct: 278 KGHHEVSPDATHASAKAVLSGTDVECGSD-YSNLPEAVRAGIIKEADVDVSVRRLLEARF 336

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
            LG F+ D     + K+    V +  HK+LALD AR+ +VLL+N+   LPL ++    + 
Sbjct: 337 ALGDFDPD-ELVPWTKISESVVASKAHKQLALDMARKSMVLLQNN-DILPLKRSG-QKIV 393

Query: 454 IIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           ++G    + + M G Y+G P    ++L+GL+    +  +  GC
Sbjct: 394 VVGANAIDSTMMWGNYSGYPTQTVTILQGLQTKSDQVTFIPGC 436



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 148/334 (44%), Gaps = 65/334 (19%)

Query: 486 YVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR---------- 535
           YV  T  A    D+     +   + VR    AD VI V G+    E E+           
Sbjct: 568 YVQVTSLAMIKFDITHTGLSTPQDIVRKTAGADVVIFVGGISPRLEGEEMEVSDPGFKGG 627

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR ++ LP  Q  ++ +++   +R V +  +G     ++      ++ +IL   YPGE G
Sbjct: 628 DRTTIELPQAQREVIKALSEAGRRIVFVNCSGSA---IALTPESQRVDAILQAWYPGEQG 684

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP G+LP+T+Y         ND  +      +  GR+YR++  T ++ 
Sbjct: 685 GTAVADVLFGDYNPSGKLPVTFY--------KNDAQLPDFLDYRMAGRTYRYFKETPLFP 736

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSYT ++                  G  + I  Q                      
Sbjct: 737 FGYGLSYTQFTI-----------------GQPRYINNQ---------------------- 757

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           VQ+SV+N G  DG  VV ++ R    + G P K L GF RV      +K++S  + P E 
Sbjct: 758 VQVSVSNTGKRDGDEVVQVYIRRTDDAAG-PIKTLRGFQRVSLKVGETKQVSVSL-PRES 815

Query: 776 LSI--ANKHGRRILPLGNHVLMVGELRHSLTIET 807
                A+ +  R++P GN+ +MVG    +  ++T
Sbjct: 816 FEWWDASSNTMRVIP-GNYEVMVGSSSMAKNLKT 848


>gi|255693561|ref|ZP_05417236.1| periplasmic beta-glucosidase(Cellobiase) [Bacteroides finegoldii
           DSM 17565]
 gi|260620626|gb|EEX43497.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 800

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 209/735 (28%), Positives = 341/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  +AP ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI +GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V   + H+ +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
           K+ LPL+K + S +A+IGP    + ++   Y     S K++ +G++ Y+  ++  YA GC
Sbjct: 454 KEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  +EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G+ +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGAQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ISF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 162/462 (35%), Positives = 232/462 (50%), Gaps = 64/462 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + ++SLS   RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF       I +AV+ EARA      A        GLT W
Sbjct: 81  -------LATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q    N K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQCMDANQKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+ FNA  I  +D  +T+  PF + 
Sbjct: 182 ---------------IHACAKHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++ K   +MC+YN++ G P C    L  Q  R +WG+ G + SDC A+   +  + + K
Sbjct: 224 VKEAKVKEVMCAYNRLEGDPCCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGH-K 282

Query: 342 THED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           TH D    SAA VL +G D+ CG+      +SA  KG + EKDID ++  L   +  LG 
Sbjct: 283 THPDAESASAAAVL-SGTDLECGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGE 340

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP K ++ K+    VC++EH  L+LD AR+ + LL N    LPL +    ++A++GP
Sbjct: 341 MD-DPDKVEWTKIPYSVVCSAEHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGP 398

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
             N+     G Y G P    +LL G+ + +    K  Y  GC
Sbjct: 399 NANDSVMQWGNYNGTPKHTITLLEGIRSAMGENDKLIYEQGC 440



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 143/324 (44%), Gaps = 64/324 (19%)

Query: 491 HYASGCHDVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR--------- 535
            Y SG  D   N D GF E V I       K AD VI   G+  + E E+          
Sbjct: 572 QYFSG--DAQLNFDLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFRK 629

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR  + LP  Q  L+ ++    K+ VI V   G P  ++         +IL   YPG++
Sbjct: 630 GDRTDIELPAVQRELIKALCDAGKK-VIFVNFSGSP--IAMEPETKYCQAILQAWYPGQS 686

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G KA AE++FGD+NP GRLP+T+Y  +       D NM         GR+YR++ G  ++
Sbjct: 687 GGKAAAEVLFGDYNPAGRLPVTFYRNTAQLPDFEDYNMT--------GRTYRYFKGDPLF 738

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT ++Y        + +  ++K G    I+                       
Sbjct: 739 PFGYGLSYTTFNY------GNIKLEQTIKVGETAKII----------------------- 769

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
              + VTN G+ DG  VV ++ +  + ++G P K L  F RV   A  +  +   + P +
Sbjct: 770 ---VPVTNTGNRDGEEVVQVYLKKQEDAEG-PVKTLRAFKRVQIPAGKTVNVELELTP-K 824

Query: 775 QLSIANKHGRRILPL-GNHVLMVG 797
           QL   +     +  + GN  +MVG
Sbjct: 825 QLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 152/444 (34%), Positives = 231/444 (52%), Gaps = 57/444 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   L+S LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 30  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 87  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG+              + R 
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 185

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 186 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L +K  R +WGF+GY+ SDC   + +     Y KT
Sbjct: 229 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 288

Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   ++AG+D+ CG  +   +  +A  +  V + DID A  ++ + +++LGLF+G 
Sbjct: 289 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 348

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y ++ P  + + EH+++ALDAAR+ IVLLKN    LPLN N V S+A++G  +N 
Sbjct: 349 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 405

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLE 484
                G Y+G P   P S+L+G++
Sbjct: 406 GKCEFGDYSGAPVVDPVSILQGIK 429



 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/303 (27%), Positives = 146/303 (48%), Gaps = 49/303 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  I +I+   YPGE G  A+A+++FGD+NP GRLP+T+Y +S  ++P
Sbjct: 653 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 709

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D ++         GR+Y+++ G  +Y FG+GLSY+++ Y            + LK  
Sbjct: 710 AFDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKY------------SDLKVK 750

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
              N +                        V   + N G   G  V  ++ R+P+     
Sbjct: 751 DGANTIS-----------------------VSFRLKNTGKRKGDEVAQVYVRIPETGGVV 787

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
           P K+L GF R+   +  S+ +   +D  EQL   +   G+ I+P G   +MVG     + 
Sbjct: 788 PIKELKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVGASSKDIR 846

Query: 805 IET 807
           ++T
Sbjct: 847 LQT 849


>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
          Length = 800

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 209/738 (28%), Positives = 338/738 (45%), Gaps = 133/738 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  +  +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DLTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-------TCMLRHTQSAIDKGKVQEKD 380
           +AV  +      T T E+ AA V+ AG++I             LRH   AI++GKV    
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRH---AINEGKVSLHT 397

Query: 381 IDRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLL 435
           +D+ +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + +VLL
Sbjct: 398 LDQRVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLL 450

Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYA 493
           KN  + LPL+KN    +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA
Sbjct: 451 KNKNQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYA 509

Query: 494 SGC------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
            GC            ++VP ++   A  HEAV +AK +D  I+V G +     E+  R +
Sbjct: 510 KGCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTN 569

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+
Sbjct: 570 LDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAI 626

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+G
Sbjct: 627 AKVLFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYG 679

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT + Y      S+L IS  +  G  +NI                 SCT         
Sbjct: 680 LSYTTFGY------SDLKISKPV-IGPQENI---------------TLSCT--------- 708

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           V N G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + 
Sbjct: 709 VKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLW 767

Query: 780 NKHGRRILPLGNHVLMVG 797
           +K+ +  +  G+  +MVG
Sbjct: 768 DKNNQFTVEPGSFSVMVG 785


>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 791

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 226/736 (30%), Positives = 332/736 (45%), Gaps = 124/736 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E LHG A+ G            TSFPQ +  A+S++ ++   +   +  
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPTMLRQVNQVIGR 185

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E RA     +      +P ++I RDPRWGR +ET GEDP +V    V  V+  QGE    
Sbjct: 186 EIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGEG--- 237

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                    R + +  G            + A  KHL  +   + G        A ++E+
Sbjct: 238 ---------RSRLLRPGH-----------VFATLKHLTGHGQPESGT---NVGPAPVSER 274

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDCD 328
           +  + F PPF   +++     +M SYN+++GVP+   R  L    R EWGF+G + SD  
Sbjct: 275 ELRENFFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYS 334

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA-IDKGKVQEKDIDRALLN 387
           AV  +    +     E++A   L AG+D +    +   T    + +GKV E  +D A+  
Sbjct: 335 AVDQLMSIHHIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRR 394

Query: 388 LFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS--EHKKLALDAARQGIVLLKNDKKFLPLN 445
           +  ++ R GLF     +  Y         T+  E + LA  AA++ I LLKND   LPL 
Sbjct: 395 MLELKFRAGLF-----ENPYADANAAAAITNNDEARALARTAAQRSITLLKNDG-MLPLK 448

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC------ 496
                ++A+IGP    ++++GG Y G P    S+L G++A V   +   +A G       
Sbjct: 449 PEG--TIAVIGPSAA-VARLGG-YYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEND 504

Query: 497 ---HDVPCNSDAG-----FHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRVSLL 541
               D    SD         +AV  A+  D +I+  G D  Q + +        DR SL 
Sbjct: 505 DWWEDKVVKSDPAENRKLIAQAVEAARNVDRIILTLG-DTEQSSREGWADNHLGDRPSLD 563

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           L G+Q  L  ++    K P+ +VL  G P   S  +   Q ++IL   Y GE G  A+A+
Sbjct: 564 LVGEQQELFDALKALGK-PITVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVAD 620

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           I+FGD NPGG+LP+T  P S  ++PM   NM+  +      R Y F T   +Y FG GLS
Sbjct: 621 ILFGDVNPGGKLPVT-VPRSVGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLS 673

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YTN+S   LSAP                       RL    I      T  +  V + V 
Sbjct: 674 YTNFS---LSAP-----------------------RLSATKI-----GTGGKTSVSVDVR 702

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G  +G  VV L+ R    S   P K+L GF RV      S+ ++F V P E L + N 
Sbjct: 703 NTGAREGDEVVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTVGP-EALQMWND 761

Query: 782 HGRRILPLGNHVLMVG 797
             RR++  G+  +M G
Sbjct: 762 QMRRVVEPGDFEIMTG 777


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 152/444 (34%), Positives = 231/444 (52%), Gaps = 57/444 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   L+S LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 27  YKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 83

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 84  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 136

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG+              + R 
Sbjct: 137 FWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD--------------DPRY 182

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 183 LK-------------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L +K  R +WGF+GY+ SDC   + +     Y KT
Sbjct: 226 KEGKAASIMTAYNALNDVPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVKT 285

Query: 343 HEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   ++AG+D+ CG  +   +  +A  +  V + DID A  ++ + +++LGLF+G 
Sbjct: 286 KEAAATLSIQAGLDLECGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGT 345

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y ++ P  + + EH+++ALDAAR+ IVLLKN    LPLN N V S+A++G  +N 
Sbjct: 346 ER-NPYTRISPSVIGSKEHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INA 402

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLE 484
                G Y+G P   P S+L+G++
Sbjct: 403 GKCEFGDYSGAPVVDPVSILQGIK 426



 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/303 (28%), Positives = 148/303 (48%), Gaps = 49/303 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 591 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 649

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  I +I+   YPGE G  A+A+++FGD+NP GRLP+T+Y +S  ++P
Sbjct: 650 AGSSLAVNWM--DEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYY-KSLDELP 706

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D ++         GR+Y+++ G  +Y FG+GLSY+++ Y  L           +K G
Sbjct: 707 AFDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFKYSDL----------KVKDG 749

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
           ++                   T   S R        N G   G  V  ++ R+P+     
Sbjct: 750 AN-------------------TVSVSFRLK------NTGKRKGDEVAQVYVRIPETGGVV 784

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH-GRRILPLGNHVLMVGELRHSLT 804
           P K+L GF R+   +  S+ +   +D  EQL   +   GR I+P G   +MVG     + 
Sbjct: 785 PIKELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVGASSKDIR 843

Query: 805 IET 807
           ++T
Sbjct: 844 LQT 846


>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 777

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 199/730 (27%), Positives = 330/730 (45%), Gaps = 127/730 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  A+++N +L   + + VA 
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q     + P +++ RDPRW R +E+ GEDP++    A   V+         
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVR--------- 219

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              +  D        KH +AY + + G+       A + E+
Sbjct: 220 ---GLGSG--------------NLSDPFATIPTLKHFVAYGIPEGGHNGS---AASVGER 259

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPF+S +  G A  +M +YN V+G+P      L     R EW F G+  SD  
Sbjct: 260 ELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLG 318

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I       K H+ +A   ++AG+D + G         A+ +G+VQE  ID+A+  +
Sbjct: 319 SIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRI 378

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            +++  +GLF       K  K    +V T  +  L+   AR+ IVLL+N    LPL K+ 
Sbjct: 379 LALKFEMGLFEKPFVDVKTAK---KEVKTESNIALSRQVARESIVLLENKNNILPLRKDV 435

Query: 449 VSSLAIIGPLVNNISQMGGGYTG-IPCSPKSLLR-GLEAYVSKTH--YASGCHDVPCNSD 504
              +AI+GP  +N+  M G YT   P    + +R  + A + K    Y  GC  +   ++
Sbjct: 436 --KIAIVGPNADNVYNMLGDYTAPQPDGAVTTVRQAISARLPKAQVSYVKGCA-IRDTTN 492

Query: 505 AGFHEAVRIAKKADFVIVVAG-----------------------LDLTQETEDRDRVSLL 541
           +    AV  A+++D ++ V G                       +   +  E  DR +L 
Sbjct: 493 SDIPAAVTAARQSDIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLD 552

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           L G+QM L+ ++ +T K P++++   G PL++++A   +Q  ++L   YPG+ G  A+A+
Sbjct: 553 LLGRQMELLKALKQTGK-PLVVIYIQGRPLNMNWAA--TQADALLCAWYPGQEGGHAIAD 609

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           ++FGD+NP G++P++  P S  ++P++  N ++    +Y   +      T +Y FG+G S
Sbjct: 610 VLFGDYNPAGKMPLS-VPRSVGQIPVH-YNRKSSLDHRYVEEA-----ATPLYAFGYGKS 662

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           Y+++ YK L    E T                     DY              HV  ++T
Sbjct: 663 YSDFEYKDLKIQKENT---------------------DY--------------HVSFTLT 687

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G  DG  V  L+ R    S   P +QL  F+R+H     SK +SF V      S+ N 
Sbjct: 688 NTGKYDGDEVPQLYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSF-VLTAGDFSVINT 746

Query: 782 HGRRILPLGN 791
             +++L  G+
Sbjct: 747 QMKKVLEPGS 756


>gi|431797765|ref|YP_007224669.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788530|gb|AGA78659.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 799

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 210/754 (27%), Positives = 348/754 (46%), Gaps = 134/754 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP      E  HG  + G            T FP  +  A+++N +L   + +A+A+
Sbjct: 137 RLGIPMM-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPALIQEMAAAIAL 184

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EAR      Q G   + P +++ R+PRW R +ET GEDP + S      V  FQGE+  S
Sbjct: 185 EARL-----QGGHIGYGPVLDLAREPRWSRVEETYGEDPYINSQMGRAMVSGFQGESIAS 239

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          +  KH  AY + + G+      +  + ++
Sbjct: 240 GKNVI--------------------------STLKHFTAYGVPEGGHNGT---SVSVGQR 270

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  +++ PPF++ + +G A  +M +YN ++GVP    G L     R++WGF G++ SD  
Sbjct: 271 ELHESYLPPFKAAVAEG-ALSVMTAYNSIDGVPCTSNGHLLNDVLRDDWGFNGFVVSDLG 329

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLN 387
           +++ +    + T+T E +A   + AG+D + G     ++  +A+  G V ++ +D A+  
Sbjct: 330 SISGLRGSHHVTETAEGAAQLAINAGVDSDLGGYGFGKNLLAAVQAGGVSQEVLDEAVRR 389

Query: 388 LFSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           +  V+  +GLF     DP K +        V +++H  LA   AR+ +VLLKN+   LPL
Sbjct: 390 VLKVKFDMGLFENPYVDPSKAE------SLVRSAKHIALARKVARESVVLLKNENDLLPL 443

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSK---THYASGCHDV 499
            K  V+S+A+IGP  +N     G YT    +    ++L G++  V K    +Y  GC  +
Sbjct: 444 RKK-VNSIAVIGPNADNTYNQLGDYTAPQPNENVVTVLEGIKNKVGKDVRVNYVKGCA-I 501

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLTQETEDR-------------------- 535
              + +   +A  +A ++D  +VV G     D   E E+                     
Sbjct: 502 RDTTQSEIGKAASLAARSDVAVVVLGGSSARDFDTEYEETAAAKVSEAEEGQVISDMESG 561

Query: 536 ---DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPG 592
              DR++L L G Q+ LV +V  T   PV++VL  G PL++++   D  + +I+   YPG
Sbjct: 562 EGFDRMTLDLLGDQLKLVQAVQATGT-PVVVVLIKGRPLNLNWI--DEHVPAIVDAWYPG 618

Query: 593 EAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ 652
           + G  A+A+++FGD+NP GRL ++  P S  ++P+   N R      Y   S        
Sbjct: 619 QEGGNAIADVLFGDYNPSGRLTIS-VPRSVGQLPVF-YNYRNPKRHDYVEGS-----AEP 671

Query: 653 VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSL 712
           +Y FGHGLSY ++ Y        L ++AS  AGS                        ++
Sbjct: 672 LYAFGHGLSYADFEY------DNLEVTASGMAGS-----------------------PTV 702

Query: 713 RFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           R H Q+S  N  +VDG  VV L+ R    S   P  +L  F++V   A  S +I+F +  
Sbjct: 703 RVHFQVS--NISNVDGEEVVQLYVRDEAGSTVRPLLELKRFEKVMVPAGESSKITFMLT- 759

Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            E L +  +    ++  G+  ++VG     + +E
Sbjct: 760 AEDLQVLGQDMNWLVEPGSFQVLVGRSSRDIRLE 793


>gi|383115541|ref|ZP_09936297.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
 gi|313695054|gb|EFS31889.1| hypothetical protein BSGG_2589 [Bacteroides sp. D2]
          Length = 800

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 209/735 (28%), Positives = 341/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  +AP ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 199 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI +GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V   + H+ +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
           K+ LPL+K + S +A+IGP    + ++   Y     S K++ +G++ Y+  ++  YA GC
Sbjct: 454 KEMLPLSK-SFSKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  +EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T ++Y      S L IS  +  G+ +NI                 SCT         V N
Sbjct: 683 TTFNY------SNLKISKPV-IGAQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ISF + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  251 bits (640), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 161/457 (35%), Positives = 229/457 (50%), Gaps = 62/457 (13%)

Query: 49  FDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
           +   P+ + +L+   RA  L+  LT++EK+  + +N+  I RLGI  YEWW+E+LHG+A 
Sbjct: 25  YSQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVAR 84

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA----MYNLGQ----A 160
            G            T FPQ +  AASF+ +L   + +A++ EARA       LGQ     
Sbjct: 85  AG----------LATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQ 134

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT W PNINIFRDPRWGRGQET GEDP + S   V  VK  QG                
Sbjct: 135 GLTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQG---------------- 178

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
                      D      L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F
Sbjct: 179 ----------PDSARYNKLHACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAF 225

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +++     +MC+YN+  G P C    L  Q  RNEWGFKG + SDC A++  +  + 
Sbjct: 226 KTLVQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKK 285

Query: 339 YTKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           +  TH D+A    +A   G D+ CG+   + T+ AI  G + EK I+ ++  L   +  L
Sbjct: 286 HN-THPDAAHASAEAVLNGTDLECGSNYRKLTE-AIKAGIISEKQINVSVKRLLKARFEL 343

Query: 396 G-LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAI 454
           G + N  P    Y       V + +H+ LAL  A + + LL+N  K LPL+K A   +AI
Sbjct: 344 GEMENIHPWTLPYSI-----VDSPKHRCLALKMAHETMTLLQNKGKVLPLDKQA--RIAI 396

Query: 455 IGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
           IGP  N+     G Y G P    +LL      +  +H
Sbjct: 397 IGPNANDSVMQWGNYNGTPSHTSTLLSAFRKRLPISH 433



 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 63/263 (23%), Positives = 114/263 (43%), Gaps = 55/263 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K  D +I   G+  + E E+           DR  +  P  Q  ++ ++    K+ +++ 
Sbjct: 599 KDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGKKVILVN 658

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
            +G     ++         +IL   YPGE G  A+  ++FGD+NP GRLP+T+Y +S  +
Sbjct: 659 FSGSA---MALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFY-KSIDQ 714

Query: 625 VP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
           +P   + +M+        GR+YR+     ++ FG+GLSYT +++                
Sbjct: 715 LPDFENYSMK--------GRTYRYMQEEPLFPFGYGLSYTTFAF---------------- 750

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                            +HI++ +     +  + I + N GD DG  VV ++ +     +
Sbjct: 751 ---------------GKIHINKNSLSAGEKVTLHIPIKNIGDRDGVEVVQIYIQRQADKE 795

Query: 744 GTPEKQLIGFDRVHTVAKGSKEI 766
           G P K L  F RV      ++E+
Sbjct: 796 G-PVKTLRAFKRVEIPKGKTQEV 817


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/458 (33%), Positives = 229/458 (50%), Gaps = 57/458 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +L+   RA  L+  LTL+EK   + + + AIPRLGI AY+WW+E+LHG+      
Sbjct: 25  PYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV------ 78

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AGLTF 164
               G     T FPQ +   ASFN  L  ++ +AV+ EARA          L +  GLT 
Sbjct: 79  ----GRAGLATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGLKRYQGLTM 134

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP +     +  V+  QG                    
Sbjct: 135 WTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE------------------ 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
              G + D+     L AC KH   +   +W   +R+SF+A  +  +D  +T+ P F+  +
Sbjct: 177 ---GGKYDK-----LHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLV 225

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN+  G P C    L  Q  R+EW + G I SDC A+   F    +   
Sbjct: 226 QKAHVKEVMCAYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETE 285

Query: 343 --HEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
              E ++A  +  G D+ CG       Q A+  G + EK ID +L  L   +  LG  + 
Sbjct: 286 PDKEHASAKAVLTGTDVECGESYASLPQ-AVKAGLIDEKKIDISLKRLMKARFELGEMD- 343

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           +P    + ++    V + EH++LAL  AR+ +VLL+N++  LPLNK+    +A++GP  N
Sbjct: 344 NPELVSWAQIPYSVVDSKEHRELALRMARESLVLLQNNQNVLPLNKSL--KVAVVGPNAN 401

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
           +     G Y G P    +LL G+  Y+ +    Y  GC
Sbjct: 402 DSVMQWGNYNGFPGHTVTLLEGIRQYLPEAQLIYEPGC 439



 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 72/267 (26%), Positives = 119/267 (44%), Gaps = 54/267 (20%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
            ++ ++  K AD ++   G+    E E+           DR ++ LP  Q  L+  + + 
Sbjct: 590 LNKTIQRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLLAELKKA 649

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++         +IL   YPG+AG  A+A ++FGD+NP GRLP+T
Sbjct: 650 GKKVVFVNFSGSA---IALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVT 706

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  +       D +M+        GR+YR+     ++ FGHGLSYT + Y       + 
Sbjct: 707 FYKSTSQLPDFEDYSMK--------GRTYRYMAEAPLFPFGHGLSYTTFRY------GDA 752

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
           ++S       ++ IL                          I V+N G+ DG  VV ++ 
Sbjct: 753 SLSTQEVKEGEQAIL-------------------------TIPVSNTGERDGEEVVQVYL 787

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGS 763
           R P   +G P   L  F RV+ +AKG+
Sbjct: 788 RRPGDKEG-PSHALRAFKRVN-IAKGT 812


>gi|224537265|ref|ZP_03677804.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521119|gb|EEF90224.1| hypothetical protein BACCELL_02142 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 885

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/461 (33%), Positives = 238/461 (51%), Gaps = 57/461 (12%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + P+ N  LS + RAK L+  LTL+EK   + D++ AIPRLGI  + WWSE+LHG+A+ G
Sbjct: 20  TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
                     +VT FP+ +  AASFN  L  +I +AV+ E RA +N         +    
Sbjct: 80  ----------NVTVFPEPVGMAASFNDKLVFDIFNAVSDEMRAKHNERVRNGLEDVRFHS 129

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
           L+ W PN+NIFRDPRWGRGQET GEDP + S   +  VK  QG EN K         +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
                             L AC KH   +   +W   SR++ N   ++ +D  +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +++     +MC+Y +++  P C    L Q+  R+EWGFK  + SDC A+A  +    
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279

Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            +     +A     AG D+ CG     +    A+ KG + E+++D+ +L L   +  LG 
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSKGLITEEEVDKHVLRLMEGRFELGE 339

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP    + K+    V    HK L+L+ +RQ + LL+N    LPL+K ++  +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             ++   + G Y G P    ++L G ++ + K    Y  GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438



 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/270 (25%), Positives = 115/270 (42%), Gaps = 54/270 (20%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           + E++   K  D V+ V G+    E E+           DR  + LP  Q + + ++   
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++         +IL   Y GE G  A+A+++FGD+NP G+LP+T
Sbjct: 650 GKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  +       D +M+        GR+YR Y    ++ FG GLSYT++           
Sbjct: 707 FYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDF----------- 746

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
              A   A  +K  L          H DE  + T       + V+N G   G+ VV ++ 
Sbjct: 747 ---AVGTASCNKTQL----------HTDESLTLT-------VPVSNTGKRSGTEVVQVYI 786

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
           R    + G P K L  + RV   A   +++
Sbjct: 787 RKTDDADG-PLKSLKAYARVELAAGAKQDV 815


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/462 (33%), Positives = 240/462 (51%), Gaps = 58/462 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  LS S RA  L+  +TL+EK+ Q+ + + AI RLGIPAY+WW+E+LHG+A  G  
Sbjct: 23  PYKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAG-- 80

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN--------LGQAGLTF 164
                     T FPQ +  AA+F+          V+ EARA Y+         G  GLTF
Sbjct: 81  --------KATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTF 132

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PNINI+RDPRWGRG ET GEDP + S   +  VK  QG       GG G         
Sbjct: 133 WTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG-------GGTG--------- 176

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                + D+       AC KH   +   +W   +R+SF+A  I+++D  +T+   F++ +
Sbjct: 177 -----KYDKA-----HACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLV 223

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GK   +MC+YN+  G P C    L  +  R +WG+   + SDC A+   F Y N+ +T
Sbjct: 224 KEGKVKEVMCAYNRFEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGD-FYYPNHHET 282

Query: 343 H---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H     ++A  + +G D+ CG         A+ KG + E+ I+ ++  L   + +LG+F+
Sbjct: 283 HPTAAAASADAVVSGTDLECGGSY-SSLNEAVRKGLISEEKINESVFRLLRARFQLGMFD 341

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     + ++    V + EH   AL+ AR+ +VLL N    LPL+K ++  +A++GP  
Sbjct: 342 DDALVS-WSEIPYSVVESKEHVTKALEMARKSMVLLTNKNHTLPLSK-SIRKVAVLGPNA 399

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSK--THYASGCHDV 499
           N+   +   Y G P    ++L G+++ + +   +Y  GC  V
Sbjct: 400 NDSVMLWANYNGFPTKSVTILEGIKSKLPEGTVYYEKGCDYV 441



 Score =  117 bits (292), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 144/318 (45%), Gaps = 59/318 (18%)

Query: 493 ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLL 542
           AS   D+    +  + E    A +AD +I V GL  T E E+           DR ++ L
Sbjct: 576 ASLKFDIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDL 635

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
           P  Q  ++ ++ +T K PVI VL  G  L + + EA++ + +IL   YPG+ G  A+A++
Sbjct: 636 PHVQAEMLKALKKTGK-PVIFVLCSGSTLALPW-EAEN-LDAILEAWYPGQQGGTAVADV 692

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NP GRLP+T+Y  S       D +M          R+YR++ G  ++ FGHGLSY
Sbjct: 693 LFGDYNPAGRLPLTFYASSNDLPDFEDYDMS--------NRTYRYFKGKALFPFGHGLSY 744

Query: 663 TNYSYKFLSAPSELTISASLKAGSDK-NILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           T + Y               KA  DK N+    G  L                   I + 
Sbjct: 745 TIFDYG--------------KAKVDKQNVRAGEGMTL------------------TIPLK 772

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G +DG  V+ ++ R P   +G P K L  F RV   A  ++ I   + P       N 
Sbjct: 773 NTGKLDGDEVIQVYLRNPADKEG-PIKTLRAFRRVSLPAGQTENIRIEL-PASTFECFNP 830

Query: 782 HGRR--ILPLGNHVLMVG 797
              R  ILP G + L+ G
Sbjct: 831 STNRMEILP-GKYELLYG 847


>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
 gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
          Length = 859

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 227/828 (27%), Positives = 357/828 (43%), Gaps = 168/828 (20%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSD-NASAI--------------------- 88
           ++ + N SL +  R + L+S +TL+EKI Q+   +A +I                     
Sbjct: 23  NFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNYG 82

Query: 89  --------------------------PRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSV 122
                                     PRLGIP +   +ESLHG   +G            
Sbjct: 83  FIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHGSVHDGS----------- 130

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
           T FPQ +   ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E
Sbjct: 131 TIFPQAIALGSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEE 185

Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
             GEDP +VS   V  V+                G+ + +V                S  
Sbjct: 186 CFGEDPFLVSRMGVSQVR----------------GYLDNQV----------------SPM 213

Query: 243 CKHLIAYDLEKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
            KH  A+   + G N +  S      +++    +   F + +++ K   +M SYN  N  
Sbjct: 214 IKHFGAHGTPQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNE 269

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
           P      L  +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D    
Sbjct: 270 PNSSSHYLMTELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEAS 329

Query: 361 TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSE 419
                  Q  ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    
Sbjct: 330 DNSYAELQQLVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPA 385

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK-- 477
           H  LA   A + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    
Sbjct: 386 HVSLARKIAEESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGV 443

Query: 478 SLLRGLEAYVSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LD 527
           +LL  L+  VS     +YA GC D+  +  +GF EAV +AKK+D  IVV G        D
Sbjct: 444 TLLEALKERVSNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARD 502

Query: 528 LTQET--EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
            +  T  E  D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I
Sbjct: 503 YSNATCGEGFDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGI 559

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSR 638
           +   YPGE G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+
Sbjct: 560 VVQWYPGEQGGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSK 618

Query: 639 QYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRL 698
             PG+ Y F +   ++ FGHGLSYT++ Y                       L  T S+ 
Sbjct: 619 NKPGKDYVFSSPKALWAFGHGLSYTDFEY-----------------------LSATTSKE 655

Query: 699 DYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHT 758
           DY       +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V  
Sbjct: 656 DY-------ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLI 707

Query: 759 VAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
               +K++   + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 708 KKGETKQVIIKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 158/455 (34%), Positives = 234/455 (51%), Gaps = 60/455 (13%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + +P+ +TSL+   RA  L+  LTL+EK   + + + AIPRL I AY WW+E+LHG+A  
Sbjct: 25  EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AG 161
           G            T FPQ +   ASF+ SL   + +AV+ EARA         NL +   
Sbjct: 85  G----------LATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LT W PN+NIFRDPRWGRGQET GEDP + S   V  V   QG                 
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQG----------------- 177

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
                     D      L AC KH   +   +W   +R+SFNA  I+ +D  +T+ P F+
Sbjct: 178 ---------PDTARYNKLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFK 225

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
           + +++ K   +MC+YN+  G P C    L  Q  R+EWGF G + SDC AV+  ++ + +
Sbjct: 226 TLVQEAKVKEVMCAYNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKH 285

Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            +TH D+A+    A   G D+ CG    +    A+  G + E  ID ++  L   +  LG
Sbjct: 286 -ETHPDAASASADAVLNGTDVECGNS-YKSLPDAVKAGLITENQIDISVKRLLKARFELG 343

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
             + +   G    +  D V + +H++LAL  AR+ + LL+N+   LPL+K A   +A+IG
Sbjct: 344 EMDENVWTG----ISSDVVDSPKHRQLALQMARETMTLLQNNNNILPLSKQA--KIALIG 397

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
           P  N+     G Y G+P    +LL G++ Y+  ++
Sbjct: 398 PNANDSVMQWGNYNGLPSHTITLLEGMQRYLPTSN 432



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 154/363 (42%), Gaps = 64/363 (17%)

Query: 452 LAIIGPLVNN----ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           L I G LV      + QM   YT    S KS    L  Y      A+   D+  N+    
Sbjct: 533 LYINGKLVKEDEGAVKQMTSIYTLKAESGKSY-DILLKYRHFDKTANLSFDMGVNAQIDV 591

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
              +   K  D VI   G+    E E+           DR  + LP  Q  +V ++    
Sbjct: 592 KGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAG 651

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           KR V +  +G     ++         +IL   YPG+AG +A+AE++FGD+NP G+LP+T+
Sbjct: 652 KRIVFVNFSGAA---IALEPESQNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTF 708

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y  +  ++P   D NM         GR+YR+   T ++ FGHGLSYT + Y         
Sbjct: 709 Y-RNLAQIPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY--------- 750

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                LK   DK    Q                     ++ I VTN G  DG  VV ++ 
Sbjct: 751 ---GKLKMNDDKIAAGQN-------------------LNLAIPVTNTGSRDGDEVVQVYL 788

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR--RILPLGNHVL 794
           +    ++G P K L  F RV   A  + E+ F +D   QL   ++     R+ P GN+ +
Sbjct: 789 KKMDDTEG-PVKTLRAFKRVRIPAGKTVEVKFSLDDT-QLEWWDEQSNTMRVCP-GNYTV 845

Query: 795 MVG 797
           M+G
Sbjct: 846 MIG 848


>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 736

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 216/750 (28%), Positives = 335/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 185

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 186 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 216

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           D    F PPFR  I+ G A  +M SYN ++G+P      L  K  RNEW F+G++ SD  
Sbjct: 217 DLHQNFLPPFRKAIDAG-ALSVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLY 275

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 276 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 335

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 336 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 390 K-MINKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 448

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A+  A++++ VIVV G     D     +ET                E  D
Sbjct: 449 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 562

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S      + +Y F
Sbjct: 563 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 615

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 616 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 641

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 700

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 701 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 730


>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 799

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 218/780 (27%), Positives = 346/780 (44%), Gaps = 145/780 (18%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           SYPF N + +I+   K  I                    RLGIP  ++ +E +HG+  + 
Sbjct: 116 SYPFSNHAEAINKIQKWFIEET-----------------RLGIPV-DFTNEGIHGLNQD- 156

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNI 169
                       T+FP  +   +++N+ L   +G  +  EA+A+      G T  +AP +
Sbjct: 157 ----------HATAFPAPIGIGSTWNKELVHQMGQIIGREAKAL------GYTNVYAPIL 200

Query: 170 NIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGE 229
           ++ RD RWGR  ET GEDP +V+                   GGI               
Sbjct: 201 DVARDQRWGRVVETYGEDPFLVAGLGTALA------------GGI--------------- 233

Query: 230 ESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKAS 289
                 E  +++  KH   Y + K G       +  +  ++ +  F  PFR  I+     
Sbjct: 234 -----QENGVASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNVHPL 288

Query: 290 CIMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAA 348
            +M SYN  +G+P       L Q  R ++GF GY+ SD  AV  ++E  +  K ++++  
Sbjct: 289 GVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKEAVK 348

Query: 349 GVLKAGMDI----NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRK 404
            V++AG+++    N  +  +   +  I +G +  + +++ +  + SV+ RLGLF  D   
Sbjct: 349 MVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLF--DAPY 406

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN--- 461
            K  K     V T   + +AL   R+ +VLLKNDK  LPL+     ++ + GPL +    
Sbjct: 407 VKDPKAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLADEKEH 466

Query: 462 -ISQMGGGYTGIPCSPK--SLLRGLEAYVSK---THYASGCH------------DVPCNS 503
            IS+ G      P + K  S+L G+  + +K    +Y  GC             D P   
Sbjct: 467 AISRYG------PSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTP 520

Query: 504 D--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
              A  ++AV  AK+ D +I V G +  Q  E   R  L LPG+Q+ L+  + +T K P+
Sbjct: 521 QEIAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTGK-PM 579

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           +L+L  G PL +++   +  + +IL   +PG AG  A+AE IFG +NPGG+L  T +P++
Sbjct: 580 VLILINGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTT-FPKT 636

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELT 677
             ++ MN    +  S    PG     Y  T V    Y FG+GLSYT + Y          
Sbjct: 637 TGQIEMN-FPFKPASHAGQPGDGPNGYGKTAVVGPLYPFGYGLSYTTFEY---------- 685

Query: 678 ISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFAR 737
             A+LK                   +D   + T     V + V N G V G  VV L+ +
Sbjct: 686 --ANLK-------------------VDPEKARTQADISVAVDVKNTGKVKGDEVVQLYVK 724

Query: 738 VPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
               S  T E  L GF+RV      +K + F + P + LSI +K+   ++  G   +MVG
Sbjct: 725 QLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP-DDLSILDKNMNFVVEPGAFDIMVG 783


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 158/455 (34%), Positives = 234/455 (51%), Gaps = 60/455 (13%)

Query: 50  DSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASN 109
           + +P+ +TSL+   RA  L+  LTL+EK   + + + AIPRL I AY WW+E+LHG+A  
Sbjct: 25  EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84

Query: 110 GPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-------NLGQ-AG 161
           G            T FPQ +   ASF+ SL   + +AV+ EARA         NL +   
Sbjct: 85  G----------LATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LT W PN+NIFRDPRWGRGQET GEDP + S   V  V   QG                 
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQG----------------- 177

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
                     D      L AC KH   +   +W   +R+SFNA  I+ +D  +T+ P F+
Sbjct: 178 ---------PDTARYNKLHACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFK 225

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNY 339
           + +++ K   +MC+YN+  G P C    L  Q  R+EWGF G + SDC AV+  ++ + +
Sbjct: 226 TLVQEAKVKEVMCAYNRFEGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKH 285

Query: 340 TKTHEDSAAGVLKA---GMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            +TH D+A+    A   G D+ CG    +    A+  G + E  ID ++  L   +  LG
Sbjct: 286 -ETHPDAASASADAVLNGTDVECGNS-YKSLPDAVKAGLITENQIDISVKRLLKARFELG 343

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
             + +   G    +  D V + +H++LAL  AR+ + LL+N+   LPL+K A   +A+IG
Sbjct: 344 EMDENVWTG----ISSDVVDSPKHRQLALQMARETMTLLQNNNNILPLSKQA--KIALIG 397

Query: 457 PLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH 491
           P  N+     G Y G+P    +LL G++ Y+  ++
Sbjct: 398 PNANDSVMQWGNYNGLPSHTITLLEGMQRYLPTSN 432



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 155/363 (42%), Gaps = 64/363 (17%)

Query: 452 LAIIGPLVNN----ISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGF 507
           L I G LV      + QM   YT    S KS    L  Y      A+   D+  N+    
Sbjct: 533 LYINGKLVKEDEGAVKQMTSIYTLKAESGKSY-DILLKYRHFDKTANLSFDMGVNAQIDV 591

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTS 557
              +   K  D VI   G+    E E+           DR  + LP  Q  +V ++    
Sbjct: 592 KGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFRGGDRTEIELPAVQRRVVEALKTAG 651

Query: 558 KRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTW 617
           KR V +  +G     ++         +IL   YPG+AG +A+AE++FGD+NP G+LP+T+
Sbjct: 652 KRIVFVNFSGAA---IALEPESLNCEAILQAWYPGQAGGQAVAEVLFGDYNPAGKLPLTF 708

Query: 618 YPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           Y  +  ++P   D NM         GR+YR+   T ++ FGHGLSYT + Y         
Sbjct: 709 Y-RNLAQIPDFEDYNM--------TGRTYRYMKETPLFPFGHGLSYTTFKY--------- 750

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                LK   DK      G  L+ V                I VTN G  DG  VV ++ 
Sbjct: 751 ---GKLKMNDDK---IAAGQNLNLV----------------IPVTNTGSRDGDEVVQVYL 788

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR--RILPLGNHVL 794
           +    ++G P K L  F RV   A  + E+ F +D   QL   ++     R+ P GN+ +
Sbjct: 789 KKMDDTEG-PVKTLRAFKRVRIPAGKTVEVKFSLDDT-QLEWWDEQSNTMRVCP-GNYTV 845

Query: 795 MVG 797
           M+G
Sbjct: 846 MIG 848


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/437 (35%), Positives = 227/437 (51%), Gaps = 57/437 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+S LT++EKI  L   +  IPRL IP Y   +ESLHG+   G            T
Sbjct: 40  RIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGVVRPG----------RFT 89

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
            FPQ +  A+ +N  L   I +A++ EAR  +N  + G          LTFW+P +N+ R
Sbjct: 90  VFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLLTFWSPTVNMAR 149

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  ET GEDP +       FV+  QG+              + R LK        
Sbjct: 150 DPRWGRTPETYGEDPYLSGILGTAFVRGLQGD--------------DPRYLK-------- 187

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                + +  KH  A + E     +R+  N  I+E+   + + P F  C++ GK++ IM 
Sbjct: 188 -----IVSTPKHFAANNEEH----NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMS 238

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN +N VP      L  K  R++WGF GY+ SDC   + +     Y KT E +A   +K
Sbjct: 239 AYNAINDVPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVKTKEAAATLSIK 298

Query: 353 AGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           AG+D+ CG    ++   +A ++  V   DID A   +   ++ LGLF+ DP    Y K+ 
Sbjct: 299 AGLDLECGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFD-DPDLNPYNKIS 357

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P  V ++EHK+LAL+AARQ IVLLKN+ + LPLN   V S+A++G  +N  +   G Y+G
Sbjct: 358 PSVVGSAEHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSG 415

Query: 472 IPC-SPKSLLRGLEAYV 487
           IP  +P S+L+G++  V
Sbjct: 416 IPANAPVSILQGIKDKV 432



 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 83/295 (28%), Positives = 141/295 (47%), Gaps = 53/295 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + VI V G++ T E E +DR  + LP  Q   +  + + +   +++VL 
Sbjct: 594 YGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKVNPN-IVVVLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  + +I+   YPGE G  A+AE++FG++NPGGRLP+T+Y  S  ++P
Sbjct: 653 AGSSLAINWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVTYY-NSLEEIP 709

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D ++         GR+Y+++ G  +Y FG+GLSYT ++YK L               
Sbjct: 710 SFDDYDITK-------GRTYQYFKGKPLYPFGYGLSYTTFAYKNLQ-------------- 748

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                +   G+ +                 V   + N G +DG  V  ++ ++P      
Sbjct: 749 -----INDNGNNI----------------KVSFELKNTGRMDGDEVSQVYVKIPSSGIFM 787

Query: 746 PEKQLIGFDRVHTVAKGSK---EISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P K+L GF R  T+ KG+    EI+   D       A +    I P G +  M+G
Sbjct: 788 PIKELKGFQR-STLKKGATKNVEINIRKDLLRYWDDATE--TFITPKGEYEFMIG 839


>gi|389794138|ref|ZP_10197297.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
 gi|388432924|gb|EIL89908.1| beta-glucosidase [Rhodanobacter fulvus Jip2]
          Length = 869

 Score =  250 bits (638), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 156/436 (35%), Positives = 228/436 (52%), Gaps = 65/436 (14%)

Query: 68  LISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQ 127
           L++ +TL EK+ QL + A AIPRLGIP Y WW+ESLHG     P           T+FP+
Sbjct: 25  LVARMTLAEKLPQLLNVAPAIPRLGIPRYNWWTESLHGALGTLP----------TTNFPE 74

Query: 128 VLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDPRWG 178
            +   ASF+  L   + +A++ E R ++ L +          GL  W+PN+NIFRDPRWG
Sbjct: 75  PIGLGASFDAPLVKRVAAAISTEVRGLHTLARQTGRLGRIGTGLDTWSPNLNIFRDPRWG 134

Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
           RGQET GEDP + +   V FV+  QG N                        +DR + + 
Sbjct: 135 RGQETYGEDPFLTARMGVAFVEGMQGPN------------------------ADRPEVI- 169

Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
             A  KH   ++  +    +R+  N  ++  D  DT+ P FR+ I + KA  IMC+YN++
Sbjct: 170 --ATPKHFAVHNGPE---STRHEANVFVSRHDLVDTYLPAFRAAITEAKAGSIMCAYNRI 224

Query: 299 NGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
           +G PAC   DL  K   R  WGF GY+ SDCDAV  I ++ +Y      + A  ++AG+D
Sbjct: 225 DGQPACAN-DLLLKHYLRGAWGFTGYVVSDCDAVKDISDHHHYAPDAATAVAAAVRAGVD 283

Query: 357 INCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGK 409
             C    L  T         A+ +  +   DIDRAL+ LF+ + R+G   G P  G    
Sbjct: 284 NECNGATLSDTAGLEDRYSEALQRELISVADIDRALVRLFAARYRVGDLAGLP--GTAPA 341

Query: 410 LGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY 469
           + P  + ++ + +LALDAA++ +VLLKND   LPL       +A+IGPL +    + G Y
Sbjct: 342 VPPSAIGSAANSQLALDAAQKTLVLLKND-GVLPLKAGV--KVAVIGPLGDATRVLRGNY 398

Query: 470 TG-IPCSPKSLLRGLE 484
           +  +  +P S+L GL+
Sbjct: 399 SSPLSGTPISVLDGLQ 414



 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 143/296 (48%), Gaps = 54/296 (18%)

Query: 512 RIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPV 561
           R A +AD ++ V GL    E E+           D+ +L LP  Q++++   AR + +P+
Sbjct: 593 RAAAQADVLVAVVGLTSDLEAEETSVTVPGFAGGDKTTLDLPADQIAMLKQ-ARATGKPL 651

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           I+V   G P+D+++++ ++  ++IL   YPG++G  A+A ++ G  +PGGRLP+T+Y   
Sbjct: 652 IVVAMNGSPIDLAWSKQNA--AAILEAWYPGQSGGLAIANVLSGKTDPGGRLPLTFYRSV 709

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
               P  D  M         GR+YR++ G  VY FG GLSYT ++Y  L           
Sbjct: 710 ADLPPFGDYRME--------GRTYRYFKGEPVYPFGFGLSYTEFTYGPL----------- 750

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                  N+    G   + +H+  V             VTNAG  +G  V  L+ R P +
Sbjct: 751 -------NVTPAAGGAANGLHVSTV-------------VTNAGQREGGDVAQLYLRFPAI 790

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             G P+  L GF RV      ++ +SF + P   LS     G+R +  G++ + VG
Sbjct: 791 P-GAPQVALRGFQRVALKPGETRTVSFDLSP-RDLSAVTADGQREVMAGDYTVSVG 844


>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
 gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  250 bits (638), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 216/750 (28%), Positives = 335/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGLGGGNLSQ 231

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G  S +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 436 K-MINKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A+  A++++ VIVV G     D     +ET                E  D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S      + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNQKAPRNHDYVEVS-----SSPLYSF 661

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776


>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 859

 Score =  250 bits (638), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 217/754 (28%), Positives = 326/754 (43%), Gaps = 114/754 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+             S  TSFP  L  A++++ +L   IG     
Sbjct: 161 RLGIPV-DFTNEGIRGLLH-----------SKATSFPAELAVASTWDPALVREIGRITGR 208

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EARA+      G T  ++P +++ RDPRWGR  ET GEDP +V    VE V+  Q E+  
Sbjct: 209 EARAL------GYTNIYSPVLDLARDPRWGRTIETYGEDPFLVGTLGVEQVRGLQAEH-- 260

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + +  KH   Y + K G       +   T 
Sbjct: 261 ------------------------------VVSTLKHFAVYSIPKGGRDGEARTDPQATW 290

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ +  F  PFR  I +  A  +M SYN  +GVP  + G    L +  R +WGF+GY+ S
Sbjct: 291 REVQTIFLEPFRRAIREAGALGVMASYNDYDGVP--VEGSALFLSEILRGQWGFRGYVVS 348

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D  AV  I        T  D+    ++AG++I              +  +  GK+    I
Sbjct: 349 DSAAVEFIHSKHRVAPTPADAIRQAVEAGLNIRTNFTPPAAYAEPLRQLVRDGKLAMATI 408

Query: 382 DRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKN 437
           D  + ++  V+ +LGLF+     DP         P      EH  +A  A R+ IVLLKN
Sbjct: 409 DARVRDVLRVKFQLGLFDRPYVADPAAADRVVRAP------EHLVVAQRAGREAIVLLKN 462

Query: 438 DKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYAS 494
           +   LPL++  +  + + GPL ++       Y        + L GL A +    +  YA 
Sbjct: 463 EPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLRAKLGAAVEVRYAK 522

Query: 495 GCH-------------DVP-CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSL 540
           G               D P     AG   AV  A+  D +I V G       E   R+SL
Sbjct: 523 GVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGETDELCRESSSRISL 582

Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
            LPG Q  L+ ++  T K P++LVL+ G PL V +A     + +I+ + +PGE G  ALA
Sbjct: 583 ALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAA--RHVPAIVELWFPGEDGGAALA 639

Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQ--VYGFGH 658
            ++ GD NP GRLP+T +P+S  ++P N           +PG   R +   +  ++ FGH
Sbjct: 640 AVLLGDANPSGRLPIT-FPQSVGQLPYN--------FPAHPGSQARDFGQVEGSLFPFGH 690

Query: 659 GLSYTNYSYKFLS-APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
           GLSYT + Y  L   P  + +     AG     L+ + SR     +  V       F + 
Sbjct: 691 GLSYTTFRYSDLRITPERIPVDGFGAAGGGDPGLRGSASRATPYSVSTVPE-----FTIT 745

Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
             VTN G   G  VV L+ R    S  T +  L GF RV      +K ++F +     L 
Sbjct: 746 CDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAPGETKPVTFTLHRA-HLE 804

Query: 778 IANKHGRRILPLGNHVLMVG------ELRHSLTI 805
           + N+ G  ++  G   +M+G       LR + T+
Sbjct: 805 LYNRDGDWVVEPGRFTVMLGASSADIRLRGTFTV 838


>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
           43183]
 gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 944

 Score =  249 bits (637), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 205/753 (27%), Positives = 331/753 (43%), Gaps = 123/753 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  +  
Sbjct: 253 -----------------------------QVAATAKHFAAYSNNKGAREGMARVDPQMPP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     +M SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIREAGLLGVMSSYNDYDGIP--IQGSYYWLTTRLRKEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   N  K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G    D+V    ++ +AL A+R+ IVLLKN    
Sbjct: 402 NDRVRDILRVKFLIGLFDAPYQTDLAG--ADDEVEKEANEAVALQASRESIVLLKNTDNT 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPLN + +  +A+ GP  +        Y  +     ++L G+       ++  Y  GC  
Sbjct: 460 LPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDL 519

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      + P   D  A    AV  A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVLGGGQRTCGENKSRTSLELPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
            Q+ L+ +V  T K PVIL+L  G PL V++A  D  + +IL   YPG  G   +A+I+F
Sbjct: 580 HQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPAILEAWYPGSKGGTVVADILF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           GD+NPGG+L +T +P++  ++P N       + D  +              +Y FG+GLS
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNPGPDGNMSRINGALYPFGYGLS 695

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S+L I+  +   + K                           +++ VT
Sbjct: 696 YTTFEY------SDLEITPKVITPNQKAT-------------------------IRLKVT 724

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G   G  VV L+ R    S  T EK L GF+R+H     SKEI F +D  + L + N 
Sbjct: 725 NTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHLKPGESKEIVFTLD-RKHLELLNA 783

Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
             +  +  G   +M G       L   LT+E Y
Sbjct: 784 DMKWTVEPGEFAIMAGASSEDIRLNGILTVEDY 816


>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
 gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
          Length = 958

 Score =  249 bits (637), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 208/751 (27%), Positives = 344/751 (45%), Gaps = 119/751 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 165 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRKLIHQVGLITGR 212

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E VK  Q  N++
Sbjct: 213 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVKGMQ-HNYQ 265

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 266 ------------------------------VAATGKHFIAYSNNKGAREGMARVDPQMSP 295

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     PF+  I++     +M SYN  +G+P       L  + R + GF+GY+ SD 
Sbjct: 296 REVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRGQMGFRGYVVSDS 355

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 356 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVQEGGLSEEIIND 415

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    ++V   E++ +AL A+R+ IVLLKNDK  LP
Sbjct: 416 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKEENEIVALQASRESIVLLKNDKNALP 473

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH--- 497
           L+  ++  +A+ GP  +  +     Y  +     ++L G+   V   ++  Y  GC    
Sbjct: 474 LDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKAEVLYTKGCELVD 533

Query: 498 ---------DVPCNSDA--GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P  +D      +AV  AK+AD  +VV G       E++ R SL LPG+Q
Sbjct: 534 ANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 593

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL V++A  D  + +I+   YPG  G  A+A+++FGD
Sbjct: 594 LDLLKAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAIIEAWYPGSKGGTAVADVLFGD 650

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P+S  ++P N     + + D  +    +         +Y FGHGLSYT
Sbjct: 651 YNPGGKLTVT-FPKSVGQIPFNFPCKPSSQIDGGKNPGPKGNMSRVNGALYPFGHGLSYT 709

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S+++IS  +   + K                           V+  +TN 
Sbjct: 710 TFEY------SDISISPKVITPNQK-------------------------VQVRCKITNT 738

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L GF+R+H     +KE+SF +D  + L + N   
Sbjct: 739 GKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSFTLD-RKALELLNAKN 797

Query: 784 RRILPLGNHVLMVG------ELRHSLTIETY 808
             ++  G+  +M+G       L   LT++ Y
Sbjct: 798 DWVVEPGDFSIMLGASSEDIRLSDILTVKEY 828


>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
          Length = 785

 Score =  249 bits (637), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 221/754 (29%), Positives = 349/754 (46%), Gaps = 141/754 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E  HG  + G            T FP  L +A+++N  L   +G A+A+
Sbjct: 131 RLGIPVL-FAEECPHGHMAIG-----------TTVFPTALSAASTWNEGLMLKMGEAIAL 178

Query: 150 EARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR    N+G      + P +++ R+PRW R +ET GEDP++ +   V  +K  QG+   
Sbjct: 179 EARLQGANIG------YGPVLDVAREPRWSRMEETFGEDPVLTTIMGVAMMKGMQGKV-- 230

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYS--FNAM 265
                                   + D   L A  KH  AY + + G N SR +     +
Sbjct: 231 ------------------------QNDGKHLYATLKHFAAYGVPESGHNGSRANCGMRQL 266

Query: 266 ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYIT 324
           ++E      + PPFR  +++G A  +M SYN ++GVP     +L     RN+WGFKG++ 
Sbjct: 267 LSE------YLPPFRKAVKEG-AGTLMTSYNAIDGVPCTANKELLTDVLRNQWGFKGFVY 319

Query: 325 SDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-TCMLRHTQSAIDKGKVQEKDIDR 383
           SD  ++  I   +   K ++++A   LKAG+D++ G     ++ + A ++G +   D+DR
Sbjct: 320 SDLISIEGIVGMRA-AKDNKEAAVKALKAGLDMDLGGNAFGKNLKKAYEEGLITMADLDR 378

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
           A+ N+  ++ ++GLF  +P      +L    V + EHK+LA   AR+G+VLLKN+   LP
Sbjct: 379 AVGNVLRLKFQMGLFE-NPYVSP--ELAKKLVHSKEHKELARQVAREGVVLLKNEG-VLP 434

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTG--IPCSPKSLLRGLEAYVS---KTHYASGCH- 497
           L+K+ +  LA+IGP  + +    G YT   +     ++L G+ A VS   +  Y  GC  
Sbjct: 435 LSKH-IGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGIRAAVSESTRVTYVKGCAV 493

Query: 498 ------DVPCNSDAGFHE-----AVRIAKKADFVIVVAGLDLTQETEDR----------- 535
                 D+P    A          V  +   DF            +ED            
Sbjct: 494 RDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTGAATVSEDAKTLPDMDCGEG 553

Query: 536 -DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            DR SL L G Q  L+++VA T K P+++V   G  ++++ A   +Q     W  YPGE 
Sbjct: 554 FDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNLAAEKAQALLTAW--YPGEQ 610

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVY 654
           G   +A+I+FGD++P GRLP++  P S  ++P+           Q   R Y    GT +Y
Sbjct: 611 GGMGIADILFGDYSPAGRLPVS-VPRSEGQLPV--------FYSQGTQRDYVESKGTPLY 661

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT ++Y  L           L+ G++   LQ               +CT    
Sbjct: 662 AFGYGLSYTRFTYSGL----------ELQKGTEMETLQ-------------TVACT---- 694

Query: 715 HVQISVTNAGDVDGSHVVMLFA--RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
                VTN G+ DG  VV L+   +V  VSQ  P   L  F R+      S+++ F +  
Sbjct: 695 -----VTNTGNRDGEEVVQLYIGDKVASVSQ--PPLLLKAFQRIFLKKGESRQVIFHLKK 747

Query: 773 CEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + L I +     ++  G   +MVG   + + +E
Sbjct: 748 -DDLGIYDSEMNYVVEPGEFKVMVGAASNDIRLE 780


>gi|423300893|ref|ZP_17278917.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472228|gb|EKJ90756.1| hypothetical protein HMPREF1057_02058 [Bacteroides finegoldii
           CL09T03C10]
          Length = 798

 Score =  249 bits (637), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 207/735 (28%), Positives = 340/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 149 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 196

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  +AP ++I +DPRWGR  E+ GEDP +V     + +   Q E   
Sbjct: 197 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLVGELGKQMILGLQNEG-- 248

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 249 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 278

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 279 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 338

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI +GKV    +D+
Sbjct: 339 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKVSLHTLDQ 398

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V   + H+ +++ AA + IVLLKN+
Sbjct: 399 RVGEILRVKFMMGLFDNPYPGDDRR-------PEVVVHNAAHQDVSMRAALESIVLLKNE 451

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
           K+ LPL+K + + +A+IGP    + ++   Y     S K++ +G++ Y+  ++  YA GC
Sbjct: 452 KEMLPLSK-SFNKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNAEVRYAKGC 510

Query: 497 ------------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  +EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 511 DIIDKYFPESELYNVPLDTQEKAMINEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 570

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 571 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 627

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 628 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVAGVLYPFGYGLSY 680

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L +S  +  G+ +NI                 SCT         V N
Sbjct: 681 TTFGY------SDLKVSKPV-IGAQENI---------------TLSCT---------VKN 709

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ISF + P + L + +K+
Sbjct: 710 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEERTISFTLTP-QDLGLWDKN 768

Query: 783 GRRILPLGNHVLMVG 797
               +  G+  +MVG
Sbjct: 769 NHFTVEPGSFSVMVG 783


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 159/443 (35%), Positives = 228/443 (51%), Gaps = 46/443 (10%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + NTS S   RA  LIS LTL+EK   L ++ +AIPRLGI +   WSE+LHGI     G 
Sbjct: 33  YLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---GA 89

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFR 173
           N +  +S  TSFP  +   ++++ +L      A+A EARA+   G  GLT+W+P +   R
Sbjct: 90  NQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLTYWSPVVEPIR 149

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  E+ GEDP + +  A  FV+   G               +   LK        
Sbjct: 150 DPRWGRTGESYGEDPFLAAEIAGGFVRGMVG--------------NDPTYLKSV------ 189

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                   C KH  A + E    F R+  ++ +  +D  + +  P++  IEQ     IM 
Sbjct: 190 -------PCAKHYFANNSE----FDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMS 238

Query: 294 SYNQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           SYN VNGVP           AR  +G KGYIT DC A+  I+    Y KT E++ A  LK
Sbjct: 239 SYNAVNGVPTSASQLYLDTIARRTYGLKGYITGDCAAIEDIYTGHYYVKTAEEATAKGLK 298

Query: 353 AGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP 412
           AG+D +CG+   R+  +A+ KG +   DIDRALLN+F V++R G F+  P K  Y +  P
Sbjct: 299 AGVDSDCGSIYQRYAIAALKKGLITMADIDRALLNIFIVRMRTGEFD-PPAKVLYAQFQP 357

Query: 413 DDVCTSEHKKLALDAARQGIVLLKN------DKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
           + V +  +K LA + A +  VLLKN      ++K LPLN   +  +A+IGP  + +    
Sbjct: 358 NIVNSPANKALAKEIATKTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHADKVEL-- 415

Query: 467 GGYTGIPCSPKSL--LRGLEAYV 487
           G Y+G P     +    G++ Y+
Sbjct: 416 GPYSGRPAQENMITPFAGIKKYI 438



 Score =  110 bits (275), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 79/255 (30%), Positives = 127/255 (49%), Gaps = 39/255 (15%)

Query: 525 GLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISS 584
           G D    TE+ DR++LLLPG Q+ L+ +VA  +    I+V+   G ++V   +    I  
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPN-TIVVMQTLGCVEVEEFKNLQNIPG 677

Query: 585 ILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGR 643
           I+W+GY G+A   A+A ++FG+ NPGG+L  TWY +S   +P + D  +R  + +   GR
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWY-KSVKDLPEITDYTLRGGNGKN--GR 734

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHI 703
           ++ ++     Y FG G+SYT + Y      S   IS        KN +         +  
Sbjct: 735 TFWYFDKDVSYEFGFGMSYTTFEY------SNFRIS--------KNSI---------IPH 771

Query: 704 DEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK--VSQGTPEKQLIGFDRVHTVAK 761
           D++T        V + V N G V+G  V+ ++ + P    S   P K+L GF RV   A 
Sbjct: 772 DKIT--------VSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPAG 823

Query: 762 GSKEISFGVDPCEQL 776
            +K ++  ++ C  L
Sbjct: 824 QTKTVNIDIN-CADL 837


>gi|383110724|ref|ZP_09931543.1| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
 gi|382949470|gb|EFS31133.2| hypothetical protein BSGG_1833 [Bacteroides sp. D2]
          Length = 783

 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 212/739 (28%), Positives = 331/739 (44%), Gaps = 132/739 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G A+  
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RDPRW R +ET GEDP++                   
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------------ 215

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
             G IG     K +++G G   D        A  KH +AY + + G     SF  +   +
Sbjct: 216 --GEIG-----KAMVEGLGS-GDLSHPYSTLATLKHFLAYGISESGQNGNPSFAGI---R 264

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPFR  I+ G A  +M SYN ++G+P      L  +  RNEW F G + SD  
Sbjct: 265 ELHENFLPPFRQAIDAG-ALSVMTSYNSMDGIPCTANHSLLTELLRNEWKFSGIVVSDLY 323

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I +      T E +A   L AG+D++ G     +  +A++ G++ +  +D ++  +
Sbjct: 324 SIEGIHQSHFVAPTMEAAAILALSAGVDVDLGGDAYMNLMNAVNTGRISKTALDASVARV 383

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP K K       +V + E   LA   A+  I LLKN+   LPLN
Sbjct: 384 LRLKFEMGLFENPYVDPEKAK------KEVRSEESVTLARRVAQASITLLKNEHSLLPLN 437

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
           KN    +A+IGP  +N   M G YT        K++L G+   +  S+  Y  GC  +  
Sbjct: 438 KN--RKVALIGPNADNRYNMLGDYTAPQEEENIKTVLDGIRTKLSSSQVEYVKGC-SIRD 494

Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
                  +AV  A++++ +I V G     D     +ET                E  DR 
Sbjct: 495 TVTTDIEQAVAAAQRSEVIIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 554

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           +L L G+Q  L+ ++  T K P+I+V   G PLD ++A  ++   ++L   YPG+ G  A
Sbjct: 555 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKTWASENAD--AVLTAYYPGQEGGNA 611

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FGD+NP GRLP+T  P S  ++P+   N +A  +  Y   S      + +Y FG+
Sbjct: 612 IADVLFGDYNPAGRLPLT-VPRSVGQIPIY-YNKKAPQNHDYVELS-----ASPLYAFGY 664

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L +SA                             +   F V  
Sbjct: 665 GLSYTTFEY------SDLRVSA----------------------------ISPHSFEVSF 690

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G  DG  V  L+ R    S   P KQL  F+R        KE+ F +   +  +I
Sbjct: 691 KVKNTGRYDGEEVSQLYLRDEYASVVQPLKQLKHFERFCLKRGEVKEVKFVLSESD-FTI 749

Query: 779 ANKHGRRILPLGNHVLMVG 797
            +++ + ++  G   +MVG
Sbjct: 750 IDRNLKTVVESGTFQVMVG 768


>gi|295689400|ref|YP_003593093.1| glycosyl hydrolase family protein [Caulobacter segnis ATCC 21756]
 gi|295431303|gb|ADG10475.1| glycoside hydrolase family 3 domain protein [Caulobacter segnis
           ATCC 21756]
          Length = 895

 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 158/449 (35%), Positives = 228/449 (50%), Gaps = 64/449 (14%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           A  L++ +T +EK+ QL + A AIPRLG+PAY WW+ESLHG   + P           T+
Sbjct: 43  AADLVAKMTTEEKLDQLLNTAPAIPRLGVPAYNWWTESLHGALGSLP----------TTN 92

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GLTFWAPNINIFRDP 175
           FP+ +  AA+F+  L  ++G A+  E R ++ L +A          L  W+PNINIFRDP
Sbjct: 93  FPEPVGLAATFDTDLVKDVGGAIGAEVRGLHTLARATGRMGRIGTALNTWSPNINIFRDP 152

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + +   V FV+  QG +                            D
Sbjct: 153 RWGRGQETYGEDPHLAARMGVAFVEGVQGPD---------------------------PD 185

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
              + A  KH   Y +      +R+  N  ++ +D EDT+ P FR+ I + KA  IMC+Y
Sbjct: 186 LPQVIATPKH---YAVHSGPESTRHHANVYVSRRDLEDTYLPAFRAAIVEAKAGSIMCAY 242

Query: 296 NQVNGVPACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG 354
           N+++G PAC    L +   R  W F GY+ SDCDAV  I +   Y      + A  ++AG
Sbjct: 243 NRIDGQPACASDMLLKDYLRTAWKFDGYVVSDCDAVKDINDNHKYAPDAATAVAAAVRAG 302

Query: 355 MDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKY 407
           +D  C    L  T       + A+ +G +   D+D+AL+ L+S +LR+G   G  R    
Sbjct: 303 VDNECNGATLTDTAGLSNRYRDALQQGLITIGDVDQALVRLYSARLRVGDLPGV-RPVDT 361

Query: 408 GKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG 467
               P  V T EH+ LAL A+ + +VLLKND   LPL       +A+IGPL +    + G
Sbjct: 362 APKAPSAVMTPEHEALALAASEKSLVLLKND-GLLPLKPGL--RVAVIGPLGDATRVLRG 418

Query: 468 GY-TGIPCSPKSLLRGLEAYV--SKTHYA 493
            Y + +   P S++ GL   +  +K  YA
Sbjct: 419 NYSSALSAPPISVVEGLRRALPGAKVAYA 447



 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 129/290 (44%), Gaps = 54/290 (18%)

Query: 518 DFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
           D ++ V GL    E E+           D+ +L +P  Q +L+   A+   +P+++V   
Sbjct: 625 DVLVAVVGLTSDLEAEESPVEIPGFKGGDKTTLDIPADQQALLEQ-AKALGKPLVVVAMN 683

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL++S+A+ ++  ++IL   YPG++G  A+A ++ G  NP GRLP+T+Y       P 
Sbjct: 684 GSPLNLSWAKDNA--AAILEAWYPGQSGGLAIANVLTGKTNPSGRLPLTFYKSVEDLPPF 741

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            D  M         GR+YR++TG  VY FG+GLSYT + Y            A LK    
Sbjct: 742 GDYRMD--------GRTYRYFTGQPVYPFGYGLSYTRFDY------------APLK---- 777

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                          ++ +         V  +V N G   G  V  L+   P  S G P 
Sbjct: 778 ---------------VEPIKGDAGQGLRVTTTVRNVGQRAGDEVAQLYLNFPN-SPGAPR 821

Query: 748 KQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             L GF RV      +K ++F +     LS  +  G R +  G + + VG
Sbjct: 822 VALRGFQRVSLKPGEAKSVTFSLS-SRDLSSVDPDGVRQVMTGRYKVSVG 870


>gi|189468349|ref|ZP_03017134.1| hypothetical protein BACINT_04746 [Bacteroides intestinalis DSM
           17393]
 gi|189436613|gb|EDV05598.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 786

 Score =  249 bits (636), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 218/742 (29%), Positives = 348/742 (46%), Gaps = 123/742 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG  +E        L A  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
           D +AV  ++          D AA V+ AG+++     +    +R  + AI +GKV  + I
Sbjct: 325 DSEAVEFLYSKHQVAVDAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D  + ++  V+  +GLF+ +P KG   K     V + EH+ +++ AA + IVLLKN+   
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
           LPL+K+ +  +A+IGP  N +  +   Y       K++ +G++ Y+  ++  YA G    
Sbjct: 443 LPLSKD-LKKIAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501

Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
                    ++VP + +  A   EAV +A+++D  I+V G +     E+  R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVTLAEESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q  L+ +V  T K PVIL+L  G    +++AE    I  I+   +PGE    A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRVATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618

Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
           D+NPGG+L +T +P S  ++P        +DS      + +   TGT +Y FG+GLSYT 
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670

Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
           ++Y      S+L I   +       I  Q   +L         SC          V N G
Sbjct: 671 FAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNTG 699

Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
            V G  VV L+      S  T  K L GF+R+H      K I F + P ++L + NK   
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTP-QELGLWNKDNH 758

Query: 785 RILPLGNHVLMVGELRHSLTIE 806
            ++  G   +MVG     + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIRLQ 780


>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 946

 Score =  249 bits (636), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 203/734 (27%), Positives = 333/734 (45%), Gaps = 113/734 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   IG     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRKLIHQIGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     PF+  I++     +M SYN  +G P       L  + R + GF+GY+ SD 
Sbjct: 284 REVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGQMGFRGYVVSDS 343

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVQEGGLSEEVIND 403

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    D+V   E++ +AL A+R+ IVLLKN+   LP
Sbjct: 404 RVRDILRVKFLVGLFDAPYQTDLKG--ADDEVEKEENEAVALQASRESIVLLKNENNTLP 461

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH--- 497
           L+  +V  +A+ GP     +     Y  +     +++ GL   +   ++  Y  GC    
Sbjct: 462 LDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVDGLREKLNGKAEVLYTKGCDLVD 521

Query: 498 ---------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
                    D P + D  +   +AV  A++AD  +VV G       E++ R SL LPG+Q
Sbjct: 522 AHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVLGGGQRTCGENKSRSSLDLPGRQ 581

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PVILVL  G PL V++A  D  + +IL   YPG  G  A+A+++FGD
Sbjct: 582 LDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAIADVLFGD 638

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NPGG+L +T +P+S  ++P N     + + D  +    +         +Y FG+GLSYT
Sbjct: 639 YNPGGKLTVT-FPKSVGQIPFNFPHKPSSQIDGGKNPGTKGDMSRVNGALYPFGYGLSYT 697

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y      S++ IS  +   + K                           V+  VTN 
Sbjct: 698 TFEY------SDINISPKVITPNQK-------------------------VQVRCKVTNT 726

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G   G  VV L+ R    S  T EK L GF+R+H     +KE+SF +D  + L + N   
Sbjct: 727 GKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHLQPGETKEVSFTLD-RKALELLNAKN 785

Query: 784 RRILPLGNHVLMVG 797
             ++  G+  +M+G
Sbjct: 786 DWVVEPGDFSIMLG 799


>gi|423222970|ref|ZP_17209439.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392640546|gb|EIY34345.1| hypothetical protein HMPREF1062_01625 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 862

 Score =  249 bits (635), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 154/461 (33%), Positives = 237/461 (51%), Gaps = 57/461 (12%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + P+ N  LS + RAK L+  LTL+EK   + D++ AIPRLGI  + WWSE+LHG+A+ G
Sbjct: 20  TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
                     +VT FP+ +  AASFN  L   I +AV+ E RA +N         +    
Sbjct: 80  ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
           L+ W PN+NIFRDPRWGRGQET GEDP + S   +  VK  QG EN K         +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
                             L AC KH   +   +W   SR++ N   ++ +D  +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +++     +MC+Y +++  P C    L Q+  R+EWGFK  + SDC A+A  +    
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279

Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            +     +A     AG D+ CG     +    A+ +G + E+++D+ +L L   +  LG 
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEVDKHVLRLMEGRFELGE 339

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP    + K+    V    HK L+L+ +RQ + LL+N    LPL+K ++  +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             ++   + G Y G P    ++L G ++ + K    Y  GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438



 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 101/233 (43%), Gaps = 44/233 (18%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q + + ++    K+ V +  +G     ++         +IL   Y GE G
Sbjct: 629 DRTDIELPAVQRNFLKALKDAGKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELG 685

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP G+LP+T+Y  +       D +M+        GR+YR Y    ++ 
Sbjct: 686 GYAVADVLFGDYNPSGKLPVTFYKSTKQLPDYEDYSMK--------GRTYR-YMSDPLFP 736

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYT++              A   A  +K  L+           DE  + T     
Sbjct: 737 FGFGLSYTDF--------------AVGTASCNKTQLR----------TDESLTLT----- 767

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
             + V+N G   G+ VV ++ R    + G P K L  + RV   A   +++  
Sbjct: 768 --VPVSNTGKRSGTEVVQVYIRKTDDADG-PLKSLKAYARVELAAGAKQDVKI 817


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  249 bits (635), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 159/437 (36%), Positives = 221/437 (50%), Gaps = 57/437 (13%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+S LT++EKI  L   +  IPRL I  Y   +E+LHGI   G   NF       T
Sbjct: 37  RIMDLLSRLTIEEKISLLRATSPGIPRLEIEKYYHGNEALHGIVRPG---NF-------T 86

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
            FPQ +  A+ +N      I + ++ EARA +N    G          LTFW+P +N+ R
Sbjct: 87  VFPQAIGLASMWNPDFLYEISTVISDEARARWNELNRGKDQKRLFSDLLTFWSPTVNMAR 146

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  ET GEDP +     V FVK  QG               + R LK        
Sbjct: 147 DPRWGRTPETYGEDPFLSGKLGVAFVKGLQGN--------------DPRYLK-------- 184

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                + +  KH  A + E     +R+  N  I+E+D  + + P F  CI  GKA  IM 
Sbjct: 185 -----VVSTPKHFAANNEEH----NRFECNPQISERDLREYYLPAFERCIIDGKAQSIMT 235

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN +N VP  L   L +K  R +WGF GY+ SDC A + +  +  Y KT E +A   LK
Sbjct: 236 AYNAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVKTPEAAATLALK 295

Query: 353 AGMDINCG-TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           AG+D+ CG    +    +A  +  V E +ID A   +   ++ LGLF+ DP K  Y  L 
Sbjct: 296 AGLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFD-DPAKNPYNALS 354

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P  V   +HK +AL+AARQ +VLLKN+  FLP+N   + S+A++G  +N  +   G Y+G
Sbjct: 355 PSIVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INAGNCEFGDYSG 412

Query: 472 IPCS-PKSLLRGLEAYV 487
            P + P S+L G+   V
Sbjct: 413 KPVNVPVSVLDGIRNRV 429



 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 87/292 (29%), Positives = 141/292 (48%), Gaps = 47/292 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + +A +  ++ D  I V G++ + E E RDR  + LP  Q   +    + + + + +VL 
Sbjct: 591 YGDAKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQELFIEEAYKLNPK-MAVVLV 649

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  + +IL   YPGE G  A+AE +FGD+NP GRLP+T+Y       P
Sbjct: 650 AGSSLAVNWM--DEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRSLDDLPP 707

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D  ++ +       R+Y ++TG  +Y FG+GLSYT + Y+ LS               
Sbjct: 708 FDDYAVQKN-------RTYMYFTGKPLYAFGYGLSYTKFDYRKLS--------------- 745

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                           +D+      L F    ++ N+G  +G  V  ++ + P++    P
Sbjct: 746 ----------------VDQDAENVRLSF----TIKNSGKYNGDEVAQVYVQFPEIGVKVP 785

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVG 797
            KQL GF+RVH     +  ++  V P ++L I N + G    P GN+V MVG
Sbjct: 786 IKQLKGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836


>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 800

 Score =  249 bits (635), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 208/738 (28%), Positives = 338/738 (45%), Gaps = 133/738 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG-------TCMLRHTQSAIDKGKVQEKD 380
           +AV  +      T T E+ AA V+ AG++I             LRH   AI++GKV    
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRH---AINEGKVSLHT 397

Query: 381 IDRALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLL 435
           +D+ +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + +VLL
Sbjct: 398 LDQRVSEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESVVLL 450

Query: 436 KNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYA 493
           KN  + LPL+KN    +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA
Sbjct: 451 KNKNQMLPLSKN-FKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYA 509

Query: 494 SGC------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
            GC            ++VP ++   A   EAV +AK +D  I+V G +     E+  R +
Sbjct: 510 KGCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTN 569

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           L L G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+
Sbjct: 570 LDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAI 626

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           A+++FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+G
Sbjct: 627 AKVLFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYG 679

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT + Y      S+L IS  +  G  +NI                 SCT         
Sbjct: 680 LSYTTFGY------SDLKISKPV-IGPQENI---------------TLSCT--------- 708

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIA 779
           V N G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + 
Sbjct: 709 VKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLW 767

Query: 780 NKHGRRILPLGNHVLMVG 797
           +K+ +  +  G+  +MVG
Sbjct: 768 DKNNQFTVEPGSFSVMVG 785


>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
 gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 736

 Score =  249 bits (635), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 216/750 (28%), Positives = 334/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 185

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 186 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 216

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G A  +M SYN ++G P      L  Q  RNEW F+G++ SD  
Sbjct: 217 DLHQNFLPPFRKAIDAG-ALSVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLY 275

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 276 SIEGIHESHFVAPTKENAAIQSVMAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 335

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 336 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 389

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 390 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 448

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A++ A++++ VIVV G     D     +ET                E  D
Sbjct: 449 TVNE---IEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 505

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 506 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 562

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S        +Y F
Sbjct: 563 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEMS-----SFPLYSF 615

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 616 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 641

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 700

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 701 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 730


>gi|423291211|ref|ZP_17270059.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
           CL02T12C04]
 gi|392663822|gb|EIY57367.1| hypothetical protein HMPREF1069_05102 [Bacteroides ovatus
           CL02T12C04]
          Length = 800

 Score =  249 bits (635), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 207/735 (28%), Positives = 339/735 (46%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L   I    A 
Sbjct: 151 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKKLIREIAKVTAN 198

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 199 EAKAL------GYTNIYSPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQSEG-- 250

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         + A  KH   Y +   G       +  +  
Sbjct: 251 ------------------------------IVATPKHFAVYSIPVGGRDGGTRTDPHVAP 280

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 281 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 340

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI++GKV    +D+
Sbjct: 341 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAINEGKVSLHTLDQ 400

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVCTSE-HKKLALDAARQGIVLLKND 438
            +  +  V+  +GLF+    GD R+       P+ V  ++ HK +++ AA + IVLLKN+
Sbjct: 401 RVGEILRVKFMMGLFDNPYPGDDRR-------PETVVHNDAHKAVSMKAALESIVLLKNE 453

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+KN  S +A+IGP    + ++   Y     S K++ +G++ Y+  S+  YA GC
Sbjct: 454 NQMLPLSKN-FSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAKGC 512

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A   EAV +AK +D  I+V G +     E+  R +L L
Sbjct: 513 DIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDVAILVLGGNEKTVREEFSRTNLDL 572

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 573 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIIHAWFPGEFMGDAIAKV 629

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  ++P          S+       +      +Y FG+GLSY
Sbjct: 630 LFGDYNPGGRLAVT-FPKSVGQIPFAFPFKPGSDSKG------KVRVDGVLYPFGYGLSY 682

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS  +  G  +NI                 SCT         V N
Sbjct: 683 TTFGY------SDLKISKPV-IGPQENI---------------TLSCT---------VKN 711

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ++F + P + L + +K+
Sbjct: 712 TGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QDLGLWDKN 770

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +MVG
Sbjct: 771 NQFTVEPGSFSVMVG 785


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  248 bits (634), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 155/449 (34%), Positives = 230/449 (51%), Gaps = 57/449 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +     R   L+S LT++EKI  L   +  I RL IP Y   +E+LHG+   G   
Sbjct: 29  YKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 86  -------RFTVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLLT 138

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG +               R 
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGND--------------SRY 184

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           + GK++ IM +YN +N VP  L   L  K  RN+WGFKGY+ SDC   + +     Y KT
Sbjct: 228 KDGKSASIMSAYNALNDVPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVKT 287

Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      SA  +  V + DID A   +   +++LGLF+  
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
             K  Y K+ P  + + EH+++AL+AAR+ IVLLKN KK LPLN   + S+A++G  +N 
Sbjct: 348 -EKNPYTKISPAVIGSKEHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INA 404

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSK 489
            S   G Y+G+P  +P S+L+G++  V +
Sbjct: 405 GSSEFGDYSGLPVIAPVSVLQGIKDRVGE 433



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/292 (28%), Positives = 142/292 (48%), Gaps = 47/292 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +++VL 
Sbjct: 593 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IVVVLV 651

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  + +I+   YPGE+G KA+AE++FGD+NPGGRLP+T+Y       P
Sbjct: 652 AGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS-APSELTISASLKAG 685
            +D ++         GR+Y+++ G  +Y FG+GLSYT++ Y  L  A  E  +S S +  
Sbjct: 710 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTSFKYSNLQVADGEEEVSVSFQ-- 760

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                                             + N G   G  V  ++ ++P+  +  
Sbjct: 761 ----------------------------------LKNTGRYAGDEVAQVYVKLPEREEVM 786

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           P K+L GF+RV   +  SK+++  +            G+ I P GN+ +MVG
Sbjct: 787 PVKELKGFERVSLKSGESKKVTIKLRKDLLRYWDEAKGKFIYPSGNYNIMVG 838


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 154/459 (33%), Positives = 226/459 (49%), Gaps = 61/459 (13%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  L+   RA  L+  LTL+EK   + + + AIPRLGI AY+WW+E+LHG+      
Sbjct: 26  PYQNPKLTPEERANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGV------ 79

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTF 164
               G     T FPQ +  AASF+  L   + +AV+ EARA Y   +         GLTF
Sbjct: 80  ----GRAGIATVFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQGLTF 135

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           W PN+NIFRDPRWGRGQET GEDP + S   +  V+  QG      D             
Sbjct: 136 WTPNVNIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEDAPYD------------- 182

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         L AC KH   +   +W   +R+ FNA  I  +D  +T+ P F+  +
Sbjct: 183 -------------KLHACAKHFAVHSGPEW---NRHEFNAENIAPRDLWETYMPAFKDLV 226

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++     +MC+YN++ G P C    L     R+EWG++G + SDC A++  +   ++ +T
Sbjct: 227 QKAHVKEVMCAYNRLEGEPCCGNNRLLTHILRDEWGYQGIVVSDCGAISDFWRKGDH-ET 285

Query: 343 HEDSA---AGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
           H D A   AG + +G D+ CG+   +    A+  G + E  +D ++  L   +  LG  +
Sbjct: 286 HPDKAHASAGAVLSGTDLECGSN-YKSLPEAVKAGLIAESQLDISVKRLLKARFELGEMD 344

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     +  +    V    HK LAL  AR+ IVLL+N    LPL K+    +A++GP  
Sbjct: 345 KDV---CWDTIPYSVVDCQAHKDLALRMARESIVLLQNRNNILPLRKDM--KIALVGPNA 399

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
           N+     G Y G P   ++L   L+  +  S+  Y  GC
Sbjct: 400 NDSIMHWGNYNGFPSHTETLYEALKKRLPASQLIYEFGC 438



 Score =  106 bits (265), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 82/282 (29%), Positives = 125/282 (44%), Gaps = 59/282 (20%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           D+P N  A         K AD ++   G+  T E E+           DR S+ LP  Q 
Sbjct: 586 DIPVNLQA----TADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQR 641

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQI-SSILWIGYPGEAGAKALAEIIFGD 606
            LV  + +  K P++ +   G  + ++    +S+I   ++   YPG+AG  A+A+++FGD
Sbjct: 642 QLVGELKKLGK-PIVFINYSGSAMGLA---PESEICDGMIQAWYPGQAGGTAIADVLFGD 697

Query: 607 FNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
           +NP G+LP+T+Y  +       D  M+        GR+YR+ T T ++ FGHGLSYT + 
Sbjct: 698 YNPAGKLPVTFYRNTEQLPDFEDYAMK--------GRTYRYMTETPLFRFGHGLSYTTFD 749

Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
           Y               KA   +N   +           E  + T       I V+N G  
Sbjct: 750 YG--------------KARLSQNTFSK----------GETLTLT-------IPVSNTGTR 778

Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           DG   V ++ R P  +   P   L  F RV+    G+KEI F
Sbjct: 779 DGEETVQVYLRRPGDADA-PSHTLRAFKRVYVPKGGTKEIKF 819


>gi|387789568|ref|YP_006254633.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
 gi|379652401|gb|AFD05457.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
          Length = 742

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 230/809 (28%), Positives = 366/809 (45%), Gaps = 136/809 (16%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGI 106
           PHF          +I  R  SL+ L+TL+EK+ Q++  +      G    +     L  +
Sbjct: 17  PHF----VAAQKKTIDQRVDSLLQLMTLEEKVGQMNQYSGPWAHTGPITED--GNILQQV 70

Query: 107 ASN--GPGVNFNGTVSSV--------------------------TSFPQVLVSAASFNRS 138
                G  +N NG   +                           T+FP  L  AAS++  
Sbjct: 71  QEGKLGSMLNINGVAHTKELQTLALKSRLKIPLLFGQDVIHGYRTTFPIPLGEAASWD-- 128

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
               + +       A      +G+ + +AP ++I RDPRWGR  E  GEDP + S  A  
Sbjct: 129 ----LEAMEQSARVAATEAAASGIHWTFAPMVDISRDPRWGRVMEGAGEDPYLGSLIAKA 184

Query: 198 FVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNF 257
            VK FQG                          +  GD   + AC KH  AY     G  
Sbjct: 185 RVKGFQG--------------------------NKLGDINSVMACAKHFAAYGAAIGGR- 217

Query: 258 SRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQK 312
               +N++ ++++   + + PPF++  E G A+  M S+N +NGVPA     L+ D+   
Sbjct: 218 ---DYNSVDMSDRTLWEIYLPPFKAAAEAGVAT-FMNSFNDLNGVPASASSYLQRDIL-- 271

Query: 313 ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAI 371
            + +W F G++ SD  ++  + ++      +E S A V+ AG D++  + C  ++    +
Sbjct: 272 -KGKWNFTGFVVSDWGSIGEMIKHGYVKDCYEASQAAVM-AGSDMDMESRCYTQNLVHLV 329

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
            +GKV E  ID A+  +   +  LGLF  DP +    K     +   +HK +A D A++ 
Sbjct: 330 KEGKVPETVIDDAVRRILRKKFELGLFE-DPFRFCDVKREQQALNNPKHKAIARDVAKKS 388

Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY-TGIPCSPK--SLLRGLEAYV- 487
           +VLLKN+   LPL+K A  ++AIIGPL  +   M G +    P S    S   G++  + 
Sbjct: 389 VVLLKNNNNVLPLSKQA-KTIAIIGPLAKSERDMLGFWAVDWPDSSYIVSQFEGIQQKLG 447

Query: 488 --SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
             SK  YA GC ++  NS +GF EA+ +A+KAD V++  G       E + R ++ LPG 
Sbjct: 448 NNSKLLYAMGC-NIEDNSTSGFAEAIEVAQKADVVLLSVGERRDMSGEAKSRSTIHLPGV 506

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSF-AEADSQISSILWIGYPGEAGAKALAEIIF 604
           Q  L+ ++  T K PV++++  G PL  ++ A+    I    W+G   EAG  A+A+++F
Sbjct: 507 QEELIKAIKATGK-PVVVLINAGRPLIFNWTADNADAIVYTWWLG--SEAG-NAIADVLF 562

Query: 605 GDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQYPGRSYRFYTGTQVYGFGH 658
           GD+NP G+LPMT +P S  ++P+  N +N       D  R Y   +Y   +    Y FG+
Sbjct: 563 GDYNPAGKLPMT-FPRSEGQLPIYYNFLNTGRPAENDKDRFYRS-AYNDLSIYPKYAFGY 620

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYTN+ Y  L   +E                               T     +  +  
Sbjct: 621 GLSYTNFKYDNLKLSAE-------------------------------TMNMDGKITLSF 649

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
           SVTN G+  G  VV L+ R    S   P K+L  F ++      +KEI+F +D  E+LS 
Sbjct: 650 SVTNTGNYTGEEVVQLYLRDEFASLVRPVKELKDFKKILLKPGETKEINFVIDK-EKLSF 708

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            N+    +   G   +M+G     + +++
Sbjct: 709 YNQKLEWVAEPGTFQVMIGSASDDIRLKS 737


>gi|409196594|ref|ZP_11225257.1| beta-glucosidase [Marinilabilia salmonicolor JCM 21150]
          Length = 876

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 149/448 (33%), Positives = 228/448 (50%), Gaps = 55/448 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N SLS   R K LIS LT++EK   L D + AIPRLGI  + WWSE+LHG A+N  
Sbjct: 33  YPFQNPSLSFEERVKDLISRLTVEEKAALLCDQSDAIPRLGIEKFNWWSEALHGYANN-- 90

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA---------GL 162
                    SVT FPQ +  AASFN  L  +I +A + E RA Y+  Q           L
Sbjct: 91  --------DSVTVFPQPVGMAASFNDELVFDIFNATSDEGRAKYHQAQRRGEENRRFLSL 142

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREK 221
           + W PN+NIFRDPRWGRGQET GEDP + S   V+ VK  QG E+ K         +R+ 
Sbjct: 143 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQGPEDAK---------YRK- 192

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
                            L AC KH   +   +W   SR+  N   ++ ++  +T+ P F+
Sbjct: 193 -----------------LLACAKHYTVHSGPEW---SRHELNVNDVSPREFYETYMPAFK 232

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
           + ++      +MC+Y++++  P C    + Q+  R+EWG++  + +DC A+         
Sbjct: 233 ALVQDADVRQVMCAYHRLDDEPCCSNTRILQRILRDEWGYEHLVAADCGAITDFHTTHGI 292

Query: 340 TKTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           + T   +AA  L AG D+ C       +    A++K  ++E+ ID +L  +   +  LG 
Sbjct: 293 SSTPVHAAATGLLAGTDLECIWDNYHYKMLPEALEKDLIKEEHIDSSLFRVLMGRFELGE 352

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + D     + +     +   EH++LA   A+Q + LL+N    LPL+K++   +A+IGP
Sbjct: 353 MDDDAIV-PWAQTPASVLNNKEHRELAYKMAQQSMTLLQNKNNVLPLDKSSTQKIAVIGP 411

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEA 485
             ++   + G Y G P    ++L G+++
Sbjct: 412 NADDEEVLWGNYNGTPIRTITILDGMKS 439



 Score = 87.0 bits (214), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 68/263 (25%), Positives = 110/263 (41%), Gaps = 51/263 (19%)

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR  + LP  Q + + ++    K  + +  +G     ++         +IL   Y GE+G
Sbjct: 643 DRTHIELPSVQRNALKALKEAGKTVIFVNFSGSA---IALEPETESCDAILQAWYAGESG 699

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
            +A+A+++FGD+NP G+LP+T+Y  S       D +M         GR+YR YT   ++ 
Sbjct: 700 GQAIADVLFGDYNPSGKLPLTFYRNSDNLGDFEDYSME--------GRTYR-YTNDHLFP 750

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG GLSYTN+                           Q G+      + + T        
Sbjct: 751 FGFGLSYTNF---------------------------QIGN----AKLSKATITPDETIS 779

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC-- 773
           + I V N G  DG+ ++ ++ R      G P K L GF+RV   A  + + +  + P   
Sbjct: 780 INIPVKNTGKRDGTEIIQVYVRKVNDIDG-PIKTLKGFERVDVPAGKTTQANISLPPSSF 838

Query: 774 -----EQLSIANKHGRRILPLGN 791
                 Q  I    G+ I+  GN
Sbjct: 839 EFYDWSQRKITVTPGQYIVYYGN 861


>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 803

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 211/773 (27%), Positives = 341/773 (44%), Gaps = 133/773 (17%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N + +++T  +  +                    RLGIP  ++ +E +HG+     
Sbjct: 121 YPFSNHAEAVNTVQRWFVEET-----------------RLGIPV-DFTNEGIHGLNH--- 159

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNIN 170
                   +  T  P  +   +++N+ L    G     EA+A+      G T  +AP ++
Sbjct: 160 --------TKATPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILD 205

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR  E  GE+P +++A   E V   Q +                         
Sbjct: 206 VVRDPRWGRTLECYGEEPFLIAALGTEMVNGIQSQG------------------------ 241

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   ++A  KH   Y + K G       +  +  ++  + F  PF+  I+      
Sbjct: 242 --------VAATLKHYAVYSVPKGGRDGHCRTDPHVAPRELHELFLYPFKKVIQNSHPMG 293

Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +M SYN  +GVP       L +  R E+GF GY+ SD  AV  +    +   T++++   
Sbjct: 294 VMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSQAVEFVESKHHVADTYDEAVRQ 353

Query: 350 VLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           VL+AG+++       +  +   +  +++ K+    ID+ +  +  V+ RLGLF+  P   
Sbjct: 354 VLEAGLNVRTHFTPPSDFILPIRRLLEEKKISMATIDKRVSEVLRVKFRLGLFD-RPYVT 412

Query: 406 KYGKLGPDDVCTSEHK-KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
             G    D+V  ++       +  +Q +VLLKN+   LPL+K  +  + + GPL +  + 
Sbjct: 413 DTG--AADNVGGADRNMDFVKEMQQQALVLLKNENNILPLDKQRIKKVLVTGPLADEDNF 470

Query: 465 MGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCH------------DVPCNS--DAGF 507
           M   Y        ++L GL AY   V++  YA GC              VP N     G 
Sbjct: 471 MTSRYGPNGLETVTVLAGLRAYLQGVAEVDYAKGCDIVDAGWPATEILPVPMNEREKRGI 530

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
            EAV  A ++D VI V G D  +  E R R SL LPG+Q  L+ ++  T K PVILVL  
Sbjct: 531 AEAVAKAGESDVVIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK-PVILVLIN 589

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PL V++A A   I +IL   +PG  G   +AE +FG+ NPGG+L +T +P+S  ++ +
Sbjct: 590 GQPLTVNWANA--YIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVT-FPKSVGQIEL 646

Query: 628 N---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
           N            +  P  S       ++Y FG GLSYT ++Y      S+L +S     
Sbjct: 647 NFPFKPGSHGSQPKSGPNGSGATRVIGELYPFGFGLSYTTFAY------SDLEVSP---- 696

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
                 L+Q                T   + V+++VTN G   G  VV L+ R    S  
Sbjct: 697 ------LRQR---------------TQGEYTVKVNVTNTGKRAGDEVVQLYVRDKVSSVI 735

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           T + QL GF+RV      +++++F + P E L I +++    +  G   +M+G
Sbjct: 736 TYDSQLRGFERVSLKPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 787


>gi|418855797|ref|ZP_13410448.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|392820838|gb|EJA76679.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
          Length = 765

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYSFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  248 bits (634), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 155/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   LIS LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 87  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FV   QG                   
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC   A +     Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288

Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V + DID A  ++ + +++LGLF+  
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSG 348

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y K+ P  + + EH+++ALDAARQ IVLLKN K  LPLN + + S+A++G  +N 
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INA 405

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
                G Y+G P   P S+L+G+   V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432



 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 86/302 (28%), Positives = 148/302 (49%), Gaps = 47/302 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  + +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y       P
Sbjct: 653 AGSSLAVNWM--DEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSY++++Y  L           +K G 
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDL----------QVKDGG 753

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                            DEVT        V   + N G  +G  V  ++ R+P+     P
Sbjct: 754 -----------------DEVT--------VSFRLKNTGKRNGDEVAQVYVRIPETGGIVP 788

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
            K+L GF RV   +  S+ +   +D  EQL   + + G+ ++P G   +MVG     + +
Sbjct: 789 LKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRL 847

Query: 806 ET 807
           +T
Sbjct: 848 QT 849


>gi|194446334|ref|YP_002041437.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Newport str. SL254]
 gi|194469038|ref|ZP_03075022.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195874002|ref|ZP_02700028.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205358223|ref|ZP_02654575.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205359498|ref|ZP_02830128.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|417374418|ref|ZP_12144175.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|418765082|ref|ZP_13321175.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|418772136|ref|ZP_13328140.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|418825678|ref|ZP_13380951.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
 gi|194404997|gb|ACF65219.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|194455402|gb|EDX44241.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|195631424|gb|EDX49984.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|205335699|gb|EDZ22463.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205344851|gb|EDZ31615.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|353599534|gb|EHC55680.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Inverness str. R8-3668]
 gi|392732250|gb|EIZ89461.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21539]
 gi|392742031|gb|EIZ99126.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35199]
 gi|392812566|gb|EJA68550.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22462]
          Length = 755

 Score =  248 bits (634), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726


>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 799

 Score =  248 bits (634), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 213/772 (27%), Positives = 328/772 (42%), Gaps = 131/772 (16%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N + +I+   +  I                    RLGIP  ++ +E +HG+     
Sbjct: 118 YPFSNHAEAINKTQRWFIEET-----------------RLGIPV-DFSNEGIHGLNH--- 156

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM-YNLGQAGLTFWAPNIN 170
                   +  T  P  +   +++NR L    G     EA+A+ YN        +AP ++
Sbjct: 157 --------TKATPLPAPINIGSTWNRDLVHQAGDIAGKEAKALGYN------NVYAPILD 202

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR  ET GEDP +V    ++ VK  Q                           
Sbjct: 203 VARDPRWGRVLETYGEDPYLVGELGIQMVKGIQQNG------------------------ 238

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   +++  KH   Y + K G  +    +  +  ++  +    PF+  +++     
Sbjct: 239 --------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKAHPKG 290

Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +M SYN  +GVP       L Q  R E+GFKGYI SD +AV  +    +   ++E++   
Sbjct: 291 VMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKHHVADSYEEAVRQ 350

Query: 350 VLKAGMDINCGTCMLRH----TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKG 405
           V++AG+++       +      +  + +GK+  K +DR + ++  V+  LGLF  D    
Sbjct: 351 VVEAGLNVRTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFELGLF--DSPYV 408

Query: 406 KYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQM 465
           K  K     V   +H+   LD  +Q +VLLKN+   LPL+KN    + I GPL    + M
Sbjct: 409 KDPKAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIAGPLAKETNYM 468

Query: 466 GGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVPCN--------------SDAGFH 508
              Y        ++  G++ Y+   ++  YA GC     N                 G  
Sbjct: 469 ISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTPLTDEEKKGIA 528

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
           EA   A   D +I V G D +   E + R  L LPG+Q  L+ ++  T K PV+LVL  G
Sbjct: 529 EAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEALHATGK-PVVLVLING 587

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN 628
            PL +++A  D  I SIL   +PG+ G +A+A+ +FGD+NPGGRL +T +P S  ++  N
Sbjct: 588 QPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVT-FPRSIGQIEFN 644

Query: 629 DMNMRADSSRQY---PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
                     QY   P  S R      +Y FG+GLSYT ++Y  LS   E   S S    
Sbjct: 645 FPFKPGSQDGQYFEGPNGSGRTRVNGALYPFGYGLSYTTFAYSNLSVKQETPYSQSPVT- 703

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                                         V + VTN G   G  VV L+ R    S   
Sbjct: 704 ------------------------------VTVDVTNTGKRAGDEVVQLYIRDKVSSVIA 733

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            E  L GF+R+      +K +SF + P E L I ++H    +  G   + +G
Sbjct: 734 YESVLRGFERISLQPGETKTVSFVLLP-EDLQILDRHMEWTVEPGEFEVRIG 784


>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
 gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
 gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
 gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
          Length = 786

 Score =  248 bits (633), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 207/735 (28%), Positives = 337/735 (45%), Gaps = 127/735 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  N             T FP      A++N+ L   I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHN-----------RATMFPAQCGQGATWNKKLIREIAKVTAD 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  +AP ++I +DPRWGR  E+ GEDP +      + +   Q E   
Sbjct: 185 EAKAL------GYTNIYAPILDIAQDPRWGRVVESYGEDPYLAGELGKQMILGLQAEG-- 236

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         L+A  KH   Y +   G       +  +  
Sbjct: 237 ------------------------------LAATPKHFAVYSIPVGGRDGGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           ++ +  +  PFR  I++  A  +M SYN  +G P         +  R +WGFKGY+ SD 
Sbjct: 267 REMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDS 326

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDR 383
           +AV  +      T T E+ AA V+ AG++I          +   + AI +GK+    +D+
Sbjct: 327 EAVEFLHTKHRITPTEEEMAAQVVNAGLNIRTNFTPPQDFILPLRRAISEGKISLHTLDQ 386

Query: 384 ALLNLFSVQLRLGLFN----GDPRKGKYGKLGPDDVC-TSEHKKLALDAARQGIVLLKND 438
            +  +  V+  LGLF+    GD R        P+ V   + H+++++ AA + IVLLKN+
Sbjct: 387 RVGEILRVKFMLGLFDNPYPGDDRH-------PETVVHNAAHQEVSMKAALESIVLLKNE 439

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
            + LPL+K +++ +A+IGP    + ++   Y       K++ +G++ Y+  ++  YA GC
Sbjct: 440 NQMLPLSK-SLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVSYAKGC 498

Query: 497 ------------HDVPCNS--DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       ++VP ++   A  +EAV +AK +D  I+V G +     E+  R SL L
Sbjct: 499 NIIDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDL 558

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PV+LV+  G    +++  A+  + +I+   +PGE    A+A++
Sbjct: 559 CGRQQQLLEAVYATGK-PVVLVMVDGRAATINW--ANKYVPAIVHAWFPGEFMGNAIAKV 615

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSY 662
           +FGD+NPGGRL +T +P+S  +VP          S+       R      +Y FG+GLSY
Sbjct: 616 LFGDYNPGGRLAVT-FPKSVGQVPFAFPFKPGSDSKG------RVRVDGVLYPFGYGLSY 668

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S L IS  +  G  +N+                 SC          V N
Sbjct: 669 TTFEY------SALKISKPV-IGPQENM---------------TLSCI---------VKN 697

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  VV L+ R    S  T +K L GF+R+H      + ISF + P + L + +K+
Sbjct: 698 TGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGLWDKN 756

Query: 783 GRRILPLGNHVLMVG 797
            +  +  G+  +M+G
Sbjct: 757 NQFTVEPGSFSIMIG 771


>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 791

 Score =  248 bits (633), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 210/754 (27%), Positives = 331/754 (43%), Gaps = 141/754 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G           +T FP  +  AA+++  L   +G  +A 
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V      N   
Sbjct: 186 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGLINGNISR 240

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
            +  I                          A  KH +AY + + G        A++  +
Sbjct: 241 KNSTI--------------------------ATLKHFLAYAVPEGGQNGN---QALVGMR 271

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           +  + F PPF+  I+ G  S +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 272 ELHENFLPPFKKAIDAGALS-VMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLY 330

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I+E      + ED+A   + AG+D++ G     +   A+ + ++ E  ID  +  +
Sbjct: 331 SIEGIYESHYTASSIEDAAIQAVSAGVDVDLGGEAYTNIYRAVKEKRLSEAIIDEVVCRV 390

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      ++  + V  + H   A   A+  + LLKN    LPL+
Sbjct: 391 LRLKFEMGLFENPYVDP------QIAIERVRNANHIANARRMAQASVTLLKNRHDILPLS 444

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGC--HD 498
           KN +  +A+IGP  +N   M G YT  P   +++   L+  +SK       Y  GC   D
Sbjct: 445 KN-IRKVAVIGPNADNCYNMLGDYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGCAIRD 502

Query: 499 VPCNSDAGFHEAVRIAKKADFVIVVAG------------------LDLTQET-----EDR 535
              N  A   +AV  A +AD VI V G                   D +Q +     E  
Sbjct: 503 TTNNEIA---KAVEAANRADVVIAVVGGSSARDFKTTYKETGAAIADKSQISDMECGEGF 559

Query: 536 DRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAG 595
           DR +L L G+Q+ L+ S+ +++++P+I+V   G PL+ ++A   +   ++L   YPG+ G
Sbjct: 560 DRATLSLLGKQLELLESL-KSTRKPLIVVYIEGRPLNKNWAAEHA--DALLTAYYPGQEG 616

Query: 596 AKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYG 655
             A+A+++FGD+NP GRLP++  P S  ++P+   N +      Y   S      + +Y 
Sbjct: 617 GDAIADVLFGDYNPAGRLPVS-VPRSEGQIPVY-YNKKTPKCHDYVEMS-----ASPLYS 669

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSY+ + Y  L                                  +VT    L F 
Sbjct: 670 FGYGLSYSTFEYSNL----------------------------------KVTQQAPLHFE 695

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           +   V N G  DG  V  L+ R    S     +QL  F R        K I F +   E 
Sbjct: 696 ISFDVENTGKYDGEEVAQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTLVE-ED 754

Query: 776 LSIANKHGRRILPLGNHVLMVG----ELRHSLTI 805
           LSI N+   RI+  G+  LM+G    ++R S TI
Sbjct: 755 LSIINQKMERIVEPGSFQLMIGAASDDIRLSKTI 788


>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
          Length = 733

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 206/739 (27%), Positives = 344/739 (46%), Gaps = 102/739 (13%)

Query: 67  SLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFP 126
           SLI L T  +   Q+   A    RLGIP   +  + +HG+                T +P
Sbjct: 79  SLIYLHTDPKLRNQIQRKAMEESRLGIPIL-FGFDVIHGLR---------------TVYP 122

Query: 127 QVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGE 186
             L  A SFN  L +      A E+     L     TF +P I++ RDPRWGR  E  GE
Sbjct: 123 ISLAQACSFNPDLVTQACGMAAKESV----LSGIDWTF-SPMIDVARDPRWGRISECYGE 177

Query: 187 DPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHL 246
           DP + + + V  V+ +QGE                             D   ++AC KH 
Sbjct: 178 DPYLNTVFGVASVQGYQGEKL--------------------------SDPYSIAACLKHY 211

Query: 247 IAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLR 306
           + Y   + G   RY+    I+ Q   +T+ PP+ +C++ G A+ +M S+N ++GVPA   
Sbjct: 212 VGYGASEGGRDYRYT---DISPQALWETYLPPYEACVKAG-AATLMSSFNDISGVPATSN 267

Query: 307 GDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC-GTCML 364
             +  +  +N+W   G++ SD +A+  +  YQ   K  +++A     AG++++       
Sbjct: 268 HYILTEILKNKWRHDGFVVSDWNAIEQLI-YQGVAKDRKEAAYKAFHAGVEMDMRDNIYY 326

Query: 365 RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLA 424
            + +  + + K+Q   ID A+  +  V+ RLGLF+ +P   +  +   +     E   LA
Sbjct: 327 EYLEQLVAEKKIQMSQIDDAVARILRVKFRLGLFD-EPYTKELTE--QERYLQKEDIALA 383

Query: 425 LDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG--YTGIPCSPKSLLRG 482
              A + +VLLKN+   LPL+ + V  +A+IGP+  + + + G   + G     +++  G
Sbjct: 384 ARLAEESMVLLKNENNLLPLS-STVKRVALIGPMAKDSANLLGAWAFKGHAEDVETIYEG 442

Query: 483 LEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVS 539
           ++       +  Y  GC  +  N ++GF  A++ A+ +D V+V  G       E+  R +
Sbjct: 443 MQKEFGDKVQLDYEQGCA-LDGNDESGFSAALKTAEASDVVVVCLGESKQWSGENASRST 501

Query: 540 LLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKAL 599
           + LP  Q  L+  + + +K P++LVL+ G PL++     + Q+ +I+ +  PG AG   L
Sbjct: 502 IALPDIQEKLLLHLKQANK-PIVLVLSSGRPLEL--IRLEPQVEAIIEMWQPGVAGGTPL 558

Query: 600 AEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGR-SYRFYTGTQVYGFGH 658
           A I+ G  NP G+L +T +P S  ++P+   NMR  S+R +     Y+      +Y FGH
Sbjct: 559 AGILSGRVNPSGKLSVT-FPLSTGQIPVY-YNMR-QSARPFDAMGDYQDIPTKPLYPFGH 615

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+  +S SLK   ++ I                          ++
Sbjct: 616 GLSYTTFVY------SDAKLS-SLKIRKNQKIT------------------------AEV 644

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
           +VTNAG ++G   V+ +   P  S   P K+L  F++    A  S+   F +DP   LS 
Sbjct: 645 TVTNAGKMEGKETVLWYVSDPFCSISRPMKELKFFEKHSLNAGESRVFRFEIDPMRDLSY 704

Query: 779 ANKHGRRILPLGNHVLMVG 797
            +  G+R L  G  ++ VG
Sbjct: 705 TDATGKRFLEAGEFIVSVG 723


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 158/447 (35%), Positives = 222/447 (49%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  I  R + L+S LT++EK+  L   +  I R+GI  Y   +E+LHGI   G   
Sbjct: 14  YKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGIIRPG--- 70

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  A+ +N  L   I   ++ EARA +N  + G          LT
Sbjct: 71  -------KFTVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLLT 123

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG++               R 
Sbjct: 124 FWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGDH--------------PRY 169

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK               A  KH  A + E     +R+  +A ITE D  + + P F  CI
Sbjct: 170 LKAV-------------ATPKHFAANNEEH----NRFYCDAAITETDLREYYFPAFEKCI 212

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
            +GKA  IM +YN +NGVP      L  K  + +WGF GYI SDC A   +     Y KT
Sbjct: 213 REGKAESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVKT 272

Query: 343 HEDSAAGVLKAGMDINCGTCMLRHTQ-SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +  +   +A  +  V   +ID A   +   ++RLG+F+ D
Sbjct: 273 PEAAAMIAIKAGLDVECGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFD-D 331

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P K  Y  L P+ V   +H  LAL+AARQ IVLLKN +  LPLN   + S+A++G  +N 
Sbjct: 332 PEKNPYNHLSPEIVGCKKHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INA 389

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYV 487
            +   G Y+G P  +P S+L G+   V
Sbjct: 390 ANCEFGDYSGTPVNAPVSVLDGIRNRV 416



 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 85/302 (28%), Positives = 150/302 (49%), Gaps = 49/302 (16%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + +A +I +++D VI V G++ + E E +DR S+ LP  Q   +    + +   +++++ 
Sbjct: 578 YGDASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKANPNTIVVLVA 637

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
           G     ++    D  I +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y  S   +P
Sbjct: 638 GSS---MAIGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFY-NSIEDLP 693

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
             +D N++ +       R+Y ++ G  +Y FG+GLSYT + Y+ L+              
Sbjct: 694 AFDDYNVKNN-------RTYMYFEGKPLYAFGYGLSYTKFDYRNLN-------------- 732

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
                            I + T   +L F    S+ N+G  +G  V  ++ + P     T
Sbjct: 733 -----------------IKQDTQNVTLNF----SIKNSGKYNGDEVAQVYVKFPDQGIKT 771

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVGELRHSLT 804
           P KQL GF RVH     +++IS  + P E+L + +   ++   P G +  MVG+   ++ 
Sbjct: 772 PLKQLKGFKRVHIKKGATEQISIEI-PKEELRLWDDQKKQFYTPSGTYHFMVGKSSDNIC 830

Query: 805 IE 806
           ++
Sbjct: 831 LQ 832


>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
 gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
          Length = 782

 Score =  248 bits (633), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 216/750 (28%), Positives = 334/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSLELVKEVGQVIAK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G  S +M SYN ++G+P      L  Q  RNEW F G++ SD  
Sbjct: 263 DLHQNFLPPFRKAIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 322 SIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A++ A++++ VIVV G     D     +ET                E  D
Sbjct: 495 TVNE---IEQAIKAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S        +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEMS-----SFPLYSF 661

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 155/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   LIS LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 87  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FV   QG                   
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC   A +     Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288

Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V + DID A  ++ + +++LGLF+  
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG 348

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y K+ P  + + EH+++ALDAARQ IVLLKN K  LPLN + + S+A++G  +N 
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INA 405

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
                G Y+G P   P S+L+G+   V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432



 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 148/302 (49%), Gaps = 47/302 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y       P
Sbjct: 653 AGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSY++++Y  L           +K G 
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDL----------QVKDG- 752

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                           + EVT        V   + N G  +G  V  ++ R+P+     P
Sbjct: 753 ----------------VGEVT--------VSFRLKNTGKRNGDEVAQVYVRIPETGGIVP 788

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHSLTI 805
            K+L GF RV   +  S+ +   ++  EQL   + + G+ ++P G   +MVG     + +
Sbjct: 789 LKELKGFRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKDIRL 847

Query: 806 ET 807
           +T
Sbjct: 848 QT 849


>gi|409250776|ref|YP_006886584.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|418760521|ref|ZP_13316675.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|418775352|ref|ZP_13331310.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|418781475|ref|ZP_13337358.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|418783741|ref|ZP_13339586.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|418789285|ref|ZP_13345072.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|418794086|ref|ZP_13349808.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|418796683|ref|ZP_13352374.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|418803316|ref|ZP_13358937.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|418809719|ref|ZP_13365271.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|418813873|ref|ZP_13369394.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|418815139|ref|ZP_13370647.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|418822928|ref|ZP_13378339.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|418830026|ref|ZP_13384989.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|418836408|ref|ZP_13391292.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|418840973|ref|ZP_13395796.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|418844732|ref|ZP_13399518.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|418850848|ref|ZP_13405564.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|418857905|ref|ZP_13412528.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|418862200|ref|ZP_13416744.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
 gi|419789701|ref|ZP_14315381.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419792237|ref|ZP_14317879.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|320086604|emb|CBY96375.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Weltevreden str. 2007-60-3289-1]
 gi|392615486|gb|EIW97925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392618961|gb|EIX01347.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392742601|gb|EIZ99688.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35185]
 gi|392747243|gb|EJA04244.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35188]
 gi|392748425|gb|EJA05411.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 33953]
 gi|392756753|gb|EJA13648.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21559]
 gi|392760526|gb|EJA17361.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19447]
 gi|392762081|gb|EJA18897.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19449]
 gi|392770056|gb|EJA26784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19567]
 gi|392772845|gb|EJA29542.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 35202]
 gi|392773804|gb|EJA30500.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21550]
 gi|392775099|gb|EJA31794.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 22513]
 gi|392787185|gb|EJA43733.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|392793518|gb|EJA49962.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|392800769|gb|EJA56999.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N1543]
 gi|392802259|gb|EJA58473.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM N18486]
 gi|392809639|gb|EJA65673.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 21554]
 gi|392813541|gb|EJA69505.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19443]
 gi|392818698|gb|EJA74582.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 37978]
 gi|392834583|gb|EJA90187.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19470]
 gi|392836652|gb|EJA92232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Newport str. CVM 19536]
          Length = 765

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 211/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
 gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
          Length = 946

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 207/735 (28%), Positives = 336/735 (45%), Gaps = 115/735 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLIHQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     PF+  I +     +M SYN  +G P       L  + R E GF+GY+ SD 
Sbjct: 284 REVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    ++V   E++++AL A+R+ IVLLKN+K  LP
Sbjct: 404 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKENEEVALQASRESIVLLKNEKNVLP 461

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ + +  +A+ GP  +  S     Y  +     S+L+G++  +   +   Y  GC  V 
Sbjct: 462 LDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDLVD 521

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                    +AV  AK+AD  IVV G       E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPGRQ 581

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+I+FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILFGD 638

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
           +NPGG+L +T +P++  ++P N     + + D  +  PG           +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGPDGNMSRANGALYPFGYGLSY 696

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS ++   + K                          +V   VTN
Sbjct: 697 TTFEY------SDLKISPAIITPNQKA-------------------------YVTCKVTN 725

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  V+ L+ R    S  T EK L+GF+RVH     +KEI+F +D  + L + N  
Sbjct: 726 TGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFPID-RKALELLNAD 784

Query: 783 GRRILPLGNHVLMVG 797
              ++  G+  LM+G
Sbjct: 785 MHWVVEPGDFTLMLG 799


>gi|161502686|ref|YP_001569798.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160864033|gb|ABX20656.1| hypothetical protein SARI_00733 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 771

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 212/763 (27%), Positives = 352/763 (46%), Gaps = 137/763 (17%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 92  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 135

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 136 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 189

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG+N                            D   +    KH  AY   +
Sbjct: 190 MGETMVKAMQGKN--------------------------PADRYSVMTSVKHFAAYGAVE 223

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N++ ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L ++
Sbjct: 224 GGK----EYNSVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKE 278

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LK+G+D++       ++    
Sbjct: 279 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKSGVDMSMADEYYSKYLPGL 337

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 338 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 392

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 393 AREVARESLVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVADQSVTVLA 450

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 451 GIQNAVGDGAKILYAKGANITNNKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 510

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 511 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 567

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
            Q  +IL   + G  G  A+A+++FGD+NP G+LPM+ +P S  ++P+  + +N      
Sbjct: 568 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYD 626

Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
            + P + + R++      +Y FG+GLSYT ++       S++T+SA          +++ 
Sbjct: 627 TEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSAP--------TMKRD 672

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
           G+         VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF+
Sbjct: 673 GN---------VTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 715

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           ++       K +SF +D  E L   N+  +     G   + +G
Sbjct: 716 KITLKPGERKTVSFPID-IEALKFWNQQMKYDAEPGKFNIFIG 757


>gi|146301613|ref|YP_001196204.1| beta-glucosidase [Flavobacterium johnsoniae UW101]
 gi|146156031|gb|ABQ06885.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 864

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/460 (33%), Positives = 236/460 (51%), Gaps = 56/460 (12%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YP+ + SLS   RA+ LIS LTL+EK   + D + AIPRLGI  + WWSE+LHG A+N  
Sbjct: 22  YPYKDPSLSSKKRAEDLISRLTLEEKAALMCDQSDAIPRLGIKKFNWWSEALHGYANN-- 79

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLG-QAG--------L 162
                    +VT FP+ +  AASF+  L   +  AV+ E RA YN   Q G        L
Sbjct: 80  --------DNVTVFPEPIGMAASFDDQLVFRVFDAVSDEGRAKYNQWIQNGNENKRFLSL 131

Query: 163 TFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKR 222
           + W PN+NIFRDPRWGRGQET GEDP + S   V  VK  QG             +R+  
Sbjct: 132 SVWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQGP--------ADAKYRK-- 181

Query: 223 VLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRS 281
                           L AC KH   +   +W   SR+  N   +  ++  +T+ P F++
Sbjct: 182 ----------------LLACAKHFAVHSGPEW---SRHELNLNNVKPRELYETYLPAFKA 222

Query: 282 CIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYT 340
            +++     +MC+Y +++  P C    L Q+  R+EWGF+  + SDC AV   +     +
Sbjct: 223 LVQEADVRQVMCAYQRLDDEPCCSNTRLLQRILRDEWGFQYLVVSDCGAVTDFYTTHKVS 282

Query: 341 KTHEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
                +A+  + AG D+ C       +    A++K  ++E++I+++LL +   +  LG  
Sbjct: 283 SDEVHAASKAVLAGTDVECVWDKYPFKKLPEAVEKDLIKEEEINKSLLRVLIGRFDLGEM 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + D     + ++    + + EH++LAL+ A++ + LL+N    LPLNKN V+ +A+IGP 
Sbjct: 343 DDDSIV-PWAQIPASVLNSKEHQQLALEMAQKSMTLLQNKNNILPLNKN-VNKVAVIGPN 400

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVS--KTHYASGC 496
            +N   + G Y G P    ++ +G+E  ++  K  Y   C
Sbjct: 401 ADNEPMLWGNYNGTPNKTITIKKGIEGKLAANKVLYDKSC 440



 Score = 82.8 bits (203), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 120/278 (43%), Gaps = 56/278 (20%)

Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSV 553
           D  F   ++  K  D VI   GL    E E+           DR ++ LP  Q   +  +
Sbjct: 589 DIDFKALIQKLKGIDTVIFAGGLSTLLEGEEMPVSFPGFKGGDRTNIELPAVQRKCLKEL 648

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
               K+ + +  +G     ++         +IL   YPGE+G +A+A+++FGD+NP G+L
Sbjct: 649 KAAGKKVIFVNCSGSA---IALTPETESCDAILQAWYPGESGGQAVADVLFGDYNPAGKL 705

Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           P+++Y  S       D +++        GR+YR YT   ++ FG GLSY+ +        
Sbjct: 706 PISFYKNSDKLGDFEDYSLK--------GRTYR-YTTDVLFPFGFGLSYSKF-------- 748

Query: 674 SELTISASLKAGS-DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVV 732
                   ++AG+  KN ++                 T L+F    S+ N+    G+ +V
Sbjct: 749 -------DIQAGNLSKNKIKPNED-------------TQLKF----SIKNSSKRSGTEIV 784

Query: 733 MLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
            ++ R    ++G P K L  F R+   A   + ++  +
Sbjct: 785 QVYVRKLNDTEG-PLKTLKAFKRIELKAGEKQNVTIDL 821


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 154/447 (34%), Positives = 226/447 (50%), Gaps = 57/447 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N +  +  R   LIS LT++EKI  L   +  IPRLGI  Y   +E+LHG+   G   
Sbjct: 30  YKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGVVRPG--- 86

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 87  -------RFTVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVLT 139

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FV   QG                   
Sbjct: 140 FWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQG------------------- 180

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
                   D    L + +  KH  A + E     +R+  N  I+E+   + + P F  C+
Sbjct: 181 --------DDPHYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCV 228

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GKA+ IM +YN +N VP  L   L QK  R +WGF+GY+ SDC   A +     Y KT
Sbjct: 229 KEGKAASIMSAYNALNDVPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVKT 288

Query: 343 HEDSAAGVLKAGMDINCGTCMLR-HTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V + DID A  ++ + +++LGLF+  
Sbjct: 289 KEAAATLSIKAGLDLECGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSG 348

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            R   Y K+ P  + + EH+++ALDAARQ +VLLKN K  LPLN + + S+A++G  +N 
Sbjct: 349 ER-NPYTKISPSVIGSKEHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INA 405

Query: 462 ISQMGGGYTGIP-CSPKSLLRGLEAYV 487
                G Y+G P   P S+L+G+   V
Sbjct: 406 GKCEFGDYSGAPVVEPVSILQGIRNRV 432



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/305 (27%), Positives = 146/305 (47%), Gaps = 53/305 (17%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +I+VL 
Sbjct: 594 YGEAGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNPN-IIVVLV 652

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGE G  A+AE++FGD+NP GRLP+T+Y       P
Sbjct: 653 AGSSLAINWM--DEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKSLDELPP 710

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPS---ELTISASLK 683
            +D ++         GR+Y+++ G  +Y FG+GLSY++++Y  L       E+T+S  LK
Sbjct: 711 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVKDGGGEVTVSFRLK 763

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
                                                 N G  +G  V  ++ R+P+   
Sbjct: 764 --------------------------------------NTGKRNGDEVAQVYVRIPETGG 785

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN-KHGRRILPLGNHVLMVGELRHS 802
             P K+L GF RV   +  S+ +   +D  EQL   + + G+ ++P G   +MVG     
Sbjct: 786 IVPLKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVGASSKD 844

Query: 803 LTIET 807
           + ++T
Sbjct: 845 IRLQT 849


>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 859

 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 216/759 (28%), Positives = 339/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G PL +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPLAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|197248682|ref|YP_002147137.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Agona
           str. SL483]
 gi|440763077|ref|ZP_20942125.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440769230|ref|ZP_20948190.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440771374|ref|ZP_20950293.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|197212385|gb|ACH49782.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|436415218|gb|ELP13139.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436421310|gb|ELP19157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|436422409|gb|ELP20247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
          Length = 765

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 208/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
 gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
          Length = 782

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 214/750 (28%), Positives = 332/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G  S +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 263 DLHQNFLPPFRKAIDSGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 322 SIEGIHESHFVALTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKAVIDTAVCRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGIITKLSPSRVEYVRGCAIRDT 494

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A+  A++++ VIVV G     D     +ET                E  D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S      + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEVS-----SSPLYSF 661

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y  L                                  +V   ++  F V
Sbjct: 662 GYGMSYTTFEYSAL----------------------------------QVVQKSARCFEV 687

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  GN  LM+G   + + ++
Sbjct: 747 FLVNYTLKKVVESGNFHLMIGAASNDIRLQ 776


>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 777

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 202/731 (27%), Positives = 325/731 (44%), Gaps = 129/731 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  A+++N +L   + + VA 
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q     + P +++ RDPRW R +E+ GEDP++    A   V          
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVT--------- 219

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              +  D        KH +AY + + G+       A I E+
Sbjct: 220 ---GLGSG--------------NLSDPFATIPTLKHFVAYGIPEGGHNGS---AASIGER 259

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPF+S +  G A  +M +YN V+G+P      L     R EW F G+  SD  
Sbjct: 260 ELREYFLPPFQSAVAAG-AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLG 318

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I       K H+ +A   ++AG+D + G         A+ +G+VQE  ID+A+  +
Sbjct: 319 SIEGIKGSHRVAKDHKQAAILAIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRV 378

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            +++  +GLF       K  K    +V T  +  L+   AR+ IVLL+N    LPL K+ 
Sbjct: 379 LALKFEMGLFEKPFVDAKTAK---KEVKTEANIALSRQVARESIVLLENKNNILPLRKDV 435

Query: 449 VSSLAIIGPLVNNISQMGGGYTG-IPCSPKSLLR-GLEAYVSKTH--YASGCHDVPCNSD 504
              +AIIGP  +NI  M G YT   P    + +R  + A + K    Y  GC  +   ++
Sbjct: 436 --KIAIIGPNADNIYNMLGDYTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-SIRDTTN 492

Query: 505 AGFHEAVRIAKKADFVIVVAG-----------------------LDLTQETEDRDRVSLL 541
           +    AV  A+++D ++ V G                       +   +  E  DR +L 
Sbjct: 493 SDIPAAVTAAQQSDIIVAVVGGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLD 552

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           L G+QM L+ ++ +T K P++++   G PL++++A   +      W  YPG+ G  A+A+
Sbjct: 553 LLGRQMELLKALKQTGK-PLVVIYIQGRPLNMNWAATHADALLCAW--YPGQEGGHAIAD 609

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYP-GRSYRFYTGTQVYGFGHGL 660
           ++FGD+NP G++P++  P S  ++P++        +R+ P    Y     T +Y FG+G 
Sbjct: 610 VLFGDYNPAGKMPLS-VPRSVGQIPVH-------YNRKSPLDHRYVEEAATPLYAFGYGK 661

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SY+++ YK             LK   D           DY               V  ++
Sbjct: 662 SYSDFEYK------------DLKIQKDNK---------DY--------------RVSFTL 686

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           TN G  DG  V  L+ R    S   P +QL  F+R+H     SK +SF V     LS+ N
Sbjct: 687 TNTGKYDGDEVAQLYIRNQYASVSQPVQQLKHFERIHLKTGESKTVSF-VLTAGDLSVIN 745

Query: 781 KHGRRILPLGN 791
              +++L  G+
Sbjct: 746 TQMKKVLEPGS 756


>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 944

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 204/753 (27%), Positives = 333/753 (44%), Gaps = 123/753 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATAKHFAAYSNNKGAREGMSRVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF---QKARNEWGFKGYITS 325
           ++ E+    PF+  I +     IM SYN  +G+P  ++G  +    + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFKRVIRETGLLGIMSSYNDYDGIP--VQGSYYWLTTRLRQEMGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++   N  K  +++    ++AG+++ C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHNTAKDMKEAVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVI 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           +  + ++  V+  +GLF+   +    G    ++V  + ++ +AL A+R+ +VLLKN    
Sbjct: 402 NDRVRDILRVKFLIGLFDSPYQTDLAG--ADNEVEKAANEAVALQASRESVVLLKNADNT 459

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH- 497
           LPLN + +  +A+ GP  +        Y  +     ++L G+       ++  Y  GC  
Sbjct: 460 LPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDL 519

Query: 498 -----------DVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
                      + P   D  A    A   A++AD  +VV G       E++ R SL LPG
Sbjct: 520 VDAHWPESEIIEYPLTPDEQAEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPG 579

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
            Q+ L+ +V  T K PV+LVL  G PL V++A  D  + +IL   YPG  G  A+A+I+F
Sbjct: 580 HQLKLLQAVQATGK-PVVLVLINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILF 636

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           GD+NPGG+L +T +P++  ++P N       + D  +              +Y FG+GLS
Sbjct: 637 GDYNPGGKLTVT-FPKTVGQIPFNFPCKPASQIDGGKNPGADGNMSRINGALYPFGYGLS 695

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y      S+L IS  +     K                           V++ VT
Sbjct: 696 YTTFEY------SDLEISPKVITPDQKAT-------------------------VRLKVT 724

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G   G  VV L+ R    S  T EK L GF+R+      +KE++F +D  + L + N 
Sbjct: 725 NTGKRAGDEVVQLYTRDILSSITTYEKNLAGFERIRLKPGETKEVTFTLD-RKHLELLNA 783

Query: 782 HGRRILPLGNHVLMVG------ELRHSLTIETY 808
             + I+  G   +M G       L   LT+E Y
Sbjct: 784 DMKWIVEPGEFAIMAGASSEDIRLNGILTVEDY 816


>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
 gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
          Length = 769

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 218/741 (29%), Positives = 341/741 (46%), Gaps = 130/741 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E LHG+A+              TS+P  +  AA+FN  L   I SA+A 
Sbjct: 110 RLGIPVI-FHEECLHGLAA-----------KDATSYPVPIGLAATFNPELIEKIFSAIAE 157

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           +AR+     +       P +++ RDPRWGR +ET GED  +VS   +  VK  QG     
Sbjct: 158 DARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQG----- 207

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
            DG +       +V+                A  KH  A+   + G N +  +F    +E
Sbjct: 208 -DGSLN---NNNKVI----------------ATLKHFAAHGQPESGTNCAPANF----SE 243

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDC 327
           +   DTF  PF+  I++     +M SYN+++G+P+     L +K  R+EW FKG++ SD 
Sbjct: 244 RFLRDTFLMPFKEAIDKAGVISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDY 303

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--------CMLRHTQSAIDKGKVQEK 379
            A+  +F ++  T +H  +A  V  A + +  G         C    T+  +  G   E 
Sbjct: 304 YAITELF-HKEETVSHGVAANKVEAAKLALEAGVNIEFPNPDCYPNLTE-MVKGGLADES 361

Query: 380 DIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDK 439
           DID  +L +   +  LGLF+    + + G+    +    + ++LAL AAR+ I LLKN+ 
Sbjct: 362 DIDALVLPMLKYKFELGLFDNPYVEAEPGQF---ENKLEQDRELALQAARETITLLKNEG 418

Query: 440 KFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK---THYASGC 496
             LPL       +A+IGP  N    + GGY G P    S+ +G++  V K     Y+ GC
Sbjct: 419 NLLPLKD--FKKIAVIGP--NADRTLLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGC 474

Query: 497 HDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLD--LTQETEDR----D 536
                            P   +   +EAV +A+K+D  ++V G +   ++E  ++    D
Sbjct: 475 KITVGGSWNDDEVILPDPAEDEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGD 534

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  LV  + +T K PV+++L  G P  + F +    + +IL   Y G+   
Sbjct: 535 RPSLELVGRQNKLVEEILKTGK-PVVVLLFNGRPNSIGFIK--DNVPAILECWYLGQETG 591

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
           +A+A+++FGD+NP G+LP++  P S   +P +       S +    R Y F   + ++ F
Sbjct: 592 RAVADVLFGDYNPSGKLPVS-IPRSAGHIPAH------YSHKPSARRGYLFDDVSPLFAF 644

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+GLSYT +S+  L   S+ TISA                       DE  S       V
Sbjct: 645 GYGLSYTKFSFDNLRL-SKDTISA-----------------------DEKVS-------V 673

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
            I V N G + G  VV L+ R    S   P K+L GF ++      +  + F + P E L
Sbjct: 674 SIEVKNEGAIAGEEVVQLYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFELLP-EHL 732

Query: 777 SIANKHGRRILPLGNHVLMVG 797
           +  N   +  +  G   +MVG
Sbjct: 733 AFTNVDMKFTVEPGEFEIMVG 753


>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 782

 Score =  247 bits (631), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 215/750 (28%), Positives = 333/750 (44%), Gaps = 135/750 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L   +G  +A 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R+     Q G   + P +++ RDPRW R +ET GEDP++        V    G N   
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGLGGGNLSQ 231

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
               I                          A  KH +AY + + G    Y   A +  +
Sbjct: 232 KYATI--------------------------ATLKHFLAYAVPEGGQNGNY---ASVGIR 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCD 328
           D    F PPFR  I+ G  S +M SYN ++G+P      L  Q  RNEW F+G++ SD  
Sbjct: 263 DLHQNFLPPFRKAIDAGALS-VMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I E      T E++A   + AG+D++ G     +   A+  G++ +  ID A+  +
Sbjct: 322 SIEGIHESHFVAPTKENAAIQSVTAGVDVDLGGDAYTNLCHAVQSGQMDKTVIDTAVCRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP      K+    V   EH +LA   A+  I LLKN+   LPL+
Sbjct: 382 LRMKFEMGLFEHPYVDP------KIAAKTVRRKEHIELARKIAQSSITLLKNENSILPLS 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIP--CSPKSLLRGLEAYVS--KTHYASGC--HDV 499
           K  ++ +A+IGP  +N   M G YT      + K++L G+   +S  +  Y  GC   D 
Sbjct: 436 KT-INKVAVIGPNADNRYNMLGDYTAPQEDSNVKTVLDGILTKLSPFRVEYVRGCAIRDT 494

Query: 500 PCNSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRD 536
             N      +A+  A++++ VIVV G     D     +ET                E  D
Sbjct: 495 TVNE---IEQAIEAARRSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFD 551

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGA 596
           R SL L G+Q  L+ S+ +T K P+I+V   G PL+ ++A       ++L   YPG+ G 
Sbjct: 552 RASLSLLGRQQELLESLQKTGK-PLIVVYIEGRPLEKNWA--SEYADALLTAYYPGQEGG 608

Query: 597 KALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGF 656
            A+A+++FGD+NP GRLP++  P S  ++P+   N +A  +  Y   S      + +Y F
Sbjct: 609 NAIADVLFGDYNPSGRLPIS-VPRSVGQIPVY-YNKKAPRNHDYVEVS-----SSPLYSF 661

Query: 657 GHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHV 716
           G+G+SYT + Y                  SD  ++Q++              C    F V
Sbjct: 662 GYGMSYTTFEY------------------SDLQVVQKSA------------RC----FEV 687

Query: 717 QISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQL 776
              V N G  DG  V  L+ R    S   P KQL  F+R H      K+++F V   E  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTF-VLTEEDF 746

Query: 777 SIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            + N   ++++  G   +M+G     + +E
Sbjct: 747 FLVNYTLKKVVESGTFQVMIGSSSDDIRLE 776


>gi|238912609|ref|ZP_04656446.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Tennessee str. CDC07-0191]
          Length = 765

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 208/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|417366389|ref|ZP_12138693.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
 gi|353592073|gb|EHC50188.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Hvittingfoss str. A4-620]
          Length = 755

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726


>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
 gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
          Length = 774

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 215/749 (28%), Positives = 343/749 (45%), Gaps = 144/749 (19%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E LHG A+              TSFPQ +   A+FN +L   + +  A 
Sbjct: 112 RLGIPVM-FHEECLHGHAAR-----------DGTSFPQPIGLGATFNPALVEKLYAMTAH 159

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      + G     P +++ RD RWGR +ET GEDP + +   +  V+ FQG+    
Sbjct: 160 ETRV-----RGGHQALTPVVDVARDARWGRVEETYGEDPFLNTQLGIAAVRGFQGDA--- 211

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                   F++K+                + A  KH  A+   + G   +      ++E+
Sbjct: 212 -------SFKDKK---------------HVIATLKHFAAHGQPESG---QNCAPVNVSER 246

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDCD 328
              +TF  PFR C+++G A  +M SYN+++GVP+   R  L    R EWGFKG++ SD  
Sbjct: 247 LLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYY 306

Query: 329 AVATIFEYQNYTKTHEDSAAG--------VLKAGMDI-----NCGTCMLRHTQSAIDKGK 375
           A   I+E  +   +H    A          +KAG++I     +C     RH    + K  
Sbjct: 307 A---IWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDC----YRHLVELVRKKV 359

Query: 376 VQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD----VCTSEHKKLALDAARQG 431
           + E ++D  +  +   + ++GLF+ DP       + P++    V    H++LA +AAR+ 
Sbjct: 360 LHETELDELIAPMLLWKFKMGLFD-DPY------VDPEEAARVVGCEVHRELASEAARET 412

Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS--- 488
           I LLKN+   LPLN   + ++A+IGP  N    + GGY+G+P    ++L G++A +    
Sbjct: 413 ITLLKNENDLLPLNPAKLKTVAVIGPNANR--SLLGGYSGVPAHNVTVLDGIKARLGGAV 470

Query: 489 KTHYASGCHDV--------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETED 534
           K  +A GC                 P        EAV++A  AD VIV  G +     E 
Sbjct: 471 KVVHAEGCKITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREA 530

Query: 535 ------RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWI 588
                  DR SL L G Q  L+ ++  T K PV+ ++  G PL ++    +  + +IL  
Sbjct: 531 WSLKHMGDRTSLDLIGHQDELIRALLATGK-PVVALVFNGRPLAINHVAQN--VPAILEC 587

Query: 589 GYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFY 648
            Y G+    A+A ++FGD NPGG+LP++  P S  ++P+        S+R    R + + 
Sbjct: 588 WYLGQECGSAVAAVLFGDHNPGGKLPIS-IPRSVGQLPV--FYNHKPSAR----RGFLWD 640

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
             T ++ FG GLSYT +++K +                 K I+ +TGS            
Sbjct: 641 EATPLFPFGFGLSYTKFTFKNVRLA--------------KKIISRTGST----------- 675

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
                 HV + VTNAG   G+ VV ++ R    S   P K+L  F ++      +K +S 
Sbjct: 676 ------HVSVDVTNAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSL 729

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            + P E L+  + + + ++  G   +MVG
Sbjct: 730 DLTP-ESLAFYDVNMKYVVEPGEFEIMVG 757


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 161/460 (35%), Positives = 235/460 (51%), Gaps = 61/460 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA+ L+  LTL+EK   + D + ++ RLGI  Y WW+E+LHG+A  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAG--- 79

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA--MYNLGQ------AGLTFW 165
                    T FPQ +  AASFN  +   + +AV+ EARA   Y   Q       GLT W
Sbjct: 80  -------LATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            P +NI+RDPRWGRG ET GEDP + S   V  VK  QG      DG             
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG----PADG------------- 175

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIE 284
               + D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ PPF + ++
Sbjct: 176 ----KYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVK 223

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           +GK   +MC+YN+  G P C    L  Q  R EWGF G + SDC A+A  +  + +  TH
Sbjct: 224 EGKVEEVMCAYNRFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGH-HTH 282

Query: 344 ED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            D    SAA V+ +G D+ CG+      +S + KG + E+ +D ++  L   +  LG  +
Sbjct: 283 PDAESASAAAVI-SGTDLECGSSYKALIES-VKKGLISEETVDTSVKRLMKARFALGEMD 340

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P K  + K+    V ++ H  LAL+ AR+ + LL N   FLPL +  + ++A++GP  
Sbjct: 341 -EPEKVSWTKIPFSVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
           N+     G Y G+P    ++L G+   +    K  Y  GC
Sbjct: 399 NDSVMQWGNYNGMPAHTVTILDGVRNLLGTDDKLIYEQGC 438



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 150/319 (47%), Gaps = 65/319 (20%)

Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF + V I       K AD VI  +G+  + E E+           DR  + 
Sbjct: 575 DAQLNFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIE 634

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALA 600
           LP  Q  L+ ++ R  K+ +ILV   G P+ +   E ++Q   +IL   YPG+ G KA+A
Sbjct: 635 LPAVQRELIDALHRAGKK-IILVNCSGSPIGL---EPETQKCEAILQAWYPGQQGGKAVA 690

Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           E++FGD+NP G+LP+T+Y  + +++P   D NM         GR+YR+     ++ FG+G
Sbjct: 691 EVLFGDYNPAGKLPVTFY-RNVSQLPDFEDYNMT--------GRTYRYMQDVPLFPFGYG 741

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT + Y               K   DKN               E+T+  SL+  V   
Sbjct: 742 LSYTTFGYG--------------KTVLDKN---------------ELTAGQSLKLTV--P 770

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSI 778
           VTN G  +G  VV ++ R    ++G P K L  F RV   A  +  + F +   E +   
Sbjct: 771 VTNTGKRNGEEVVQVYLRKQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWD 829

Query: 779 ANKHGRRILPLGNHVLMVG 797
              +  R+ P GN+ +MVG
Sbjct: 830 DQSNTVRVCP-GNYDIMVG 847


>gi|365877135|ref|ZP_09416640.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
 gi|442587941|ref|ZP_21006755.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
 gi|365754995|gb|EHM96929.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
 gi|442562440|gb|ELR79661.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
          Length = 827

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 207/778 (26%), Positives = 346/778 (44%), Gaps = 140/778 (17%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           SYP+ N + +++   K  I                    RLGIP  ++ +E +HG+    
Sbjct: 143 SYPYSNHTEALNNIQKWFIEET-----------------RLGIPV-DFTNEGIHGL---- 180

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNIN 170
                  T    T FP  +   +++++ L   IG+ +  EA   Y LG   +  +AP ++
Sbjct: 181 -------THDRATPFPAPINIGSTWDKDLVGKIGNTIGKEA---YYLGYTNV--YAPILD 228

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR  ET GEDP ++  Y                          KR++KG  + 
Sbjct: 229 VSRDPRWGRVVETYGEDPFMIGEYG-------------------------KRMVKGIQQN 263

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   +++  KH   Y + K G       +  +  ++    +  PF+  I +     
Sbjct: 264 G-------VASTLKHYAVYSVPKGGRDGLARTDPHVAPKEMHTMYLYPFKEVIRKEHPLG 316

Query: 291 IMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDS 346
           +M SYN  +GVP         DL    R E+GF GY+ SD DA+  +    +  K +E+ 
Sbjct: 317 VMASYNDYDGVPVISSKYFLTDLL---RKEYGFDGYVVSDSDALEFLHGKHHVAKDYEEG 373

Query: 347 AAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
               L+AG+D+          L     A+  GK++E+ ++  + ++   + RLGLF+   
Sbjct: 374 IQKALEAGLDVRTNFTQPKEYLTALMDALKSGKIKEEVLNERVRSVLKTKFRLGLFDEPI 433

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNI 462
           R   + K     V T E + L++D  R+ +VLLKN+K+ LPL+   + ++ I GPL + +
Sbjct: 434 RN--FIKEADRKVHTKEDEALSVDVNRRSVVLLKNEKQTLPLDTGKLKNILITGPLADAV 491

Query: 463 SQMGGGYTGIPCSPKSLLR-GLEAYVSKTH----YASGCHDV-------------PCNSD 504
           +     Y G   +P + +R G+E Y S  H    Y  G   +             P   +
Sbjct: 492 NYTTSRY-GPSNNPVTTIRKGIEDYASLHHINTSYTKGVDVIDEGWPETEIIPVEPTEKE 550

Query: 505 -AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVIL 563
            +   + + +A+K+D +I V G    +  E R R SL LPG+Q   +  + +T ++P++L
Sbjct: 551 KSEISKTISMAEKSDVIIAVMGESEKEVGESRSRSSLNLPGKQTYFLQQLYKT-RKPIVL 609

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
           VL  G PL +++   +  + +IL   + G      +AE +FG+ NPGG+LP++ +P+S  
Sbjct: 610 VLVNGRPLTINW--ENKYLPAILETWFLGPQSGNIVAETLFGENNPGGKLPIS-FPKSIG 666

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELTIS 679
           ++ MN     A  + Q PG        ++V    Y FG+GLSYTN+ +   S  S+    
Sbjct: 667 QLEMNFPTKPAAQAGQ-PGTGPNGSGSSRVTGFLYPFGYGLSYTNFEFTDFSLSSK---- 721

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
             +KAG++                           H ++ VTN G V G  VV L+    
Sbjct: 722 -KIKAGNE--------------------------LHAKLKVTNTGKVKGDEVVQLYLSDL 754

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             S  T E  L GF+RV      +KE+ F ++  E + + N     ++  G   + VG
Sbjct: 755 VSSVTTYEMDLRGFERVTLEPGEAKEVQFTLN-KEHMQLLNDKMEWVVEPGEFRVSVG 811


>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 859

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 216/759 (28%), Positives = 339/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           VS     +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VSNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|189464698|ref|ZP_03013483.1| hypothetical protein BACINT_01042 [Bacteroides intestinalis DSM
           17393]
 gi|189436972|gb|EDV05957.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 862

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 153/461 (33%), Positives = 237/461 (51%), Gaps = 57/461 (12%)

Query: 51  SYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNG 110
           + P+ N  LS + RAK L+  LTL+EK   + D++ AIPRLGI  + WWSE+LHG+A+ G
Sbjct: 20  TLPYQNPELSPAERAKDLVKRLTLEEKALLMCDDSEAIPRLGIKKFNWWSEALHGVANQG 79

Query: 111 PGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---------LGQAG 161
                     +VT FP+ +  AASFN  L   I +AV+ E RA +N         +    
Sbjct: 80  ----------NVTVFPEPVGMAASFNDKLVFEIFNAVSDEMRAKHNERVRNGLEDVRFHS 129

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFRE 220
           L+ W PN+NIFRDPRWGRGQET GEDP + S   +  VK  QG EN K         +R+
Sbjct: 130 LSVWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGIAVVKGLQGPENEK---------YRK 180

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPF 279
                             L AC KH   +   +W   SR++ N   ++ +D  +T+ P F
Sbjct: 181 ------------------LLACAKHYAVHSGPEW---SRHTANLNNVSPRDLWETYLPAF 219

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQN 338
           ++ +++     +MC+Y +++  P C    L Q+  R+EWGFK  + SDC A+A  +    
Sbjct: 220 KALVQKADVREVMCAYQRLDDDPCCGNTRLLQQILRDEWGFKYLVVSDCGAIADFWTSHK 279

Query: 339 YTKTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
            +     +A     AG D+ CG     +    A+ +G + E+++++ +L L   +  LG 
Sbjct: 280 SSSDAVHAAVKGTMAGTDVECGYGYAYQKLPEAVSRGLITEEEVNKHVLRLMEGRFELGE 339

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
            + DP    + K+    V    HK L+L+ +RQ + LL+N    LPL+K ++  +A+IGP
Sbjct: 340 MD-DPSLVNWTKIPMSVVNCKAHKDLSLNMSRQTMTLLQNKNNVLPLSK-SIRKIAVIGP 397

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGC 496
             ++   + G Y G P    ++L G ++ + K    Y  GC
Sbjct: 398 NADDKPMLWGNYNGTPNQTITILDGFKSKLKKNQIVYMKGC 438



 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 114/272 (41%), Gaps = 54/272 (19%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVART 556
           + E++   K  D V+ V G+    E E+           DR  + LP  Q + + ++   
Sbjct: 590 YSESISKLKGIDVVVFVGGISPQLEGEEMPVNIPGFKGGDRTDIELPAVQRNFLKALKDA 649

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
            K+ V +  +G     ++         +IL   Y GE G  A+A+++FGD+NP G+LP+T
Sbjct: 650 GKQVVFVNCSGSS---MALLPETESCDAILQAWYGGELGGYAVADVLFGDYNPSGKLPVT 706

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           +Y  +       D +M+        GR+YR Y    ++ FG GLSYT++           
Sbjct: 707 FYKNTKQLPDYEDYSMK--------GRTYR-YMSDPLFPFGFGLSYTDF----------- 746

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
              A   A  +K  L+           DE  + T       + V+N G   G+ +V ++ 
Sbjct: 747 ---AIGTASCNKTQLR----------TDESLTLT-------VPVSNTGKRSGTEIVQVYI 786

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
           R    + G P K L  + RV       +++  
Sbjct: 787 RKTDDADG-PLKSLKAYARVELATGAQQDVKI 817


>gi|417519247|ref|ZP_12181440.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
 gi|353647113|gb|EHC90324.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Uganda str. R8-3404]
          Length = 755

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 207/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEVNGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGESKTVSFPID-IEALKFWNQQ 726


>gi|198245676|ref|YP_002216250.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|375119733|ref|ZP_09764900.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
 gi|197940192|gb|ACH77525.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326624000|gb|EGE30345.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Dublin str. SD3246]
          Length = 755

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADM 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 944

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 209/754 (27%), Positives = 342/754 (45%), Gaps = 125/754 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   IG     
Sbjct: 153 RLGIPV-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRELIHKIGFITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQYNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH  AY   K         +  I+ 
Sbjct: 253 -----------------------------QVAATGKHFAAYSNNKGAREGMSRVDPQISP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ E+    PFR  I +     +M SYN  +G+P  ++G    L  + R E GF+GY+ S
Sbjct: 284 REVENIHIYPFRRVIREAGLLGVMSSYNDYDGIP--IQGSHYWLTTRLRGEIGFRGYVVS 341

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DAV  ++      K  +++    ++AG++I C        +   +  + +G + E+ I
Sbjct: 342 DSDAVEYLYTKHGTAKDMKEAIRQSVEAGLNIRCTFRSPDSFVLPLRELVKEGGLSEEII 401

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD-DVCTSEHKKLALDAARQGIVLLKNDKK 440
           +  + ++  V+   GLF+  P +      G D +V   E+  +AL A+R+ IVLLKN+  
Sbjct: 402 NDRVRDILRVKFLTGLFD-TPYQSDLA--GADREVEKEENGSIALQASRESIVLLKNENN 458

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGC- 496
            LPL+ + V  +A+ GP  +  +     Y  +     ++L+G++  VS   +  Y  GC 
Sbjct: 459 MLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKAEVLYTKGCD 518

Query: 497 -------------HDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
                        H +  +  A  ++A   A+++D  +VV G       E++ R SL LP
Sbjct: 519 LVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGENKSRSSLDLP 578

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q+ L+ ++  T K PVILVL  G PL V++A  D  + +IL   YPG  G  ALA+++
Sbjct: 579 GRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAKGGIALADVL 635

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           FGD+NPGG+L +T +P++  ++P N       + D  +              +Y FG+GL
Sbjct: 636 FGDYNPGGKLTVT-FPKTVGQIPFNFPYKPASQIDGGKNPGPEGNMSRINGALYPFGYGL 694

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y      S+L I+  +   +                 +E T        V++ V
Sbjct: 695 SYTTFEY------SDLEITPKVITPN-----------------EEAT--------VRLKV 723

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           TN G   G  VV L+ R    S  T EK L GF+RVH     +KE+ F +   + L + +
Sbjct: 724 TNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVFTLG-RKHLELLD 782

Query: 781 KHGRRILPLGNHVLMVG------ELRHSLTIETY 808
            + + ++  G+  +M         L   LT+E Y
Sbjct: 783 ANMQWVVEPGDFTIMAAASSEDIRLNGLLTVEDY 816


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 161/460 (35%), Positives = 235/460 (51%), Gaps = 61/460 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + SLS   RA+ L+  LTL+EK   + D + ++ RLGI  Y WW+E+LHG+A  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAG--- 79

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA--MYNLGQ------AGLTFW 165
                    T FPQ +  AASFN  +   + +AV+ EARA   Y   Q       GLT W
Sbjct: 80  -------LATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLK 225
            P +NI+RDPRWGRG ET GEDP + S   V  VK  QG      DG             
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG----PADG------------- 175

Query: 226 GFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIE 284
               + D+     L AC KH   +   +W   +R+SFNA  I  +D  +T+ PPF + ++
Sbjct: 176 ----KYDK-----LHACAKHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVK 223

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
           +GK   +MC+YN+  G P C    L  Q  R EWGF G + SDC A+A  +  + +  TH
Sbjct: 224 EGKVEEVMCAYNRFEGDPCCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGH-HTH 282

Query: 344 ED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
            D    SAA V+ +G D+ CG+      +S + KG + E+ +D ++  L   +  LG  +
Sbjct: 283 PDAESASAAAVI-SGTDLECGSSYKALIES-VKKGLISEETVDTSVKRLMKARFALGEMD 340

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            +P K  + K+    V ++ H  LAL+ AR+ + LL N   FLPL +  + ++A++GP  
Sbjct: 341 -EPEKVSWTKIPFSVVASAAHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNA 398

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
           N+     G Y G+P    ++L G+   +    K  Y  GC
Sbjct: 399 NDSVMQWGNYNGMPAHTVTILDGVRNLLGTDDKLIYEQGC 438



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 151/319 (47%), Gaps = 65/319 (20%)

Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF + V I       K AD VI  +G+  + E E+           DR  + 
Sbjct: 575 DAQLNFDLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIE 634

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQ-ISSILWIGYPGEAGAKALA 600
           LP  Q  L+ ++ R  K+ +ILV   G P+ +   E ++Q   +IL   YPG+ G KA+A
Sbjct: 635 LPAVQRELIDALHRAGKK-IILVNCSGSPIGL---EPETQKCEAILQAWYPGQQGGKAVA 690

Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHG 659
           E++FGD+NP G+LP+T+Y  + +++P   D NM         GR+YR+     ++ FG+G
Sbjct: 691 EVLFGDYNPAGKLPVTFY-RNVSQLPDFEDYNMT--------GRTYRYMQDVPLFPFGYG 741

Query: 660 LSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQIS 719
           LSYT + Y               K   DKN               E+T+  SL+  + + 
Sbjct: 742 LSYTTFGYG--------------KTVLDKN---------------ELTAGQSLK--LTVP 770

Query: 720 VTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSI 778
           VTN G  +G  VV ++ R    ++G P K L  F RV   A  +  + F +   E +   
Sbjct: 771 VTNTGKRNGEEVVQVYLRKQGDAEG-PIKTLRAFKRVSIPAGKTVNVEFDLKDKELEWWD 829

Query: 779 ANKHGRRILPLGNHVLMVG 797
              +  R+ P GN+ +MVG
Sbjct: 830 DQSNTVRVCP-GNYDIMVG 847


>gi|194736410|ref|YP_002115255.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Schwarzengrund str. CVM19633]
 gi|197300833|ref|ZP_02661390.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204929066|ref|ZP_03220209.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205360240|ref|ZP_02681470.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|416491454|ref|ZP_11727088.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|416496191|ref|ZP_11729048.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|416586610|ref|ZP_11775622.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|416674556|ref|ZP_11821234.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|417359373|ref|ZP_12133763.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|417391848|ref|ZP_12154881.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|417476332|ref|ZP_12170882.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|417511967|ref|ZP_12176427.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
 gi|194711912|gb|ACF91133.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|197290594|gb|EDY29949.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|204321610|gb|EDZ06809.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205350836|gb|EDZ37467.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|322649375|gb|EFY45811.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. OH_2009072675]
 gi|322656495|gb|EFY52784.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. CASC_09SCPH15965]
 gi|322667165|gb|EFY63332.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MD_MDA09249507]
 gi|323222876|gb|EGA07232.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. MB102109-0047]
 gi|353589745|gb|EHC48461.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Give
           str. S5-487]
 gi|353614040|gb|EHC65987.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Minnesota str. A4-603]
 gi|353641236|gb|EHC86018.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Rubislaw str. A4-653]
 gi|353641325|gb|EHC86084.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Senftenberg str. A4-543]
          Length = 755

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|62180752|ref|YP_217169.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|375115081|ref|ZP_09760251.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
 gi|62128385|gb|AAX66088.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|322715227|gb|EFZ06798.1| beta-D-glucoside glucohydrolase, periplasmic [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SCSA50]
          Length = 765

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 348/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN+     +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNQDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--AIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  +A G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
 gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
          Length = 859

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
 gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
          Length = 859

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 216/760 (28%), Positives = 337/760 (44%), Gaps = 125/760 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS 488
            + IVLL+N+   LPL  N + S+A+IGP   N  Q+  G        K  +  LEA   
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGP---NADQVQFGDYTWSRDNKDGVTLLEALKE 451

Query: 489 KT------HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--E 533
           +       +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E
Sbjct: 452 RAGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGE 510

Query: 534 DRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGE 593
             D   L L G Q  LV ++  T K PVI+VL  G PL +S+ + +  I  I+   YPGE
Sbjct: 511 GFDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPLAMSWIKEN--IPGIVVQWYPGE 567

Query: 594 AGAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYR 646
            G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y 
Sbjct: 568 QGGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYV 626

Query: 647 FYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEV 706
           F +   ++ FGHGLSYT++ Y                       L  T S+ DY      
Sbjct: 627 FSSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------ 657

Query: 707 TSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEI 766
            +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++
Sbjct: 658 -ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715

Query: 767 SFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
              + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 716 IIKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|445146604|ref|ZP_21387818.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|445154484|ref|ZP_21391811.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
 gi|444845501|gb|ELX70711.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|444850729|gb|ELX75826.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
          Length = 765

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADM 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|423227452|ref|ZP_17213913.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623082|gb|EIY17188.1| hypothetical protein HMPREF1062_06099 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 786

 Score =  246 bits (629), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 219/742 (29%), Positives = 349/742 (47%), Gaps = 123/742 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG  +E        L A  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
           D +AV  ++          D AA V+ AG+++     +    +R  + AI +GKV  + I
Sbjct: 325 DSEAVEFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D  + ++  V+  +GLF+ +P KG   K     V + EH+ +++ AA + IVLLKN+   
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
           LPL+K+ +  +A+IGP  N +  +   Y       K++ +G++ Y+  ++  YA G    
Sbjct: 443 LPLSKD-LKKVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501

Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
                    ++VP + +  A   EAV +AK++D  I+V G +     E+  R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q  L+ +V  T K PVIL+L  G    +++AE    I  I+   +PGE    A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRAATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618

Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
           D+NPGG+L +T +P S  ++P        +DS      + +   TGT +Y FG+GLSYT 
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670

Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
           ++Y      S+L I   +       I  Q   +L         SC          V N G
Sbjct: 671 FAY------SDLKIENPV-------IGVQGSVKL---------SC---------KVKNTG 699

Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
            V G  VV L+      S  T  K L GF+R+H  +   K I F + P ++L + NK   
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLESGEEKVIDFVLTP-QELGLWNKDNH 758

Query: 785 RILPLGNHVLMVGELRHSLTIE 806
            ++  G   +MVG     + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIKLQ 780


>gi|205357092|ref|ZP_02344874.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|213650030|ref|ZP_03380083.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|289824263|ref|ZP_06543858.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-3139]
 gi|378958841|ref|YP_005216327.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar Typhi
           str. P-stx-12]
 gi|205324021|gb|EDZ11860.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA29]
 gi|374352713|gb|AEZ44474.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. P-stx-12]
          Length = 755

 Score =  246 bits (628), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 210/748 (28%), Positives = 348/748 (46%), Gaps = 137/748 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+  + +N     +
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
            + P + + R++      +Y FG+GLSYT ++       S++T+S+          +Q+ 
Sbjct: 611 PEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSSP--------TMQRD 656

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
           G         +VT+         + VTN G  +G  V+ ++ +    S   P KQL GF+
Sbjct: 657 G---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGFE 699

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
           ++       K +SF +D  E L   N+ 
Sbjct: 700 KITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|423140736|ref|ZP_17128374.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
 gi|379053290|gb|EHY71181.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
          Length = 765

 Score =  246 bits (628), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 208/738 (28%), Positives = 341/738 (46%), Gaps = 138/738 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTWLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LK+G+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKSGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESLVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVADQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + LP  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITLPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR------ 633
            Q  +IL   + G  G  A+A+++FGD+NP G+LPM+ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYV 620

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           ++   +Y  R +    G  +Y FG+GLSYT ++       S++T+SA          +++
Sbjct: 621 SEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSAP--------TMKR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVD 771
           +++      SK +SF +D
Sbjct: 709 EKIALKPGESKTVSFPID 726


>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 740

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 201/666 (30%), Positives = 316/666 (47%), Gaps = 93/666 (13%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T  P  L  + S++           AVEA A      AGL + +AP ++I RDPRWGR  
Sbjct: 118 TISPVPLAESCSWDMETIEASARMAAVEASA------AGLQWTFAPMVDIARDPRWGRVM 171

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GEDP + S  A   V+ FQG++  + +                           + A
Sbjct: 172 EGAGEDPYLGSHIARARVRGFQGDDLSAPN--------------------------TILA 205

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           C KH   Y   + G      +N + I++Q   + + PPF++  + G A+  M S+N+++G
Sbjct: 206 CAKHFAGYGASEGGR----DYNTVDISDQRLRELYLPPFKAAADAGAAT-FMNSFNELSG 260

Query: 301 VPACLRGDLF---QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           VPA   G+ F   Q  RNEWG+ G I SD  +VA +  +    +  + +A   +K   DI
Sbjct: 261 VPAT--GNRFLVKQILRNEWGWDGVIVSDWGSVAEMIPH-GIAEDKKQAALLAVKNECDI 317

Query: 358 NC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
           +  G C     +  + +GKV EK+IDR++  +  ++  LGLF+ DP +    +   +   
Sbjct: 318 DMEGNCYPSSLEELVKEGKVSEKEIDRSVRRILRLKYELGLFD-DPYRYCDEQREKEVTL 376

Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPC 474
           ++ H++ A D AR+ IVLL+N K  LPL K    S+A++GPL ++   M G +   G P 
Sbjct: 377 SAAHREAARDMARKSIVLLENRKSVLPLGKP--RSIAVVGPLADSPVDMLGEWRAKGDPK 434

Query: 475 SPKSLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQE 531
              ++LRG+E      ++  +A GC DV  +  +GF EAVR A+ AD VI   G      
Sbjct: 435 EVVTILRGIEKTAGAGTRVTHAKGC-DVTGSDRSGFAEAVRAARSADVVIACLGESADMS 493

Query: 532 TEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYP 591
            E   R  L LPG Q  L+  + +T K P++L+L+ G PL +++ + +  I +I+   + 
Sbjct: 494 GEGYCRSELGLPGVQQELLKELKKTGK-PIVLLLSNGRPLTLAWEKEN--IETIVETWFL 550

Query: 592 GEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGT 651
           G     A+A+++FG +NP G+L M+ +P +  ++P+   +       +   R    Y   
Sbjct: 551 GTEAGNAVADVLFGKYNPSGKLVMS-FPYNVGQIPVYYNHKHTGRPFEPNQRYVMHYIDA 609

Query: 652 QV---YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
            V   Y FG+GLSYT + Y       E T+S+   A  D                     
Sbjct: 610 PVDALYPFGYGLSYTRFEY------GEPTLSSDRMAAGDT-------------------- 643

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
                    + VTNAGD DG  VV L+ R  K     P K+L GF ++      S +++F
Sbjct: 644 -----ITATVKVTNAGDYDGEEVVQLYIRDLKAQITRPVKELKGFRKIFLKKGESADVTF 698

Query: 769 GVDPCE 774
            +   E
Sbjct: 699 DITRAE 704


>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 775

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 218/774 (28%), Positives = 347/774 (44%), Gaps = 126/774 (16%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLG----IPAYEWWSESLHGIASNGPGVNF- 115
           I  +  S+++L+TL+EKI Q++  +      G      A E+  + + G   N  G +  
Sbjct: 41  IERKVDSVLNLMTLEEKIGQMNQYSGNFAVTGEVAETKAGEYLKKGMIGSTFNTFGADHI 100

Query: 116 ------NGTVSSV---------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
                 N   S +               T+FP  L  A S++  L        A EA A 
Sbjct: 101 RMLQEQNLKYSRLKIPMLFAADVIHGLETTFPIPLAEACSWDLELMEQSARIAAEEATA- 159

Query: 155 YNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGG 213
                +G+ + +AP I+I RDPRWGR  E  GEDP + S  A   V+ FQG     D   
Sbjct: 160 -----SGIAWNFAPMIDIARDPRWGRVMEGAGEDPYLGSLVARARVRGFQGIETYKDFSK 214

Query: 214 IGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTED 273
           I                        + A  KH + Y   + G   R   +  ++ +   +
Sbjct: 215 IN----------------------TMMATSKHFVGYGAVQAG---RDYHSVDMSVRTLHE 249

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVAT 332
           T+ PPF++ +++G  +  M ++N +NGVP      LF++  R+ WGF G + +D  A+  
Sbjct: 250 TYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKEILRDRWGFGGMVVTDYTAIQE 308

Query: 333 IFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSV 391
           +  +  + +  + +    + AG+D++      + + +  +++GKV EK ID A+  +  +
Sbjct: 309 MVAH-GFARDLKHATELAIDAGIDMDMISEGFVTYLKELVEEGKVSEKQIDVAVSRILEM 367

Query: 392 QLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSS 451
           +  LGLF+ DP K    +   + V   EH K A + A++ IVLL+N    LPL KN    
Sbjct: 368 KFLLGLFD-DPFKYCNAERQKEVVMNPEHLKAAREVAQRSIVLLENKNNVLPLKKNEPKR 426

Query: 452 LAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYVS----KTHYASGCH-------- 497
           +A+IGP V     + G +   G P    +L+ GLE        K  YA G          
Sbjct: 427 VALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYKDSQVKFSYAKGTSLPVIDRTT 486

Query: 498 ------DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVT 551
                  VP  S  GF EA+ +A+ +D ++V  G       E   R  + LPG Q  L+ 
Sbjct: 487 QKVSTTRVPDRS--GFSEAINLARTSDVILVAMGEKFHWSGEAASRTDITLPGNQRELLK 544

Query: 552 SVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGG 611
            + +T K P+ILVL  G PLD+S+ EA++ + +I+   YPG     A+A+++ GD+NP  
Sbjct: 545 ELKKTGK-PIILVLFNGRPLDLSW-EAEN-VDAIVEAWYPGIMAGHAVADVLSGDYNPSA 601

Query: 612 RLPMTWYPESFTKVPM--NDMNMRADSSRQYPG---RSYRFYTGTQVYGFGHGLSYTNYS 666
           +L MT +P +  ++P+  N  N         P     SY     + +Y FG+GLSYT++ 
Sbjct: 602 KLVMT-FPRNVGQIPIFYNVKNTGRPFDEDNPADYRSSYIDCPNSPLYPFGYGLSYTSFE 660

Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
           Y             + K  S K  L++ G                    V + VTN G +
Sbjct: 661 YD------------NAKISSKK--LERGGI-----------------LTVSVDVTNTGTM 689

Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           DG  VV L+      S   P K+L GF ++H     +K + F +D  E+L + N
Sbjct: 690 DGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGETKTVEFTIDE-ERLKMYN 742


>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
 gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
          Length = 859

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVIPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|416424053|ref|ZP_11691321.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|416434178|ref|ZP_11697512.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|416438422|ref|ZP_11699509.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|416447728|ref|ZP_11705981.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|416450525|ref|ZP_11707600.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|416460485|ref|ZP_11714793.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|416470222|ref|ZP_11718747.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|416476726|ref|ZP_11721214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|416508101|ref|ZP_11735884.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|416513698|ref|ZP_11738019.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|416540687|ref|ZP_11750492.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|416561671|ref|ZP_11761668.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|416575337|ref|ZP_11768369.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|416594582|ref|ZP_11780414.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|416602160|ref|ZP_11785217.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|416607231|ref|ZP_11788413.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|416611818|ref|ZP_11791047.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|416623896|ref|ZP_11797678.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|416630098|ref|ZP_11800505.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|416636573|ref|ZP_11803136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|416652084|ref|ZP_11811486.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|416655624|ref|ZP_11812629.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|416669115|ref|ZP_11819140.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|416700997|ref|ZP_11829262.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|416705382|ref|ZP_11830863.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|416713638|ref|ZP_11837193.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|416720123|ref|ZP_11841889.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|416721559|ref|ZP_11842724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|416729877|ref|ZP_11848338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|416738151|ref|ZP_11853179.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|416744242|ref|ZP_11856524.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|416759732|ref|ZP_11864557.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|416760387|ref|ZP_11864780.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|416767537|ref|ZP_11869997.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|418484908|ref|ZP_13053898.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|418492113|ref|ZP_13058613.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|418496941|ref|ZP_13063366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|418500305|ref|ZP_13066703.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|418503319|ref|ZP_13069684.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|418509304|ref|ZP_13075600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|418512924|ref|ZP_13079159.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|418527981|ref|ZP_13093934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|452119629|ref|YP_007469877.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
 gi|322615194|gb|EFY12116.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|322617785|gb|EFY14681.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|322624621|gb|EFY21452.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|322626928|gb|EFY23724.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|322634114|gb|EFY30850.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|322635695|gb|EFY32405.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|322640170|gb|EFY36834.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|322646407|gb|EFY42919.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322661471|gb|EFY57695.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|322665641|gb|EFY61825.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|322671171|gb|EFY67299.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|322675188|gb|EFY71265.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|322680797|gb|EFY76832.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|322686972|gb|EFY82949.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|323192736|gb|EFZ77963.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|323198832|gb|EFZ83931.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|323205158|gb|EFZ90136.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|323213417|gb|EFZ98214.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|323215774|gb|EGA00517.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|323223860|gb|EGA08163.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|323231220|gb|EGA15335.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|323233654|gb|EGA17746.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|323237724|gb|EGA21784.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|323245707|gb|EGA29701.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|323248929|gb|EGA32853.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|323253000|gb|EGA36833.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|323258676|gb|EGA42338.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|323259508|gb|EGA43143.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Montevideo str. IA_2010008284]
 gi|323268291|gb|EGA51766.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|323271920|gb|EGA55336.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|363552716|gb|EHL36999.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|363567278|gb|EHL51278.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|363573864|gb|EHL57738.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|366055944|gb|EHN20277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|366057212|gb|EHN21516.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|366058491|gb|EHN22779.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|366069158|gb|EHN33284.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|366073161|gb|EHN37236.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|366078494|gb|EHN42495.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|366082861|gb|EHN46791.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|366827184|gb|EHN54093.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|372204105|gb|EHP17636.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|451908633|gb|AGF80439.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
          Length = 765

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|437730260|ref|ZP_20831186.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
 gi|435289521|gb|ELO66481.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 648901 16-16]
          Length = 764

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|437849166|ref|ZP_20847216.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
 gi|435338617|gb|ELP07830.1| beta-D-glucoside glucohydrolase, partial [Salmonella enterica
           subsp. enterica serovar Enteritidis str. 6.0562-1]
          Length = 792

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 113 RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 156

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 157 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 210

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 211 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 244

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 245 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 299

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 300 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 358

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 359 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 413

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 414 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 471

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 472 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 531

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 532 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 588

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 589 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 647

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 648 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 692

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 693 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 735

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 736 EKITLKPGERKTVSFPID-IEALKFWNQQ 763


>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 859

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 214/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPFLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +    + +  +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLA----SVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N    LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNKNNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|378984729|ref|YP_005247884.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
 gi|312913157|dbj|BAJ37131.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. T000240]
          Length = 765

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQEMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
 gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
          Length = 946

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 207/735 (28%), Positives = 335/735 (45%), Gaps = 115/735 (15%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 153 RLGIPT-DFTNEGIRGVES-----------YKATNFPTQLGLGHTWNRQLIHQVGLITGR 200

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   +E V+  Q  +  
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQHNH-- 252

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         ++A  KH IAY   K         +  ++ 
Sbjct: 253 -----------------------------QVAATGKHFIAYSNNKGAREGMARVDPQMSP 283

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDC 327
           ++ E     PF+  I +     +M SYN  +G P       L  + R E GF+GY+ SD 
Sbjct: 284 REVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGEMGFRGYVVSDS 343

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDR 383
           DAV  ++      K  +++    ++AG+++ C        +   +  + +G + E+ I+ 
Sbjct: 344 DAVEYLYTKHGTAKDMKEAVRQSVEAGLNVRCTFRSPDSYVLPLRELVKEGGLSEEVIND 403

Query: 384 ALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLP 443
            + ++  V+  +GLF+   +    G    ++V   E++++AL A+R+ IVLLKN+K  LP
Sbjct: 404 RVRDILRVKFLVGLFDTPYQTDLKG--ADEEVEKKENEEVALQASRESIVLLKNEKNVLP 461

Query: 444 LNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDVP 500
           L+ + +  +A+ GP  +  S     Y  +     S+L+G++  +   +   Y  GC  V 
Sbjct: 462 LDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKADVLYTKGCDLVD 521

Query: 501 CN--------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
            N                    +AV  AK+AD  IVV G       E++ R SL LPG+Q
Sbjct: 522 ANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENKSRSSLDLPGRQ 581

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
           + L+ +V  T K PV+LVL  G PL +++A  D  + +IL   YPG  G  A+A+I+FGD
Sbjct: 582 LDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKGGIAVADILFGD 638

Query: 607 FNPGGRLPMTWYPESFTKVPMN---DMNMRADSSRQYPGRSYRFYTGT-QVYGFGHGLSY 662
           +NPGG+L +T +P++  ++P N     + + D  +  PG           +Y FG+GLSY
Sbjct: 639 YNPGGKLTVT-FPKTVGQIPFNFPCKPSSQIDGGKN-PGPDGNMSRANGALYPFGYGLSY 696

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L IS ++   + K                          +V   VTN
Sbjct: 697 TTFEY------SDLKISPAIITPNQKA-------------------------YVTCKVTN 725

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            G   G  V+ L+ R    S  T EK L GF+RVH     +KEI+F +D  + L + N  
Sbjct: 726 TGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPID-RKALELLNAD 784

Query: 783 GRRILPLGNHVLMVG 797
              ++  G+  LM+G
Sbjct: 785 MHWVVEPGDFTLMLG 799


>gi|194449077|ref|YP_002046219.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Heidelberg str. SL476]
 gi|205359117|ref|ZP_02667180.2| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|386591978|ref|YP_006088378.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|417384271|ref|ZP_12149694.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|194407381|gb|ACF67600.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|205338504|gb|EDZ25268.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|353609401|gb|EHC62716.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Johannesburg str. S5-703]
 gi|383799022|gb|AFH46104.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
          Length = 755

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|383761254|ref|YP_005440236.1| beta-xylosidase [Caldilinea aerophila DSM 14535 = NBRC 104270]
 gi|381381522|dbj|BAL98338.1| beta-xylosidase [Caldilinea aerophila DSM 14535 = NBRC 104270]
          Length = 790

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 226/830 (27%), Positives = 350/830 (42%), Gaps = 198/830 (23%)

Query: 47  PHFDSYPFCNTSLSISTRAKSLISLLTLQEK----------------------------- 77
           P   +YP+ N SL +  R   LI  +T++EK                             
Sbjct: 4   PTKTNYPYQNASLPVEERVNDLIGRMTIEEKVAQLGSLWIYEIADGQGLNRAWAQARMAN 63

Query: 78  ----IQQLSDNASAIP------------------RLGIPAYEWWSESLHGIASNGPGVNF 115
               + +L+  ++  P                  RLGIPA     E   G  + G     
Sbjct: 64  GLGQVTRLAGGSTLGPVETAKLANQIQKFLIEETRLGIPAL-IHDECCSGFLAKG----- 117

Query: 116 NGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDP 175
                  T+FPQ++  A+++   L   +   +  + RA+      GL   AP ++I RDP
Sbjct: 118 ------ATNFPQIIGVASTWEPELVEAMTRVIRQQMRAVGV--HHGL---APVLDIARDP 166

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGR +ET GEDP + S   V +++  QGE+W   DG +  G                  
Sbjct: 167 RWGRTEETFGEDPYLTSTMGVAYIRGLQGESWA--DGVMATG------------------ 206

Query: 236 ELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCS 294
                   KH + Y   + G N++     A IT ++  + +  PF + +   + + IM +
Sbjct: 207 --------KHFVGYSASEGGLNWA----PAHITPRELREVYLAPFEAAVRAARLASIMPA 254

Query: 295 YNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           Y++++G P +  R  + +  R+EWGF G + SD  A+  +  Y    +    +A   L A
Sbjct: 255 YHEIDGEPCSSSRWLMTELLRHEWGFDGLVVSDYMAINQLHNYHRLARDKTQAAKLALAA 314

Query: 354 GMDINC------GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG---DPRK 404
           GMD+        G  ++     A+  G+V  + IDR++  + + +   GLF     +P  
Sbjct: 315 GMDLELPNIDAYGQALI----DAVHAGEVPLEWIDRSVRRILTFKFAFGLFEHPYVEP-- 368

Query: 405 GKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ 464
               +  P+   T   ++LA   AR+ +VLLKN+   LPL+K  +SS+A+IGP  +    
Sbjct: 369 ----EAVPEVFNTPPQRELARTIARKSLVLLKNEGNLLPLSKQ-LSSIAVIGPNADTKRN 423

Query: 465 MGGGYT-------------------------GIPCSPKSLLRGLEAY------VSKTHYA 493
           + G Y+                         G+     S+L  +EA        ++  +A
Sbjct: 424 LLGDYSYPAHIETLITLRQLGFSEHPLPESVGLVDDYGSMLSIVEAIRQVVSPETQVQHA 483

Query: 494 SGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDLTQETEDRDRVSLLLPGQQMS 548
            GC D+   S  GF EAV  A+KAD  IVV G     +      E RD   L LPG Q +
Sbjct: 484 KGC-DILSTSTEGFAEAVEAARKADVAIVVVGDKAGLIPECTSGEFRDSAHLTLPGVQQA 542

Query: 549 LVTSVARTSKRPVILVLTGGGPLDV-SFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           LV ++  T   PV+LVL  G P  +    EA   +    W+  PG  GA ALAE++FGD 
Sbjct: 543 LVEAILATGT-PVVLVLVTGRPYAIPQLVEAAPAVIEA-WL--PGAEGAPALAEVLFGDV 598

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NPGG+LP+T+      +VP+   + R   +R +    Y   +   ++ FG+GLSYT +++
Sbjct: 599 NPGGKLPITFL-RHVGQVPLFYAH-RPSGARSFFYGPYMDESNEPLFPFGYGLSYTQFAF 656

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
           + L+   E T                                T     V + V N G   
Sbjct: 657 ENLTVTPEAT-------------------------------TTDGEVQVSVEVVNIGTRA 685

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
           G  VV L+ RV   S   P K+L GF RVH      K + F + P E L+
Sbjct: 686 GEEVVQLYTRVDGASVTRPVKELKGFKRVHLDPGERKRVHFTL-PVELLA 734


>gi|197263061|ref|ZP_03163135.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|207857601|ref|YP_002244252.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Enteritidis str. P125109]
 gi|421359420|ref|ZP_15809713.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421364010|ref|ZP_15814248.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421367050|ref|ZP_15817252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421373030|ref|ZP_15823175.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421375395|ref|ZP_15825508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421382177|ref|ZP_15832228.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421386891|ref|ZP_15836897.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421391230|ref|ZP_15841201.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421395600|ref|ZP_15845536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421397961|ref|ZP_15847870.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421402484|ref|ZP_15852342.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421409068|ref|ZP_15858863.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421411752|ref|ZP_15861516.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421418166|ref|ZP_15867872.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421422724|ref|ZP_15872392.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421424918|ref|ZP_15874555.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421433210|ref|ZP_15882778.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421434152|ref|ZP_15883702.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421441895|ref|ZP_15891355.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421443218|ref|ZP_15892660.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|436600421|ref|ZP_20512957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436686654|ref|ZP_20517957.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436802328|ref|ZP_20525319.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436809476|ref|ZP_20528856.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436814772|ref|ZP_20532323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436844194|ref|ZP_20537952.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436854476|ref|ZP_20544110.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436855800|ref|ZP_20544925.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436864301|ref|ZP_20550268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436870232|ref|ZP_20554038.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436877723|ref|ZP_20558651.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436886662|ref|ZP_20563082.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436893559|ref|ZP_20567466.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436901307|ref|ZP_20572217.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436912656|ref|ZP_20578485.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436920338|ref|ZP_20582934.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436926675|ref|ZP_20586501.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436936605|ref|ZP_20592045.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436940618|ref|ZP_20594562.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436950717|ref|ZP_20599772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436961959|ref|ZP_20605333.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436969147|ref|ZP_20608268.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436976823|ref|ZP_20612073.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436989589|ref|ZP_20616596.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437001001|ref|ZP_20620797.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437022365|ref|ZP_20628334.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437036112|ref|ZP_20633844.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437044148|ref|ZP_20637101.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437052068|ref|ZP_20641628.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437059125|ref|ZP_20645972.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437064832|ref|ZP_20648606.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437077112|ref|ZP_20655320.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437083638|ref|ZP_20659292.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437092016|ref|ZP_20663616.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437113855|ref|ZP_20669057.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437122048|ref|ZP_20672085.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437128670|ref|ZP_20675357.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437139171|ref|ZP_20681653.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437144157|ref|ZP_20684771.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437151251|ref|ZP_20689128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437162727|ref|ZP_20696289.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437169975|ref|ZP_20700070.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437179039|ref|ZP_20705157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437181201|ref|ZP_20706372.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437239027|ref|ZP_20714239.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437261953|ref|ZP_20718699.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437267405|ref|ZP_20721157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437280676|ref|ZP_20728053.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437289916|ref|ZP_20731294.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437311898|ref|ZP_20736006.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437330435|ref|ZP_20741599.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437346849|ref|ZP_20747003.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437411031|ref|ZP_20752807.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437442646|ref|ZP_20757878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437461006|ref|ZP_20761959.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437479748|ref|ZP_20768095.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437494385|ref|ZP_20772414.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437505366|ref|ZP_20775420.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437535206|ref|ZP_20781440.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437559607|ref|ZP_20785823.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437572273|ref|ZP_20789035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437583575|ref|ZP_20792569.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437599978|ref|ZP_20797137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437616848|ref|ZP_20802600.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437631373|ref|ZP_20806367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437662149|ref|ZP_20813366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437676329|ref|ZP_20816941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437695929|ref|ZP_20822252.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437711846|ref|ZP_20826864.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437813102|ref|ZP_20841687.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|438082703|ref|ZP_20857889.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438099351|ref|ZP_20863367.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438109484|ref|ZP_20867447.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|445165173|ref|ZP_21394056.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445182314|ref|ZP_21398520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445229380|ref|ZP_21405024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445333762|ref|ZP_21414961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445344304|ref|ZP_21417576.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445361577|ref|ZP_21423869.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|197241316|gb|EDY23936.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|206709404|emb|CAR33745.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Enteritidis str. P125109]
 gi|395985043|gb|EJH94216.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395985485|gb|EJH94655.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395989705|gb|EJH98839.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395998658|gb|EJI07685.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|395999277|gb|EJI08299.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396005381|gb|EJI14360.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|396011532|gb|EJI20442.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396012239|gb|EJI21137.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396012640|gb|EJI21536.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396025999|gb|EJI34772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396032021|gb|EJI40746.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396032157|gb|EJI40881.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396039333|gb|EJI47961.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396042039|gb|EJI50662.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396045253|gb|EJI53847.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396049426|gb|EJI57969.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396058514|gb|EJI66975.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396060758|gb|EJI69199.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396062572|gb|EJI70983.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396071002|gb|EJI79329.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|434957421|gb|ELL51060.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434967293|gb|ELL60128.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434972888|gb|ELL65276.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434975403|gb|ELL67694.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434978780|gb|ELL70772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434983279|gb|ELL75087.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434992014|gb|ELL83484.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434995336|gb|ELL86652.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|435002560|gb|ELL93625.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435003859|gb|ELL94860.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435008607|gb|ELL99430.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435011905|gb|ELM02608.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435018555|gb|ELM09017.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435020741|gb|ELM11130.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435026901|gb|ELM17032.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435027857|gb|ELM17949.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435036516|gb|ELM26335.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435039443|gb|ELM29224.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435045523|gb|ELM35151.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435051098|gb|ELM40602.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435051184|gb|ELM40686.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435059248|gb|ELM48538.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435071156|gb|ELM60106.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435071299|gb|ELM60247.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435074446|gb|ELM63278.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435075555|gb|ELM64369.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076993|gb|ELM65767.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435081352|gb|ELM69994.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435091470|gb|ELM79861.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435094938|gb|ELM83277.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435100574|gb|ELM88742.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435103950|gb|ELM92024.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435107381|gb|ELM95366.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435112922|gb|ELN00787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435116153|gb|ELN03904.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435124194|gb|ELN11661.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435128186|gb|ELN15537.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435132693|gb|ELN19891.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435139148|gb|ELN26152.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435142647|gb|ELN29534.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435145134|gb|ELN31963.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435147755|gb|ELN34507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435152456|gb|ELN39086.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435162324|gb|ELN48508.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435166749|gb|ELN52715.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435170164|gb|ELN55920.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435175921|gb|ELN61323.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435182180|gb|ELN67212.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435182531|gb|ELN67539.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435183030|gb|ELN68005.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435189277|gb|ELN73922.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435189619|gb|ELN74243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435204236|gb|ELN87933.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435211340|gb|ELN94479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435219425|gb|ELO01787.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435220698|gb|ELO02980.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435227283|gb|ELO08792.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435235423|gb|ELO16226.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435238918|gb|ELO19527.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435240567|gb|ELO20958.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435247913|gb|ELO27842.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435261576|gb|ELO40730.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435262896|gb|ELO41978.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435263473|gb|ELO42520.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435267795|gb|ELO46460.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435276722|gb|ELO54719.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435277229|gb|ELO55183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435283446|gb|ELO61011.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435288066|gb|ELO65157.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435297184|gb|ELO73479.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435320361|gb|ELO93000.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435325905|gb|ELO97749.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435332633|gb|ELP03544.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|444865473|gb|ELX90243.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444867011|gb|ELX91716.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444871369|gb|ELX95805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444875449|gb|ELX99648.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444880476|gb|ELY04551.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444884583|gb|ELY08407.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|421449803|ref|ZP_15899183.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|396068619|gb|EJI76965.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 791

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 209/748 (27%), Positives = 331/748 (44%), Gaps = 135/748 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      +++N+ L + IG    V
Sbjct: 142 RLGIPV-DFTNEGIRGLCHD-----------RATFFPSQSGQGSTWNKELIARIGE---V 186

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           EA+    LG   +  ++P ++I +DPRWGR  E  GEDP +V     + ++S Q      
Sbjct: 187 EAKEAIALGYTNI--YSPILDICQDPRWGRSVECYGEDPYLVGQLGKQMIQSLQKHR--- 241

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                                        L +  KH   Y +   G   +   +  ++ +
Sbjct: 242 -----------------------------LVSTVKHFAVYSIPVGGRDGKTRTDPHVSPR 272

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +    +  PFR    +  A  +M SYN  +G P         +  R E+GFKGY+ SD +
Sbjct: 273 EMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFKGYVVSDSE 332

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTC----MLRHTQSAIDKGKVQEKDIDRA 384
           AV  I    +      +  A  + AG++I          +   + AI +GKV  + I+  
Sbjct: 333 AVEFITTKHHVVSNEVEGVAQAVNAGLNIRTHFTKPEDFVLPLRQAIKEGKVSPETINSR 392

Query: 385 LLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPL 444
           + ++  ++  LGLF+ +P +G   K     V   EH+++AL+AARQ +VLLKN+ + LPL
Sbjct: 393 VADILRIKFWLGLFD-NPYRGD-EKQEEKIVHCKEHQQVALEAARQSLVLLKNENQLLPL 450

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCN 502
            K  V S+A+IGP  N  +Q+   Y       K++ +G++  + +T   Y  GC  +  +
Sbjct: 451 KKT-VKSVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPETEVVYRKGCEIIDSH 509

Query: 503 --------------SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMS 548
                               EAV  A+ A+ V++V G       EDR R SL LPG Q  
Sbjct: 510 FPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVREDRSRTSLDLPGHQQE 569

Query: 549 LVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFN 608
           L+ ++  T K P +LVL  G    +++A  +  I +IL   +PGE    A+AE +FGD+N
Sbjct: 570 LMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWFPGEFAGTAVAEALFGDYN 626

Query: 609 PGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTG------TQVYG----FGH 658
           PGGRL +T +P+S  ++P                 ++ F  G      T VYG    FG+
Sbjct: 627 PGGRLAVT-FPKSVGQIPF----------------AFPFKPGSDEPCETAVYGALYPFGY 669

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT +SYK L    E       + G    I                 SC         
Sbjct: 670 GLSYTKFSYKNLQITPE-------EQGPQGEI---------------TVSC--------- 698

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            VTN GD  G  VV L+ R    S  T  K L GF+R+      +K+++F + P + L +
Sbjct: 699 EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILTP-QDLGL 757

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIE 806
            +K+ + ++  G   +M+G     + +E
Sbjct: 758 WDKNNKFVVEPGMFKVMIGAASTDIRLE 785


>gi|16761109|ref|NP_456726.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29141195|ref|NP_804537.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|213428232|ref|ZP_03360982.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
 gi|25289432|pir||AD0778 beta-glucosidase (EC 3.2.1.21) - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16503407|emb|CAD02546.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi]
 gi|29136821|gb|AAO68386.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. Ty2]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 345/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G  V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|200387658|ref|ZP_03214270.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
 gi|199604756|gb|EDZ03301.1| glycosyl hydrolase family 3 N domain protein [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKLTLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|417419081|ref|ZP_12159904.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
 gi|353619740|gb|EHC70042.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Mississippi str. A4-633]
          Length = 769

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 212/762 (27%), Positives = 346/762 (45%), Gaps = 151/762 (19%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQ 639
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYY 610

Query: 640 Y-------PGRSY----------RFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISA 680
           Y       PGR Y          R++      +Y FG+GLSYT ++       S++T+S+
Sbjct: 611 YSHLNTGRPGRPYNPEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSS 664

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
                     +Q+ G         +VT+         + VTN G  +G+ V+ ++ +   
Sbjct: 665 P--------TMQRDG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVT 699

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
            S   P KQL GF+++       K +SF +D  E L   N+ 
Sbjct: 700 ASMSRPVKQLKGFEKITLKPGERKTVSFPID-IEALKFWNQQ 740


>gi|224583333|ref|YP_002637131.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467860|gb|ACN45690.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi C strain RKS4594]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 347/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  +A G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILHAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++      SK +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGESKTVSFPID-IEALKFWNQQ 736


>gi|213852437|ref|ZP_03381969.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
          Length = 721

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 210/749 (28%), Positives = 345/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 42  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 85

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 86  FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 139

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 140 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 173

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 174 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 228

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 229 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 287

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 288 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 342

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 343 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 400

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 401 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 460

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 461 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 517

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 518 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 576

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 577 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 621

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G  V+ ++ +    S   P KQL GF
Sbjct: 622 DG---------KVTAS--------VEVTNTGKREGETVIQMYLQDVTASMSRPVKQLKGF 664

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 665 EKITLKPGERKTVSFPID-IEALKFWNQQ 692


>gi|1483154|dbj|BAA13102.1| T-cell inhibitor(STI) [Salmonella enterica subsp. enterica serovar
           Typhimurium]
          Length = 765

 Score =  246 bits (627), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 209/748 (27%), Positives = 343/748 (45%), Gaps = 137/748 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+  + +N     +
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 638 RQYPGRSYRFY---TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
            + P +    Y       +Y FG+GLSYT ++       S+ T+S+          +Q+ 
Sbjct: 621 PEKPNKYTSLYFDKPNGPLYPFGYGLSYTTFTV------SDFTLSSP--------TMQRD 666

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
           G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF+
Sbjct: 667 G---------KVTAS--------VKVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
           ++       K +SF +D  E L   N+ 
Sbjct: 710 KITLKPGKRKTVSFPID-IEALKFWNQQ 736


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  245 bits (626), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 156/440 (35%), Positives = 229/440 (52%), Gaps = 57/440 (12%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +  R   L+S LT++EKI  L   +  I RL IP Y   +E+LHG+   G          
Sbjct: 35  MHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG---------- 84

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNIN 170
             T FPQ +  AA++N  L   + + ++ EARA +N    G          LTFW+P +N
Sbjct: 85  RFTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVN 144

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR  ET GEDP +       FVK  QG+    DD          R LK     
Sbjct: 145 MARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGD----DD----------RYLK----- 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   + +  KH  A + E     +R+  N  I+E+   + + P F +C++ GK++ 
Sbjct: 186 --------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACVKDGKSAS 233

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           IM +YN +N VP  L   L  K  R +WGFKGY+ SDC   + +     Y KT E +AA 
Sbjct: 234 IMSAYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAAAL 293

Query: 350 VLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
            +KAG+D+ CG  +      SA  +  V + DID A   +   ++ LGLF+    +  Y 
Sbjct: 294 SIKAGLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSG-EQNPYT 352

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           K+ P  + ++EH+++AL+AAR+ IVLLKN KK LPLN   V S+A++G  +N  S   G 
Sbjct: 353 KISPAVIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGD 410

Query: 469 YTGIPC-SPKSLLRGLEAYV 487
           Y+G+P  +P S+L+G++  V
Sbjct: 411 YSGLPVIAPISVLQGIKDRV 430



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 141/291 (48%), Gaps = 45/291 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  Q   +  + + +   +++VL 
Sbjct: 592 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKVNPN-IVVVLV 650

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGE+G KA+AE++FGD+NPGGRLP+T+Y       P
Sbjct: 651 AGSSLAINWM--DEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 708

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSYT + Y  L             A  
Sbjct: 709 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTTFKYSNLQV-----------ADG 750

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           ++ I                        +V   + N+G   G  V  ++ ++P+  +  P
Sbjct: 751 EEEI------------------------NVSFQLKNSGKYAGDEVAQVYVKLPERDEVMP 786

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF+RV   +  +K+++  +             + + P G++ +MVG
Sbjct: 787 IKELKGFERVTLKSGENKKVTLKLRKDLLRYWDEAKDKFVCPSGDYTIMVG 837


>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
 gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 762

 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 212/764 (27%), Positives = 349/764 (45%), Gaps = 122/764 (15%)

Query: 59  LSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGT 118
           +S ++   S +S+  +QE +  ++   +   RLGIP     +ESLHG+  +G        
Sbjct: 81  MSYASVFGSRLSVEQMQESMFAINKYMAEHNRLGIPVL-GEAESLHGLIHDG-------- 131

Query: 119 VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWG 178
               T FPQ +  +++FN  +   + + +A EA+A   + Q      +P +++ R+ RWG
Sbjct: 132 ---ATIFPQSIALSSTFNPDITHRVATVIAQEAKAT-GVDQV----LSPVLDLARELRWG 183

Query: 179 RGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELM 238
           R +ET GEDP +V    V +V +F  E                                 
Sbjct: 184 RVEETYGEDPYLVGRMGVAYVSAFNKEG-------------------------------- 211

Query: 239 LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQV 298
           +    KH +A+     G  +  S      E+D    +  PF+  + +     +M SYN  
Sbjct: 212 VMTTLKHFLAHG-SPTGGLNLASVTG--CERDLRSLYLKPFQDVMREAMPYSVMNSYNSY 268

Query: 299 NGVP-ACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
             VP A     L    R E GFKGYI+SD  +V  +    +  K   D+A   + AG+D+
Sbjct: 269 ESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLHHTAKDKADAACQAVIAGVDV 328

Query: 358 NCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCT 417
                      S +  G + EK+ID+ +  + + +  +GLF+ D    K   L    V T
Sbjct: 329 EVDGDCYETLDSLVRSGVLPEKEIDKCVSRVLTAKFAMGLFDKD--YTKRANLS-QTVHT 385

Query: 418 SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSP 476
            E  +LAL AAR+  +L+KN+   LPL+ N + S+A+IGP   N +Q+  G Y     + 
Sbjct: 386 PEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP---NAAQVQFGDYMWTNSNE 442

Query: 477 KSL--LRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL----- 526
             +  L+G+EA      K +YA GC ++     +GF +AV  A+ +D  ++  G      
Sbjct: 443 YGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTAARNSDVALLFVGAMSGSP 501

Query: 527 ------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
                  ++ E+ D   +S  LPG Q +L+ +V  T K P I+VL  G P  + + + + 
Sbjct: 502 GRPWPNSVSGESFDLSDIS--LPGCQEALIRAVKATGK-PTIVVLVAGKPFAIPWVKDNC 558

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM------NDMNMRA 634
           +   + W  Y GE   +A+AEI+FG+ NP GRL ++ +P+S   +P+      +D     
Sbjct: 559 EAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVS-FPQSTGHLPVFYNYYPSDKGFYH 615

Query: 635 D-SSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           D  + + PGR Y F +   V+ FGHGLSYT + YK        ++  S K  +D +    
Sbjct: 616 DHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKYK--------SMQISNKEFTDDD---- 663

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
                         +C      + + V N G  DG  VV L+      S  TP K+L  F
Sbjct: 664 --------------TC-----EITVEVANTGKRDGKEVVQLYVNDIVSSVVTPVKELRRF 704

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           ++V   A  ++ + F + P ++L++ N   + ++  G+  L VG
Sbjct: 705 EKVFIPAGETRTVKFNL-PIKELALWNTDMKEVVEPGDFELQVG 747


>gi|224535250|ref|ZP_03675789.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523135|gb|EEF92240.1| hypothetical protein BACCELL_00111 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 786

 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 219/742 (29%), Positives = 348/742 (46%), Gaps = 123/742 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIAQVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG  +E        L A  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQQEG-------LVATPKHFAVYSIPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM----LRHTQSAIDKGKVQEKDI 381
           D +AV  ++          D AA V+ AG+++     +    +R  + AI +GKV  + I
Sbjct: 325 DSEAVEFLYSKHQVAADAVDGAAQVVNAGLNVRTNFTLPENFIRPLRQAISEGKVSMQTI 384

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
           D  + ++  V+  +GLF+ +P KG   K     V + EH+ +++ AA + IVLLKN+   
Sbjct: 385 DSRVADVLRVKFGMGLFD-NPYKGD-AKHPEKVVHSKEHQAVSMRAALESIVLLKNENNI 442

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC--- 496
           LPL+K+ +  +A+IGP  N +  +   Y       K++ +G++ Y+  ++  YA G    
Sbjct: 443 LPLSKD-LKKVAVIGPNANEVQNLICRYGPANAPIKTVYQGIKEYLPDAEVRYAKGTDII 501

Query: 497 ---------HDVPCNSD--AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
                    ++VP + +  A   EAV +AK++D  I+V G +     E+  R +L L G+
Sbjct: 502 DKYFPESELYEVPLDQEEQAMMDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDLCGR 561

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q  L+ +V  T K PVIL+L  G    +++AE    I  I+   +PGE    A+A+++FG
Sbjct: 562 QEKLLQAVYATGK-PVILLLVDGRAATINWAE--RYIPGIVHAWFPGEFMGDAVAQVLFG 618

Query: 606 DFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTN 664
           D+NPGG+L +T +P S  ++P        +DS      + +   TGT +Y FG+GLSYT 
Sbjct: 619 DYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGT-LYPFGYGLSYTT 670

Query: 665 YSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAG 724
           ++Y      S+L I   +       I  Q   +L         SC          V N G
Sbjct: 671 FAY------SDLKIENLV-------IGVQGSVKL---------SC---------KVKNTG 699

Query: 725 DVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR 784
            V G  VV L+      S  T  K L GF+R+H      K I F + P ++L + NK   
Sbjct: 700 KVAGDEVVQLYLHDEMSSVTTYVKVLRGFERIHLEPGEEKVIDFVLTP-QELGLWNKDNH 758

Query: 785 RILPLGNHVLMVGELRHSLTIE 806
            ++  G   +MVG     + ++
Sbjct: 759 FVVEPGTFAVMVGSSSQDIKLQ 780


>gi|419728314|ref|ZP_14255280.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419736497|ref|ZP_14263337.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419741060|ref|ZP_14267772.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419745173|ref|ZP_14271816.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419746816|ref|ZP_14273391.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|421574444|ref|ZP_16020065.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|421579948|ref|ZP_16025510.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|421585722|ref|ZP_16031214.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|381291806|gb|EIC33035.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381293004|gb|EIC34177.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381299851|gb|EIC40919.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381306597|gb|EIC47470.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|381320690|gb|EIC61232.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|402521423|gb|EJW28761.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|402524644|gb|EJW31941.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|402529572|gb|EJW36805.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
          Length = 765

 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|205357644|ref|ZP_02572188.2| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|374981198|ref|ZP_09722528.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|383496875|ref|YP_005397564.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 798]
 gi|205330573|gb|EDZ17337.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar 4,[5],12:i:- str. CVM23701]
 gi|321224818|gb|EFX49881.1| Periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. TN061786]
 gi|380463696|gb|AFD59099.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
          Length = 755

 Score =  245 bits (626), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|16765496|ref|NP_461111.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|378445601|ref|YP_005233233.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|378450850|ref|YP_005238209.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|378700080|ref|YP_005182037.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Typhimurium str. SL1344]
 gi|378989553|ref|YP_005252717.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|379701394|ref|YP_005243122.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|422026439|ref|ZP_16372832.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422031463|ref|ZP_16377632.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427551058|ref|ZP_18928136.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427567333|ref|ZP_18932851.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427587543|ref|ZP_18937641.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427611175|ref|ZP_18942507.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427634848|ref|ZP_18947401.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427656473|ref|ZP_18952166.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427661625|ref|ZP_18957079.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427670100|ref|ZP_18961878.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|20141276|sp|Q56078.2|BGLX_SALTY RecName: Full=Periplasmic beta-glucosidase; AltName:
           Full=Beta-D-glucoside glucohydrolase; AltName:
           Full=Cellobiase; AltName: Full=Gentiobiase; AltName:
           Full=T-cell inhibitor; Flags: Precursor
 gi|16420702|gb|AAL21070.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. LT2]
 gi|261247380|emb|CBG25205.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. D23580]
 gi|267994228|gb|ACY89113.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. 14028S]
 gi|301158728|emb|CBW18240.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|323130493|gb|ADX17923.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. ST4/74]
 gi|332989100|gb|AEF08083.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Typhimurium str. UK-1]
 gi|414017758|gb|EKT01456.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414018535|gb|EKT02182.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414020221|gb|EKT03810.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414032204|gb|EKT15215.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414033625|gb|EKT16574.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414036785|gb|EKT19597.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414046773|gb|EKT29088.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414047776|gb|EKT30042.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414052520|gb|EKT34556.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414059327|gb|EKT40912.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
          Length = 765

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 209/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|375001990|ref|ZP_09726330.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|353076678|gb|EHB42438.1| glycosyl hydrolase family 3 protein [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
          Length = 751

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 207/749 (27%), Positives = 345/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 72  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 115

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 116 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 169

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 170 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 203

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 204 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 258

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 259 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 317

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 318 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 372

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 373 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 430

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 431 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 490

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 491 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 547

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 548 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 606

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 607 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 651

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 652 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 694

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 695 EKITLKPGERKTVSFPID-IEALKFWNQQ 722


>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
 gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
          Length = 859

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELTA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATTSKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
 gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
          Length = 859

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 215/759 (28%), Positives = 338/759 (44%), Gaps = 123/759 (16%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVS 131
           LTL  ++Q+     +   RLGIP +   +ESLHG   +G            T FPQ +  
Sbjct: 95  LTLMNEVQKYMREKT---RLGIPVFTL-TESLHGSVHDGS-----------TIFPQAIAL 139

Query: 132 AASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVV 191
            ++FN  L   + SA+A E  A     Q       P I++ RD RWGR +E  GEDP +V
Sbjct: 140 GSTFNPILAYEMTSAIAKELSA-----QGITQSLTPVIDVCRDLRWGRVEECFGEDPYLV 194

Query: 192 SAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDL 251
           S   V  V+                G+ + +V                S   KH  A+  
Sbjct: 195 SRMGVSQVR----------------GYLDNQV----------------SPMIKHFGAHGA 222

Query: 252 EKWG-NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF 310
            + G N +  S      +++    +   F + +++ K   +M SYN  N  P      L 
Sbjct: 223 PQGGLNLASVSCG----QRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLM 278

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQS 369
            +  R+ W F+GY+ SD  A+  +  +    +   ++A   L AG+D           Q 
Sbjct: 279 TELLRDRWDFQGYVYSDWGAIGMLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQ 338

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD-PRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++ G +  K ID+A+  + + +  +GLF    P +  Y K+    V    H  LA   A
Sbjct: 339 LVENGMLDVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKV----VHAPAHVSLARKIA 394

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAY 486
            + IVLL+N+   LPL  N + S+A+IGP  + +    G YT    +    +LL  L+  
Sbjct: 395 EESIVLLQNENNILPLQMNKLKSIAVIGPNADQVQ--FGDYTWSRDNKDGVTLLEALKER 452

Query: 487 VSKT---HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-------LDLTQET--ED 534
           V      +YA GC D+  +  +GF EAV +AKK+D  IVV G        D +  T  E 
Sbjct: 453 VGNQLTLNYAKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEG 511

Query: 535 RDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEA 594
            D   L L G Q  LV ++  T K PVI+VL  G P  +S+ + +  I  I+   YPGE 
Sbjct: 512 FDLSDLTLTGVQEDLVEAIHATGK-PVIVVLLSGKPFAMSWIKEN--IPGIVVQWYPGEQ 568

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPES-------FTKVPMNDMNMRADSSRQYPGRSYRF 647
           G  ALA+++ G  NP G+L  + +P+S       +  +P +    R+  S+  PG+ Y F
Sbjct: 569 GGLALADMLLGKVNPSGKLNYS-FPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVF 627

Query: 648 YTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVT 707
            +   ++ FGHGLSYT++ Y                       L  T S+ DY       
Sbjct: 628 SSPKALWAFGHGLSYTDFEY-----------------------LSATISKEDY------- 657

Query: 708 SCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEIS 767
           +C  +   V I++ N GD DG  V  ++ R    S   P ++L GF++V      +K++ 
Sbjct: 658 ACEDV-IEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVI 716

Query: 768 FGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
             + P  +L++ NK  ++++  G   L +G     + I+
Sbjct: 717 IKI-PVSELALYNKEMKKVVEPGAFELQIGRASDDIRIK 754


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  245 bits (625), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 153/463 (33%), Positives = 231/463 (49%), Gaps = 60/463 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  LS   RA  L++ +TL+EK  Q+ +N   I RLG+  Y+WW+E+LHGIA  G  
Sbjct: 25  PYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWWNEALHGIARAG-- 82

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA---------MYNLGQAGLT 163
                     T FPQ +  AA+F+ +    +   V+ E RA         MYN G  GLT
Sbjct: 83  --------KATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQRKGMYN-GYKGLT 133

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRG ET GEDP + +   +  VK  QG+  +  D            
Sbjct: 134 FWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQGDGTQKYDKA---------- 183

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                            AC KH   +   +W   +R+S+NA  I+ +D  +T+ P F++ 
Sbjct: 184 ----------------HACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKAL 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIF---EYQN 338
           + +GK   +MC+YN+  G P C    L     ++EWGF   I SDC A+A  +    ++ 
Sbjct: 225 VTEGKVKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHET 284

Query: 339 YTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           +    + SA  V+ +G D+ CG         A++KG + E  I+ ++  L   +  LG+F
Sbjct: 285 HASAADASADAVI-SGTDLECGGSYW-ALDEALEKGLITETKINESVFRLLRARFELGMF 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + D     +  +    VC  +HK  AL+ AR+ +VLL N    LPL+K ++  +A++GP 
Sbjct: 343 DDDSLVS-WSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPLSK-SIKKVAVMGPN 400

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDV 499
            N+   +   Y G P    ++L G++A + +    Y  GC  V
Sbjct: 401 ANDSVMLWANYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 127/273 (46%), Gaps = 50/273 (18%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           DV  +    +       K AD +I V G+  + E E+           DR ++ LP  Q 
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           +++ ++  T K PVI VL  G  + +S+   D  + +IL   YPG+ G  A+A+++FGD+
Sbjct: 643 NMMKALKETGK-PVIFVLCSGSTMALSWE--DKNMDAILQAWYPGQEGGTAVADVLFGDY 699

Query: 608 NPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSY 667
           NP GRLP+T+Y  S       + NM         GR+YR++ G  +Y FGHGLSYT +SY
Sbjct: 700 NPAGRLPLTFYASSDDLPDFENYNMSEGQ-----GRTYRYFKGKPLYPFGHGLSYTGFSY 754

Query: 668 KFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVD 727
                             S   + +++ S  D V +              +++ N G  D
Sbjct: 755 ------------------SKAKLNKKSMSVNDSVFL-------------SLNLKNTGLRD 783

Query: 728 GSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
           G  VV ++ R  +  +G P K L G+ RV   A
Sbjct: 784 GDEVVQVYIRNLQDPEG-PSKSLRGYKRVSVKA 815


>gi|256838673|ref|ZP_05544183.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
 gi|256739592|gb|EEU52916.1| glycoside hydrolase, family 3 [Parabacteroides sp. D13]
          Length = 758

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 220/778 (28%), Positives = 347/778 (44%), Gaps = 129/778 (16%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+ P    +   +   +I  R  S++SL+TL+EK+ Q++  +      G P      E+L
Sbjct: 20  CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
                 G   N   TV  V    ++ +S +     +    G  V    R ++ +  A   
Sbjct: 79  LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135

Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
            W                         AP ++I RD RWGR  E  GEDP + S  A   
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195

Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
           V+ FQG N W+S                         D   + ACCKH  AY   + G  
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232

Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
            N S  S N ++      + + PP+ +  E G A+  M S+N++NGVP+     L  DL 
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285

Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
           +K   +WGFKG++ +D      I E   ++    D  AG L   AG+D++  G    ++ 
Sbjct: 286 RK---DWGFKGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYL 339

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
             ++ +GKV E++IDRA+ ++  ++  LGLF+ DP +    +   + +   E  + A + 
Sbjct: 340 VQSVKEGKVSEENIDRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
           + + IVLLKND  F P++K+   ++A+IGP+V +     G + G     +  SL  GL  
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458

Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
             + T+    YA GC D+  +  + F EA+  A++AD V+   G D     E   R  L 
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LPG Q +L+  + +T K P+ L+L  G PLD+S+   D  +  IL   Y G      +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
           +I GD+NP  RL M+ +P +  ++P+     +  + R  P  +        Y     T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT ++              S+K   D+N   + G                 +
Sbjct: 632 YPFGYGLSYTTFAVN------------SMKL--DQNSFTKGG-----------------K 660

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             V   V N G VDG  VV ++ R    S   P K+L GF++V   A   K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 154/450 (34%), Positives = 227/450 (50%), Gaps = 57/450 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N    I  R   L+S LT++EKI  L   +  I RL IP Y   +E+LHG+   G   
Sbjct: 29  YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 86  -------RFTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG               + R 
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGN--------------DSRY 184

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GK++ IM +YN +N VP  L   L  K  R +WGFKGY+ SDC   A +     Y KT
Sbjct: 228 KEGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKT 287

Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V + DID A   +   +++LGLF+  
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
                Y K+ P  + + EH+K+ALDAAR+ IVLLKN  K LPL+   + S+A++G  +N 
Sbjct: 348 -ENNPYTKISPKVIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INA 404

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSKT 490
                G Y+G+P  +P S+L+G++  V  T
Sbjct: 405 GRSEFGDYSGLPVIAPVSILQGIKDRVGDT 434



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 144/291 (49%), Gaps = 45/291 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA R+ ++ + V+ V G++   E E +DR  + LP  Q   +  + + +   +++VL 
Sbjct: 593 YGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLV 651

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGE+G KA+AE++FGD+NPGGRLP+T+Y       P
Sbjct: 652 AGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSYT++ Y      S+L +       +
Sbjct: 710 FDDYDITK-------GRTYQYFKGNVLYPFGYGLSYTSFKY------SDLQV-------T 749

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           D N               EV        +V   + N G   G  V  ++ ++P+  +  P
Sbjct: 750 DGN--------------QEV--------NVSFCLKNVGKYAGDEVAQIYVKLPERDKIMP 787

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF+R+      S++++  +          +    + P G++ +M+G
Sbjct: 788 IKELKGFERISLKRGESRKVTIRLKKDLLRYWDEEKECFVHPSGDYTIMIG 838


>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
 gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
          Length = 730

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 215/741 (29%), Positives = 322/741 (43%), Gaps = 121/741 (16%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF +  L    R   LI+ +TL+EK+  L      IPRL +  Y   SE  HG+A  GP
Sbjct: 27  YPFQDPDLPNEERIDDLITCMTLEEKVD-LMGFVPGIPRLDV-KYTRISEGYHGVAQGGP 84

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYN---LGQAGLTFWAPN 168
                   +  T FPQ    AA+++ +L S + +  A E R +Y      ++GL   APN
Sbjct: 85  SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144

Query: 169 INIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFG 228
            ++ RDPRWGR +E  GEDP +    A  F     G++               R LK   
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGLAGDH--------------PRYLKA-- 188

Query: 229 EESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
                      ++  KH +A   E    FS   F+    E+   + +  PF   I  G A
Sbjct: 189 -----------TSLLKHFLANSNEDDRFFSSSDFD----ERLWREYYAKPFEMAIRDGGA 233

Query: 289 SCIMCSYNQVNGVPACLRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHED--- 345
             +M +YN +NG PA +   L      EWG  G I +D   +A +    N  KT+ D   
Sbjct: 234 RSMMAAYNAINGTPAHVHPMLRDIVMGEWGLDGTICTDGGGLAHLV---NQHKTYPDLPT 290

Query: 346 SAAGVLKAGMDINCGTCMLRHTQSAID---KGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           + A  +KAG+++     +  HTQ+A+D   +  V E +ID  +     + L LGL +  P
Sbjct: 291 ATAACIKAGINL----FLDNHTQAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PP 345

Query: 403 RKGKYGKLGPDDVCT----SEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
               Y  +G +         E      +  R+ IVLLKN+   LPL+ + ++S+AI+GPL
Sbjct: 346 ELVPYSNIGHEPGLEPWELPETHAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPL 405

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYAS----GCHDVPCNSDAGFHEAVRIA 514
            N  + +   Y+G P        G+E Y +   + S    G + V   SD     A+ +A
Sbjct: 406 AN--TTLLDWYSGTPPYAIPPRDGIEGYANSGPFPSPAKFGSNWVADMSDT----ALEVA 459

Query: 515 KKADFVIVVAG------------LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVI 562
              D  IVV G               ++  E  DR  ++L   Q   +  V   +    I
Sbjct: 460 ASRDVAIVVVGNHPESNAGWGVVTSPSEGKEAVDRQEIILQPDQEEFIQKVYAANPN-TI 518

Query: 563 LVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESF 622
           +VL    P  + +A  ++   +I+ I +  +    ALA+++FGD+NPGG+   TW P+S 
Sbjct: 519 VVLVSNFPYAMPWAAENA--PAIVHITHASQEQGNALADVLFGDYNPGGKTVQTW-PKSL 575

Query: 623 TKV-PMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
            ++ PM D ++R        GR+Y +      Y FG+GLSYT +    L AP +L   A+
Sbjct: 576 DQLPPMMDYDIRR-------GRTYMYSQHEPQYPFGYGLSYTTFELSKLKAPKKLKADAT 628

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                                             +++ V N G+ DG  VV L+ R P  
Sbjct: 629 AT--------------------------------IKVRVANTGERDGDEVVQLYVRYPNS 656

Query: 742 SQGTPEKQLIGFDRVHTVAKG 762
               P KQL GF RV TV  G
Sbjct: 657 KVERPSKQLKGFQRV-TVPAG 676


>gi|354585466|ref|ZP_09004353.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353188735|gb|EHB54255.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 734

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 207/697 (29%), Positives = 331/697 (47%), Gaps = 106/697 (15%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
           T FP  L  A+S+   L  +     A E  A        +TF AP  ++ RDPRWGR  E
Sbjct: 108 TVFPVPLGLASSWEPKLAEDTSRVAAAEGAA----SGINVTF-APMADLVRDPRWGRVME 162

Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
           +PGEDP +    A   V+ FQG++ K+ D                           ++AC
Sbjct: 163 SPGEDPYLNGLMAAAMVRGFQGDDLKALD--------------------------TVAAC 196

Query: 243 CKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
            KH  AY   + G      +N + ++E    + + P +++ I+ G A  IM S+N  +GV
Sbjct: 197 VKHFAAYGAAEGGR----DYNTVDMSEAALRNYYLPAYKAGIDAG-AELIMTSFNVYDGV 251

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAV-ATIFEYQNYTKTHEDSAAGVLKAGMDIN- 358
           PA     L +K  R EWGF+G + SD  ++  TIF     +K  ED+A   L+AG+DI  
Sbjct: 252 PATTNSFLLRKVLREEWGFEGVVISDYTSLWETIFHMS--SKDGEDAAKQGLEAGLDIEM 309

Query: 359 CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP------RKGKYGKLGP 412
             T  + H +  +++G+V    ID A+  + +++ +LGLF+ DP       + K G L P
Sbjct: 310 ISTEYISHLEQLVERGEVDVALIDEAVRRILTLKFKLGLFD-DPYRYINIEREKKGHLKP 368

Query: 413 DDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI 472
                 E+++LA +AA++ +VLLKND   LPL K+ + S+A+IGP  +N  ++ GG++G+
Sbjct: 369 ------EYRQLAREAAQKSMVLLKND-GVLPLKKD-IKSVAVIGPFADN-GRILGGWSGL 419

Query: 473 --PCSPKSLLRGLEAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
             P    ++ +GL   +    K   A+GC D   N  +GF  A+  A  ++ VI+  G +
Sbjct: 420 GKPEEAVTVKQGLINKLGNDVKVTAAAGC-DYSSNDVSGFQAALEAAASSEVVILAMGEE 478

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
                E   R  L LPG Q  LV  V +  K P +LVL  G PL++ +      + +IL 
Sbjct: 479 DHMSGEAGSRAYLTLPGVQPKLVEEVLKLGK-PTVLVLFNGRPLELKWYH--EHVPAILE 535

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRF 647
             +PG  G  A+A+++FGD NP  +L M+ +P +  +VP+    +     + +    +RF
Sbjct: 536 AWFPGTEGGNAVADLLFGDANPSAKLTMS-FPYTVGQVPVYYNCLNTGRPKGHEDNDFRF 594

Query: 648 YT------GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYV 701
            +          Y FG+GLSYT Y+Y                  SD  ++          
Sbjct: 595 LSKYLDIPNAPFYPFGYGLSYTEYTY------------------SDAALIG--------- 627

Query: 702 HIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAK 761
             D +T   S++    ++V N GD  G  +V ++ R    S   P  +L GF ++H    
Sbjct: 628 --DTLTPGGSVK--AMVTVQNTGDYAGEEIVQMYVRDLWGSTSRPLLELKGFHKIHLAPG 683

Query: 762 GSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
               + F ++P E LS      +  +  G+  + VG+
Sbjct: 684 EKATVEFVIEP-EMLSYFTAKKQYEVEEGDFKVFVGK 719


>gi|161613146|ref|YP_001587111.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|416529744|ref|ZP_11744511.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|417466542|ref|ZP_12165045.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|161362510|gb|ABX66278.1| hypothetical protein SPAB_00854 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|353629977|gb|EHC77663.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Montevideo str. S5-403]
 gi|363552147|gb|EHL36453.1| hypothetical protein SEEM010_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
          Length = 755

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 206/749 (27%), Positives = 343/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 435 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 611 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 655

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 656 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 698

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 699 EKITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  245 bits (625), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 154/450 (34%), Positives = 227/450 (50%), Gaps = 57/450 (12%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + N    I  R   L+S LT++EKI  L   +  I RL IP Y   +E+LHG+   G   
Sbjct: 29  YKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGVVRPG--- 85

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LT 163
                    T FPQ +  AA++N  L   + + ++ EARA +N    G          LT
Sbjct: 86  -------RFTVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLLT 138

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW+P +N+ RDPRWGR  ET GEDP +       FVK  QG               + R 
Sbjct: 139 FWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQGN--------------DSRY 184

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
           LK             + +  KH  A + E     +R+  N  I+E+   + + P F +C+
Sbjct: 185 LK-------------IVSTPKHFAANNEE----HNRFVCNPQISEKQLREYYLPAFEACV 227

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           ++GK++ IM +YN +N VP  L   L  K  R +WGFKGY+ SDC   A +     Y KT
Sbjct: 228 KEGKSASIMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVKT 287

Query: 343 HEDSAAGVLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            E +A   +KAG+D+ CG  +      +A  +  V + DID A   +   +++LGLF+  
Sbjct: 288 KEAAATLSIKAGLDLECGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSG 347

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
                Y K+ P  + + EH+K+ALDAAR+ IVLLKN  K LPL+   + S+A++G  +N 
Sbjct: 348 -ENNPYTKISPKVIGSKEHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INA 404

Query: 462 ISQMGGGYTGIPC-SPKSLLRGLEAYVSKT 490
                G Y+G+P  +P S+L+G++  V  T
Sbjct: 405 GRSEFGDYSGLPVIAPVSILQGIKDRVGDT 434



 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 143/291 (49%), Gaps = 45/291 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA R+ ++ + V+ V G++   E E +DR  + LP  Q   +  + + +   +++VL 
Sbjct: 593 YGEAGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKVNPN-IVVVLV 651

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L +++   D  I +I+   YPGE+G KA+AE++FGD+NPGGRLP+T+Y       P
Sbjct: 652 AGSSLSINWM--DEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 709

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSYT++ Y      S+L ++       
Sbjct: 710 FDDYDITK-------GRTYQYFKGNVLYPFGYGLSYTSFKY------SDLQVTEG----- 751

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
                                   +   +V   + N G   G  V  ++ ++P+  +  P
Sbjct: 752 ------------------------NQEVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIMP 787

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF+R+     GS++++  +          + G  + P G++ +MVG
Sbjct: 788 IKELKGFERISLKRGGSRKVTIRLKKDLLRYWDEEKGCFVHPSGDYTIMVG 838


>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
 gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
          Length = 770

 Score =  245 bits (625), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 233/827 (28%), Positives = 356/827 (43%), Gaps = 164/827 (19%)

Query: 44  CKPPHF---DSYPFCNTSLS----ISTRAKSLISLLTLQEKIQQLSD------------- 83
           C  P     D+  F   S S    I  R  S++ L+TL+EKI Q++              
Sbjct: 19  CATPAILKSDNEKFTQKSFSGDKHIEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMA 78

Query: 84  ----------------NASAIP-------------RLGIPAYEWWSESLHGIASNGPGVN 114
                           NA+++              RLGIP   +  + +HG         
Sbjct: 79  DKYQPYLEKGLVGSIFNATSVAGIRKLQKIAVEQTRLGIPIL-FGQDVIHGYK------- 130

Query: 115 FNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFR 173
                   T FP  L  + S++ +L        A EA A       G+ + +AP ++I R
Sbjct: 131 --------TIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITR 176

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESD 232
           D RWGR  E  GEDP + S  A   VK FQG +NW+                        
Sbjct: 177 DARWGRAMEGAGEDPYLGSLIAEARVKGFQGGDNWQM----------------------- 213

Query: 233 RGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCI 291
                 L AC KH   Y   + G      +N A ++     + + PP+ + +  G  S I
Sbjct: 214 LSSPHTLLACGKHFAGYGAAESGK----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-I 268

Query: 292 MCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGV 350
           M S N++NGVPA     L  +  R EWGF G + SD      I E   +    +D  A  
Sbjct: 269 MASLNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAAN 325

Query: 351 LKAG----MDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
           L A     MD+N G   +++  + + +GKV E  ID+A+ ++  ++  LGLF+ DP +  
Sbjct: 326 LSANAGIEMDMN-GATFIKYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYL 383

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
                 ++  T E+ K+A  A    +VLLKN+ + LP+ K++  ++A+IGP++NN S + 
Sbjct: 384 DETRAKENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDIN 443

Query: 467 GGYT--GIPCSPKSLLRGL-EAYVS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFV 520
           G +T  G      SLL GL E Y     K  YA GC     +++    EAV IA+KAD V
Sbjct: 444 GSWTCLGDGKQSVSLLTGLTEKYKGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRV 502

Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
           +V  G   +   E   R  + LP  Q  L+ ++   +K P+ +V   G PLD+S+   + 
Sbjct: 503 LVAVGEQSSWAGESAVRTDIRLPQAQRQLLEALKAINK-PIAIVTFSGRPLDLSW--ENE 559

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMN----------DM 630
            + +IL   +PG  G   +A++I GD NP G L M+ +P S  ++P+             
Sbjct: 560 NVQAILQAWFPGTQGGNGIADVIAGDVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVYT 618

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
           N      R +    Y   + T +Y FG+GLSYT ++   +    +     S+K  +D  I
Sbjct: 619 NNEEVDHRPHYNAGYLDSSITPLYPFGYGLSYTTFAISNVHLNKK-----SIKRYNDSII 673

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                                    V  SV N G  +G  VV L+ R    S   P K+L
Sbjct: 674 -------------------------VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKEL 708

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            GF ++   A  SK++ F + P E L+  + +G++       +L VG
Sbjct: 709 KGFQKISLKAGESKQVRFEL-PSEALAFYSINGKKDTEPSECLLWVG 754


>gi|146299327|ref|YP_001193918.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146153745|gb|ABQ04599.1| Candidate beta-glucosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 743

 Score =  244 bits (624), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 208/698 (29%), Positives = 326/698 (46%), Gaps = 105/698 (15%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFW--APNINIFRDPRWGRG 180
           T+FP  L  AAS++         A+ + AR       A    W  AP ++I RDPRWGR 
Sbjct: 111 TTFPLPLAEAASWDLQ-------AIELAARVAATEASASGIHWTFAPMVDISRDPRWGRV 163

Query: 181 QETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLS 240
            E  GED  + S  A   VK FQG                          +  GD   + 
Sbjct: 164 MEGAGEDTYLGSKIAYARVKGFQG--------------------------NKLGDLNSVM 197

Query: 241 ACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
           AC KH  AY     G      +N++ ++E+   +T+ PPF++ ++ G A+  M S+N +N
Sbjct: 198 ACVKHFAAYGAGVGGR----DYNSVDMSERMLWETYLPPFKAALDAG-AATFMNSFNDIN 252

Query: 300 GVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           G+PA     L+ D+    + +W F+G++ SD  ++  +  +  Y+K  +++A   + AG 
Sbjct: 253 GIPATGNAHLQRDIL---KGKWNFQGFVVSDWGSIGEMVAH-GYSKNLKEAAYSAITAGS 308

Query: 356 DINCGTCMLRHTQSAIDK-GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           D++  +   R+  + + K G+V    ID A+  +   +  LGLF+ DP +    K     
Sbjct: 309 DMDMESNAYRYNLAQLVKEGRVSVDLIDDAVKRILRKKFELGLFD-DPYRYSDEKRAEKA 367

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQ-MGGGYTGIP 473
           +   EH+K ALD A++ IVLLKN+ + LP++K +V ++A IGP+V    + MG     +P
Sbjct: 368 LNNPEHRKAALDVAQKSIVLLKNENQTLPISK-SVKTIAFIGPMVKEYKENMGFWSVELP 426

Query: 474 CSPK-----SLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG 525
                    S   GL+  V K     YA GC     N D GF EAV  AK+AD VI+  G
Sbjct: 427 EVDYNKWIVSQWDGLQNKVGKNTKLLYAKGCEIEGTNKD-GFAEAVETAKQADVVILSIG 485

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
                  E + R  + LPG Q  LV ++  T K PV++++  G PL   F      + ++
Sbjct: 486 ERRDMSGEAKSRSDIHLPGVQEDLVKAIQATGK-PVVVLINAGRPL--VFNWTADNVPAV 542

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQ 639
           ++  + G     A+A ++FGD+NP G+LPMT +P    ++P+  N  +     + ++   
Sbjct: 543 VYTWWLGTEAGNAIANVLFGDYNPSGKLPMT-FPREVGQIPIYYNHFSTGRPAKTENETN 601

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
           Y   +Y     +  + FG+GLSYT +SY      S+L +S S K  S++ I         
Sbjct: 602 YVS-AYIDLKNSPKFPFGYGLSYTQFSY------SDLKLS-STKIKSNETI--------- 644

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
                           V   ++N G V G  V  L+ +    S   P  +L  F++V   
Sbjct: 645 ---------------KVSFKLSNVGKVAGEEVAQLYLKDKFGSVVRPVLELRDFEKVKLN 689

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           A  SK I F +D  E+LS  N         G+  LM+G
Sbjct: 690 AGESKTIEFTIDK-EKLSFYNDKLEWTTEPGDFELMIG 726


>gi|256819849|ref|YP_003141128.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
 gi|256581432|gb|ACU92567.1| glycoside hydrolase family 3 domain protein [Capnocytophaga
           ochracea DSM 7271]
          Length = 804

 Score =  244 bits (624), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 217/743 (29%), Positives = 335/743 (45%), Gaps = 113/743 (15%)

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I++L   A    RLGIP   +  + +HG                 T FP  L  + S++ 
Sbjct: 136 IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 179

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
           +L        A EA A       G+ + +AP ++I RD RWGR  E  GEDP + S  A 
Sbjct: 180 ALMRKTAELAAREASA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 233

Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             VK FQG +NW++                             L AC KH   Y   + G
Sbjct: 234 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 270

Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
                 +N A ++     + + PP+ + ++ G  S IM S N++NGVPA     L  +  
Sbjct: 271 K----DYNTAELSMHTLRNVYLPPYEATLKAGVGS-IMASLNEINGVPATADKWLLTEVL 325

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG--MDINCGTCMLRHTQSAI 371
           R EWGF G + SD   +  +  +    K  +  A     AG  MD+N G   +++  + +
Sbjct: 326 RKEWGFNGLLVSDYTGINELVRH-GVAKDDKQVANLSANAGIEMDMN-GATFIKYLSALV 383

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
            +GKV E  ID+A+ ++  ++  LGLF+ DP +        ++  T E+ K+A  A    
Sbjct: 384 KEGKVTENQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVASS 442

Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYVS 488
           +VLLKN+ + LP+ KN+  ++A+IGP++NN S + G +T  G      SLL GL E Y  
Sbjct: 443 VVLLKNEAEALPIKKNSDKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYKG 502

Query: 489 ---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQ 545
              K  YA GC     +++    EAV IA+KAD V+V  G   +   E   R  + LP  
Sbjct: 503 TNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQA 561

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           Q  L+ ++   +K P+ ++   G PLD+S+   +  + +IL   +PG  G   +A++I G
Sbjct: 562 QRQLLEALKAINK-PIAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVIAG 618

Query: 606 DFNPGGRLPMTWYPESFTKVPM-----------NDMNMRADSSRQYPGRSYRFYTGTQVY 654
           D NP G L M+ +P S  ++P+           +  N   D    Y    Y   + T +Y
Sbjct: 619 DVNPSGHLTMS-FPRSVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNA-GYLDSSITPLY 676

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT ++   +    +     S+K  +D  I                        
Sbjct: 677 PFGYGLSYTTFAISNVHLNKK-----SIKRYNDSII------------------------ 707

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
            V  SV N G  +G  VV L+ R    S   P K+L GF ++   A  SK++ F + P E
Sbjct: 708 -VNASVQNTGRTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVRFEL-PSE 765

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
            L+    +G++       +L VG
Sbjct: 766 ALAFYGINGKKDTEPSECLLWVG 788


>gi|441500080|ref|ZP_20982250.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
 gi|441436171|gb|ELR69545.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
          Length = 704

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 213/741 (28%), Positives = 344/741 (46%), Gaps = 117/741 (15%)

Query: 72  LTLQEKIQQLSDNASAIPRLGIPAYEWWSESLH----GIASNGPGVNFNGTVSSV----- 122
           +TL+EKI Q++   S     G    E + E L     G   N   V +N  +  +     
Sbjct: 1   MTLEEKIGQMTLYTSGWDVTGPTLNENYKEDLKAGKVGALFNAHTVKYNSDLQRMAVEET 60

Query: 123 -----------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF- 164
                            T FP  L  A+S+N  L        A EA A      +GL + 
Sbjct: 61  RLGIPLLFGYDVIHGHRTIFPLPLAEASSWNLDLIKETARLSAKEAAA------SGLNWT 114

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           + P ++I RDPRWGR  E  GED  + S  A   V+ +QG+                   
Sbjct: 115 FNPMVDIARDPRWGRIAEGSGEDTYLGSLIAKAKVEGYQGD------------------- 155

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
                  D  D   + AC KH  AY   + G   R      ++++   +T+ PP+++ I+
Sbjct: 156 -------DLSDPFTVLACVKHFAAYGASQAG---RDYHTVDMSDRVLRETYLPPYKAAID 205

Query: 285 QGKASCIMCSYNQVNGVPAC-LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            G A+ +M S+N+++GVPA   R  + +  R EW FKG++ +D  ++  +  +       
Sbjct: 206 AG-AATVMTSFNELHGVPASGSRYLMTEILREEWRFKGFVVTDYTSINEMVPH-GVVANE 263

Query: 344 EDSAAGVLKAGMDINC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDP 402
           +++A   L AG+D++  G     H  + +++GKV EK +D A+  +  ++ RLGLF  DP
Sbjct: 264 KEAADLALNAGVDMDMQGGVYNDHLATLVNEGKVSEKQVDEAVRRILEMKWRLGLFK-DP 322

Query: 403 RKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND----KKFLPLNKNAVSSLAIIGPL 458
            +    K     + + E    AL +AR+ IVLLKN+    KK LP+  N V S+A+IGPL
Sbjct: 323 YRYLDEKRELQVLFSKELMDHALVSARESIVLLKNEPYNNKKLLPI-ANDVKSIALIGPL 381

Query: 459 VNNISQMGGGY--TGIPCSPKSLLRGL-EAY-VSKTHYASGCHDVPCNSDAGFHEAVRIA 514
            +N   M G +  +G      ++L+GL EA+  +K  Y  G  D   +  +GF EA + A
Sbjct: 382 GDNQIDMLGTWHASGDANKVVTVLQGLKEAFPKAKITYTKGA-DFMGSDKSGFEEATKNA 440

Query: 515 KKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS 574
           + AD VI+  G +  Q  E   R  L LPG Q  LV ++ +T K P++ ++  G PL + 
Sbjct: 441 RAADLVIMAVGENHQQSGEAASRSGLDLPGVQQELVEAIYQTGK-PIVALVMAGRPLTIG 499

Query: 575 FAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP----MNDM 630
           +   D  I +I+   + G    KA+AE++ G +NP G+L +T +P +  ++P    M + 
Sbjct: 500 W--MDENIPAIVNTWHLGTMAGKAIAEVLAGKYNPSGKLTIT-FPRNVGQIPIYYSMKNT 556

Query: 631 NMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
               D+  +Y  + Y   +   +Y FG+GLSYT + Y     P                 
Sbjct: 557 GRPFDADSKYTSK-YLDVSNEPLYPFGYGLSYTTFEY---GEP----------------- 595

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
                 +L  + I E  + T     +++ V N G+ +G  VV L+ R    S   P K+L
Sbjct: 596 ------KLSKIEIKEHENLT-----IEVMVKNTGEYEGQEVVQLYVRDLVGSVTRPVKEL 644

Query: 751 IGFDRVHTVAKGSKEISFGVD 771
            GF+++      SK ++F ++
Sbjct: 645 KGFEKISLKPGESKVVTFTIN 665


>gi|421883572|ref|ZP_16314801.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
 gi|379986798|emb|CCF87074.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Senftenberg str. SS209]
          Length = 765

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 208/749 (27%), Positives = 344/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN       Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFNA-----PYSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|378954476|ref|YP_005211963.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Gallinarum/pullorum str. RKS5078]
 gi|357205087|gb|AET53133.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum/pullorum str. RKS5078]
          Length = 755

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 209/748 (27%), Positives = 349/748 (46%), Gaps = 137/748 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 76  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 119

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 120 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 173

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 174 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 207

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 208 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 262

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 263 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 321

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 322 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 376

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 377 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 434

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+A+ 
Sbjct: 435 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLHEEAVKIDPRSPQAMIDEAVQAAKQANV 494

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 495 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 551

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+  + +N     +
Sbjct: 552 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 610

Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
            + P + + R++      +Y FG+GLSYT ++       S++T+S+          +Q+ 
Sbjct: 611 PEKPNKYTSRYFDEANGPLYPFGYGLSYTTFTV------SDVTLSSP--------TMQRD 656

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
           G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF+
Sbjct: 657 G---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 699

Query: 755 RVHTVAKGSKEISFGVDPCEQLSIANKH 782
           ++       K +SF +D  E L   N+ 
Sbjct: 700 KITLKPGERKTVSFPID-IEALKFWNQQ 726


>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
 gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
          Length = 756

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 217/771 (28%), Positives = 353/771 (45%), Gaps = 137/771 (17%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EKI+   D A    RLGIP + + S+ +HG                 T+FP  L  ++S+
Sbjct: 83  EKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------TTFPIPLGLSSSW 126

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           +  L        A+EA A       G+ + ++P ++I RDPRWGR  E  GEDP + S  
Sbjct: 127 DMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGEDPYLGSQI 180

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
           A   V  +QGE+  + +                           + A  KH   Y   + 
Sbjct: 181 AKAMVTGYQGEDLMAKN--------------------------TMLATVKHFALYGAAEA 214

Query: 255 GNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDL 309
           G      +N++ ++     + + PP+++ I+ G  S +M S+N ++G+PA     L  DL
Sbjct: 215 GR----DYNSVDMSRLKMYNEYLPPYKAAIDAGVGS-VMSSFNDIDGIPASGNKWLLTDL 269

Query: 310 FQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQ 368
               R++W F G++ SD  +V  +  +       +  +A  LKAG+D++  G   L   +
Sbjct: 270 L---RDDWKFNGFVVSDYTSVNEMIAHG--LGDLQAVSALSLKAGLDMDMVGEGFLTTLK 324

Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAA 428
            ++D+GKV  ++I  A   +   + +LGLF+ DP K    K    D+   E++ LA +AA
Sbjct: 325 KSLDEGKVTAEEITTACRRILEAKFKLGLFD-DPYKYIDKKRPAKDILKDENRALAREAA 383

Query: 429 RQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGY--TGIPCSPKSLLRGLE-- 484
           ++  VLLKND K LP+NK+  S +A+IG L N+   M G +  TG P    S+L+G +  
Sbjct: 384 KKSFVLLKNDTKNLPINKS--SKIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNV 441

Query: 485 ---AYVSKTHYASGCHDVPC----------------NSDAGFHEAVRIAKKADFVIVVAG 525
              A ++    A+   D                   +++   +EAV +AKK+D ++ V G
Sbjct: 442 APNAQITHAKGANITDDAALAKKINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVG 501

Query: 526 LDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSI 585
                  E   R  + +P  Q  L+ ++A T K P++LVL  G PL     E  +  +SI
Sbjct: 502 EATEFTGESSSRTDISIPQSQKKLIRALAATGK-PLVLVLMSGRPL--VLEEELALSASI 558

Query: 586 LWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNMRADSSRQY 640
           L + +PG     A+A+++FGD+NP G+L  TW P +  ++P+     N    +  S  + 
Sbjct: 559 LQVWFPGVEAGNAIADVVFGDYNPSGKLTATW-PRNVGQIPIYHSIKNTGRPQLTSEFEK 617

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
              +Y     T +  FG+GLSYT + Y      S L ++AS    ++  I          
Sbjct: 618 FTSNYLDAPNTPLLPFGYGLSYTEFEY------SNLNVNASQINQNEPLI---------- 661

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
                          V +SVTN G+ DG  VV L+ R    S   P KQL GF +V    
Sbjct: 662 ---------------VTVSVTNTGNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKK 706

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
             +K+++  + P + L   N +   +   G+  + VG      L+ S T++
Sbjct: 707 GETKQVTLTLTP-DDLKFYNSNLDFVAEPGDFEIYVGTNSDATLKSSFTLK 756


>gi|416539671|ref|ZP_11750078.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|416551719|ref|ZP_11756625.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|416566632|ref|ZP_11763924.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|417327454|ref|ZP_12112866.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|418868407|ref|ZP_13422850.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
 gi|353570549|gb|EHC34783.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Adelaide str. A4-669]
 gi|363559751|gb|EHL43903.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. SARB30]
 gi|363566134|gb|EHL50153.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 29N]
 gi|363579468|gb|EHL63250.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Montevideo str. 4441 H]
 gi|392838313|gb|EJA93877.1| periplasmic beta-D-glucoside glucohydrolase [Salmonella enterica
           subsp. enterica serovar Newport str. CVM 4176]
          Length = 765

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 206/749 (27%), Positives = 343/749 (45%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP +   ++       H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPADTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +                     P +  A   EAV+ AK+AD 
Sbjct: 445 GIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|387790798|ref|YP_006255863.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
 gi|379653631|gb|AFD06687.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
          Length = 730

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 224/774 (28%), Positives = 343/774 (44%), Gaps = 139/774 (17%)

Query: 64  RAKSLISLLTLQEKIQQLSDNAS----AIPRLGIPAYEWWSESLHGI-ASNGPGVNFNGT 118
           + + LI  +TL+EK+  +  N+S     I RLGIP     S+  HG+   +G     +  
Sbjct: 40  KIEQLIEKMTLEEKVGMIHGNSSFTSAGIERLGIPELVT-SDGPHGVRVEHGRDWTVDTN 98

Query: 119 VS-SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRW 177
           V  + T  P     AA++N  L    G+ +  EA       +       P +NI R P  
Sbjct: 99  VDDAATYLPTGNTLAATWNTDLGYQFGAVLGSEANY-----RGKDVILGPGVNIIRSPLC 153

Query: 178 GRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDEL 237
           GR  E   EDP ++S  AV ++K  Q                            D+G   
Sbjct: 154 GRNFEYLSEDPYLISKMAVGYIKGVQ----------------------------DQG--- 182

Query: 238 MLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQ 297
            +SAC KH  A + E      R + +  ++E+   + + P F++ +  G  + +M SYN+
Sbjct: 183 -VSACVKHYAANNEE----VDRNTVDVQMSERALREIYLPAFKAAVVDGGVNTVMGSYNK 237

Query: 298 VNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE-YQNYTKTHEDSAAGVLKAGM 355
             G  A     L +K  + EWGFKG + SD  AV    E  QN T     +  G+L    
Sbjct: 238 FRGQYATHNEYLVKKILKGEWGFKGVLMSDWGAVHNTMEAMQNGTDLEMGTDLGMLP--- 294

Query: 356 DINCGTCMLRHTQSAIDK-GKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           + N     +  T  A+ K GK+ E+ ID  +  +  V  +  + +G        K  P  
Sbjct: 295 NPNYNKFFMADTVLALVKSGKLSEQLIDEKVRRILWVMFKTNMIDG--------KRQPGS 346

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPC 474
             T EH+K+AL  A +GIVLLKN+   LPL KN + S+A+IG   N  + MGGG + +  
Sbjct: 347 FNTKEHQKVALKVAEEGIVLLKNENGILPLQKNDLKSIAVIGENANRPNSMGGGSSQVKA 406

Query: 475 SPK-SLLRGLEAYVSKT---HYASGCHDVPC-NSDAGF-HEAVRIAKKADFVIVVAGL-- 526
             + +LL+GL+  +  T    YA G        +DA    EAV  A KA+  I+V G   
Sbjct: 407 KYEITLLQGLKNLLGSTVNIQYAQGYKIARGQQADAKLISEAVSAASKAEIAILVVGWTH 466

Query: 527 --------DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEA 578
                   D   + E  D+  + +P  Q  L+ +V + +   V+ VLTGGGP+DV+    
Sbjct: 467 GYDYSVWNDNAYDAEGVDKPDMDMPFGQNELIKAVLKANPHTVV-VLTGGGPIDVTQWIG 525

Query: 579 DSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSR 638
           D++   +L   Y G  G  ALA+I+FG+ NP G+LPMT +P+     P       A    
Sbjct: 526 DAK--GVLEGWYAGMEGGNALAKILFGEVNPSGKLPMT-FPKKLEDSP-------AHKFG 575

Query: 639 QYPGRS------------YRFYTGTQV---YGFGHGLSYTNYSYKFLSAPSELTISASLK 683
            +PG +            YR++   +V   + FGHGLSYT +SY+          +  + 
Sbjct: 576 DFPGVNNVAHYKEDIFVGYRYFDTYKVQPQFAFGHGLSYTTFSYE----------NMKVA 625

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           AG DK                  T+ T       I++ N G V G+ V  L+ +  K S 
Sbjct: 626 AGDDK------------------TTAT-------ITIKNTGKVGGAEVAQLYVKQVKSSL 660

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
             PEK+L  F ++      SKEISF ++        +K  + ++  G   +++G
Sbjct: 661 KRPEKELKAFQKIFLKPGESKEISFELNDEAFHYFNDKENKWVVEPGKFDILIG 714


>gi|332184951|ref|ZP_08386700.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
 gi|332014675|gb|EGI56731.1| glycosyl hydrolase family 3 N terminal domain protein [Sphingomonas
           sp. S17]
          Length = 886

 Score =  244 bits (624), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 152/423 (35%), Positives = 212/423 (50%), Gaps = 63/423 (14%)

Query: 65  AKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTS 124
           A +L+  +T  EK+ QL + A AIPRLG+PAY WW+ESLHG     P           T+
Sbjct: 34  AATLVGKMTTDEKLYQLLNTAPAIPRLGVPAYNWWTESLHGALGPVP----------TTN 83

Query: 125 FPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLTFWAPNINIFRDP 175
           FP+ +  AA+F+ +L      A++ E R ++ L +          GL  W+PNINIFRDP
Sbjct: 84  FPEPVGLAATFDDALVGRAAQAISREVRGLHALARQTGRLGRIGTGLDSWSPNINIFRDP 143

Query: 176 RWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGD 235
           RWGRGQET GEDP + +   V FV+  QG +    D                        
Sbjct: 144 RWGRGQETYGEDPFLTARMGVAFVRGMQGTDPNRID------------------------ 179

Query: 236 ELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSY 295
              + A  KH   Y +      +R+  N  +++ D  DT+ P FR+ I +G A  IMC+Y
Sbjct: 180 ---VVATPKH---YAVHSGPESTRHEANVYVSQHDLVDTYLPAFRAAIVEGGAGSIMCAY 233

Query: 296 NQVNGVPACLRGDLFQK--ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKA 353
           N+++G PAC   DL  K   R  WGF+GY+ SDCDAV  I    +Y      + A  ++A
Sbjct: 234 NRIDGQPACA-SDLLLKDYLRGRWGFRGYVVSDCDAVKDIDANHHYAPDPATAVAAAMRA 292

Query: 354 GMDINCGTCMLRHT-------QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGK 406
           G+D  C    L  T       + A+ +G +   DIDR+L  LF+ + R G   G  R   
Sbjct: 293 GVDNECNNATLTDTDGLTNPYRDALARGLISVADIDRSLTRLFAARYRTGDLPGV-RPLS 351

Query: 407 YGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG 466
                P D+ T  H+ LAL+ A + +VLLKN K  LPL   A   +A+IGPL +    + 
Sbjct: 352 MQSSSPADIGTPAHQALALETAEKSLVLLKN-KGILPLKAQA--RIAVIGPLGDATRVLR 408

Query: 467 GGY 469
           G Y
Sbjct: 409 GNY 411



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 99/306 (32%), Positives = 150/306 (49%), Gaps = 56/306 (18%)

Query: 503 SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTS 552
           SD    E    AK +D ++ V GL    E E+           D+ +L LP  Q +L+  
Sbjct: 601 SDDPAAELRTAAKDSDALVAVVGLTSDLEAEEAPISVPGFKGGDKTTLDLPADQQALLEQ 660

Query: 553 VARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGR 612
            A+ + +P+I+VL  G P+++++A+A++      W  YPG+AG  A+  ++ G  NP GR
Sbjct: 661 -AKATGKPLIVVLMNGSPVNLAWAKANADAIVEAW--YPGQAGGLAVGNVLSGKTNPAGR 717

Query: 613 LPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLS 671
           LP+T+Y  S   +P   D  M+        GR+YR++TGT VY FG+GLSYT+++Y    
Sbjct: 718 LPLTFY-RSVEDLPAFGDYAMK--------GRTYRYFTGTPVYPFGYGLSYTSFAY---- 764

Query: 672 APSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHV 731
           AP +LT +              TG +      D +T  T +R        N G   G  V
Sbjct: 765 APVKLTPAV-------------TGVQ------DGLTVTTEVR--------NTGRRAGDEV 797

Query: 732 VMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGN 791
             L+   PKV  G P   L GF RVH      + + F +DP   LS  +  G R++  G+
Sbjct: 798 AQLYLNFPKV-DGAPRVALRGFQRVHLAPGERRMLRFTLDP-RDLSAVDADGHRLVMKGD 855

Query: 792 HVLMVG 797
           + + VG
Sbjct: 856 YRVSVG 861


>gi|423301451|ref|ZP_17279475.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472052|gb|EKJ90581.1| hypothetical protein HMPREF1057_02616 [Bacteroides finegoldii
           CL09T03C10]
          Length = 781

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 211/749 (28%), Positives = 338/749 (45%), Gaps = 132/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  AA+++  L + +G A+  
Sbjct: 129 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPQLINEVGKAIGK 176

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RDPRW R +ET GEDP++             GE  K+
Sbjct: 177 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLT------------GEIGKA 219

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G+G G              D        A  KH +AY + + G     SF  M   +
Sbjct: 220 MVAGLGSG--------------DLSRPYSTLATLKHFLAYGISESGQNGNPSFAGM---R 262

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           +  + F PPF   I  G A  +M SYN ++G P      L  +  R++W FKG + SD  
Sbjct: 263 ELHENFLPPFGQAINAG-ALSVMTSYNSMDGTPCTANHYLLTELLRDDWKFKGVVVSDLY 321

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  I +      T +++A   L AG+DI+ G     +   A+++ ++ ++ +D A+  +
Sbjct: 322 SIEGIHQSHFVASTMKEAAVMALSAGVDIDLGGDAYMNLMDAVNRKEISKEILDAAVSRV 381

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF     DP K K       +V + E+  LA   A+  I LLKN+   LPL+
Sbjct: 382 LRLKFEMGLFENPYVDPGKAK------KEVRSKEYVALARQVAQASITLLKNEHSLLPLD 435

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSP--KSLLRGLEAYV--SKTHYASGCHDVPC 501
           ++    +A+IGP  +N   M G YT        K++L G+ A +  S+  Y  GC  +  
Sbjct: 436 RSM--KVALIGPNADNRYNMLGDYTAPQEEENVKTVLDGIRAKLSSSQVEYVKGC-SIRD 492

Query: 502 NSDAGFHEAVRIAKKADFVIVVAG----LDLT---QET----------------EDRDRV 538
              +   +AV  A++++ VI V G     D     +ET                E  DR 
Sbjct: 493 TVTSDIEQAVAAARRSEVVIAVVGGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRA 552

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           +L L G+Q  L+ ++  T K P+I+V   G PLD ++A  ++   ++L   YPG+ G  A
Sbjct: 553 TLSLLGKQQELLKALKATGK-PLIVVYIEGRPLDKNWASENA--DALLTAYYPGQEGGNA 609

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+++FG+FNP GRLP +  P S  +VP+   N +A  S  Y   S      + +Y FG+
Sbjct: 610 IADVLFGEFNPAGRLPFS-VPRSVGQVPVY-YNKKAPQSHDYVEVS-----ASPLYSFGY 662

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT + Y      S+L +SA                             T   F V  
Sbjct: 663 GLSYTTFEY------SDLHLSA----------------------------LTPHSFEVSC 688

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            + N G  DG  VV L+ R    S   P KQL  F R+       +++ F +   E  ++
Sbjct: 689 KIRNTGKYDGEEVVQLYLRDEYASVVQPLKQLKHFARLFLKCGEEQKVKFILSE-EDFAL 747

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            +++ +R++  G   +M+G     + ++T
Sbjct: 748 VDRNLKRVVEPGTFQVMIGAASDDIRLQT 776


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 156/440 (35%), Positives = 228/440 (51%), Gaps = 57/440 (12%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVS 120
           +  R   L+S LT++EKI  L   +  IPRL IP Y   +E+LHG+   G          
Sbjct: 35  MHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHGVVRPG---------- 84

Query: 121 SVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNIN 170
             T FPQ +  AA++N  L   + + ++ EARA +N    G          LTFW+P +N
Sbjct: 85  RFTVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVN 144

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           + RDPRWGR  ET GEDP +       FVK  QG+    DD          R LK     
Sbjct: 145 MARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQGD----DD----------RYLK----- 185

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   + +  KH  A + E     +R+  N  I+E+   + + P F +C++ GK++ 
Sbjct: 186 --------IVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSAS 233

Query: 291 IMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           IM +YN +N VP  L   L  K  R +WGFKGY+ SDC   + +     Y KT E +A  
Sbjct: 234 IMSAYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVKTKEAAATL 293

Query: 350 VLKAGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYG 408
            +KAG+D+ CG  +      SA  +  V   DID A   +   +++LGLF+    K  Y 
Sbjct: 294 SIKAGLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSG-EKNPYT 352

Query: 409 KLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGG 468
           K+ P  V +++H+++AL+AAR+ IVLLKN KK LPLN   V S+A++G  +N  +   G 
Sbjct: 353 KISPAVVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGD 410

Query: 469 YTGIPC-SPKSLLRGLEAYV 487
           Y+G P  +P S+L+G++  V
Sbjct: 411 YSGSPVIAPISVLQGIKDRV 430



 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 143/291 (49%), Gaps = 45/291 (15%)

Query: 507 FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           + EA +  ++ + V+ V G++ + E E +DR  + LP  QM  +  + + +   +++VL 
Sbjct: 592 YGEAGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKVNPN-IVVVLV 650

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
            G  L V++   D  + +I+   YPGE+G KA+AE++FGD+NPGGRLP+T+Y       P
Sbjct: 651 AGSSLAVNWM--DEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRSLDELPP 708

Query: 627 MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGS 686
            +D ++         GR+Y+++ G  +Y FG+GLSYT + Y  L             A  
Sbjct: 709 FDDYDITK-------GRTYKYFKGDVLYPFGYGLSYTTFKYSNLQV-----------ADG 750

Query: 687 DKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTP 746
           ++ I                        +V   + NAG   G  V  ++ ++P+  +  P
Sbjct: 751 EEEI------------------------NVSFQLKNAGKYAGDEVAQVYVKLPERDEVMP 786

Query: 747 EKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            K+L GF+RV   +  +K+++  +            G+ + P G++ +MVG
Sbjct: 787 VKELKGFERVALKSGENKKMTLKLRKDLLRYWDEAKGKFVYPSGDYTIMVG 837


>gi|384914665|ref|ZP_10015417.1| Periplasmic beta-glucosidase [Methylacidiphilum fumariolicum SolV]
 gi|384527282|emb|CCG91285.1| Periplasmic beta-glucosidase [Methylacidiphilum fumariolicum SolV]
          Length = 717

 Score =  244 bits (623), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 178/646 (27%), Positives = 314/646 (48%), Gaps = 82/646 (12%)

Query: 162 LTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREK 221
           LTF AP +++ RDPRWGR  E+PGEDP + S  +  ++K FQG+N  S            
Sbjct: 129 LTF-APMLDVCRDPRWGRVAESPGEDPFLASLLSSAWIKGFQGKNLSSSS---------- 177

Query: 222 RVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFR 280
                           +L+AC KH + Y   + G      +N++ I+ +  ++ + PPF 
Sbjct: 178 ----------------LLAACPKHFVGYGAVEGGR----DYNSVTISHRLLQEIYLPPFL 217

Query: 281 SCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNY 339
              ++G A   M ++  +NG+P      L +   R + G++G + SD +AV  + ++   
Sbjct: 218 QAFKEG-AFATMAAFVAINGLPPAANPQLLKGLLRKDIGWEGPVISDWNAVRELVQH-GI 275

Query: 340 TKTHEDSAAGVLKAGMDINCGTCML-RHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           T+  + + A  + +G+DI+  + +  ++ +  + +GKV+E+ ID A L + +++ +LGLF
Sbjct: 276 TENEKTAVALAINSGIDIDMVSGLYSQYLEELLKEGKVKEETIDEAALRVLTLKKKLGLF 335

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
             +P   K  +     +  + +K++AL+AA++ +VLLKN+   LP+   ++ S+A+IGP 
Sbjct: 336 TQEPHPLKIDQNQWQTILKT-NKQIALEAAKKSVVLLKNEDGLLPIAP-SIRSIALIGPF 393

Query: 459 VNNISQMGGGYTGI-PCSPK-SLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRI 513
                +  G ++ I  C+   +L  GL+ Y    +K H   GC+    +    + +A++I
Sbjct: 394 SEERKEHLGPWSAIGDCNDVITLAEGLKRYAPRGTKIHVVEGCNKF-SHGTVDYQQALKI 452

Query: 514 AKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDV 573
           A+ +  ++   G       E   R  L LP QQ  L+  +  T   PV+L++  G PLD+
Sbjct: 453 AQSSQLIVAAVGEKANMSGEAASRAFLDLPTQQQKLIEVLLETGV-PVVLIVFSGRPLDL 511

Query: 574 SFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR 633
           S     S+  +IL   + G    +AL +I+FG +NP G+LP+T +P S  ++P+    + 
Sbjct: 512 S--RIISKTKAILQAWFLGTETGEALGQILFGLYNPSGKLPIT-FPRSVGQIPIYYAQLP 568

Query: 634 A--DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
               ++R +P   Y   + T ++ FG+GLSYT + Y F +                    
Sbjct: 569 TGRPANRSFPSCRYIDQSNTPLFPFGYGLSYTTFHYSFPTL------------------- 609

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
               SR++Y    E+ +           + N G V G  +V L+ R P  S   P K+L 
Sbjct: 610 --NRSRINYHETLEILT----------EIENTGTVPGEEIVQLYIRDPVASISRPLKELK 657

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
           G  RV       ++I F +   + L+  +  G+R +  G   L VG
Sbjct: 658 GIRRVFLKPGERQKIKFSLSSVD-LAFIDYEGKRKIEPGRFFLWVG 702


>gi|150009653|ref|YP_001304396.1| beta-glucosidase [Parabacteroides distasonis ATCC 8503]
 gi|149938077|gb|ABR44774.1| glycoside hydrolase family 3, candidate beta-glucosidase
           [Parabacteroides distasonis ATCC 8503]
          Length = 758

 Score =  244 bits (623), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 193/632 (30%), Positives = 299/632 (47%), Gaps = 100/632 (15%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
           +AP ++I RD RWGR  E  GEDP + S  A   V+ FQG N W+S              
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKVRVEGFQGGNDWRS-------------- 207

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
                      D   + ACCKH  AY   + G   N S  S N ++      + + PP+ 
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252

Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
           +  E G A+  M S+N++NGVP+     L  DL +K   +WGF G++ +D      I E 
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLLRK---DWGFNGFVVTDYTG---INEM 305

Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
             ++    D  AG L   AG+D++  G    +H   ++ +GKV E++IDRA+ ++  ++ 
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQHLVQSVKEGKVSEENIDRAVASILEMKF 365

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
            LGLF+ DP +    +   + +   E  + A + + + IVLLKND  F P++K+   ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKNITVA 424

Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
           +IGP+V +     G + G     +  SL  GL    + T+    YA GC D+  +  + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
            EA+  A++AD V+   G D     E   R  L LPG Q +L+  + +T K P+ L+L  
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PLD+S+   D  +  IL   Y G      +A++I GD+NP  RL M+ +P +  ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599

Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                +  + R  P  +        Y     T +Y FG+GLSYT ++             
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFAVN----------- 646

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            S+K   D+N   + G                 +  V   V N G VDG  VV ++ R  
Sbjct: 647 -SMKL--DQNSFTKGG-----------------KITVTAEVENTGKVDGETVVQMYIRDL 686

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             S   P K+L GF++V   A   K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718


>gi|205353304|ref|YP_002227105.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|375124143|ref|ZP_09769307.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. SG9]
 gi|445128447|ref|ZP_21380240.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|205273085|emb|CAR38038.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 287/91]
 gi|326628393|gb|EGE34736.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Gallinarum str. SG9]
 gi|444854909|gb|ELX79964.1| beta-D-glucoside glucohydrolase [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
          Length = 765

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 209/749 (27%), Positives = 346/749 (46%), Gaps = 139/749 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++    
Sbjct: 273 VLRDEWGFKGITVSDHGAIKELIKHGTAADP-EDAVRVALKAGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTVLA 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  V   +K  YA G +     D+              P +  A   EAV+ AK+A+ 
Sbjct: 445 GIQNAVDDGAKILYAKGANITNDKDIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQANV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R ++ +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTNITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA----- 634
            Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++       
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 635 -DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
            +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +Q+
Sbjct: 621 PEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TMQR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGF 708

Query: 754 DRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           +++       K +SF +D  E L   N+ 
Sbjct: 709 EKITLKPGERKTVSFPID-IEALKFWNQQ 736


>gi|429756169|ref|ZP_19288778.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           324 str. F0483]
 gi|429171889|gb|EKY13478.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           324 str. F0483]
          Length = 755

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 214/749 (28%), Positives = 336/749 (44%), Gaps = 125/749 (16%)

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I++L   A    RLGIP   +  + +HG                 T FP  L  + S++ 
Sbjct: 87  IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 130

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
           +L        A EA A       G+ + +AP ++I RD RWGR  E  GEDP + S  A 
Sbjct: 131 ALMRKTAELAAREATA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 184

Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             VK FQG +NW++                             L AC KH   Y   + G
Sbjct: 185 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 221

Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKA 313
                 +N A ++     + + PP+ + +  G  S IM S N++NGVPA   +  L ++ 
Sbjct: 222 K----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEINGVPATADKWLLTEEL 276

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQS 369
           R EWGF G + SD      I E   +    +D  A  L A     MD+N G   +++  +
Sbjct: 277 RKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSA 332

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
            + +GK  E  ID+A+ ++  ++  LGLF+ DP +        ++  T E+ K+A  A  
Sbjct: 333 LVKEGKATEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVA 391

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGLEAYV 487
             +VLLKN+ + LP+ KN+  ++A+IGP++NN S + G +T  G      SLL GL    
Sbjct: 392 SSVVLLKNEAEVLPIKKNSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLSGLTQKY 451

Query: 488 SKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
             T+    YA GC     +++    EAV IA+KAD V+V  G   +   E   R  + LP
Sbjct: 452 KGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLP 510

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
             Q  L+ ++   +K P+ ++   G PLD+S+   +  + +IL   +PG  G   +A++I
Sbjct: 511 QAQRQLLEALKAINK-PITIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVI 567

Query: 604 FGDFNPGGRLPMTWYPESFTKVPM---------------NDMNMRADSSRQYPGRSYRFY 648
            GD NP G L M+ +P S  ++P+                ++++R   +  Y   S    
Sbjct: 568 AGDVNPSGHLTMS-FPRSVGQIPIYYNYKNTGRPVYTNNEEVDLRPHYNAGYLDSSI--- 623

Query: 649 TGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTS 708
             T +Y FG+GLSYT ++   +    +     S+K  +D  I                  
Sbjct: 624 --TPLYPFGYGLSYTTFAISNVHLNKK-----SMKRYNDSII------------------ 658

Query: 709 CTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
                  V  SV N G  +G  V+ L+ R    S   P K+L GF ++   A  SK++ F
Sbjct: 659 -------VNASVQNTGTTEGEIVLQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRF 711

Query: 769 GVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            + P E L+    +G++       +L VG
Sbjct: 712 EL-PSEALAFYGINGKKDTEPSECLLWVG 739


>gi|445499551|ref|ZP_21466406.1| periplasmic beta-glucosidase BglX [Janthinobacterium sp. HH01]
 gi|444789546|gb|ELX11094.1| periplasmic beta-glucosidase BglX [Janthinobacterium sp. HH01]
          Length = 743

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 196/656 (29%), Positives = 304/656 (46%), Gaps = 87/656 (13%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RDPRWGR  E  GED  + SA A   V  FQG+   + D             
Sbjct: 152 FAPMVDIARDPRWGRVMEGAGEDTFLGSAIARARVHGFQGKQLGATDS------------ 199

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
                         + A  KH  AY     G      +NA+ ++ Q   + + PPF++ +
Sbjct: 200 --------------VMATAKHFAAYGAAVAGR----DYNAVDMSNQQLFEVYLPPFKAAL 241

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQK---ARNEWGFKGYITSDCDAVATIFEYQNYT 340
           + G A+  M S+N +NG+PA   G++F +    +  W FKG++ SD  +V  +  +  Y 
Sbjct: 242 DAG-AATFMNSFNTLNGIPAT--GNVFLQRDILKGSWNFKGFVVSDWGSVREMVPH-GYA 297

Query: 341 KTHEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
               D++   + AG D++  +   ++H + A+  GKV+ K +D A+  +   +  LGLF+
Sbjct: 298 VDLADASVKAINAGSDMDMESYAHIKHLEDAVKSGKVKMKTLDDAVYRILYKKFELGLFD 357

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            DP +          +    H+  ALD A++ +VLLKN    LPL++ A   +A+IGPL 
Sbjct: 358 -DPYRYSVAAREKAVMADPSHRAAALDVAQKSLVLLKNAAGTLPLSR-AAQKIAVIGPLA 415

Query: 460 NNISQMGGGYT--GIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAGFHEAVRIAK 515
           +    + GG+   G      S+L G+ ++  K    YA  C D  C    GF +AV  A 
Sbjct: 416 DAKRDLEGGWVVEGDRAPVVSILEGIRSHAGKAEVSYAQAC-DNRCTGTEGFADAVAAAT 474

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD V++V G       E + R  + LPG Q  L  ++  T K PV++V+  G PL   F
Sbjct: 475 RADVVVLVVGETWDLSGEAKSRTDITLPGHQEQLFAALKATGK-PVVVVMLAGRPL--VF 531

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMR-- 633
                Q  +I++  +PG  G  A+A ++FGD+NP G+LP+T +P S  ++P++       
Sbjct: 532 NTIADQADAIVYAWFPGSEGGNAVANVLFGDYNPSGKLPIT-FPRSVGQIPLSYAQYSTG 590

Query: 634 ---ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNI 690
               D        +Y     T  Y FGHGLSYT++ Y      S LTIS+   A S    
Sbjct: 591 RPVTDEKNVVYKSAYIDSVNTPRYAFGHGLSYTDFKY------SGLTISSKEMAPS---- 640

Query: 691 LQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQL 750
            QQ     D                    + N G  +G+ +V L+ R    S   P K+L
Sbjct: 641 -QQVTLSFD--------------------LANTGKAEGTEIVQLYLRDLVASVVRPVKEL 679

Query: 751 IGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            GF ++H +    + +SF +D  E LS  N         G+  LMVG     + +E
Sbjct: 680 KGFQKIHLLPGEQRRVSFTID-RELLSFFNSQLVWGAEAGDFKLMVGSASDDIRLE 734


>gi|317476310|ref|ZP_07935559.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316907336|gb|EFV29041.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 862

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 167/470 (35%), Positives = 239/470 (50%), Gaps = 66/470 (14%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C  P + +     TSL+   RA+ L+  LTL+EK+  + DN+ A+ RLGI  Y WW+E+L
Sbjct: 17  CSQPTYKT-----TSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEAL 71

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
           HG+A  G            T FPQ +  AASF+      + SAV+ EARA  N   A   
Sbjct: 72  HGVARAG----------LATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQG 120

Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
                 GLT W P +NI+RDPRWGRG ET GEDP + S   V  VK  QG N    DG  
Sbjct: 121 SYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTN----DG-- 174

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTED 273
                          + D+     L AC KH   +   +W   +R+SFN   ++ +D  +
Sbjct: 175 ---------------KYDK-----LHACAKHFAVHSGPEW---NRHSFNVEDLSARDLYE 211

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVAT 332
           T+ PPF + +++ K   +MC+YN   G P C    L  Q  RN+WGF G I SDC A+A 
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271

Query: 333 IFE---YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
            +    ++ Y      SAA VL  G D+ CG+   +    A+  G++ EKDID+A++ L 
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLN-GTDLECGSS-YKALVKAVRAGQINEKDIDKAVMRLL 329

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
             +  LG  + DP    + K+    V ++EH  LALD AR+ + LL+N    LPL +  +
Sbjct: 330 EARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALDMARKSMTLLQNTGNILPLKRGGL 388

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
            ++A++GP  N+     G Y G+P    ++L G+   +    K  Y  GC
Sbjct: 389 -TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437



 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 146/318 (45%), Gaps = 63/318 (19%)

Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF + V I       K AD V+  +G+    E E+           DR  + 
Sbjct: 574 DAQLNFDIGFKQKVDINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIE 633

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LP  Q  L++++ R  K+ +ILV   G P  ++         +IL   YPG+ G  A+AE
Sbjct: 634 LPAIQRELISALHRAGKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAE 690

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           ++FGD+NPGGRLP+T+Y  + +++P   D NM         GR+YR+ T   ++ FG+GL
Sbjct: 691 VLFGDYNPGGRLPVTFY-RNMSQLPDFEDYNMT--------GRTYRYMTQQPLFPFGYGL 741

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y  LS   E      +K G                              + +SV
Sbjct: 742 SYTTFDYGKLSLGKE-----QIKQGEP--------------------------LKLAVSV 770

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIA 779
           TN+G  DG  VV ++ +    ++G P K L  F RV+  A  S ++ F +   E +   A
Sbjct: 771 TNSGQRDGEEVVQVYLKKQDDAEG-PGKTLRAFKRVYIPAGKSVDVEFDLKDKELEWWDA 829

Query: 780 NKHGRRILPLGNHVLMVG 797
             +  R+   G + +MVG
Sbjct: 830 QSNTMRVCA-GKYDIMVG 846


>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
          Length = 712

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 176/534 (32%), Positives = 268/534 (50%), Gaps = 54/534 (10%)

Query: 282 CIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
           C    K   IMCSYN VNG+PAC    L+  L +K  N      +ITSDC A+  I+++ 
Sbjct: 189 CTRDAKVGSIMCSYNAVNGIPACANSYLQETLLRKHWNWTHTNNWITSDCGAMQDIWQHH 248

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGL 397
           NYTKT  ++A    + G D +C     +    + ++G + EK +DRAL  LF   +  G 
Sbjct: 249 NYTKTGAEAAKAAFENGQDSSCEYTTTKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGF 308

Query: 398 FNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGP 457
           F+GD  K ++  L  DDV T   + LAL +A +G VLLKND   LPLN     S+A+IG 
Sbjct: 309 FDGD--KSEWSSLDFDDVNTRHAQDLALQSAVRGAVLLKNDNT-LPLNIKKKESVALIGF 365

Query: 458 LVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASG----CHDVPCNSDAGFH 508
             ++ +++ GGY+G    P   +R   AY +K     T+ A G       VP N      
Sbjct: 366 WADDKTKLQGGYSG----PAPHVR-TPAYAAKMLGLNTNVAWGPTLQNSSVPDNWTTNAL 420

Query: 509 EAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGG 568
           EA   AKK+D+++ + GLD T   E+RDR  L  P  Q++L+  ++   K  V++ L  G
Sbjct: 421 EA---AKKSDYIVYLGGLDATAAGEERDRTDLDWPSTQLTLLKKLSNLGKPLVVVQL--G 475

Query: 569 GPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT-KVPM 627
             +D +    +  ++SILW+ YPG+ G  A+ E+I G   P GRLP+T YP  +T +V M
Sbjct: 476 DQVDDTPLLKNKGVNSILWVNYPGQEGGTAVMELITGRKGPAGRLPLTQYPSKYTEQVGM 535

Query: 628 NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSD 687
            +M +R   ++  PGR+YR+Y+ + V  FG G  YT +   F S   E+ I   LK    
Sbjct: 536 LEMELRP--TKSSPGRTYRWYSDS-VLPFGFGKHYTTFKAMFKSQKIEMNIQKILKG--- 589

Query: 688 KNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPE 747
                      D  ++D   +C     H  +SV N G      V ++F +     +  P 
Sbjct: 590 ----------CDATYVD---TCPLPPIH--LSVKNTGRTTSDFVSLVFIQGKVGPKPYPL 634

Query: 748 KQLIGFDRVHTV---AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVGE 798
           K L  + R H +   A    E+ + +D    ++   K+G  ++  G + L++ E
Sbjct: 635 KTLAAYSRSHDIKPRATKDVELQWTMD---NIARREKNGDLVVYPGTYTLLLDE 685



 Score =  136 bits (342), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 97/160 (60%), Gaps = 6/160 (3%)

Query: 39  KPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEW 98
           +PD  C      S   C+T+ S + RA +L+S LT +EK+  L  NA+  PR+G+P Y W
Sbjct: 28  QPD--CTKEPLRSNGICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNW 85

Query: 99  WSESLHGIASNGPGVNFNGT--VSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARA-MY 155
           W+E+LHG+A   PG ++N      S TSFP  L+  ++F+  L  +IG  +  EARA   
Sbjct: 86  WNEALHGVAG-APGNDYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNN 144

Query: 156 NLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYA 195
                G+ +W PN+N F+DPRWGRG ETPGED + VS YA
Sbjct: 145 GGWGGGVDYWTPNVNPFKDPRWGRGSETPGEDALHVSRYA 184


>gi|347536214|ref|YP_004843639.1| glycoside hydrolase family protein [Flavobacterium branchiophilum
           FL-15]
 gi|345529372|emb|CCB69402.1| Glycoside hydrolase precursor, family 3 [Flavobacterium
           branchiophilum FL-15]
          Length = 740

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 205/710 (28%), Positives = 338/710 (47%), Gaps = 109/710 (15%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T+FP  L  AAS++           A EA        +G+ + +AP ++I RDPRWGR  
Sbjct: 111 TTFPIPLAEAASWDVEAIEKSARVAATEA------ASSGIHWTFAPMVDISRDPRWGRVM 164

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GED  + S  A   VK FQ                           ++ GD   + A
Sbjct: 165 EGAGEDTYLGSKIAFARVKGFQ---------------------------ANLGDVHSVMA 197

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNG 300
           C KH  AY     G      +N++ I+E+   +T+ PPF++ ++ G A+  M ++N +NG
Sbjct: 198 CVKHFAAYGAAVGGR----DYNSVDISERMLWETYLPPFKAALDAG-AATFMNAFNDING 252

Query: 301 VPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMD 356
           +PA     ++ D+    + +W F+G++ SD  ++  +  +  Y K ++ +A   L AG D
Sbjct: 253 IPATANKHIQRDIL---KGKWQFQGFVVSDWGSIGEMVAH-GYAKDYKQAAEKALLAGSD 308

Query: 357 INC-GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
           ++   +  + H  + + + KV    ID A+  +   ++ LGLF  DP K    +     +
Sbjct: 309 MDMESSAYIGHLATLVKENKVPIALIDDAVRRILRKKMELGLFE-DPFKFCNPERQNKAL 367

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
              EH K+A + A + IVLLKNDK+ LPL+K+ + ++A IGP+V +  +   G+  +   
Sbjct: 368 NNPEHTKIAREVAAKSIVLLKNDKQVLPLSKD-LKTIAFIGPMVQS-KRDNHGFWAVDLK 425

Query: 476 PK------SLLRGLEAYV---SKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGL 526
                   S   GL+  V   +K  YA GC DV   + +GF EA+ +A +AD V+V  G 
Sbjct: 426 DVDSTYIVSQWEGLQRKVGKNTKLLYAKGC-DVLSTNKSGFEEAIAVAHQADVVVVSVGE 484

Query: 527 DLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSIL 586
                 E + R SL LPG Q  L+  + +T K P+++++  G PL  ++  AD+ + +IL
Sbjct: 485 KHNMSGEAKSRSSLQLPGVQEDLIMELQKTGK-PIVVLINAGRPLIFNWT-ADN-MPTIL 541

Query: 587 WIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----RADSSRQY 640
           +  + G     A+A+++FGD+NP  +LP+T +P S  +VP+  N  +     ++D  + Y
Sbjct: 542 YTWWLGSEAGNAIADVLFGDYNPSAKLPIT-FPRSEGQVPIYYNHFSTGRPAKSDDDKIY 600

Query: 641 PGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDY 700
              +Y     +  + FG+GLSYT + Y      S+L +S      +D             
Sbjct: 601 KS-AYIDLQNSPKFPFGYGLSYTTFEY------SDLKLSTQKITTND------------- 640

Query: 701 VHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVA 760
                       R  VQ ++ N G   G+ +V L+ +    S   P  +L  F ++   A
Sbjct: 641 ------------RIMVQATIKNTGKYAGTEIVQLYIKDQFGSVVRPVLELKDFQKITLEA 688

Query: 761 KGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG----ELRHSLTIE 806
             SK ISF +D  E+LS  N   + +   G   +M+G    +LR   T E
Sbjct: 689 GASKTISFVIDK-EKLSFYNADLQYVAEPGTFEIMIGASAADLRLKSTFE 737


>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 791

 Score =  243 bits (621), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 226/737 (30%), Positives = 334/737 (45%), Gaps = 126/737 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  E LHG A+ G            TSFPQ +  A+S++ ++   +   +A 
Sbjct: 138 RLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIAR 185

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E RA     +      +P ++I RDPRWGR +ET GEDP +V    V  V+  QG     
Sbjct: 186 EIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQG----- 235

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
               +G      R L+             + A  KHL  +   + G N       A ++E
Sbjct: 236 ----VG----RSRTLQ----------PNHVFATLKHLTGHGQPESGTNIG----PAPVSE 273

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKARNEWGFKGYITSDC 327
           ++  + F PPF   +++     +M SYN+++GVP+   R  L    R EWGF+G + SD 
Sbjct: 274 RELRENFFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDY 333

Query: 328 DAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSA-IDKGKVQEKDIDRALL 386
            AV  +    +     E++A   L AG+D +    +   T    + +GKV E  +D A+ 
Sbjct: 334 SAVDQLMSIHHIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVR 393

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTS--EHKKLALDAARQGIVLLKNDKKFLPL 444
            +  ++ R GLF     +  Y         T+  + + LA  AA++ I LLKND   LPL
Sbjct: 394 RMLELKFRAGLF-----ENPYADANAAAAITNNEDARALARTAAQRSITLLKNDG-MLPL 447

Query: 445 NKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCH---- 497
                 ++A+IGP    ++++GG Y G P    S+L G++A V   +   +A G      
Sbjct: 448 KPEG--TIAVIGPSAA-VARLGG-YYGQPPHSVSILEGIKARVGTKANIVFAQGVKITED 503

Query: 498 -----DVPCNSDAG-----FHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRVSL 540
                D    SD         +AV  A+  D +I+  G D  Q + +        DR SL
Sbjct: 504 DDWWADSVTKSDPAENRKLIAQAVEAARNVDRIILTLG-DTEQSSREGWADNHLGDRPSL 562

Query: 541 LLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALA 600
            L G+Q  L  ++    K P+ +VL  G P   S  +   Q ++IL   Y GE G  A+A
Sbjct: 563 DLVGEQQELFDALKALGK-PITVVLINGRP--ASTVKVSEQANAILEGWYLGEQGGNAVA 619

Query: 601 EIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           +I+FGD NPGG+LP+T  P S  ++P+   NM+  +      R Y F T   +Y FG GL
Sbjct: 620 DILFGDVNPGGKLPVT-VPRSAGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGL 672

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT++S   LSAP                       RL    I      T  +  V + V
Sbjct: 673 SYTSFS---LSAP-----------------------RLSATRI-----GTGGKTSVSVDV 701

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
            N G  +G  VV L+ R    S   P K+L GF RV      S+ I+F V P E L + N
Sbjct: 702 RNTGAREGDEVVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTITFTVGP-EALQMWN 760

Query: 781 KHGRRILPLGNHVLMVG 797
              RR++  G+  +M G
Sbjct: 761 DQMRRVVEPGDFEIMTG 777


>gi|260062042|ref|YP_003195122.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
 gi|88783604|gb|EAR14775.1| beta-glucosidase [Robiginitalea biformata HTCC2501]
          Length = 763

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 189/633 (29%), Positives = 301/633 (47%), Gaps = 104/633 (16%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RDPRWGR  E  GEDP + S   V  V+ FQG+                   
Sbjct: 163 FAPMVDISRDPRWGRVMEGAGEDPYLGSRVGVARVRGFQGD------------------- 203

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRS 281
                  D  D L ++AC KH   Y   + G   N + +  + +       +   PPF++
Sbjct: 204 -------DLSDPLTIAACLKHFAGYGFAEGGRDYNTADFGLSTLY------NVVLPPFQA 250

Query: 282 CIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQ 337
            ++ G A+ +M S+N +NG+PA     L+ D+ + A   W F+G++ SD  ++  +  + 
Sbjct: 251 GVDAG-AATVMNSFNVLNGIPATADAFLQRDILKAA---WDFQGFVVSDWGSIGEMIPH- 305

Query: 338 NYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLG 396
            Y +   ++A     AG D++  + M L      +  GKV E  +D A+L +  ++  LG
Sbjct: 306 GYARDRNEAALRAAVAGSDMDMESGMYLTELPELVRDGKVPESLVDEAVLRILGLKYDLG 365

Query: 397 LFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIG 456
           LF  DP +    +     +      +   D AR+ IVLLKN+   LPL+KN   S+A+IG
Sbjct: 366 LF-ADPYRYADAEREKRILSNPARLETVRDMARKSIVLLKNEGGVLPLSKNG-GSIALIG 423

Query: 457 PLVNNISQMGGGY--TGIPCSPKSLLRGLEAYVSKT-HYASGC------------HDVPC 501
           PL ++     G +  T  P S  S+L G++AY   T  Y  G               +  
Sbjct: 424 PLASDKDSPLGSWRLTAEPNSAVSVLEGMQAYSGNTLAYERGVPLAEGETAFVFETKINT 483

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
              +G   AV +A+ ++ V++V G    Q  E R R +L LPG Q  L+ +V   +   +
Sbjct: 484 TDRSGIPAAVELARSSETVVMVLGEHGFQSGEGRSRAALGLPGLQQELLEAVHAVNPN-I 542

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           +LVL  G PL +++A     + +IL   + G     A+AE+++GD+NP G+LPMT +P+S
Sbjct: 543 VLVLMNGRPLTINWAA--EHVPAILEAWHLGTESGHAIAEVLYGDYNPSGKLPMT-FPKS 599

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYT------GTQVYGFGHGLSYTNYSYKFLSAPSE 675
             ++P+   ++ A    +YPG    F++         +Y FGHGLSY+++ Y        
Sbjct: 600 VGQIPVYYSHL-ATGRPEYPGNDLVFWSHYIDQVNEPLYPFGHGLSYSDFRY-------- 650

Query: 676 LTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLF 735
               A LK       LQ T          E+    SL   V + + NA D  G+ +V L+
Sbjct: 651 ----ADLK-------LQTT----------EIRPGGSL--EVSVRLENASDTPGTEIVQLY 687

Query: 736 ARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISF 768
            R    S+  P ++L GF++V   A GS E+SF
Sbjct: 688 VRDHFGSRARPVRELKGFEKVFLEAGGSAEVSF 720


>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 750

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 207/708 (29%), Positives = 329/708 (46%), Gaps = 106/708 (14%)

Query: 76  EKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASF 135
           EK  +L   A    RLGIP   +  + +HG                 T FP  L  A SF
Sbjct: 95  EKSNELQKIAVEESRLGIPIL-FGLDVIHGYR---------------TIFPIPLAEACSF 138

Query: 136 NRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAY 194
           +           A EA A      AGL + +AP ++I RDPRWGR  E  GEDP + S  
Sbjct: 139 DIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDPYLGSVI 192

Query: 195 AVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKW 254
           A   V+ FQGE+  + +                           + AC KH   Y     
Sbjct: 193 AKARVEGFQGESLDNPES--------------------------ILACAKHFAGYGAPDG 226

Query: 255 GNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA 313
           G      +N + ++ Q   D + PPF++  E G     M ++N +NG+P  +   L    
Sbjct: 227 GR----DYNTVDMSLQTLHDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDV 281

Query: 314 -RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHT-QSAI 371
            R ++GF G++ SD +++  +     Y + ++ ++   L AG+D++      R+     +
Sbjct: 282 LREKFGFNGFVVSDANSIPEVV-VHGYAEDNKAASKKALNAGLDMDMSQGTYRNELPELV 340

Query: 372 DKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQG 431
            +G + E+ +D A+  +  V+  LGLF+ +P +    K     +C  EH + A D +R+ 
Sbjct: 341 KEGDILEEVLDEAVRRVLRVKFLLGLFD-NPYRTDAKKEEKTLLC-KEHLEAARDISRRS 398

Query: 432 IVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGG--GYTGIPCSPKSLLRGLEAYVS- 488
           IVLLKN+   LPL K+ +  +A++GPL  N ++M G   +TG P    +++ G++A VS 
Sbjct: 399 IVLLKNENNALPLKKD-LKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVST 457

Query: 489 --KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQ 546
             +  YA GC  +       F  AVR+AK++D +I V G +     E   R+ + LPG+Q
Sbjct: 458 ETEILYAEGCK-ITGEECIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQ 516

Query: 547 MSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGD 606
             L+  + +  K P+I+VL  G PL + + EA++ + +++     G     A+A+++FGD
Sbjct: 517 EELLKELRKIGK-PLIVVLINGRPLTIPW-EAEN-VDALVEAWQLGTQSGNAIADVLFGD 573

Query: 607 FNPGGRLPMTWYPESFTKVPM---NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
           +NP G+L  T +P S  +VP+   N M  R     ++  + Y       +Y FG GLSYT
Sbjct: 574 YNPSGKLVAT-FPYSVGQVPIYYNNPMTGRPAGKIKFTSK-YIDGPAEPLYPFGFGLSYT 631

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y+ LS      +SA  K G                  D V         V++ VTN 
Sbjct: 632 TFKYENLS-----ILSAENKIG------------------DTVA--------VKVYVTNT 660

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
           G+V G  VV L+      S+  P K+L  F++V    K  K I F ++
Sbjct: 661 GEVSGEEVVQLYVSDVVASRVRPVKELKSFEKVLLQPKECKTIIFKLN 708


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 206/744 (27%), Positives = 323/744 (43%), Gaps = 123/744 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T+FP     A+++N  L   +G  +A 
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P ++I RDPRW R +E+ GED  + +     +V+         
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVR--------- 213

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G G G              D        +  KH IAY   + G   +   + ++ E+
Sbjct: 214 ---GTGSG--------------DLSQSRHALSTLKHFIAYGASEGG---QNGGSNLLGER 253

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
           +  +T+ PPF + ++ G A  +M +YN V+G+P    R  L    R EWGF G++ SD  
Sbjct: 254 ELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLL 312

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  + E      +  ++A   L+AG+D +         + A + G V E +IDRA+  +
Sbjct: 313 SIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERV 372

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            +++  +GLF  +P      +    +V  + H +LAL+AARQ + LL+N    LPL+   
Sbjct: 373 LALKFEMGLFE-NP---YIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRR 428

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAG 506
           +  +A+IGP  +NI    G YT    +  ++  GLE  + +    Y+ GC  V     + 
Sbjct: 429 LRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSE 487

Query: 507 FHEAVRIAKKADFVIVVAG----LDLTQE-------------------TEDRDRVSLLLP 543
              AV  A+  D  +VV G     D   E                    E  DR +L L 
Sbjct: 488 IAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALL 547

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q  L+  +  T   P+I+V   G PLD+    A  Q  ++L   YPG  G  A+AE I
Sbjct: 548 GEQEELLRRIKATGT-PLIVVCIAGRPLDLR--RASEQADALLMAWYPGARGGDAVAETI 604

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
            G  NP GRLP+T  P +  ++P+     R  +        Y   T   +Y FG+GLSY+
Sbjct: 605 LGHNNPAGRLPIT-IPRAEGQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYS 657

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y  L A             S  N+L+               SC          + N 
Sbjct: 658 TFEYGSLEARQ-----------SGDNVLE--------------VSC---------RIRNT 683

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
            D +G  VV L+      S   P +QL GF R+       +++SF +   E LS+ +  G
Sbjct: 684 SDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLGD-EALSLIDPQG 742

Query: 784 RRILPLGNHVLMVGELRHSLTIET 807
           RR++  G+ V+ VG     + ++T
Sbjct: 743 RRVVEKGDFVIAVGSSSQDIRLQT 766


>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 774

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 198/685 (28%), Positives = 324/685 (47%), Gaps = 98/685 (14%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T+FP  L  A S++  L        A EA A      +G+ + +AP ++I RDPRWGR  
Sbjct: 128 TTFPIPLAEACSWDLQLMEKSARIAAEEATA------SGVAWNFAPMVDISRDPRWGRIM 181

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GEDP + S  A   V+ FQG +                  K F + +       + A
Sbjct: 182 EGAGEDPFLGSLIARARVRGFQGID----------------SYKDFSKPN------TMMA 219

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
           C KH + Y   + G   R      I+E+   +T+ PPF++ +++G AS  M ++N++NGV
Sbjct: 220 CAKHFVGYGAAQAG---RDYHTVDISERTLFETYLPPFKAAVDEGVAS-FMTAFNELNGV 275

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-C 359
           P      +FQ   R++W F G + +D  A+  +  +  + K  + ++   + AG+D++  
Sbjct: 276 PCTGNKYIFQDILRHQWNFNGMVVTDYTAIQEMVAH-GFAKDLKQASKLAIDAGIDMDMI 334

Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
               + + +  +++G+V EK ID A+  +  ++  LGLF+ DP K    +   + +   +
Sbjct: 335 SEGFVTYLKELVEEGQVSEKQIDVAVARILEMKFLLGLFD-DPYKYCDAEREKEVLMNPQ 393

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPK 477
           H + A + A++ IVLLKN+   LPL K+    +A+IGP V     + G +   G      
Sbjct: 394 HLQAAREVAQRSIVLLKNENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIKGDRSKSV 453

Query: 478 SLLRGLEAYVSKT----HYASGCHDVPCNSDA-------------GFHEAVRIAKKADFV 520
           +L  GL+   + T    +YA G   +P    A             GF EA+R+AK +D +
Sbjct: 454 TLWEGLQEKYADTPVRFNYAKGT-SLPLIDGATRHVSLEQGFDKSGFAEALRVAKTSDLI 512

Query: 521 IVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADS 580
           +V  G       E   R  + LPG Q  L+  + +T K P++LVL  G PLD+S+ EA++
Sbjct: 513 LVAMGEHYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDLSW-EAEN 570

Query: 581 QISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSSR 638
            + +I+   YPG     A+A+++ GD+NP  RL +T +P +  ++P+  N  N       
Sbjct: 571 -VDAIVEAWYPGIMAGHAVADVLSGDYNPSARLVVT-FPRNVGQIPIFYNMKNTGRPFDE 628

Query: 639 QYPG---RSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTG 695
            +P     SY     + ++ FG GLSYT++ Y         TIS+          L + G
Sbjct: 629 NHPADYKSSYIDSPNSPLFPFGFGLSYTSFQY------DNATISSQK--------LTKGG 674

Query: 696 SRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDR 755
           S +                 V + VTN G+VDG  VV L+      S   P K+L GF +
Sbjct: 675 SLI-----------------VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRPVKELKGFKK 717

Query: 756 VHTVAKGSKEISFGVDPCEQLSIAN 780
           +      +K + F ++  E L + N
Sbjct: 718 IFLKKGETKTVEFTINE-EMLKMYN 741


>gi|393789623|ref|ZP_10377743.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
 gi|392650339|gb|EIY44008.1| hypothetical protein HMPREF1068_04023 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 154/459 (33%), Positives = 227/459 (49%), Gaps = 58/459 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA+ L+  LTL+EK+  + + +  I RLGI  Y+WW+E+LHG+A  G  
Sbjct: 24  PYRNPNLSPEERAEDLLGRLTLKEKVTLMQNESFPIERLGIAHYDWWNEALHGVARAG-- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQ---------AGLT 163
                     T FP  +  A+SF+     +I +AV+ E RA Y+             GLT
Sbjct: 82  --------IATVFPITMGMASSFDDKAIEDIFTAVSDEVRAKYHDAHRQGRRGVRCEGLT 133

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
           FW PNINIFRDPRWGRGQET GEDP + S   V  VK  QG      D            
Sbjct: 134 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPADAKYD------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          L AC KH   Y +       R+SFNA  I+ +D  +T+ P F++ 
Sbjct: 182 --------------KLHACAKH---YAVHSGPEAKRHSFNAENISPRDLWETYLPAFKTL 224

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++     +MC+YN+  G P C    L  Q  R++WG+K  + SDC A++  F Y +   
Sbjct: 225 VQEADVKEVMCAYNRFEGDPCCGSNRLLTQILRDDWGYKHVVVSDCGAISDFF-YSDRHA 283

Query: 342 THED---SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH++   ++A  + +G D+ CG     H   A+ +G + E+ I+ +L  L   +  LG  
Sbjct: 284 THKNAADASAAAVLSGTDLECG-IEYAHLDKAVKEGLISEERINTSLFRLLKARFELGEM 342

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + D     + K+  D V    HK++ALD  R+ +VLL N+   LPL K   + +A++GP 
Sbjct: 343 DDDALV-PWSKISIDTVDCQMHKQMALDITRKSMVLLHNN-GVLPLGKTG-ARIAVMGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCH 497
            N+     G Y G P    ++L G+   +    Y  GC 
Sbjct: 400 ANDSVMQWGNYEGTPSHTVTVLEGIRNKIGNVIYEKGCE 438



 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/301 (27%), Positives = 140/301 (46%), Gaps = 56/301 (18%)

Query: 515 KKADFVIVVAGLDLTQETEDR-----------DRVSLLLPGQQMSLVTSVARTSKRPVIL 563
           K+AD VI V G+    E E++           DR S+ LP  Q +++ ++ +  K+ + +
Sbjct: 597 KEADVVIFVGGISPNLEGEEKNFVNCPGFVGGDRTSIELPEVQRNILKALKKAGKKVIFV 656

Query: 564 VLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 623
             +G     ++         +IL   YPG+AG  A+A+IIFGD+NP G+LP+T+Y  +  
Sbjct: 657 NCSGSA---MALVPETQSCDAILQAWYPGQAGGTAVADIIFGDYNPSGKLPVTFYKNTEQ 713

Query: 624 KVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLK 683
                D +M+        GR+YR+ T + ++ FG+GLSYT + +               K
Sbjct: 714 LPDFEDYSMK--------GRTYRYMTESPLFPFGYGLSYTTFQFS--------------K 751

Query: 684 AGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQ 743
           AG +K +                T    ++F  +++V N G  DG+ VV ++ R     +
Sbjct: 752 AGLNKRV---------------ATINQPVQF--KVNVKNTGKRDGTEVVQIYIRKADDEE 794

Query: 744 GTPEKQLIGFDRVHTVAKGSKEISFGVDP-CEQLSIANKHGRRILPLGNHVLMVGELRHS 802
           G P K L  F  V   A  S  ++  + P   +      +  RI+P G++ +M G   ++
Sbjct: 795 G-PVKSLRAFRPVTLKAGKSATVNITLSPDAFEFFDPETNTMRIVP-GDYEIMYGNSSNT 852

Query: 803 L 803
           L
Sbjct: 853 L 853


>gi|393779898|ref|ZP_10368130.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 412 str. F0487]
 gi|392609318|gb|EIW92128.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 412 str. F0487]
          Length = 770

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 221/788 (28%), Positives = 347/788 (44%), Gaps = 127/788 (16%)

Query: 61  ISTRAKSLISLLTLQEKIQQLS----DNASAIPRLGIPAYEWWSESLHGIASNGPGVNFN 116
           I  R  S++ L+TL+EKI Q++    D +   P +      +  + L G   N   V   
Sbjct: 43  IEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVAGI 102

Query: 117 GTVSSV----------------------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM 154
             +  +                      T FP  L  + S++ +L        A EA A 
Sbjct: 103 RNLQKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLTLMRKTAELAAREASA- 161

Query: 155 YNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQG-ENWKSDDG 212
                 G+ + +AP ++I RD RWGR  E  GEDP + S  A   VK FQG +NW+    
Sbjct: 162 -----DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQGGDNWQM--- 213

Query: 213 GIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFN-AMITEQDT 271
                                     L AC KH   Y   + G      +N A ++    
Sbjct: 214 --------------------LSSPHTLLACGKHFAGYGAAESGK----DYNTAELSMHTL 249

Query: 272 EDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAV 330
            + + PP+ + +   +   IM S N++NGVPA     L  +  R EWGF G + SD    
Sbjct: 250 RNVYLPPYEATL-NARVGSIMASLNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTG- 307

Query: 331 ATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQSAIDKGKVQEKDIDRALL 386
             I E   +    +D  A  L A     MD+N G   +++  + + +GKV E  ID+A+ 
Sbjct: 308 --INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSALVKEGKVTEAQIDKAVR 364

Query: 387 NLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNK 446
           ++  ++  LGLF+ DP +        ++  T ++ K+A  A    +VLLKN+ + LP+ K
Sbjct: 365 HILEIKFLLGLFD-DPYRYLDETRAKENTFTEKYLKVARQAVASSVVLLKNEAEVLPIKK 423

Query: 447 NAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYVS---KTHYASGCHDVP 500
           ++  ++A+IGP++NN S + G +T  G      SLL GL E Y +   K  YA GC    
Sbjct: 424 DSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYKATNVKLLYAEGCGFTT 483

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRP 560
            +++    EAV +A+KAD V+V  G   +   E   R  + LP  Q  L+ ++ +T  +P
Sbjct: 484 ISTEQ-LKEAVAMARKADRVLVAVGEQSSWSGESAVRTDIRLPQAQRQLLEAL-KTINKP 541

Query: 561 VILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPE 620
           + ++   G PLD+S+   +  + +IL   +PG  G   +A++I GD NP G L M+ +P 
Sbjct: 542 IAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGYGIADVIAGDVNPSGHLTMS-FPR 598

Query: 621 SFTKVPM-----------NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKF 669
           S  ++P+           +  N   D    Y    Y   + T +Y FG+GLSYT ++   
Sbjct: 599 SVGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNA-GYLDSSITPLYPFGYGLSYTTFAISN 657

Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
           +    +     SLK  +D  I                         V  SV N G  +G 
Sbjct: 658 VHLNKK-----SLKRYNDSII-------------------------VNASVQNTGTTEGE 687

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
            VV L+ R    S   P K+L GF ++   A  SK++ F + P E L+    +G++    
Sbjct: 688 IVVQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRFEL-PSEALAFYGINGKKDTEP 746

Query: 790 GNHVLMVG 797
              +L VG
Sbjct: 747 SECLLWVG 754


>gi|430736195|gb|AGA60127.1| glycoside hydrolase [Aminobacter sp. Gsoil204]
          Length = 772

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 200/666 (30%), Positives = 313/666 (46%), Gaps = 97/666 (14%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T FP  L  AAS++           A EA A       G+ + +AP +++ RDPRWGR  
Sbjct: 141 TIFPISLGEAASWDLKAIEKAARISATEASA------EGIHWTFAPMVDVARDPRWGRIS 194

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GED  + S  A   V+ FQG + K+ D                           + A
Sbjct: 195 EGAGEDVYLGSRIAEARVRGFQGNDLKAVD--------------------------TVLA 228

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
             KH  AY   + G   R      I+E+   D + PPF++  +   A+  M S+N V+G+
Sbjct: 229 TAKHFAAYGAAQAG---RDYGTVDISERTLRDVYLPPFKAAADA-GAATFMTSFNDVDGI 284

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC- 359
           PA     L     R++WGFKG++ +D  ++  +  +  Y+K  + +    + AG+D++  
Sbjct: 285 PASGNHHLLTDVLRDKWGFKGFVVTDYTSINEMVAH-GYSKDLQQAGEQAINAGVDMDLQ 343

Query: 360 GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
           G   + H   ++ +GKV    ID A+  +  ++ RLGLF  DP +          V   +
Sbjct: 344 GAVFMEHLAKSVAEGKVDVARIDAAVKAILEMKYRLGLFE-DPYRYSDEAREKATVYRPD 402

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGI---PCSP 476
             + A D AR+ +VLLKN    LPL  +A  S+A+IGPL ++ + M G ++        P
Sbjct: 403 FLEAARDVARKSMVLLKNANNALPLAASA-KSIAVIGPLGDSKADMIGSWSAAGDRKTRP 461

Query: 477 KSLLRGLEAYVSKTHY------ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQ 530
            +LL G++A   K         AS   +    +D GF EA+ +A+K+D ++   G     
Sbjct: 462 VTLLEGMQARAPKGQSVAYVRGASYAFEDAGKTD-GFAEAIALAQKSDVIVAAMGERWDM 520

Query: 531 ETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGY 590
             E   R SL LPG Q +L+  + +T K P+ILVL  G P  + +A  D+ + +IL   Y
Sbjct: 521 TGEAASRTSLDLPGNQQALLQELKKTGK-PIILVLMSGRPNSIEWA--DANVDAILEAWY 577

Query: 591 PGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP----MNDMNMRADSSR---QYPGR 643
           PG  G  A+A++++GD+NP G+LP T +P +  +VP    M +     D ++   +Y  R
Sbjct: 578 PGTMGGHAIADVLYGDYNPSGKLPAT-FPRNVGQVPLYYDMKNTGRPIDPAKPDAKYVSR 636

Query: 644 SYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS-ASLKAGSDKNILQQTGSRLDYVH 702
            Y     T +Y FG+GLSYT+++Y      S +T+S A +K G                 
Sbjct: 637 -YLNTPNTPLYPFGYGLSYTSFTY------SPVTLSKARIKPGEP--------------- 674

Query: 703 IDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKG 762
                          ++VTN+G  DG  VV L+ R    S   P ++L GF ++      
Sbjct: 675 -----------LTASVTVTNSGARDGEEVVQLYVRDLVGSVTRPVRELKGFRKIPLKKGE 723

Query: 763 SKEISF 768
           SK +SF
Sbjct: 724 SKTVSF 729


>gi|420148909|ref|ZP_14656095.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 335 str. F0486]
 gi|394754508|gb|EJF37885.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 335 str. F0486]
          Length = 770

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 226/803 (28%), Positives = 351/803 (43%), Gaps = 157/803 (19%)

Query: 61  ISTRAKSLISLLTLQEKIQQLSD-----------------------------NASAI--- 88
           I  R  S++ L+TL+EKI Q++                              NA+++   
Sbjct: 43  IEQRVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGM 102

Query: 89  ----------PRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRS 138
                      RLGIP   +  + +HG                 T FP  L  + S++ +
Sbjct: 103 RKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDLA 146

Query: 139 LWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVE 197
           L        A EA A       G+ + +AP ++I RD RWGR  E  GEDP + S  A  
Sbjct: 147 LMRKTAELAAREATA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEA 200

Query: 198 FVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGN 256
            VK FQG +NW++                             L AC KH   Y   + G 
Sbjct: 201 RVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESGK 237

Query: 257 FSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-CLRGDLFQKAR 314
                +N A ++     + + PP+ + +  G  S IM S N++NGVPA   +  L +  R
Sbjct: 238 ----DYNTAELSMHTLRNVYLPPYEATLNAGVGS-IMASLNEINGVPATAYKWLLTEVLR 292

Query: 315 NEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQSA 370
            EWGF G + SD      I E   +    +D  A  L A     MD+N G   +++  + 
Sbjct: 293 KEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSAL 348

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           + +GKV E  ID+A+ ++  ++  LGLF+ DP +        ++  T E+ K+A  A   
Sbjct: 349 VKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVAS 407

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAYV 487
            +VLLKN+ + LP+ K++  ++A+IGP++NN S + G +T  G      SLL GL E Y 
Sbjct: 408 SVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 467

Query: 488 S---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPG 544
               K  YA GC     +++    EAV IA+KAD V+V  G   +   E   R  + LP 
Sbjct: 468 GTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSSWSGESAVRTDIRLPQ 526

Query: 545 QQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIF 604
            Q  L+ ++   +K P+ ++   G PLD+S+   +  + +IL   +PG  G   +A++I 
Sbjct: 527 AQRQLLEALKAINK-PIAIITFSGRPLDLSW--ENENVQAILQAWFPGTQGGYGIADVIA 583

Query: 605 GDFNPGGRLPMTWYPESFTKVPMN----------DMNMRADSSRQYPGRSYRFYTGTQVY 654
           GD NP G+L M+ +P S  ++P+             N      R +    Y   + T +Y
Sbjct: 584 GDVNPSGQLTMS-FPRSVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNAGYLDSSITPLY 642

Query: 655 GFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRF 714
            FG+GLSYT ++   +    +     S+K  +D  I                        
Sbjct: 643 PFGYGLSYTTFAINNVHLNKK-----SIKRYNDSII------------------------ 673

Query: 715 HVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
            V  SV N G  +G  VV L+ R    S   P K+L GF ++   A  SK++ F + P E
Sbjct: 674 -VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVHFEL-PSE 731

Query: 775 QLSIANKHGRRILPLGNHVLMVG 797
            L+    +G++       +L VG
Sbjct: 732 ALAFYGINGKKDTEPSECLLWVG 754


>gi|156740905|ref|YP_001431034.1| glycoside hydrolase family 3 [Roseiflexus castenholzii DSM 13941]
 gi|156232233|gb|ABU57016.1| glycoside hydrolase family 3 domain protein [Roseiflexus
           castenholzii DSM 13941]
          Length = 790

 Score =  243 bits (620), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 219/761 (28%), Positives = 336/761 (44%), Gaps = 158/761 (20%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E   G  +NG            T+FPQ++  A+++   L   +   +  
Sbjct: 97  RLGIPAL-IHDECCSGFLANG-----------ATNFPQIIGIASAWEPELVEAMTRVIRQ 144

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + RA+      GL   AP ++I RDPRWGR +ET GEDP + S     +++  QG +W  
Sbjct: 145 QMRAVGV--HHGL---APVLDIARDPRWGRTEETFGEDPYLTSVMGAAYIRGLQGADWSE 199

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMITE 268
                                        + A  KH + Y   + G N++     A IT 
Sbjct: 200 G----------------------------VMATGKHFVGYSASEGGLNWA----PAHITT 227

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA-----CLRGDLFQKARNEWGFKGYI 323
           ++  + +  PF + +   + + IM +Y++++G P       L G L    R+EWGF+G +
Sbjct: 228 RELREVYLAPFETAVRAARLASIMPAYHEIDGEPCSGAHWLLTGIL----RDEWGFEGLV 283

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINC------GTCMLRHTQSAIDKGKVQ 377
            SD  A+  +  Y    +    +A   L+AGMDI        G  +L     A+  G++ 
Sbjct: 284 VSDYMAIDQLRNYHKLARDKAHAARLALEAGMDIELPNVEAYGQPLL----DALAAGEIP 339

Query: 378 EKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC----TSEHKKLALDAARQGIV 433
            + +DR++  + +++   GLF           + PD V     T   ++LA + AR+ IV
Sbjct: 340 MEWVDRSVRRILTLKFAFGLFENP-------YVDPDAVPAVFDTPAQRELAREIARKSIV 392

Query: 434 LLKNDKKFLPLNKNAVSSLAIIGPLVNN-------------------ISQMGGGYTGIPC 474
           LLKN+   LPL K  +S++A+IGP  ++                   +SQ+G     +P 
Sbjct: 393 LLKNEGNRLPLPKT-LSAIAVIGPNADSKRNLLGDYSYPAHIETLITLSQLGFSEHPLPD 451

Query: 475 SPK---------SLLRGLEAYVSKTH---YASGCHDVPCNSDAGFHEAVRIAKKADFVIV 522
           S +         S++  +   VS T    YA GC DV   S  GF EA+  A+KA+  IV
Sbjct: 452 SIRLIENDSSMLSIVEAIRRTVSPTTQVLYARGC-DVNSPSTDGFAEAIEAARKAEVAIV 510

Query: 523 VAG--LDLTQET---EDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           V G    LT E    E RD   L LPG Q  LV ++  T   PV+LVL  G P  +    
Sbjct: 511 VVGDKAGLTPECTSGEFRDSAHLTLPGVQQQLVAAILATGT-PVVLVLVTGRPYAIPHLV 569

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSS 637
             +      W+  PG  GA ALAE +FGD NPGG+LP+T +P    +VP+   + R   +
Sbjct: 570 DATPAVVEAWL--PGAEGAPALAEALFGDVNPGGKLPIT-FPRHVGQVPLFYAH-RPSGA 625

Query: 638 RQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSR 697
           R +    Y   +   ++ FG GLSYT ++++ L+   ++T    ++A  D          
Sbjct: 626 RSFFYGPYMDESNQPLFPFGFGLSYTQFAFENLTVTPDVTTDGEVQASVD---------- 675

Query: 698 LDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVH 757
                                 V N G+  G  VV L+ R    S   P K+L GF RVH
Sbjct: 676 ----------------------VINTGERSGDEVVQLYTRTEGASVTRPVKELRGFKRVH 713

Query: 758 TVAKGSKEISFGVDPCEQLSIANKHGRRIL-PLGNHVLMVG 797
                 K + F + P E L+  +     ++ P+  H+ MVG
Sbjct: 714 LQPGERKRVIFTL-PVELLAYYDTTMHLVVEPVEVHI-MVG 752


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 164/490 (33%), Positives = 243/490 (49%), Gaps = 69/490 (14%)

Query: 16  FRVLQLI-VVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTL 74
            ++L+LI  ++ ++ FS S    NK +            F + +     R   L++ LT+
Sbjct: 1   MKILKLIGCLIPLLVFSLSGQAQNKTEL-----------FRDMTAPQHERILDLLNRLTV 49

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           +EK+  L ++A  IPRL I  Y   +E+LHG+   G            T FPQ +  AA+
Sbjct: 50  EEKVSLLVNDAREIPRLNIDKYNHGNEALHGVVRPG----------EFTVFPQAIGLAAT 99

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFRDPRWGRGQETP 184
           +N +L   + +A++ EAR  +     G          LTFW+P +N+ RDPRWGR  ET 
Sbjct: 100 WNPNLIFRVSTAISDEARGRWKELDYGKKQIAGGSDLLTFWSPTVNMARDPRWGRTPETY 159

Query: 185 GEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCK 244
           GEDP +      EFVK  QG+N               R LK               +  K
Sbjct: 160 GEDPFLSGRIGCEFVKGLQGDN--------------PRYLKTV-------------STPK 192

Query: 245 HLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC 304
           H  A + E     +R S NA ++E+D  + + P F  CI  GKA  IM +YN VN VP  
Sbjct: 193 HFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAYNAVNDVPCT 248

Query: 305 LRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM 363
           +   L +K  R +W F GYI SDC A   +     Y K  E +A   LKAG+D+ CG  +
Sbjct: 249 VNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNLEAAATLALKAGLDLECGDRV 308

Query: 364 LRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKK 422
                  A ++  V E +ID A  ++   ++ LGLF+ DP +  Y K+ P  +   EH++
Sbjct: 309 YTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFD-DPSQNPYNKIEPSVIGCKEHQE 367

Query: 423 LALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP-CSPKSLLR 481
           LAL+ ARQ +VLLKN K FLPLN+  + S+A++G  ++      G Y+G P  +P S+L 
Sbjct: 368 LALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNPKNTPVSVLD 425

Query: 482 GLEAYVSKTH 491
           G++ Y    +
Sbjct: 426 GIKKYAENAN 435



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 154/306 (50%), Gaps = 48/306 (15%)

Query: 502 NSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPV 561
           N  A F +A ++AK+ D  + V G++ + E E +DR SL LP  Q   +  + + +   V
Sbjct: 588 NRLALFGDAGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTV 647

Query: 562 ILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPES 621
           + VL  G  + +++   D  + +IL   YPGE G  A+AE++FGD+NPGGRLP+T+Y  S
Sbjct: 648 V-VLVAGSSMAINWM--DENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYY-NS 703

Query: 622 FTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISAS 681
             ++P  D     D S +   R+Y+++ G  +Y FG+GLSYTN+ YK             
Sbjct: 704 LDELPAFD-----DYSVK--NRTYQYFEGKPLYEFGYGLSYTNFKYK------------- 743

Query: 682 LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKV 741
                 K  + Q+   +D                +  +++N G  DG  V  ++ R P+ 
Sbjct: 744 ------KKSIMQSNDTVD----------------ITFNLSNVGKYDGDEVAQVYVRYPET 781

Query: 742 SQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELR 800
               P KQL GF RVH     S +I+  + P ++L   ++  R+ + P G +V  VG   
Sbjct: 782 GTYMPLKQLKGFSRVHLKKGKSADITISI-PKKELRYWDEKTRQFVTPTGEYVFQVGGSS 840

Query: 801 HSLTIE 806
            +++IE
Sbjct: 841 ENISIE 846


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 152/463 (32%), Positives = 231/463 (49%), Gaps = 57/463 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA+ L++ LTL+EK   + D + AIPRLGI  + WWSE+LHG A+N   
Sbjct: 13  PYQNPNLSPEERAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFANN--- 69

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLT 163
                    VT FP+ +  AASF+  L   +  A + E RA Y         N     L+
Sbjct: 70  -------DDVTVFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLS 122

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP + S   V+ VK  QG     +D       + K++
Sbjct: 123 VWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVQVVKGLQG----PEDA------KYKKL 172

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
           L                AC KH   +   +W   SR+  N   ++++D  +T+ P F+  
Sbjct: 173 L----------------ACAKHYAVHSGPEW---SRHELNLNNVSQRDLWETYLPAFKVL 213

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           ++      +MC+Y +++  P C    L Q+  R +WGF+  + SDC A+   +   N + 
Sbjct: 214 VQDANVRQVMCAYQRLDDEPCCGSDRLLQQILREKWGFEHLVVSDCGAIQDFYTSHNVSS 273

Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
               +AA  + AG D+ C       +    A++KG V+E+DIDR++  +   +  LG  +
Sbjct: 274 DAVHAAAKAVLAGTDVECQWDKHNYKLLPEAVEKGLVKEEDIDRSVKRVLIGRFELGEMD 333

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     Y ++    +   EH++LAL  AR+ + LL+N    LPL+K     +A+IGP  
Sbjct: 334 PD-EIVPYAQIPASVINNEEHRQLALKMARESMTLLQNKNNILPLSKGQ-DRIAVIGPNA 391

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGCHDVPCN 502
           ++   + G Y G P    S+L G+    SK    S  +D  C+
Sbjct: 392 DDEPMLWGNYNGTPVRTISILDGI---TSKIGEKSIVYDKACD 431



 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 144/313 (46%), Gaps = 63/313 (20%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQM 547
           ++P N    + + ++  K  + VI V GL    E E+           DR  + LP  Q 
Sbjct: 577 EIPVN----YEDLLKKLKGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQR 632

Query: 548 SLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDF 607
           + + ++    K+ VI V   G    +      +   +IL   Y GE+G +A+A+++FGD+
Sbjct: 633 NCLKTLKDAGKK-VIFVNNSGSA--IGLVPETTSCDAILQAWYGGESGGQAVADVLFGDY 689

Query: 608 NPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYS 666
           NP G+LP+T+Y ++ T++P   D +M         GR+YRF     ++ FGHGLSYTN+ 
Sbjct: 690 NPSGKLPVTFYKDT-TQLPDFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNF- 739

Query: 667 YKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDV 726
                           K G          ++LD   ID     TS   ++ IS++N G  
Sbjct: 740 ----------------KIGE---------AQLDKSEID-----TSSSVNITISISNEGKT 769

Query: 727 DGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGR-- 784
           +G  ++ ++     + +G P K L GF RV+      K ++  + P       +K  R  
Sbjct: 770 EGVEIIQVYVHKQGLEEG-PIKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSM 827

Query: 785 RILPLGNHVLMVG 797
           +++P GN+ +  G
Sbjct: 828 KVMP-GNYEIFYG 839


>gi|56412920|ref|YP_149995.1| periplasmic beta-glucosidase [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|197361851|ref|YP_002141487.1| beta-glucosidase [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. AKU_12601]
 gi|56127177|gb|AAV76683.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi A str. ATCC 9150]
 gi|197093327|emb|CAR58775.1| periplasmic beta-glucosidase precursor [Salmonella enterica subsp.
           enterica serovar Paratyphi A str. AKU_12601]
          Length = 768

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 209/751 (27%), Positives = 343/751 (45%), Gaps = 140/751 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMC--SYNQVNGVPACLRGDLF 310
            G      +N + ++ Q   + + PP+++ ++ G  + ++   S N +NG PA     L 
Sbjct: 218 GGK----EYNTVDMSSQRLFNDYMPPYKAGLDAGSGAVMVALNSLNSLNGTPATSDSWLL 273

Query: 311 QKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQ 368
           +   R+EWGFKG   SD  A+  + ++       ED+    LKAG+D++       ++  
Sbjct: 274 KDVLRDEWGFKGITVSDHGAIKELIKH-GTAADPEDAVRVALKAGVDMSMADEYYSKYLP 332

Query: 369 SAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HK 421
             I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+
Sbjct: 333 GLIKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHR 387

Query: 422 KLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSL 479
           K A + AR+ +VLLKN  + LPL K+   ++A++GPL ++   + G ++  G+     ++
Sbjct: 388 KEAREVARESVVLLKNRLETLPLKKSG--TIAVVGPLADSQRDVMGSWSAAGVANQSVTV 445

Query: 480 LRGLEAYV---SKTHYASGCHDV-------------------PCNSDAGFHEAVRIAKKA 517
           L G++  V   +K  YA G +                     P +  A   EAV+ AK+A
Sbjct: 446 LAGIQNAVGDGAKILYAKGANITNDKGIVDFLNLYEEAVKIDPRSPQAMIDEAVQAAKQA 505

Query: 518 DFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAE 577
           D V+ V G       E   R  + +P  Q  L+T++  T K P++LVL  G PL  +  +
Sbjct: 506 DVVVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVK 562

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRA--- 634
            D Q  +IL   + G  G  A+A+++FGD+NP G+LP++ +P S  ++P+   ++     
Sbjct: 563 EDQQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPIS-FPRSVGQIPVYYSHLNTGRP 621

Query: 635 ---DSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNIL 691
              +   +Y  R +    G  +Y FG+GLSYT ++       S++T+S+          +
Sbjct: 622 YNPEKPNKYTSRYFDEANGP-LYPFGYGLSYTTFTV------SDVTLSSP--------TM 666

Query: 692 QQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLI 751
           Q+ G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL 
Sbjct: 667 QRDG---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLK 709

Query: 752 GFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           GF+++       K +SF +D  E L   N+ 
Sbjct: 710 GFEKITLKPGERKTVSFPID-IEALKFWNQQ 739


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 152/461 (32%), Positives = 230/461 (49%), Gaps = 62/461 (13%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGV 113
           + + S S + RA+ L+  LTL+EK+  + DN+  + RLGI  Y WW+E+LHG+A +G   
Sbjct: 24  YKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSG--- 80

Query: 114 NFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--------GLTFW 165
                    T FPQ +  AASF+         AV+ EARA      A        GLT W
Sbjct: 81  -------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQGLTIW 133

Query: 166 APNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQ--GENWKSDDGGIGFGFREKRV 223
            P +NI+RDPRWGRG ET GEDP + S   V  VK  Q   EN K D             
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQCLDENEKYDK------------ 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                          + AC KH   +   +W   +R+SFNA  I+ +D  +T+ PPF + 
Sbjct: 182 ---------------VHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEAL 223

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           +++GK   +MC+YN+  G P C    L     R EWG+ G + +DC A++  F      K
Sbjct: 224 VKEGKVKEVMCAYNRFEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISD-FHNDKGHK 282

Query: 342 TH---EDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLF 398
           TH     +++  + +G D+ CG+   R     + KG + E DIDR++  L   +  LG  
Sbjct: 283 THADAASASSAAVLSGTDLECGS-NYRSLTEGVKKGFIDEADIDRSVKRLLQARFELGEM 341

Query: 399 NGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPL 458
           + +P + ++ ++    VC+ +H  L+LD AR+ + LL N    LPL +   +++A++GP 
Sbjct: 342 D-EPDQVRWAQIPYSVVCSDKHDSLSLDMARKSMTLLLNKNNALPLERGG-TTIAVMGPN 399

Query: 459 VNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH---YASGC 496
            N+     G Y G+P    ++L G+ + + K     Y  GC
Sbjct: 400 ANDSVMQWGNYNGLPKRTITILDGIRSAMGKDDKLIYEQGC 440



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 82/292 (28%), Positives = 124/292 (42%), Gaps = 62/292 (21%)

Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF E   I       K AD VI   G+    E E+           DR  + 
Sbjct: 577 DAQLNFDLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIE 636

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LP  Q  ++ ++    K+ VI V   G P  ++         +IL   YPG++G KA+AE
Sbjct: 637 LPAVQREMIKALHDAGKK-VIFVNCSGSP--IAMEPETEYCQAILQAWYPGQSGGKAVAE 693

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           ++FGD+NP GRLP T+Y  +  ++P   D NM         G +YRF+ G  ++ FG+GL
Sbjct: 694 VLFGDYNPAGRLPATFY-RNLAQLPDFEDYNM--------AGHTYRFFNGEPLFPFGYGL 744

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y  +   S      ++K                                + + V
Sbjct: 745 SYTTFKYGKIQLKSSAQTDETVK--------------------------------ITVPV 772

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDP 772
           TN G  +G  VV ++ +    + G P K L  F RV+  A  + ++   + P
Sbjct: 773 TNTGSRNGEEVVQVYLKKQGETDG-PVKTLRAFKRVYIPAGKTVKVELELTP 823


>gi|381200964|ref|ZP_09908096.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
          Length = 792

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 221/749 (29%), Positives = 336/749 (44%), Gaps = 133/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  ESLHG  +           +  T FP  +  A SF+R L +++ S +A 
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187

Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E RA       G+    +P ++I RDPRWGR +ET GEDP +     V  V   QGE+  
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGES-- 239

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
                                E    D++M  A  KH+  +   + G N +     A I+
Sbjct: 240 ---------------------EQIGPDKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
           E++  + F PPFR  +++   + +M SYN+++GVP+     L GD+    R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
            SD  AV  +    +     E +A   L+AG+D      +  R     +  GKV  + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
            A   + +++ R GLF     +  Y  L       +E + LAL AA + IVLLKND   L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDG-VL 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
           PL   A   +A+IGP  N      GGY+ IP    SLL G++A +   ++  +A G    
Sbjct: 445 PLKPGAHRKVAVIGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRAEIVHAQGVFIT 502

Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
                         P  +     EAV +AK AD +++  G D  Q + +        DR 
Sbjct: 503 QSEDRSVDEVLLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           SL L G+Q  L  ++  T K  V+  + G  P   S+       ++++   YPG+ G  A
Sbjct: 562 SLDLVGEQNDLFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALIECWYPGQEGGTA 618

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+I+FGD NPG +LP+T   ++  ++P+     R  SSR    R Y F   + ++ FG 
Sbjct: 619 MADILFGDVNPGAKLPVTVARDA-GQIPI--FYNRKPSSR----RGYLFADASPLFPFGF 671

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT    KF+  P  L+ S  +  G D                            V++
Sbjct: 672 GLSYT----KFVFGPPRLSAS-RIGVGGD--------------------------VTVEV 700

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G+V G  VV L+      S   P K+L GF+R+      S+ +   + P E  ++
Sbjct: 701 DVRNVGNVAGEEVVQLYVHDQTASVTRPLKELKGFERIALAPGESRTVRLTIGP-EAFAL 759

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            N   R ++  G   +MVG    S T++T
Sbjct: 760 WNLDMREVVEPGLFDIMVGP--DSATLQT 786


>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 947

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 201/756 (26%), Positives = 336/756 (44%), Gaps = 139/756 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+ S              T+FP  L    ++NR L   +G     
Sbjct: 159 RLGIPV-DFTNEGIRGVES-----------FKATNFPTQLGLGTTWNRKLIHQVGYITGR 206

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EAR +      G T  +AP +++ RD RWGR +E  GE P +V+   ++  +  Q     
Sbjct: 207 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPFLVAELGIQMTRGLQ----- 255

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
                                         +++  KH  AY   K         +  ++ 
Sbjct: 256 --------------------------TNYQVASTGKHFAAYSNNKGAREGMARVDPQMSP 289

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++ ++    P+   + +      M SYN  +GVP  ++G    L +  R ++GFKGY+ S
Sbjct: 290 REVQNIHLYPWGRVVREAGLLGAMSSYNDYDGVP--IQGSFHWLTEVLRQQFGFKGYVVS 347

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDI 381
           D DA+  +F         +++    + AG+++ C        +   +  + +G++  K I
Sbjct: 348 DSDALEYLFSKHRTAANMKEAVYKAVMAGLNVRCTFRSPDSFVLPLRELVKEGRIPMKVI 407

Query: 382 DRALLNLFSVQLRLGLFNGDPRKGKYGKLGPD-DVCTSEHKKLALDAARQGIVLLKNDKK 440
           D  L ++  V+  +G+F+   R  +      D +V    H+++AL A+R+ IVLLKN   
Sbjct: 408 DERLRDILRVKFMVGIFD---RPYQMNLQAADKEVDGKSHQQVALQASRESIVLLKNQNN 464

Query: 441 FLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVS---KTHYASGCH 497
            LPLNK ++  +A+ GP  N+ +     Y  +     ++  G+   V    +  Y  GC 
Sbjct: 465 TLPLNKASIKKIAVCGPNANDAAYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKGCD 524

Query: 498 ------------DVPCNSDAG--FHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
                       D P  +D      +AV   +++D  +VV G +     E++ R SL LP
Sbjct: 525 LVDAHWPESELVDYPMTADEQNEIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLELP 584

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q+ L+ +V  T K  VILVL  G PL V++A  D  + +I+   YPG  G  A+A+++
Sbjct: 585 GRQLQLLKAVQATGK-TVILVLINGRPLSVNWA--DKFVPAIVEAWYPGSQGGTAVADVL 641

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMN-------------DMNMRADSSRQYPGRSYRFYTG 650
           FGD+NPGG+L +T +P++  ++P N              + +  ++SR            
Sbjct: 642 FGDYNPGGKLTVT-FPKTVGQIPFNFPSKPAALVDGGNKLGLHGNASR----------AN 690

Query: 651 TQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCT 710
             +Y FGHGLSYT + Y      S L +SA            Q  S  D V    V SC 
Sbjct: 691 GALYYFGHGLSYTTFKY------SNLRLSA------------QNISPTDSV----VVSC- 727

Query: 711 SLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGV 770
                    +TN G   G  VV L+ +    +  T EK L GF+RVH     ++ +SF +
Sbjct: 728 --------DITNTGQRAGDEVVQLYIQDVLSTVTTYEKNLRGFERVHLKPGETRTLSFVI 779

Query: 771 DPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIE 806
            P E L + N+  + ++  G+  +M+G     + +E
Sbjct: 780 KP-EHLQLINEQYQHVVEPGDFKVMMGASSEDIRLE 814


>gi|408386263|gb|AFU63317.1| beta-glucosidase [uncultured bacterium]
          Length = 764

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 196/623 (31%), Positives = 298/623 (47%), Gaps = 87/623 (13%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RD RWGR  E  GEDP + S  A   V  FQG                    
Sbjct: 167 FAPMVDIGRDARWGRVMEGAGEDPYLGSKVAEARVLGFQG-------------------- 206

Query: 225 KGFGEESDRGDELM-LSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCI 283
              G + +   EL  L A  KHL AY   + G   R    A ++E    + + PP+R+  
Sbjct: 207 ---GRDVNSLRELTTLLATGKHLAAYGAAESG---RDYNTAELSEHTLRNVYLPPYRAAH 260

Query: 284 EQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           E G     M S+N++NGVPA   R  + +  RNEWGFKG++ SD   V  +  +    + 
Sbjct: 261 EAG-VGTFMASFNEINGVPATADRWLMTEVLRNEWGFKGFVVSDYTGVNELVPH-GVARD 318

Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            + +A   + AG+D++  G   ++H   ++ +GKV EK ID A+  +  ++  LGLF+ D
Sbjct: 319 EKHAAELAVNAGIDMDMTGATFIKHLTESVKEGKVSEKSIDTAVRRILEMKFLLGLFD-D 377

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P +    +    +  T E  + A  A    +VLLKND + LP+ K +  ++A+IGPL+N+
Sbjct: 378 PYRYLDEERAKQNTMTPEFMETARKAVSSSVVLLKNDNQVLPVTKESGKTVALIGPLMND 437

Query: 462 ISQMGGGYTGIPCSPKS--LLRGL----EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAK 515
              + G + G+    KS  +L+GL    E    +  YA G   +   + A   EAV  A+
Sbjct: 438 SINLNGEWAGLGDRNKSVPILKGLTEKYEGSTVRLIYAEGAS-ITETTPAKIAEAVAAAR 496

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD V+   G D     E   R  + LP  Q  L+ ++  T K PV+LV   G PLD+S+
Sbjct: 497 RADIVVAAMGEDFNWSGEAAVRTDIRLPEPQRELLRALKETGK-PVVLVTLSGRPLDLSW 555

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-------- 627
              D  + +IL   +PG  G   +A++I GD+NP GRL M+ +P +  ++P+        
Sbjct: 556 --EDETMDAILQAWFPGTQGGHGIADVIAGDYNPSGRLVMS-FPRNIGQIPIYYNQKNTG 612

Query: 628 --NDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
              D+N   + S  Y   +Y   + T +Y FG+GLSYT +                  A 
Sbjct: 613 RPADLN---NESVDYKS-NYLDTSITPLYPFGYGLSYTTFEIS--------------NAQ 654

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
            DK  L+           D++T   ++R        NAG+VDG  VV L+ R    S   
Sbjct: 655 VDKKTLK--------TEDDKITVTANVR--------NAGEVDGEIVVQLYTRQLVASVTR 698

Query: 746 PEKQLIGFDRVHTVAKGSKEISF 768
           P K+L GF +V   A  SK+++F
Sbjct: 699 PVKELKGFQKVALQAGKSKQVTF 721


>gi|332671963|ref|YP_004454971.1| glycoside hydrolase family 3 domain-containing protein
           [Cellulomonas fimi ATCC 484]
 gi|332341001|gb|AEE47584.1| glycoside hydrolase family 3 domain protein [Cellulomonas fimi ATCC
           484]
          Length = 771

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 192/638 (30%), Positives = 293/638 (45%), Gaps = 98/638 (15%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RDPRWGR  E  GEDP + SA A   V+ FQG       G IG     +RV+
Sbjct: 166 FAPMVDIARDPRWGRIVEGAGEDPYLGSAVAAAQVRGFQG-------GRIG---TPERVI 215

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIE 284
            G                 KH   Y     G   R      +++Q   + + PPF++ ++
Sbjct: 216 AG----------------PKHFAGYGAAIGG---RDYDEVDLSDQQLRNVYLPPFKAAVD 256

Query: 285 QGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTH 343
            G A  IM +Y  +NGVPA     L +   R+EWGF G++ SD +AV  +  +  Y    
Sbjct: 257 AG-AGNIMTAYMPLNGVPATANAWLLKDVLRDEWGFDGFLVSDANAVRNLVTH-GYAADL 314

Query: 344 EDSAAGVLKAGMDINCGTC--MLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
            D+A   ++ G+D+          H   A++ G V  + +D  +  +  V++RLGLF  +
Sbjct: 315 PDAAVRAVEVGLDLEMAISDPAYAHLPEAVESGAVSIEAVDACVRRILEVKVRLGLFE-E 373

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P   +        +    H+++A +AA++  VLL+ND + LPL+ +A+ S+A+IGPL ++
Sbjct: 374 PYVDE--DRAAAVLADPAHRQVAREAAQRSAVLLRNDGRLLPLDPDALGSVAVIGPLADS 431

Query: 462 ISQMGGGYT--GIPCSPKSLLRGLEAYVSKT---HYASGCHDV-----------PCNS-- 503
                G +          ++L GL A +  T    YA G               P N+  
Sbjct: 432 KRDTLGPWVFDDDLDETVTILDGLRARLGDTTDVRYAPGVRPAQRTFPSMFEMFPGNATP 491

Query: 504 -------DAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVART 556
                  +A    AV +A+ +D  ++V G       E   R SL LPG+Q+ L+ +VA T
Sbjct: 492 DPEDFDDEAELARAVELARTSDVAVLVVGEWQGMIGEAASRSSLELPGRQLELLQAVAAT 551

Query: 557 SKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMT 616
              PV+L++  G PLD+ +A     + +IL + YPG  G  A+A+++ GD  PGG+LP T
Sbjct: 552 GT-PVVLLVMNGRPLDLRWAA--QHVPAILDVWYPGTQGGTAVADLLVGDVAPGGKLPFT 608

Query: 617 WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSEL 676
           W P +  +VPM   +       Q  GR Y     T ++ FG GL Y  ++Y  L      
Sbjct: 609 W-PRTVGQVPMI-YSHTTSFEPQNQGRRYWDEESTPLFPFGFGLGYGEFTYSDL------ 660

Query: 677 TISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFA 736
                               RLD   I    S T+      ++VTNAG  DG  VV L+ 
Sbjct: 661 --------------------RLDAETIPLDGSLTA-----SVTVTNAGQHDGDEVVQLYV 695

Query: 737 RVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE 774
                S   P ++L GF R+H  A  S+ +   + P E
Sbjct: 696 HQRHGSAARPVRELKGFQRIHLAAGESRTVELTIGPDE 733


>gi|298374050|ref|ZP_06984008.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298268418|gb|EFI10073.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 758

 Score =  242 bits (618), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 188/632 (29%), Positives = 298/632 (47%), Gaps = 100/632 (15%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
           +AP ++I RD RWGR  E  GEDP + S  A   V+ FQG N W+S              
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQGGNDWRS-------------- 207

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
                      D   + ACCKH  AY   + G   N S  S N ++      + + PP+ 
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252

Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
           +  E G A+  M S+N++NGVP+     L  DL +K   +WGF G++ +D      I E 
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLLRK---DWGFNGFVVTDYTG---INEM 305

Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
             ++    D  AG L   AG+D++  G    ++   ++ +GKV E++IDRA+ ++  ++ 
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYLVQSVKEGKVSEENIDRAVASILEMKF 365

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
            LGLF+ DP +    +   + +   E  + A + + + IVLLKND  F P++K+   ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKNITVA 424

Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
           +IGP+V +     G + G     +  SL  GL    + T+    YA GC D+  +  + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
            EA+  A++AD V+   G D     E   R  L LPG Q +L+  + +T K P+ L+L  
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PLD+S+   D  +  IL   Y G      +A++I GD+NP  RL M+ +P +  ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599

Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                +  + R  P  +        Y     T +Y FG+GLSYT ++             
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFA------------- 644

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
                             ++ + +D+ +     +  V   V N G VDG  VV ++ R  
Sbjct: 645 ------------------VNSMKLDQNSFTKGGKITVMAEVENTGKVDGETVVQMYIRDL 686

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             S   P K+L GF++V   A   K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718


>gi|315225249|ref|ZP_07867066.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
 gi|420158631|ref|ZP_14665447.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga ochracea str. Holt 25]
 gi|314944932|gb|EFS96964.1| periplasmic beta-glucosidase [Capnocytophaga ochracea F0287]
 gi|394763447|gb|EJF45542.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga ochracea str. Holt 25]
          Length = 770

 Score =  242 bits (618), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 219/744 (29%), Positives = 330/744 (44%), Gaps = 115/744 (15%)

Query: 78  IQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNR 137
           I++L   A    RLGIP   +  + +HG                 T FP  L  + S++ 
Sbjct: 102 IRKLQKIAVEQTRLGIPIL-FGQDVIHGYK---------------TIFPIPLAESCSWDL 145

Query: 138 SLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAV 196
           +L        A EA A       G+ + +AP ++I RD RWGR  E  GEDP + S  A 
Sbjct: 146 ALMRKTTELAAREASA------DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAE 199

Query: 197 EFVKSFQG-ENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG 255
             VK FQG +NW++                             L AC KH   Y   + G
Sbjct: 200 ARVKGFQGGDNWQT-----------------------LSSPHTLLACGKHFAGYGAAESG 236

Query: 256 NFSRYSFN-AMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA- 313
                 +N A ++     + + PP+ + +  G  S IM S N++NGVPA     L  +  
Sbjct: 237 K----DYNTAELSMHTFRNVYLPPYEATLNAGVGS-IMASLNEINGVPATADKWLLTEVL 291

Query: 314 RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAG----MDINCGTCMLRHTQS 369
           R EWGF G + SD      I E   +    +D  A  L A     MD+N G   +++  +
Sbjct: 292 RKEWGFNGLLVSDYTG---INELVRHGVAKDDKQAANLSANAGIEMDMN-GATFIKYLSA 347

Query: 370 AIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAAR 429
            + +GKV E  ID+A+ ++  ++  LGLF+ DP +        ++  T E+ K+A  A  
Sbjct: 348 LVKEGKVTEAQIDKAVRHILEMKFLLGLFD-DPYRYLDETRAKENTFTEEYLKVARQAVA 406

Query: 430 QGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLRGL-EAY 486
             +VLLKN+ + LP+ K++  ++A+IGP++NN S + G +T  G      SL  GL E Y
Sbjct: 407 SSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLFTGLTEKY 466

Query: 487 VS---KTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
                K  YA GC     +++    EAV IA+KAD V+V  G       E   R  + LP
Sbjct: 467 KGTNVKLLYAEGCGFTTISTEQ-LKEAVAIARKADRVLVAVGEQSNWAGESAVRTDIRLP 525

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
             Q  L+ ++   +K P+ +V   G PLD+S+   +  + +IL   +PG  G   +A++I
Sbjct: 526 QAQRQLLEALKAINK-PIAIVTFSGRPLDLSW--ENENVQAILQAWFPGTQGGNGIADVI 582

Query: 604 FGDFNPGGRLPMT----------WYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV 653
            GD NP G L M+          +Y    T  P+   N   D    Y    Y   + T +
Sbjct: 583 AGDVNPSGHLTMSFPRNVGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNA-GYLDSSITPL 641

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT ++   +    +     SLK  +D  I                       
Sbjct: 642 YPFGYGLSYTTFAISNVHLNKK-----SLKRYNDSII----------------------- 673

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPC 773
             V  SV N G  +G  VV L+ R    S   P K+L GF+++   A  SK++ F + P 
Sbjct: 674 --VNASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFEKISLKAGESKQVCFEL-PS 730

Query: 774 EQLSIANKHGRRILPLGNHVLMVG 797
           E L+    +G++       +L VG
Sbjct: 731 EALAFYGINGKKDTEPSECLLWVG 754


>gi|218131278|ref|ZP_03460082.1| hypothetical protein BACEGG_02889 [Bacteroides eggerthii DSM 20697]
 gi|217986495|gb|EEC52831.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 862

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 166/470 (35%), Positives = 239/470 (50%), Gaps = 66/470 (14%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C  P + +     TSL+   RA+ L+  LTL+EK+  + DN+ A+ RLGI  Y WW+E+L
Sbjct: 17  CSQPTYKT-----TSLAPEKRAELLLKELTLEEKVALMMDNSQAVERLGIKPYNWWNEAL 71

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA--- 160
           HG+A  G            T FPQ +  AASF+      + SAV+ EARA  N   A   
Sbjct: 72  HGVARAG----------LATVFPQPIGMAASFSPESVYEVFSAVSDEARAK-NANYASQG 120

Query: 161 ------GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGI 214
                 GLT W P +NI+RDPRWGRG ET GEDP + S   V  VK  QG N    DG  
Sbjct: 121 SYARYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQGTN----DG-- 174

Query: 215 GFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTED 273
                          + D+     L AC KH   +   +W   +R+SFN   ++ +D  +
Sbjct: 175 ---------------KYDK-----LHACAKHFAVHSGPEW---NRHSFNVEDLSARDLYE 211

Query: 274 TFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVAT 332
           T+ PPF + +++ K   +MC+YN   G P C    L  Q  RN+WGF G I SDC A+A 
Sbjct: 212 TYLPPFEALVKEAKVKEVMCAYNSFEGEPCCGSNRLLMQILRNDWGFDGIILSDCGAIAD 271

Query: 333 IFE---YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
            +    ++ Y      SAA VL  G D+ CG+   +    A+  G++ EKDID+A++ L 
Sbjct: 272 FYNEHGHKAYPDAESASAAAVLN-GTDLECGSS-YKALVKAVRAGQINEKDIDKAVMRLL 329

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
             +  LG  + DP    + K+    V ++EH  LAL+ AR+ + LL+N    LPL +  +
Sbjct: 330 EARFALGEMD-DPDNVSWTKIPFSVVASAEHDSLALNMARKSMTLLQNTGNILPLKRGGL 388

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
            ++A++GP  N+     G Y G+P    ++L G+   +    K  Y  GC
Sbjct: 389 -TVAVMGPNANDSVMQWGNYNGMPPHTVTILGGIRKALGADDKLIYEQGC 437



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 145/318 (45%), Gaps = 63/318 (19%)

Query: 498 DVPCNSDAGFHEAVRI------AKKADFVIVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF + V I       K AD V+  +G+    E E+           DR  + 
Sbjct: 574 DAQLNFDIGFKQKVDINASVARVKDADIVVFASGVSPVLEGEEMGVNLPGFKGGDRTDIE 633

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LP  Q  L++++ R  K+ +ILV   G P  ++         +IL   YPG+ G  A+AE
Sbjct: 634 LPAIQRELISALHRAGKK-IILVNCSGSP--IALEPETKNCEAILQAWYPGQQGGTAVAE 690

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           ++FGD+NPGGRLP+T+Y  + +++P   D NM         GR+YR+ T   ++ FG+GL
Sbjct: 691 VLFGDYNPGGRLPVTFY-RNMSQLPDFEDYNMT--------GRTYRYMTQQPLFPFGYGL 741

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y  LS   E      +K G                              + +SV
Sbjct: 742 SYTTFDYGKLSLDKE-----QIKQGEP--------------------------LKLAVSV 770

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIA 779
           TN+G  DG  VV ++ +    ++G P K L  F RV+  A  S  + F +   E +   A
Sbjct: 771 TNSGQRDGEEVVQVYLKKQDDAEG-PGKTLRAFKRVYIPAGKSVNVEFDLKDKELEWWDA 829

Query: 780 NKHGRRILPLGNHVLMVG 797
             +  R+   G + +MVG
Sbjct: 830 QSNTMRVCA-GKYDIMVG 846


>gi|311031835|ref|ZP_07709925.1| glycoside hydrolase family 3 domain protein [Bacillus sp. m3-13]
          Length = 719

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 202/699 (28%), Positives = 334/699 (47%), Gaps = 92/699 (13%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
           T FP  L    S++  L        A EA    ++    +TF AP +++ RDPRWGR  E
Sbjct: 96  TIFPVPLAIGCSWDTELAEKSAEIAAKEA----SVSGVHVTF-APMVDLVRDPRWGRVME 150

Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
           + GEDP + S +A  FV+ FQGE+  +D           RV                +AC
Sbjct: 151 STGEDPYLNSVFAKAFVRGFQGEDLTNDT---------HRV----------------AAC 185

Query: 243 CKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
            KH  AY   + G      +N + ++E+   +++ P +++ +++G    +M ++N V+G+
Sbjct: 186 VKHFAAYGAPEGGR----DYNTVNMSERQLRESYLPAYKAALDEG-CEMVMTAFNTVDGI 240

Query: 302 PACLRGDLFQK-ARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
           PA     L +   R EWGF G + SD  AV  +  +       E +  G L+AG+DI   
Sbjct: 241 PASGNKKLMRDLLREEWGFDGVMISDWGAVKELIPHGVAEDEREAALKG-LEAGVDIEMM 299

Query: 361 T-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE 419
           T C  ++ +  ++ G++ E  +D ++L + +++ +LGLF  +P +G   +L  + + ++E
Sbjct: 300 TACYAKNLKDLVESGELSEALVDESVLRILNLKNKLGLFE-NPYRGTSEELEKEVIMSAE 358

Query: 420 HKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMG-GGYTGIPCSPKS 478
           H+ +A + A +  VLLKND + LPLNK     +A+IGP   N   +G   + G   +   
Sbjct: 359 HRHVAKELAIKSSVLLKND-QVLPLNKK--QKIALIGPFAQNGDVLGPWSWQGSKEAAVQ 415

Query: 479 LLRGLEAYVSKTHY--ASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRD 536
           L  GL A  S+ +   A G  D+   +D    EAV  AK AD +++  G       E   
Sbjct: 416 LYDGLLAKTSEANLLVAQGS-DIEAITDEQLTEAVETAKGADVIVLALGEASEMSGEGGC 474

Query: 537 RVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAEADSQISSILWIGYPGEA 594
           R  + LP  Q+ LV  + R   +PV+ VL  G PLD+    AE+D+ + +  W  YPG  
Sbjct: 475 RADIRLPEAQLKLVAKI-RELGKPVVTVLFNGRPLDLHGVIAESDALLEA--W--YPGTE 529

Query: 595 GAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNMRADSSRQYPGRSYRFYT 649
              A+A+++FGD NP GRL M+ +P S  ++P+     N    + ++ ++     Y    
Sbjct: 530 AGSAIADLLFGDANPSGRLTMS-FPYSVGQIPVYYNCFNTGRPQGENKQERYVTHYLDIP 588

Query: 650 GTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSC 709
              +  FG+GLSYT ++Y      S++++SA                       D +T  
Sbjct: 589 NEPLLPFGYGLSYTTFAY------SDMSLSA-----------------------DSLTEN 619

Query: 710 TSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFG 769
           +SL   V ++VTN G+  G   V  + R        P K+L G+++V      +K+I+F 
Sbjct: 620 SSL--DVTVNVTNTGERAGEETVQFYIRDLSGEIVRPLKELKGYEKVWLEPGETKQITFT 677

Query: 770 VDPCEQLSIANKHGRRILPLGNHVLMVGELRHSLTIETY 808
           +   EQL   +         G    M+G   + LTI+ +
Sbjct: 678 ITE-EQLRYYHSDLSFQSDKGKFHAMIGPNSNELTIKEF 715


>gi|375145746|ref|YP_005008187.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361059792|gb|AEV98783.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 866

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 152/446 (34%), Positives = 219/446 (49%), Gaps = 54/446 (12%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N  LS   RAK LI+ LTL+EK   + D + AIPRLGI  + WWSE+LHG+A+N   
Sbjct: 25  PYQNPDLSSEERAKDLITRLTLEEKASLMFDQSPAIPRLGIKKFNWWSEALHGLANN--- 81

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY---------NLGQAGLT 163
                    VT FP+ +  AASF+ +L   I  A + E RA Y         N     L+
Sbjct: 82  -------DKVTVFPEPVGMAASFDDTLVYKIFDATSDEVRAKYHDAIRNGKENRRFLSLS 134

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PNINIFRDPRWGRGQET GEDP + S   V  VK  QG               + R 
Sbjct: 135 VWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGP-------------ADARY 181

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNA-MITEQDTEDTFQPPFRSC 282
            K             L AC KH   +   +W   SR++ N   +  +D  +T+ P F+S 
Sbjct: 182 RK-------------LLACAKHFAVHSGPEW---SRHTLNLYQVRPRDLYETYLPAFKSL 225

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           + +     +MC+Y +++  P C    L Q   R+ WGFK  + SDC A+   F     + 
Sbjct: 226 VMEADVRQVMCAYQRLDDEPCCGNNRLLQTILRDNWGFKHVVVSDCGAITDFFTSHKVSS 285

Query: 342 THEDSAAGVLKAGMDINC--GTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN 399
               ++A    AG D+ C       ++   A+ +G + EK+ID+ LL + + +  LG  +
Sbjct: 286 DAVHASAKGALAGTDVECVWEGYAFKNLPEAVSRGLITEKEIDKHLLRVLAGRFDLGEMD 345

Query: 400 GDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLV 459
            D     +  +    V    H+KLALD A++ + LL+N    LPL +NA   +A+IGP  
Sbjct: 346 DDAVV-PWAAIPMSVVNNEAHRKLALDMAQESMTLLQNRNNILPLKRNA-KKIAVIGPNA 403

Query: 460 NNISQMGGGYTGIPCSPKSLLRGLEA 485
           +N   + G Y G P    +++ G+ A
Sbjct: 404 DNDPVLWGNYNGRPVRTINIVSGIRA 429



 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/304 (24%), Positives = 124/304 (40%), Gaps = 54/304 (17%)

Query: 504 DAGFHEAVRIAKKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSV 553
           D  F   +   K  D V+ V GL    E E+           DR  + LP  Q + + ++
Sbjct: 591 DIDFSGLISKLKGIDEVVFVGGLSTLLEGEEMPVSYPGFKGGDRTDIQLPEVQRNCLKAL 650

Query: 554 ARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRL 613
               K+ + +  +G     + F        +IL   Y GE+G +A+A+++FGD+NP G+L
Sbjct: 651 KAAGKKVIFVNCSGSA---IGFVPETESCDAILQAWYGGESGGQAVADVLFGDYNPSGKL 707

Query: 614 PMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAP 673
           P+T+Y ++       D +++        GR+YRF +   ++ FG GLSYT +S       
Sbjct: 708 PVTFYKDTTQLAGFEDYSLK--------GRTYRFMS-DPLFPFGFGLSYTTFS------- 751

Query: 674 SELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVM 733
                                   +    + +     S    + I VTN G  +G+ +V 
Sbjct: 752 ------------------------IGTAQLSKTNISNSESIELSIPVTNTGKRNGTEIVQ 787

Query: 734 LFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHV 793
           ++ R    S G P K + GF R+   A  +      + P        K G+  +  G + 
Sbjct: 788 VYVRKTGDSDG-PLKTMRGFKRIAIPAGKTTTAVINLPPAAFEFYDEKFGKMTVAAGEYE 846

Query: 794 LMVG 797
           L+ G
Sbjct: 847 LLYG 850


>gi|317474221|ref|ZP_07933497.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909531|gb|EFV31209.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 786

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 218/745 (29%), Positives = 342/745 (45%), Gaps = 129/745 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIARVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG   E        L A  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVATPKHFAVYSVPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYLEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-------LRHTQSAIDKGKVQE 378
           D +AV  ++          D AA V+ AG+++     +       LRH   AI +GK+  
Sbjct: 325 DSEAVEFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRH---AISEGKISM 381

Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
           + ID  + ++  V+  +GLF+ +P KG   KL     C  EH+ +++ AA + IVLLKN+
Sbjct: 382 QTIDSRVADVLRVKFMMGLFD-NPYKGDTKKLEKVVHC-KEHQNVSMRAALESIVLLKNE 439

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
              LPL+K ++  +A+IGP    + ++   Y       K++ +G++ Y+  S+  YA G 
Sbjct: 440 NNILPLSK-SIKKVAVIGPNAAEVDKLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGA 498

Query: 497 ------------HDVPCN--SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       +DVP N   +    EAV +AK++D  I+V G +     E+  R +L L
Sbjct: 499 DIIDEYFPESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDL 558

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PVILV+  G    +++AE    I  I+   +PGE    A+A++
Sbjct: 559 CGRQEKLLQAVYATGK-PVILVMIDGRAATINWAE--RYIPGIVHAWFPGEFMGDAIAKV 615

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           IFGD+NPGG+L +T +P S  ++P        +DS      + +   TG+ +Y FG+GLS
Sbjct: 616 IFGDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGS-LYPFGYGLS 667

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y  L   + +             I  Q   +L         SC          V 
Sbjct: 668 YTTFDYSALKIENPV-------------IGTQGSVKL---------SC---------RVK 696

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+      S  T  K L GF+R+H      K + F + P + + + NK
Sbjct: 697 NTGKIAGDEVVQLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNK 755

Query: 782 HGRRILPLGNHVLMVGELRHSLTIE 806
               I+  G   +MVG     + ++
Sbjct: 756 DNHFIVEPGTFAVMVGSSSQDIRLQ 780


>gi|160891510|ref|ZP_02072513.1| hypothetical protein BACUNI_03961 [Bacteroides uniformis ATCC 8492]
 gi|156858917|gb|EDO52348.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 756

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 184/619 (29%), Positives = 295/619 (47%), Gaps = 81/619 (13%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RD RWGR  E  GEDP + S  +   V+ FQGE                   
Sbjct: 161 FAPMVDISRDARWGRVMEGSGEDPYLGSLLSAARVRGFQGEK------------------ 202

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
               E+  R D+++  AC KH  AY   + G      +N   ++E+   D + PPF++  
Sbjct: 203 ---PEDLMRLDKML--ACAKHFCAYGAAEAGR----DYNTTDVSERSLRDIYFPPFKAAK 253

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           + G A+  M ++N+++GVP      L+Q   R+EW F G++ +D  A+  +  +    + 
Sbjct: 254 DAGVAT-FMTAFNEISGVPCTSSKFLYQDVLRDEWRFNGFVVTDYTAINELVPH-GVARD 311

Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
              +A     AG++++  G     H   A+ +GKV E+ ID A+  +  ++  LG+ + D
Sbjct: 312 EAHAAELAANAGIEMDMTGGVFHAHLLQAVKEGKVNEETIDNAVRRILEMKFLLGIMD-D 370

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P +    +     +   E  + A DAAR+ +VLLKN+  F P+  +   ++A+IGP+V  
Sbjct: 371 PYRYLNEEREKATIMKPEFLEAARDAARKSVVLLKNENDFFPIQPSERKTVALIGPMVKE 430

Query: 462 ISQMGGGYTGIPCSPKS--LLRGLEAYVSKTH----YASGCHDVPCNSDAGFHEAVRIAK 515
            + + GG+ G     +S  L  GLE     ++    YA GC D+     AGF +AV +A+
Sbjct: 431 RNSVNGGWGGRGDRQRSVTLFEGLEKKYGNSNVRFLYAEGC-DLRKPGTAGFAQAVSVAR 489

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD ++V AG D     E   R  + LP  Q  L+  + +T K P+ LVL  G PL++++
Sbjct: 490 QADVILVAAGEDQNWSAEAACRTDITLPASQRDLLKELKKTGK-PIGLVLMNGRPLELTW 548

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN-- 631
              D  + +IL   YPG  G  A+A++I GD+NP G+L M+ +P S  ++P+  N  N  
Sbjct: 549 --EDENMDAILEAWYPGTMGGHAIADVIAGDYNPAGKLTMS-FPRSVGQLPLYYNHKNTG 605

Query: 632 --MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
             +  D+ +     SY     + +Y FG+GLSYT++    L    E      LK G    
Sbjct: 606 RPLPPDNPKMDYKSSYIDCPNSPLYPFGYGLSYTSFEVDNLKLDKE-----ELKKGE--- 657

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                                     V + V N G V G  VV L+ R    S   P K+
Sbjct: 658 -----------------------TLTVTVDVANIGKVGGEEVVQLYIRDLVGSVTRPVKE 694

Query: 750 LIGFDRVHTVAKGSKEISF 768
           L GF +++  A   K ++F
Sbjct: 695 LKGFQKLYLKAGEKKSLTF 713


>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
          Length = 852

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 222/441 (50%), Gaps = 57/441 (12%)

Query: 64  RAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVT 123
           R   L+S LT++EKI  L ++A  IPRL I  Y   +E+LHGI   G            T
Sbjct: 40  RLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGIVRPG----------EFT 89

Query: 124 SFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAG----------LTFWAPNINIFR 173
            FPQ +  AA++N  L   + SA++ EAR  +     G          LTFW+P +N+ R
Sbjct: 90  VFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLLTFWSPTVNMAR 149

Query: 174 DPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDR 233
           DPRWGR  ET GEDP +      EFVK  QG++               R LK        
Sbjct: 150 DPRWGRTPETYGEDPFLTGVIGCEFVKGLQGDH--------------PRYLKTV------ 189

Query: 234 GDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMC 293
                  +  KH  A + E     +R S NA ++E+D  + + P F  CI   KA  IM 
Sbjct: 190 -------STPKHFAANNEEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMM 238

Query: 294 SYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLK 352
           +YN VNGVP  +   L +   R +WGF GYI SDC A   +     Y +  + +A   +K
Sbjct: 239 AYNAVNGVPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDLDAAATLAIK 298

Query: 353 AGMDINCGTCMLRH-TQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLG 411
           AG+D+ CG  +       A ++  V + DID A   +   ++ LGLF+ DP +  Y ++ 
Sbjct: 299 AGLDLECGDRVYTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFD-DPSQNPYNQIE 357

Query: 412 PDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG 471
           P  +   +H++LAL+ ARQ +VLLKN K FLPLN   V S+A++G  +N      G Y+G
Sbjct: 358 PSVIGCKKHQELALETARQSMVLLKNQKNFLPLNLKKVKSIAVVG--INAGHCEFGDYSG 415

Query: 472 IP-CSPKSLLRGLEAYVSKTH 491
           IP  +P S+L G+  Y  K +
Sbjct: 416 IPKNAPVSVLDGIRKYAEKAN 436



 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 90/303 (29%), Positives = 148/303 (48%), Gaps = 48/303 (15%)

Query: 505 AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           A F +A + AK+ D  + V G++ + E E +DR SL LP  Q   +  + + +   V+ V
Sbjct: 592 ALFGDAGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVV-V 650

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G  L +++   D  + +IL   YPGE G  A+AE++FGD+NPGGRLP+T+Y  S  +
Sbjct: 651 LVAGSSLAINWI--DENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYY-NSLDE 707

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
           +P  D       +     R+Y+++ G  +Y FG+GLSYT ++YK                
Sbjct: 708 LPSFD-------NYSVQNRTYQYFKGKPLYEFGYGLSYTKFNYK---------------- 744

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
              KN+                 S  +    +   V+NAG  DG  V  ++ + P+    
Sbjct: 745 --KKNV-----------------SIANDTIDITFKVSNAGKYDGDEVAQVYVQYPETGTY 785

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRR-ILPLGNHVLMVGELRHSL 803
            P KQL GF RVH     S +++  V P ++L   ++  R+ + P G +V ++G     +
Sbjct: 786 MPLKQLRGFSRVHIKKGKSADVTISV-PKKELRYWDEKTRQFVTPEGKYVFLIGSSSEDI 844

Query: 804 TIE 806
            ++
Sbjct: 845 KLQ 847


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 205/744 (27%), Positives = 323/744 (43%), Gaps = 123/744 (16%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T+FP     A+++N  L   +G  +A 
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P ++I RDPRW R +E+ GED  + +     +V+         
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVR--------- 213

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
              G G G              D        +  KH IAY   + G   +   + ++ E+
Sbjct: 214 ---GTGSG--------------DLSQSRHALSTLKHFIAYGASEGG---QNGGSNLLGER 253

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACL-RGDLFQKARNEWGFKGYITSDCD 328
           +  +T+ PPF + ++ G A  +M +YN V+G+P    R  L    R EWGF G++ SD  
Sbjct: 254 ELRETYLPPFEAAVKAG-ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLL 312

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  + E      +  ++A   L+AG+D +         + A + G V E +IDRA+  +
Sbjct: 313 SIEGLHETHGVAGSVREAAVQALRAGVDADLKGGAFASLREAAEAGDVAEAEIDRAVERV 372

Query: 389 FSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNA 448
            +++  +GLF  +P      +    +V  + H +LAL+AARQ + LL+N    LPL+   
Sbjct: 373 LALKFEMGLFE-NP---YIDEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRR 428

Query: 449 VSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSKTH--YASGCHDVPCNSDAG 506
           +  +A+IGP  +NI    G YT    +  ++  GLE  + +    Y+ GC  V     + 
Sbjct: 429 LRRVAVIGPNADNIYNQLGDYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSE 487

Query: 507 FHEAVRIAKKADFVIVVAG----LDLTQE-------------------TEDRDRVSLLLP 543
              AV  A+  D  +VV G     D   E                    E  DR +L L 
Sbjct: 488 IAAAVSAARGTDAAVVVIGGSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALL 547

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
           G+Q  L+  +  T   P+I+V   G PLD+    A  Q  ++L   YPG  G  A+AE I
Sbjct: 548 GEQEELLRRIKATGT-PLIVVCIAGRPLDLR--RASEQADALLMAWYPGARGGDAVAETI 604

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
            G  NP GRLP+T  P +  ++P+     R  +        Y   T   +Y FG+GLSY+
Sbjct: 605 LGRNNPAGRLPIT-IPRAEGQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYS 657

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            + Y  L A             S  N+L+               SC          + N 
Sbjct: 658 TFEYGSLEARQ-----------SGDNVLE--------------VSC---------RIRNT 683

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
            D +G  VV L+      S   P +QL GF R+       +++SF +   E L++ +  G
Sbjct: 684 SDREGDEVVQLYISDMVASTVRPPRQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQG 742

Query: 784 RRILPLGNHVLMVGELRHSLTIET 807
           RR++  G+ V+ VG     + ++T
Sbjct: 743 RRVVEKGDFVIAVGSSSQDIRLQT 766


>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 753

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 216/744 (29%), Positives = 345/744 (46%), Gaps = 131/744 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIPA     E L G  ++G            T FPQ +  A++++      +   +  
Sbjct: 89  RLGIPAI-VHEECLSGFMADG-----------ATVFPQAIGLASTWDAEAIEAMAGVIRQ 136

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           + +A       GL+   P +++ RDPRWGR +ET GEDP +V++ AV +V+  QG++   
Sbjct: 137 QMKAAG--AHQGLS---PVLDVARDPRWGRVEETFGEDPYLVASMAVSYVRGLQGQD--- 188

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
                        + KG            + A  KH   +   + G   R      + E+
Sbjct: 189 -------------LTKG------------IFATLKHFAGHSFSEGG---RNCAPVHVGER 220

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVP-ACLRGDLFQKARNEWGFKGYITSDCD 328
           +  D F  PF + + +  A  +M +Y+ ++GVP A  R  L    R  +GF G + SD D
Sbjct: 221 ELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHFGFDGIVVSDYD 280

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGT--CMLRHTQSAIDKGKVQEKDIDRALL 386
           A+  + +        +++A   L+AG+DI      C  +    A+ +G + E  I+ ++ 
Sbjct: 281 AIDRLRKAHFTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEGMISEATINESVE 340

Query: 387 NLFSVQLRLGLFNG-----DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKF 441
            + + +  LGLF+G     D   G +         T E ++++ D AR+ IVLLKND   
Sbjct: 341 RVLTAKFELGLFDGVYVDVDSVPGLFE--------TPEQREMSRDIARKSIVLLKNDN-V 391

Query: 442 LPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP--------CSPKSLLRGLEAYV---SKT 490
           LPL+K+ + S+A+IGP  +N   M G Y  +             ++L G++  V    + 
Sbjct: 392 LPLSKD-IKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLDSCRI 450

Query: 491 HYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAG-----LDLTQETEDRDRVSLLLPGQ 545
            YA GC  +  ++D GF EAV  A+ AD  IVV G            E+ DR  + LPG 
Sbjct: 451 TYAKGCDIIDPSTD-GFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDITLPGV 509

Query: 546 QMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFG 605
           QM LV ++  T K PVI+VL  G     +  E     S+++   YPGE G  A+A+++FG
Sbjct: 510 QMQLVKAIKDTGK-PVIVVLINGRAF--AAKELADNASALMEAWYPGEEGGNAVADVLFG 566

Query: 606 DFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQV---YGFGHGLSY 662
           D+NP GRLP++  P    ++P+N  N++          SY  Y  T+    + FG+G+SY
Sbjct: 567 DYNPAGRLPISL-PCEVGQIPIN-YNLKP--------ASYINYLSTETKPAFAFGYGMSY 616

Query: 663 TNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTN 722
           T + Y      S+L+I+ ++   + K                           +   VTN
Sbjct: 617 TTFGY------SDLSITPAVAPSAGK-------------------------VDISFKVTN 645

Query: 723 AGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           AG + G  VV L+ R    S   P K+L GF RV+     +KEI+F +   +QL+  +K 
Sbjct: 646 AGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTL-YADQLAFHDKD 704

Query: 783 GRRILPLGNHVLMVGELRHSLTIE 806
            R ++  G   +MVG     + +E
Sbjct: 705 MRLVVEPGTFKIMVGSSSDDIRLE 728


>gi|300726322|ref|ZP_07059774.1| beta-xylosidase B [Prevotella bryantii B14]
 gi|291292284|gb|ADD92014.1| Xyl3A [Prevotella bryantii B14]
 gi|299776347|gb|EFI72905.1| beta-xylosidase B [Prevotella bryantii B14]
          Length = 885

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 153/456 (33%), Positives = 225/456 (49%), Gaps = 52/456 (11%)

Query: 53  PFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPG 112
           P+ N +LS   RA  L   LTL+EK   + D + AIPRLGI  + WWSE+LHG A+ G  
Sbjct: 49  PYQNPNLSAYERAIDLCHRLTLEEKALLMQDESPAIPRLGIKKFFWWSEALHGAANMG-- 106

Query: 113 VNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMY-----NLGQ----AGLT 163
                   +VT+FP+ +  A+SFN +L  ++ SA + E RA Y     N G+      L+
Sbjct: 107 --------NVTNFPEPIAMASSFNPTLLKSVFSAASDEMRAQYHHRMDNGGEDEKFHSLS 158

Query: 164 FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRV 223
            W PN+NIFRDPRWGRGQET GEDP + S      V+  QG                   
Sbjct: 159 VWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGCAVVEGLQGP------------------ 200

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSC 282
                 ES +  +L   AC KH   +   +    +R++ N   I+ +D  +T+ P F+S 
Sbjct: 201 ------ESSKYRKLW--ACAKHFAVHSGPE---STRHTANLNNISPRDLYETYLPAFQST 249

Query: 283 IEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTK 341
           ++ G    +MC+Y +++  P C    L Q+  R EWGFK  + SDC AV+ I++    + 
Sbjct: 250 VQDGHVREVMCAYQRLDDEPCCSNNRLLQQILREEWGFKYLVVSDCGAVSDIWQSHKTSS 309

Query: 342 THEDSAAGVLKAGMDINCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNG 400
               ++     AG D+ CG          A+ +G + E++ID+ ++ L   +  LG  + 
Sbjct: 310 DAVHASRQATLAGTDVECGYGYTYAKIPEAVKRGLLTEEEIDKHVIRLLEGRFDLGEMD- 368

Query: 401 DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVN 460
           D +  ++ K+    +    H +LALD ARQ IVLL+N    LPL       +A+IGP  +
Sbjct: 369 DSKLVEWSKIPYSIMSCKAHAQLALDMARQSIVLLQNKGNILPLQLKKNERIAVIGPNAD 428

Query: 461 NISQMGGGYTGIPCSPKSLLRGLEAYVSKTHYASGC 496
           N   M G Y G P    S+L G+        Y   C
Sbjct: 429 NKPMMWGNYNGTPNHTVSILEGIRKQYKNVVYLPAC 464



 Score = 76.3 bits (186), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 69/294 (23%), Positives = 127/294 (43%), Gaps = 59/294 (20%)

Query: 515 KKADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILV 564
           K  D VI V G+  + E E+           DR  + +P  Q   + ++A   K+ +++ 
Sbjct: 624 KGIDKVIFVGGIAPSLEGEEMPVNIPGFKGGDRTDIEMPQVQRDFIKALAEAGKQIILVN 683

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
            +G     ++      +  +I+   YPG+ G  A+A+I+ G  NP G+LP+T+Y  +   
Sbjct: 684 CSGSA---IALTPEAQRCQAIIQAWYPGQEGGTAVADILMGKVNPMGKLPVTFYKSTQQL 740

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKA 684
               D +M+         R+YR++    +Y FG+GLSYT                 S + 
Sbjct: 741 PDFEDYSMK--------NRTYRYFEDA-LYPFGYGLSYT-----------------SFEI 774

Query: 685 GSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQG 744
           G+ K                 + + T+    +QI VTN G  +G+ +V ++ R     +G
Sbjct: 775 GTAK-----------------LQTLTNNSITLQIPVTNTGKREGTELVQVYLRRDDDVEG 817

Query: 745 TPEKQLIGFDRVHTVAKGSKEISFGVDPCE-QLSIANKHGRRILPLGNHVLMVG 797
            P K L  F  +   A  +K+    ++  + +   A+ +  R++P G + +  G
Sbjct: 818 -PSKTLRSFAHITLKAGETKKAILKLNRNQFECWDASTNTMRVIP-GKYTIFYG 869


>gi|317478381|ref|ZP_07937545.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316905540|gb|EFV27330.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 756

 Score =  242 bits (617), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 184/619 (29%), Positives = 295/619 (47%), Gaps = 81/619 (13%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVL 224
           +AP ++I RD RWGR  E  GEDP + S  +   V+ FQGE                   
Sbjct: 161 FAPMVDISRDARWGRVMEGSGEDPYLGSLLSAARVRGFQGEK------------------ 202

Query: 225 KGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPFRSCI 283
               E+  R D+++  AC KH  AY   + G      +N   ++E+   D + PPF++  
Sbjct: 203 ---PEDLMRLDKML--ACAKHFCAYGAAEAGR----DYNTTDVSERSLRDIYFPPFKAAK 253

Query: 284 EQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKT 342
           + G A+  M ++N+++GVP      L+Q   R+EW F G++ +D  A+  +  +    + 
Sbjct: 254 DAGVAT-FMTAFNEISGVPCTSSKFLYQDVLRDEWRFNGFVVTDYTAINELVPH-GVARD 311

Query: 343 HEDSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGD 401
              +A     AG++++  G     H   A+ +GKV E+ ID A+  +  ++  LG+ + D
Sbjct: 312 EAHAAELAANAGIEMDMTGGVFHAHLLQAVKEGKVNEETIDNAVRRILEMKFLLGIMD-D 370

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
           P +    +     +   E  + A DAAR+ +VLLKN+  F P+  +   ++A+IGP+V  
Sbjct: 371 PYRYLNEEREKATIMKPEFLEAARDAARKSVVLLKNENNFFPIQPSERKTVALIGPMVKE 430

Query: 462 ISQMGGGYTGIPCSPKS--LLRGLEAYVSKTH----YASGCHDVPCNSDAGFHEAVRIAK 515
            + + GG+ G     +S  L  GLE     ++    YA GC D+     AGF +AV +A+
Sbjct: 431 RNSVNGGWGGRGDRQRSVTLFEGLEKKYGNSNVRFLYAEGC-DLRKPGTAGFAQAVSVAR 489

Query: 516 KADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSF 575
           +AD ++V AG D     E   R  + LP  Q  L+  + +T K P+ LVL  G PL++++
Sbjct: 490 QADVILVAAGEDQNWSAEAACRTDITLPASQRDLLKELKKTGK-PIGLVLMNGRPLELTW 548

Query: 576 AEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMN-- 631
              D  + +IL   YPG  G  A+A++I GD+NP G+L M+ +P S  ++P+  N  N  
Sbjct: 549 --EDENMDAILEAWYPGTMGGHAIADVIAGDYNPAGKLTMS-FPRSVGQLPLYYNHKNTG 605

Query: 632 --MRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKN 689
             +  D+ +     SY     + +Y FG+GLSYT++    L    E      LK G    
Sbjct: 606 RPLPPDNPKMDYKSSYIDCPNSPLYPFGYGLSYTSFEVDNLKLDKE-----ELKKGE--- 657

Query: 690 ILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQ 749
                                     V + V N G V G  VV L+ R    S   P K+
Sbjct: 658 -----------------------TLTVTVDVANIGKVGGEEVVQLYIRDLVGSVTRPVKE 694

Query: 750 LIGFDRVHTVAKGSKEISF 768
           L GF +++  A   K ++F
Sbjct: 695 LKGFQKLYLKAGEKKSLTF 713


>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
           43183]
 gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 818

 Score =  241 bits (616), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 209/777 (26%), Positives = 342/777 (44%), Gaps = 141/777 (18%)

Query: 52  YPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGP 111
           YPF N + +++T  +  +                    RLGIP  ++ +E +HG+     
Sbjct: 136 YPFSNHAEAVNTVQRWFVEET-----------------RLGIPV-DFTNEGIHGLNH--- 174

Query: 112 GVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT-FWAPNIN 170
                   +  T  P  +   +++N+ L    G     EA+A+      G T  +AP ++
Sbjct: 175 --------TKATPLPAPIAIGSTWNKELVRRAGVIAGQEAKAL------GYTNVYAPILD 220

Query: 171 IFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEE 230
           I RDPRWGR  E  GE+P +++A   E V   Q +                         
Sbjct: 221 IVRDPRWGRTLECYGEEPYLIAALGTEMVNGIQSQG------------------------ 256

Query: 231 SDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASC 290
                   ++A  KH   Y + K G       +  +  ++  + F  PF+  I+      
Sbjct: 257 --------VAATLKHYAVYSVPKGGRDGNCRTDPHVAPRELHELFLYPFKKVIQNSHPMG 308

Query: 291 IMCSYNQVNGVPACLRGD-LFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAG 349
           +M SYN  +GVP       L +  R E+GF GY+ SD +AV  +    +   T++++   
Sbjct: 309 VMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSEAVEFVESKHHVADTYDEAVRQ 368

Query: 350 VLKAGMDINCG----TCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFN----GD 401
           VL+AG+++       +  +   +  +++ K+    ID+ +  +  V+ RLGLF+     D
Sbjct: 369 VLEAGLNVRTHFTPPSDFILPIRRLLEEKKISMAVIDKRVSEVLRVKFRLGLFDQPYVAD 428

Query: 402 PRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNN 461
            +     ++G  D      K++     +Q +VLLKN+   LPL+K  +  + + GPL + 
Sbjct: 429 TKAAD--RVGGADRNMDFVKQMQ----QQALVLLKNENNILPLDKRQIKKVLVTGPLADE 482

Query: 462 ISQMGGGYTGIPCSPKSLLRGLEAY---VSKTHYASGCHDVPCNSDA------------- 505
            + M   Y        ++L GL  Y   +++  YA GC  V     A             
Sbjct: 483 DNFMTSRYGPNGLETVTVLAGLRNYLKGIAEVDYAKGCDIVDAGWPATEILPAPMSEQEK 542

Query: 506 -GFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILV 564
            G  EAV  A ++D +I V G D  +  E R R SL LPG+Q  L+ ++  T K PVILV
Sbjct: 543 QGIAEAVAKAGESDVIIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK-PVILV 601

Query: 565 LTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTK 624
           L  G PL V++A A   I +IL   +PG  G   +AE +FG+ NPGG+L +T +P+S  +
Sbjct: 602 LINGQPLTVNWANA--YIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVT-FPKSVGQ 658

Query: 625 VPMNDMNMRADSSRQYPGRSYRFYTGTQV----YGFGHGLSYTNYSYKFLSAPSELTISA 680
           + +N    +  S    P         T++    Y FG GLSYT ++Y      S+L +S 
Sbjct: 659 IELN-FPFKPGSHGAQPHSGPNGSGATRIIGELYPFGFGLSYTTFAY------SDLEVSP 711

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
                     LQQ                T   + ++++VTN G   G  VV L+ R   
Sbjct: 712 ----------LQQH---------------TQGEYTIKVNVTNTGKRAGDEVVQLYVRDKV 746

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            S  T + QL GF+RV      +++++F + P E L I +++    +  G   +M+G
Sbjct: 747 SSVITYDSQLRGFERVSLQPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFEVMIG 802


>gi|255013061|ref|ZP_05285187.1| beta-glucosidase [Bacteroides sp. 2_1_7]
 gi|410102523|ref|ZP_11297449.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
 gi|409238595|gb|EKN31386.1| hypothetical protein HMPREF0999_01221 [Parabacteroides sp. D25]
          Length = 758

 Score =  241 bits (616), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 191/632 (30%), Positives = 298/632 (47%), Gaps = 100/632 (15%)

Query: 165 WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGEN-WKSDDGGIGFGFREKRV 223
           +AP ++I RD RWGR  E  GEDP + S  A   V+ FQG N W+S              
Sbjct: 162 FAPMVDISRDARWGRVMEGAGEDPYLGSLIAKARVEGFQGGNDWRS-------------- 207

Query: 224 LKGFGEESDRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFR 280
                      D   + ACCKH  AY   + G   N S  S N ++      + + PP+ 
Sbjct: 208 ---------LADVNTVLACCKHFAAYGAAEAGRDYNTSELSQNTLM------NYYMPPYL 252

Query: 281 SCIEQGKASCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEY 336
           +  E G A+  M S+N++NGVP+     L  DL    R +WGF G++ +D      I E 
Sbjct: 253 AAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL---REDWGFNGFVVTDYTG---INEM 305

Query: 337 QNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQL 393
             ++    D  AG L   AG+D++  G    ++   ++ +GKV E++IDRA+ ++  ++ 
Sbjct: 306 VAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYLVQSVKEGKVSEENIDRAVASILEMKF 365

Query: 394 RLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLA 453
            LGLF+ DP +    +   + +   E  + A + + + IVLLKND  F P++K+   ++A
Sbjct: 366 LLGLFD-DPYRYLDNEREKNTIMKPEFLQEARETSARSIVLLKNDNNFFPISKDKHITVA 424

Query: 454 IIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEAYVSKTH----YASGCHDVPCNSDAGF 507
           +IGP+V +     G + G     +  SL  GL    + T+    YA GC D+  +  + F
Sbjct: 425 LIGPMVKDKINQNGEWAGRGEREESISLFEGLTEKYAGTNVKFIYAEGC-DLLTDDSSKF 483

Query: 508 HEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTG 567
            EA+  A++AD V+   G D     E   R  L LPG Q +L+  + +T K P+ L+L  
Sbjct: 484 AEAIATARRADIVLAAMGEDFNWSGEAACRTDLKLPGAQQALLKELKKTGK-PLGLILVN 542

Query: 568 GGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM 627
           G PLD+S+   D  +  IL   Y G      +A++I GD+NP  RL M+ +P +  ++P+
Sbjct: 543 GRPLDLSW--EDQHVDGILEAWYLGTMAGHGMADVISGDYNPSARLTMS-FPRTVGQLPL 599

Query: 628 NDMNMRADSSRQYPGRS--------YRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTIS 679
                +  + R  P  +        Y     T +Y FG+GLSYT ++             
Sbjct: 600 --YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPLYPFGYGLSYTTFAVN----------- 646

Query: 680 ASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVP 739
            S+K   D+N   + G                 +  V   V N G VDG  V+ ++ R  
Sbjct: 647 -SMKL--DQNSFTKGG-----------------KITVTAEVENTGKVDGETVIQMYIRDL 686

Query: 740 KVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             S   P K+L GF++V   A   K++SF +D
Sbjct: 687 AGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718


>gi|409198859|ref|ZP_11227522.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Marinilabilia salmonicolor JCM 21150]
          Length = 777

 Score =  241 bits (616), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 201/756 (26%), Positives = 331/756 (43%), Gaps = 141/756 (18%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP +    E+ HG  + G            T FP  +  A+++   L   +G  +  
Sbjct: 124 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGLASTWAPELIEEVGKVIGR 171

Query: 150 EARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKS 209
           E R      Q G   + P +++ RD RW R +ET GEDP++        V+   G+    
Sbjct: 172 EVRL-----QGGHIGYGPVLDLVRDARWSRVEETYGEDPVLAGIMGAAMVRGMAGDGLHD 226

Query: 210 DDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITEQ 269
             G                          L A  KH +AY   + G     +    I   
Sbjct: 227 KHG--------------------------LIATLKHFLAYGTPEAGQNGGIT---TIGPL 257

Query: 270 DTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCD 328
           D    + PPF+  I  G A  +M SYN ++G+P+     L     R +W F G++ SD  
Sbjct: 258 DVMQNYLPPFKEAINAG-ALSVMTSYNSIDGIPSTSNHYLLTDVLRKKWNFDGFVVSDLA 316

Query: 329 AVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNL 388
           ++  ++      ++ +++A   L AG+D++ G+       ++++ G+VQE  ID A+  +
Sbjct: 317 SIEGLYGSHYVAQSVQEAANMALSAGVDVDLGSNAFVKLVNSVNAGEVQESLIDTAVTRV 376

Query: 389 FSVQLRLGLFNG---DPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLN 445
             ++  +GLF+    DP K +  K+    V ++EH  LA + A+  +VLLKND + LPL 
Sbjct: 377 LRLKFEMGLFDNPYVDPTKAE--KM----VRSAEHIALAREVAQNTLVLLKNDHEILPLK 430

Query: 446 KNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYVSK-----THYASGCHDVP 500
           ++    + ++GP  +N   M G YT  P    +++  LE  ++K       Y  GC  + 
Sbjct: 431 RDP-GKILVVGPNADNQYNMLGDYTA-PQDSDNVVTILEGVLNKLSPEQVEYVKGCA-IR 487

Query: 501 CNSDAGFHEAVRIAKKADFVIVVAG----LDLTQET-------------------EDRDR 537
             S+    +AV  A+ +D ++ V G     D   E                    E  DR
Sbjct: 488 DTSNVNIDKAVEAAEHSDVILAVVGGSSARDFKTEYLETGAAVANNQSVSDMDAGEGFDR 547

Query: 538 VSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAK 597
            +L L G+Q  L+ ++ +T K P+++V   G P+ +++A  ++      W  YPG+ G  
Sbjct: 548 ATLDLLGKQTDLLKALKKTGK-PLVVVYIQGRPMKMNWAAENADALICAW--YPGQEGGN 604

Query: 598 ALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSSRQYPGRSYRFYTGTQVYG 655
           A+A+++FGD+NP GRLPM+  P    ++P+  N  N R           Y   + T +Y 
Sbjct: 605 AIADVLFGDYNPAGRLPMS-VPRDVGQLPIYYNKKNPRP--------HDYVELSATPLYS 655

Query: 656 FGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFH 715
           FG+GLSY++++Y  L                                  EV       F 
Sbjct: 656 FGYGLSYSSFTYSDL----------------------------------EVAPVGQEEFV 681

Query: 716 VQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQ 775
           V   V N G+ DG  VV L+ R    S   P +QL  F R+       K++SF ++  + 
Sbjct: 682 VNFKVKNTGNFDGDEVVQLYLRDEYASVVQPVQQLKHFKRLFLKKGEEKQVSFHLNE-KD 740

Query: 776 LSIANKHGRRILPLGNHVLMVG----ELRHSLTIET 807
           L+I N+    ++  G   LM+G    ++R S TI+ 
Sbjct: 741 LAILNQKKEWVVEPGTFQLMIGAASDDIRLSGTIKV 776


>gi|329963634|ref|ZP_08301109.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328528179|gb|EGF55158.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 863

 Score =  241 bits (616), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 169/470 (35%), Positives = 240/470 (51%), Gaps = 66/470 (14%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C  P + +     TSL    RA+ L+  LTL+EK+  + D A  + RLGI  Y WW+E+L
Sbjct: 18  CSQPTYKT-----TSLPPEERAELLLKELTLEEKVSLMMDGAKPVERLGIKPYNWWNEAL 72

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAM--YNLGQA- 160
           HG+A  G            T FPQ +  AASF+      +  AV+ EARA   Y   Q  
Sbjct: 73  HGVARAG----------LATVFPQPIGMAASFSPEAVYGVFDAVSDEARAKNSYYASQGS 122

Query: 161 -----GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIG 215
                GLT W P +NI+RDPRWGRG ET GEDP + S   VE VK  QG N    DG   
Sbjct: 123 YERYQGLTMWTPTVNIYRDPRWGRGIETYGEDPYLTSRMGVEVVKGLQGGN----DG--- 175

Query: 216 FGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDT 274
                            R D+  L AC KH   +   +W   +R+ FNA  I  +D  +T
Sbjct: 176 -----------------RYDK--LHACAKHFAVHSGPEW---NRHEFNAENIKPRDLYET 213

Query: 275 FQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLF-QKARNEWGFKGYITSDCDAVATI 333
           + PPF + +++GK   +MC+YN+  G P C    L  Q  R+EWGF G + SDC A+A  
Sbjct: 214 YLPPFEALVKEGKVKEVMCAYNRFEGDPCCGSNRLLMQILRDEWGFDGIVLSDCGAIADF 273

Query: 334 FEYQNYTKTHED----SAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLF 389
           +  + + KTH D    SAA VL +G D+ CG+   +    A+  GK++EK +D A+  L 
Sbjct: 274 YNDRGH-KTHPDAESASAAAVL-SGTDLECGSS-YKALVEAVKLGKIEEKAVDVAVKRLL 330

Query: 390 SVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAV 449
           + +  LG  + +P K  + K+    V +++H  LALD AR+ + LL N    LPL +  +
Sbjct: 331 TARFALGEMD-EPDKVSWTKIPFSVVASAKHDSLALDIARKSMTLLMNKNDILPLKQGGL 389

Query: 450 SSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGC 496
            ++A++GP  N+     G Y G+P    ++L G+   +    K  Y  GC
Sbjct: 390 -TVAVMGPNANDSVMQWGNYNGMPSHTVTILEGVRKALGADDKLIYEQGC 438



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 94/333 (28%), Positives = 147/333 (44%), Gaps = 68/333 (20%)

Query: 498 DVPCNSDAGFHEAVRIAKKADFV------IVVAGLDLTQETEDR----------DRVSLL 541
           D   N D GF + V I K    V      I V G+  + E E+           DR  + 
Sbjct: 575 DAQLNFDLGFRKEVDIKKSVAVVKDADVVIFVGGISPSLEGEEMGVNLPGFKKGDRTDIE 634

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LP  Q  ++ ++ R+ K+ VI V   G P  +       +  +IL   YPG+AG  A+AE
Sbjct: 635 LPAVQREMIAALHRSGKK-VIFVNCSGSP--IGLEPETKECEAILQAWYPGQAGGTAVAE 691

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVP-MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGL 660
           ++FG +NP GRLP+T+Y  + +++P   D +M         GR+YR+ T   ++ FG+GL
Sbjct: 692 VLFGKYNPAGRLPVTFY-RNVSQLPDFEDYDMS--------GRTYRYMTQEPLFPFGYGL 742

Query: 661 SYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISV 720
           SYT + Y  +                   +L Q          +++ +  +L   + + V
Sbjct: 743 SYTAFEYGQM-------------------VLDQ----------NKINAGQTL--ELTVPV 771

Query: 721 TNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIAN 780
           TNAG  DG  VV ++ R    + G P K L  F RV+  A  +  + F +   E      
Sbjct: 772 TNAGKRDGEEVVQVYLRKRDDTDG-PNKTLRAFKRVNIPAGKTVNVDFELKDKELEWWDT 830

Query: 781 KHGRRILPLGNHVLMVGE-------LRHSLTIE 806
                 +  G++ +MVG        L +S TIE
Sbjct: 831 ATNTMRVCAGDYDIMVGGSSQTEDLLLYSFTIE 863


>gi|218130692|ref|ZP_03459496.1| hypothetical protein BACEGG_02281 [Bacteroides eggerthii DSM 20697]
 gi|217987036|gb|EEC53367.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 786

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 218/745 (29%), Positives = 341/745 (45%), Gaps = 129/745 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP  ++ +E + G+  +             T FP      A++N+ L S I    A 
Sbjct: 137 RLGIPV-DFTNEGIRGLCHD-----------RATMFPAQCGQGATWNKELISEIARVTAE 184

Query: 150 EARAMYNLGQAGLT-FWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           EA+A+      G T  ++P ++I +DPRWGR  E  GEDP +V                 
Sbjct: 185 EAKAL------GYTNIYSPILDIAQDPRWGRVVECYGEDPFLV----------------- 221

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAMITE 268
              G +G     KR++KG   E        L A  KH   Y +   G  +    +  +  
Sbjct: 222 ---GELG-----KRMIKGLQAEG-------LVATPKHFAVYSVPVGGRDAGTRTDPHVAP 266

Query: 269 QDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGD---LFQKARNEWGFKGYITS 325
           ++    +  PFR    +  A  +M SYN  +G P  + G    L +  R+EWGFKGY+ S
Sbjct: 267 REMRTLYLEPFRKAFCEAGALGVMSSYNDYDGEP--ITGSYHFLTEILRHEWGFKGYVVS 324

Query: 326 DCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-------LRHTQSAIDKGKVQE 378
           D +AV  ++          D AA V+ AG+++     +       LRH   AI +GK+  
Sbjct: 325 DSEAVEFLYSKHKVAADAVDGAAQVINAGLNVRTNFSLPENFIYPLRH---AISEGKISM 381

Query: 379 KDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKND 438
           + ID  + ++  V+  +GLF+ +P KG   KL     C  EH+ +++ AA + IVLLKN+
Sbjct: 382 QTIDSRVADVLRVKFMMGLFD-NPYKGDTKKLEKVVHC-KEHQNVSMRAALESIVLLKNE 439

Query: 439 KKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV--SKTHYASGC 496
              LPL+K ++  +A+IGP    +  +   Y       K++ +G++ Y+  S+  YA G 
Sbjct: 440 NNILPLSK-SIKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDSEVRYAKGA 498

Query: 497 ------------HDVPCN--SDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLL 542
                       +DVP N   +    EAV +AK++D  I+V G +     E+  R +L L
Sbjct: 499 DIIDEYFPESELYDVPLNKTEETMIDEAVALAKESDVAIMVLGGNEKTVREEYSRTNLDL 558

Query: 543 PGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEI 602
            G+Q  L+ +V  T K PVILV+  G    +++AE    I  I+   +PGE    A+A++
Sbjct: 559 CGRQEKLLQAVYATGK-PVILVMIDGRAATINWAE--RYIPGIVHAWFPGEFMGDAIAKV 615

Query: 603 IFGDFNPGGRLPMTWYPESFTKVPMN-DMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLS 661
           IFGD+NPGG+L +T +P S  ++P        +DS      + +   TG+ +Y FG+GLS
Sbjct: 616 IFGDYNPGGKLAVT-FPRSVGQIPFAFPFKPGSDS------KGFVRVTGS-LYPFGYGLS 667

Query: 662 YTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVT 721
           YT + Y  L   + +             I  Q   +L         SC          V 
Sbjct: 668 YTTFDYSALKIENPV-------------IGTQGSVKL---------SC---------RVK 696

Query: 722 NAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANK 781
           N G + G  VV L+      S  T  K L GF+R+H      K + F + P + + + NK
Sbjct: 697 NTGKIAGDEVVQLYLHDEVSSVTTYVKVLRGFERIHLQPGEEKVVDFVLTP-QDMGLWNK 755

Query: 782 HGRRILPLGNHVLMVGELRHSLTIE 806
               I+  G   +MVG     + ++
Sbjct: 756 DNHFIVEPGTFAVMVGSSSQDIRLQ 780


>gi|423333917|ref|ZP_17311698.1| hypothetical protein HMPREF1075_03349 [Parabacteroides distasonis
           CL03T12C09]
 gi|409226752|gb|EKN19658.1| hypothetical protein HMPREF1075_03349 [Parabacteroides distasonis
           CL03T12C09]
          Length = 758

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 214/778 (27%), Positives = 345/778 (44%), Gaps = 129/778 (16%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+ P    +   +   +I  R  S++SL+TL+EK+ Q++  +      G P      E+L
Sbjct: 20  CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
                 G   N   TV  V    ++ +S +     +    G  V    R ++ +  A   
Sbjct: 79  LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135

Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
            W                         AP ++I RD RWGR  E  GEDP + S  A   
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195

Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
           V+ FQG N W+S                         D   + ACCKH  AY   + G  
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232

Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
            N S  S N ++      + + PP+ +  E G A+  M S+N++NGVP+     L  DL 
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285

Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
           +K   +WGF G++ +D      I E   ++    D  AG L   AG+D++  G    ++ 
Sbjct: 286 RK---DWGFNGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYNQYL 339

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
             ++ +GKV E++I+RA+ ++  ++  LGLF+ DP +    +   + +   E  + A + 
Sbjct: 340 VQSVKEGKVSEENINRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
           + + IVLLKND  F P++K+   ++A+IGP+V +     G + G     +  SL  GL  
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458

Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
             + T+    YA GC D+  +  + F EA+  A++AD V+   G D     E   R  L 
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LPG Q +L+  + +T K P+ L+L  G PLD+S+   D  +  IL   Y G      +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
           +I GD+NP  RL M+ +P +  ++P+     +  + R  P  +        Y     T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT ++                               ++ + +D+ +     +
Sbjct: 632 YPFGYGLSYTTFA-------------------------------VNSMKLDQNSFTKGGK 660

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             V   V N G VDG  VV ++ R    S   P K+L GF++V   A   K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVVQMYIRDLAGSVTRPVKELKGFEKVALKAGEKKQVSFTID 718


>gi|399025370|ref|ZP_10727374.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398078230|gb|EJL69151.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 775

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 232/857 (27%), Positives = 379/857 (44%), Gaps = 161/857 (18%)

Query: 19  LQLIVVVNVIAFSNSKPVLNKPDFPCKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKI 78
           ++ ++V+  +A +   PV +  +   KP    SY              +L+S +TL EKI
Sbjct: 1   MKKLIVIATLALA---PVFSAQEMVSKP--VQSYQTAQYQAKKKAFVNNLLSKMTLDEKI 55

Query: 79  QQLSDNASAIPRLGIPAY----EWWSESLHGIASNGPGVNFNGTVSSV------------ 122
            QL+  +S     G+       +   + L G   N  G +    V  V            
Sbjct: 56  GQLNLPSSGDFTTGLAQSSDIGKKIEQGLVGGLFNIKGADKIRAVQKVAVEKSRLKIPLI 115

Query: 123 ----------TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINI 171
                     T+FP  L  AAS++ +L        A EA +       G+ + ++P ++I
Sbjct: 116 FGMDVIHGYETTFPIPLGLAASWDMNLVQQSARVAAKEASS------DGINWTFSPMVDI 169

Query: 172 FRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEES 231
            R+PRWGR  E  GEDP + S  A   V  +QG++                         
Sbjct: 170 SREPRWGRVSEGSGEDPYLGSEIAKNMVYGYQGKDLSV---------------------- 207

Query: 232 DRGDELMLSACCKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKA 288
             G+ ++  AC KH   Y   + G   N    S   M  E      + PP+++ ++ G A
Sbjct: 208 --GNTIL--ACVKHFALYGAGEAGRDYNTVDMSHVRMFNE------YFPPYKAAVDAGVA 257

Query: 289 SCIMCSYNQVNGVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHE 344
           S +M S+N+V+GVPA     L+ ++    RN+W FKG++ +D   +  + ++       +
Sbjct: 258 S-VMASFNEVDGVPATGNRWLQTEVL---RNKWNFKGFVVTDYTGINEMVDHG--MGDLQ 311

Query: 345 DSAAGVLKAGMDIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPR 403
             +A  LKAG+D++  G   L   + ++ +GK+ + DID A   +   +  LGLF+ DP 
Sbjct: 312 QVSALALKAGVDMDMVGEGFLTTLKKSLSEGKITQADIDIAAKRILEAKYDLGLFD-DPY 370

Query: 404 KGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNIS 463
           +    KL   +V   E++ +A +AA Q +VL+KND + LPL K+   ++A+IGPLVNN  
Sbjct: 371 RYGDAKLAAKEVYNMENRNIARNAAAQSMVLMKNDNQVLPLKKSG--TVAVIGPLVNNSL 428

Query: 464 QMGGGYTGIPCSPKS--LLRGLEA-----------------YVSKTHYASGCHDVPCNSD 504
            M G ++      ++  L++GL+A                 Y +K       H    + D
Sbjct: 429 NMAGTWSVAAKHDQAVNLMQGLQANFGKEVKFISAKGANIDYDAKLEDIYAAHGKKTDRD 488

Query: 505 -----AGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKR 559
                A   EAV +A KAD +++  G       E   R  + +P  Q+ L+  + +T K 
Sbjct: 489 NRPKEALLKEAVDVANKADVIVLAIGESAEMSGESSSRTEITIPQSQVDLLNELKKTGK- 547

Query: 560 PVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYP 619
           P+ +VL  G PL  +         SIL + + G     A+++++FG  NP G+LPMT +P
Sbjct: 548 PIAVVLFTGRPL--ALTNIKDTPDSILNVWFAGSEAGNAISDVLFGKVNPSGKLPMT-FP 604

Query: 620 ESFTKVPM--NDMNMRADSSRQYPGR--------SYRFYTGTQVYGFGHGLSYTNYSYKF 669
            S  +VP+  N  N     S++   +        +Y     T +Y FG+GLSYT + Y  
Sbjct: 605 RSLGQVPVYYNAKNTGRPLSKELTDKCEYQRFRSNYMDECNTPLYPFGYGLSYTKFGY-- 662

Query: 670 LSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGS 729
               S++T+S +   G+      QT                       + VTN+G+ DG+
Sbjct: 663 ----SDVTVSNANPKGN------QT-------------------IQASVMVTNSGNYDGA 693

Query: 730 HVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL 789
            VV L+ R    S   P K+L GF +V      SK+++F + P E L   N   +     
Sbjct: 694 EVVQLYIRDMVGSITRPVKELKGFQKVMLKKGESKKVTFDITP-ENLKFYNGDLKYDWES 752

Query: 790 GNHVLMVG----ELRHS 802
           G   +M+G    E++HS
Sbjct: 753 GEFDIMIGTNSEEVKHS 769


>gi|423301011|ref|ZP_17279035.1| hypothetical protein HMPREF1057_02176 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472346|gb|EKJ90874.1| hypothetical protein HMPREF1057_02176 [Bacteroides finegoldii
           CL09T03C10]
          Length = 746

 Score =  241 bits (615), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 201/712 (28%), Positives = 326/712 (45%), Gaps = 114/712 (16%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQ 181
           T FP  L  +AS++           A EA ++      GL + ++P  +I RDPRWGR  
Sbjct: 121 TIFPVNLGISASWDLEEIERFARVSAEEASSV------GLHWTFSPMCDISRDPRWGRVS 174

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GEDP +    A   V+ +QG++   D+                           + +
Sbjct: 175 EGSGEDPYLGGLIAAAMVRGYQGDDLSKDN--------------------------TILS 208

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
           C KH IAY   + G   R      I+E++  DT+ PPF++ I+ G A+ +M ++N+ +GV
Sbjct: 209 CAKHFIAYGAAQAG---RDYHTVDISERELRDTYLPPFKAAIDAG-AATVMSAFNEYDGV 264

Query: 302 PACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCG 360
           P      L +   R E GFKG++ SD  AV  +  +      +++ AA +   GM+    
Sbjct: 265 PCTANSFLMRDLLRKEMGFKGFVVSDYTAVKELIAHG--VAGNDEEAASL---GMNAGVN 319

Query: 361 TCM-----LRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDV 415
            CM     L H +  + +G V+E+ ++     + +++ +LGLF+   R G   K     +
Sbjct: 320 MCMVDNLYLFHGEKLVKEGVVREETVNELCAEILTMKYKLGLFDDPYRYGSEEK-EKSTI 378

Query: 416 CTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCS 475
               +   A + A++ +VLL+N+   LP+ +     +A+IGP  N+  +M G +   P  
Sbjct: 379 FNDSNLLAAKELAKKSMVLLQNNNNVLPVKEG--KKIALIGPFANHKREMLGSWVLAP-E 435

Query: 476 PK---SLLRGLEAYVSK--THYASGC---HDVPCNSDAGFHEAVRIAKKADFVIVVAGLD 527
           PK   + L GLE    K   ++A GC    ++P     GF  AV  A+KAD +    GL 
Sbjct: 436 PKRTVTFLEGLEERYGKQNIYFAPGCRIHEEIP----DGFKNAVAQAQKADVIFYTMGLS 491

Query: 528 LTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILW 587
             +  E     SL LP  QM L+  +  T K P++++L    P+++   E   ++SS+L 
Sbjct: 492 QKESGEAASMTSLKLPEVQMQLLEKLKSTGK-PIVVLLVVARPIELE--EVTEKVSSLLL 548

Query: 588 IGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--------NDMNMRADSSRQ 639
              PG     AL E++ GD+NP G+L ++ +P    +VP+          M +++ SS +
Sbjct: 549 TWNPGTMAGAALGEVVSGDYNPSGKLTLS-FPLLTGQVPLYYNMKNTGRPMGLKSVSSEK 607

Query: 640 YPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLD 699
           Y  R Y       +Y FG+GLSYT + Y  +   S               ++ + G  L 
Sbjct: 608 YKSR-YIDCPNEPLYPFGYGLSYTTFEYSDIQPSS---------------VIAEEGDTLQ 651

Query: 700 YVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTV 759
           +                 ++VTN G+VDG  VV L+ R  K S   P K+L GF ++   
Sbjct: 652 F----------------SVNVTNTGNVDGEEVVQLYIRDMKGSVTRPVKELKGFKKIMIK 695

Query: 760 AKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG-----ELRHSLTIE 806
           A  S++++F + P E LS   K        G + L +G      L  S TI+
Sbjct: 696 AGESRKVTFNLSP-EDLSFTRKDMTWGQEKGEYKLWIGGDSNANLEASFTIK 746


>gi|301307646|ref|ZP_07213603.1| periplasmic beta-glucosidase [Bacteroides sp. 20_3]
 gi|423337347|ref|ZP_17315091.1| hypothetical protein HMPREF1059_01016 [Parabacteroides distasonis
           CL09T03C24]
 gi|300834320|gb|EFK64933.1| periplasmic beta-glucosidase [Bacteroides sp. 20_3]
 gi|409237807|gb|EKN30603.1| hypothetical protein HMPREF1059_01016 [Parabacteroides distasonis
           CL09T03C24]
          Length = 758

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 217/778 (27%), Positives = 346/778 (44%), Gaps = 129/778 (16%)

Query: 44  CKPPHFDSYPFCNTSLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESL 103
           C+ P    +   +   +I  R  S++SL+TL+EK+ Q++  +      G P      E+L
Sbjct: 20  CQQPAVSGWKSFSGDKNIERRVDSVLSLMTLEEKVGQMAQYSCNWDVTG-PVMTGDYETL 78

Query: 104 HGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLT 163
                 G   N   TV  V    ++ +S +     +    G  V    R ++ +  A   
Sbjct: 79  LKQGLVGSLFNVY-TVDGVRKMQEMALSESRLKIPVL--FGYDVVHGYRTIFPMPLAESC 135

Query: 164 FW-------------------------APNINIFRDPRWGRGQETPGEDPMVVSAYAVEF 198
            W                         AP ++I RD RWGR  E  GEDP + S  A   
Sbjct: 136 SWDLERMRKSAAIAADEASASGIAWTFAPMVDISRDARWGRVMEGAGEDPYLGSLIAKAR 195

Query: 199 VKSFQGEN-WKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-- 255
           V+ FQG N W+S                         D   + ACCKH  AY   + G  
Sbjct: 196 VEGFQGGNDWRS-----------------------LADVNTVLACCKHFAAYGAAEAGRD 232

Query: 256 -NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPAC----LRGDLF 310
            N S  S N ++      + + PP+ +  E G A+  M S+N++NGVP+     L  DL 
Sbjct: 233 YNTSELSQNTLM------NYYMPPYLAAKEAGVAT-FMASFNEINGVPSTGNKWLMTDLL 285

Query: 311 QKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVL--KAGMDIN-CGTCMLRHT 367
           +K   +WGF G++ +D      I E   ++    D  AG L   AG+D++  G    ++ 
Sbjct: 286 RK---DWGFNGFVVTDYTG---INEMVAHSIVRNDKEAGELAANAGIDMDMTGGIYSQYL 339

Query: 368 QSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDA 427
             ++ +GKV E++I+RA+ ++  ++  LGLF+ DP +    +   + +   E  + A + 
Sbjct: 340 VQSVKEGKVSEENINRAVASILEMKFLLGLFD-DPYRYLDNEREKNTIMKPEFLQEARET 398

Query: 428 ARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPK--SLLRGLEA 485
           + + IVLLKND  F P++K+   ++A+IGP+V +     G + G     +  SL  GL  
Sbjct: 399 SARSIVLLKNDNNFFPISKDKHITVALIGPMVKDKINQNGEWAGRGEREESISLFEGLTE 458

Query: 486 YVSKTH----YASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLL 541
             + T+    YA GC D+  +  + F EA+  A++AD V+   G D     E   R  L 
Sbjct: 459 KYAGTNVKFIYAEGC-DLLTDDSSKFAEAIATARRADIVLAAMGEDFNWSGEAACRTDLK 517

Query: 542 LPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAE 601
           LPG Q +L+  + +T K P+ L+L  G PLD+S+   D  +  IL   Y G      +A+
Sbjct: 518 LPGAQQALLKELKKTGK-PLGLILVNGRPLDLSW--EDQHVDGILEAWYLGTMAGHGMAD 574

Query: 602 IIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRS--------YRFYTGTQV 653
           +I GD+NP  RL M+ +P +  ++P+     +  + R  P  +        Y     T +
Sbjct: 575 VISGDYNPSARLTMS-FPRTVGQLPL--YYNQKPTGRPVPPEAPDTDYKSRYMDVPNTPL 631

Query: 654 YGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLR 713
           Y FG+GLSYT ++              S+K   D+N   + G                 +
Sbjct: 632 YPFGYGLSYTTFAVN------------SMKL--DQNSFTKGG-----------------K 660

Query: 714 FHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVD 771
             V   V N G VDG  V+ ++ R    S   P K+L GF++V   A   K++SF +D
Sbjct: 661 ITVTAEVENTGKVDGETVIQMYIRDLAGSVTRPVKELKGFEKVTLKAGEKKQVSFTID 718


>gi|373461603|ref|ZP_09553342.1| hypothetical protein HMPREF9944_01606 [Prevotella maculosa OT 289]
 gi|371951907|gb|EHO69749.1| hypothetical protein HMPREF9944_01606 [Prevotella maculosa OT 289]
          Length = 739

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 195/690 (28%), Positives = 315/690 (45%), Gaps = 113/690 (16%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAM-YNLGQAGLTFWAPNINIFRDPRWGRGQ 181
           T FP  L  + +++  L        A+EA A+ YN        ++P ++I RDPRWGR  
Sbjct: 103 TVFPIPLGLSCTWDSVLIKKTARIAAIEATAIGYN------NTYSPMVDITRDPRWGRVM 156

Query: 182 ETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSA 241
           E  GEDP + S  A   VK +QGEN  +                             L A
Sbjct: 157 EGSGEDPYLGSVIARAMVKGYQGENLDASTS--------------------------LLA 190

Query: 242 CCKHLIAYDLEKWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGV 301
           C KH   Y   + G     +  + +T     + + PP+++ +E G A  +M S+N V G+
Sbjct: 191 CVKHFGMYGAIEAGRDYNTTDMSRVT---MYNVYLPPYKAAVEAG-AGSVMTSFNDVEGI 246

Query: 302 PAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDI 357
           P+     L  DL    RN+WGFKG++ +D +AV  +  +      +E++A   +KAG D+
Sbjct: 247 PSTANKWLLTDLL---RNQWGFKGFMVTDYNAVQELIPH-GVAADYEEAAEKAIKAGTDM 302

Query: 358 NCGT-CMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVC 416
           +  + C     +  + +G +    I+RA   +  V+  LGLFN DP KG    L    + 
Sbjct: 303 DMASECYSMFMEKLVTEGHIDTSYINRACRRVLEVKYDLGLFN-DPYKGYQKGLAERVIL 361

Query: 417 TSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPC 474
           + EHK +A +A ++ +VLLKN+++ LPL  +  S +A+IGP ++N ++M   ++  G+  
Sbjct: 362 SKEHKAVAREATQRAMVLLKNERQTLPLRPD--SRVALIGPFLDNKAEMLSMWSPDGVID 419

Query: 475 SPKSLLRGLEAYVSKTHYASGCHDVPCNSDAGFH---------------EAVRIAKKADF 519
           S  S++ G++    +   A G +    N D                   EAV +A ++D 
Sbjct: 420 SVVSIVAGMKKQFREVKTAVGTY---MNDDPALKSFEPFDSAQQAKMIAEAVELAHQSDV 476

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVS--FAE 577
           V+ V G       E R    + +P  Q  L+ ++ +T K PVILVL  G P+ ++     
Sbjct: 477 VVAVLGESRWLSGEARSMTKISMPACQKELLKALKKTGK-PVILVLLSGRPMVLTDIIPH 535

Query: 578 ADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM-----NDMNM 632
           AD+ I++  W   PG     A+A+++ G +NP G+L MT +P S  ++P+     N    
Sbjct: 536 ADAIIAA--W--RPGTMAGDAIADVLSGKYNPSGKLTMT-FPRSVGQIPIYYNHKNTGRP 590

Query: 633 RADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQ 692
             D  R      Y     + ++ FG+GLSYT + Y       ++ +S ++  G+ K+I  
Sbjct: 591 FKDGDRGQFKSKYLDEMNSPLFPFGYGLSYTTFRY------GDIVLSDTIVNGAKKHIT- 643

Query: 693 QTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIG 752
                                    ISVTN G   G   V L+   P  S   P K+L  
Sbjct: 644 -----------------------ASISVTNTGKYSGEETVQLYINDPVASITRPVKELKD 680

Query: 753 FDRVHTVAKGSKEISFGVDPCEQLSIANKH 782
           + +V      +K+I+F + P E L   N H
Sbjct: 681 YQKVMLAPGETKKITFNITP-EALKFYNHH 709


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 151/448 (33%), Positives = 220/448 (49%), Gaps = 62/448 (13%)

Query: 55  CNT------SLSISTRAKSLISLLTLQEKIQQLSDNASAIPRLGIPAYEWWSESLHGIAS 108
           CNT      S S   RA  L+  LTL+EK+  + D  +AI RLGI  Y WW+E+LHG+A 
Sbjct: 21  CNTEIWKDNSYSPEERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWWNEALHGVAR 80

Query: 109 NGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQA-------- 160
            G            T FPQ +  AA+F+R +  ++ SA + EARA ++  +         
Sbjct: 81  AG----------QATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFKERGERGRYQ 130

Query: 161 GLTFWAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFRE 220
           GLT W PNIN+FRDPRWGRG E  GEDP +        VK  QG+               
Sbjct: 131 GLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQGDR-------------- 176

Query: 221 KRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWGNFSRYSFNAM-ITEQDTEDTFQPPF 279
                        G    L AC KH   +   +W   +R+SFNA  I  +D  +T+ P F
Sbjct: 177 ------------SGKYDKLHACAKHYAVHSGPEW---NRHSFNAENIRPRDLHETYLPAF 221

Query: 280 RSCIEQGKASCIMCSYNQVNGVPACLRGDLFQKA-RNEWGFKGYITSDCDAVATIFE--- 335
           +  +  G    +MC+YN+  G P C    L +   RNEWGF G + SDC A+   F    
Sbjct: 222 KKLVIDGDVRMVMCAYNRFEGEPCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDA 281

Query: 336 YQNYTKTHEDSAAGVLKAGMDINCGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRL 395
           +  Y      S   VL AG D+NCG       + A+++G + E+ +D +L  L   +  L
Sbjct: 282 HAMYPDAKTASTDAVL-AGTDLNCGDSYPSLVE-AVEQGLITEEQLDISLRRLLIARFEL 339

Query: 396 GLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAII 455
           G  + D  + ++ K+    V +  H ++AL+AAR+ + LL N    LPL K  + ++A++
Sbjct: 340 GEMDPD-EEVEWSKIPHSVVSSPTHSEMALEAARKSMTLLMNKNGALPLKKEGL-TVAVM 397

Query: 456 GPLVNNISQMGGGYTGIPCSPKSLLRGL 483
           GP  N+     G Y G P +  ++L+G+
Sbjct: 398 GPNANDSLMQWGNYNGTPATTTTILQGI 425



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 92/293 (31%), Positives = 142/293 (48%), Gaps = 57/293 (19%)

Query: 517 ADFVIVVAGLDLTQETEDR----------DRVSLLLPGQQMSLVTSVARTSKRPVILVLT 566
           AD V+  +G+    E E+           DR  + LP  Q  ++ ++ +  K  +++  +
Sbjct: 603 ADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGKEIILVNCS 662

Query: 567 GGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVP 626
           G     + F EA    S+IL   YPG+AG +A+AE++FGD+NP GRLP+T+Y +S  ++P
Sbjct: 663 GSA---IGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFY-KSVDQLP 718

Query: 627 -MNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAG 685
              D NM          R+YR++ G  +Y FG+GLSYT +SY                  
Sbjct: 719 DFQDYNMT--------NRTYRYFEGEPLYPFGYGLSYTTFSY------------------ 752

Query: 686 SDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGT 745
            D+  L QT            +  T     +++SV N GD DG  VV L+ + P  ++G 
Sbjct: 753 -DQPELSQT------------SISTEEEASLKVSVANTGDYDGEEVVQLYLQKPDDTEG- 798

Query: 746 PEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPL-GNHVLMVG 797
           P   L GF RV      + E+ F +   E L   N   +R+ PL G++ L+VG
Sbjct: 799 PSLTLRGFQRVFIPKGETVEVEFQLTE-EVLEWWNADAQRMTPLAGDYRLLVG 850


>gi|339999930|ref|YP_004730813.1| periplasmic beta-glucosidase [Salmonella bongori NCTC 12419]
 gi|339513291|emb|CCC31041.1| periplasmic beta-glucosidase precursor [Salmonella bongori NCTC
           12419]
          Length = 765

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 206/737 (27%), Positives = 343/737 (46%), Gaps = 136/737 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+Q+ D   A+ RL IP + +  + +HG                 T FP  L  A++
Sbjct: 86  RQDIRQMQDQVMALSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLAST 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGR  E  GED  + S 
Sbjct: 130 FNLDAVRTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRASEGFGEDTYLTSI 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                VK+ QG++                            D   +    KH  AY   +
Sbjct: 184 MGETMVKAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSPQRLFNDYMPPYKAGLDAG-SGAVMIALNSLNGTPATSDAWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R++WGFKG   SD  A+  + ++   T   ED+    LK+G+D++       ++    
Sbjct: 273 VLRHQWGFKGITVSDHGAIKELIKHGTATDP-EDAVRVALKSGVDMSMADEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGP---DDVCTSE----HKKL 423
           I  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP   D V T+     H+K 
Sbjct: 332 IKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKESDPVDTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A + AR+ +VLLKN    LPL+KNA  ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 AREVARESLVLLKNRLNTLPLSKNA--TIAVVGPLADSKRDIMGSWSAAGVADQSVTVLT 444

Query: 482 GLEAYVS---KTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G++  +    K  YA G +     D+              P +  A   EAV+ AK+AD 
Sbjct: 445 GIKNALGERGKVLYAKGANITNDKDIVDFLNLYEEVVKIDPRSPQAMIDEAVQAAKQADV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + +P  Q  L+T++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGESQGMAHEASSRTDITIPQSQRDLITALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNMRADSS 637
            Q  +IL   + G  G  A+A+++FGD NP G+LPM+ +P S  ++P+  + +N     +
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDDNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 638 RQYPGR-SYRFYTGTQ--VYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQT 694
            + P + + R++      +Y FG+GLSY  +        S++T+SA          +++ 
Sbjct: 621 PEKPDKYTSRYFDEANGPLYPFGYGLSYNTFKV------SDVTLSAP--------TMKRD 666

Query: 695 GSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFD 754
           G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF+
Sbjct: 667 G---------KVTAS--------VEVTNTGKREGATVIQMYLQDVTASMSRPVKQLKGFE 709

Query: 755 RVHTVAKGSKEISFGVD 771
           ++      +K +SF +D
Sbjct: 710 KIILKPGETKTVSFPID 726


>gi|300777062|ref|ZP_07086920.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300502572|gb|EFK33712.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 775

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 211/717 (29%), Positives = 330/717 (46%), Gaps = 124/717 (17%)

Query: 123 TSFPQVLVSAASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQE 182
           T+FP  L  AAS++ +L   I  +  V AR   + G    TF +P ++I R+PRWGR  E
Sbjct: 126 TTFPIPLGLAASWDMNL---IQQSARVAAREAASDG-INWTF-SPMVDISREPRWGRVSE 180

Query: 183 TPGEDPMVVSAYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSAC 242
             GEDP + S  +   V  +QG+                          D  +   + AC
Sbjct: 181 GSGEDPYLGSEISKNMVYGYQGK--------------------------DLANGSNILAC 214

Query: 243 CKHLIAYDLEKWG---NFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVN 299
            KH   Y   + G   N    S   M  E      + PP+++ ++ G AS +M S+N+V+
Sbjct: 215 VKHFALYGAGEAGRDYNTVDMSHVRMFNE------YFPPYKAAVDAGVAS-VMASFNEVD 267

Query: 300 GVPAC----LRGDLFQKARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGM 355
           GVPA     L+ ++    RN+W FKG++ +D   +  + E+       +  +A  LKAG+
Sbjct: 268 GVPATGNRWLQTEVL---RNQWKFKGFVVTDYTGINEMVEHG--MGDLQQVSALALKAGV 322

Query: 356 DIN-CGTCMLRHTQSAIDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDD 414
           D++  G   L   + ++ +GKV + +ID A   +   +  LGLF+ +P K    KL   +
Sbjct: 323 DMDMVGEGFLTTLKKSLAEGKVTQAEIDMAARRILEAKYDLGLFD-NPYKHGDAKLAAKE 381

Query: 415 VCTSEHKKLALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTGIP- 473
           V   E++ +A   A Q +VLLKND + LPL K+   ++A+IGPLVNN   M G ++    
Sbjct: 382 VYNMENRNIARSTAAQSMVLLKNDNQVLPLKKSG--TVAVIGPLVNNSMNMAGTWSVATK 439

Query: 474 -CSPKSLLRGLEA-----------------YVSKTHYASGCHDVPCNSDAG-----FHEA 510
             S  SL++GL++                 Y +K       H    + D         EA
Sbjct: 440 HASSVSLMQGLQSNYGKEVKFLSAKGANIDYDAKLEEIYAAHGKKTDRDNRSKEELLKEA 499

Query: 511 VRIAKKADFVIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGP 570
           V +A KAD +++  G       E   R  + +P  Q+ L+  + +T K P+ +VL  G P
Sbjct: 500 VEVANKADVIVLAIGESAEMSGESSSRTEITIPQSQVDLLNELKKTGK-PIAMVLFTGRP 558

Query: 571 LDVSFAEADSQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--N 628
           L  +         +IL   + G     A+A+++FG  NP G+LPMT +P S  +VP+  N
Sbjct: 559 L--ALTNVKDTPDAILNAWFAGSEAGNAIADVLFGKVNPSGKLPMT-FPRSLGQVPIYYN 615

Query: 629 DMNM-RADSSRQYPGRSY-RFYTG------TQVYGFGHGLSYTNYSYKFLSAPSELTISA 680
             N  R  S  +    +Y RF +       T +Y FG+GLSYT ++Y      S++T+S 
Sbjct: 616 AKNTGRPLSQDKVDKCTYERFRSNYMDECNTPLYAFGYGLSYTKFNY------SDMTVSN 669

Query: 681 SLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPK 740
           +   G+      QT                       ++VTNAG+ DG+ VV L+ R   
Sbjct: 670 TSPKGN------QT-------------------VQASVTVTNAGNYDGAEVVQLYIRDMV 704

Query: 741 VSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHGRRILPLGNHVLMVG 797
            S   P K+L GF +V      SK+++F + P E L   N   +     G   +M+G
Sbjct: 705 GSITRPVKELKGFQKVFLKKGESKKVTFDITP-ENLKFYNGELKYDWEPGEFDIMIG 760


>gi|423303939|ref|ZP_17281938.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
           CL03T00C23]
 gi|423307339|ref|ZP_17285329.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
           CL03T12C37]
 gi|392686630|gb|EIY79933.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
           CL03T00C23]
 gi|392690354|gb|EIY83622.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
           CL03T12C37]
          Length = 736

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 209/802 (26%), Positives = 343/802 (42%), Gaps = 146/802 (18%)

Query: 54  FCNTSLSISTRAKSLISLLTLQEKIQQLSD--------------NASAIP---------- 89
           + +    I  R   L+S +TL+EK+QQL+                   IP          
Sbjct: 29  YKDAKAPIEERVNDLVSRMTLEEKVQQLNQYTLGRNNNENNRGEEVKKIPATLGSLIYFD 88

Query: 90  -----------------RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSA 132
                            RLGIP   +  + +HG                 T +P  L  A
Sbjct: 89  EDANLRNEAQRKAMEESRLGIPIL-FGYDVIHGFR---------------TIYPISLGQA 132

Query: 133 ASFNRSLWSNIGSAVAVEARAMYNLGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVVS 192
            S+N  L     +  A EAR    +     TF +P I++ RD RWGR  E  GEDP   +
Sbjct: 133 CSWNPQLVEQACAVAAQEAR----MSGVDWTF-SPMIDVARDGRWGRVAEGYGEDPYTNA 187

Query: 193 AYAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLE 252
            + V  +K +QGE                          D  D   ++AC KH I Y   
Sbjct: 188 VFGVASIKGYQGE--------------------------DMSDSKRVAACLKHYIGYGAS 221

Query: 253 KWGNFSRYSFNAMITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRG-DLFQ 311
           + G    Y+    I+ Q   DT+ PP+ + ++ G A+ +M S+N ++G P       + +
Sbjct: 222 EAGRDYVYT---EISNQTLWDTYIPPYEAGVKAG-AATLMSSFNDISGTPGSANHYTMTE 277

Query: 312 KARNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDIN-CGTCMLRHTQSA 370
             +N W   G++ SD  AV  + + Q +    +++A     AG++++  G C  +H    
Sbjct: 278 ILKNRWKHDGFVVSDWSAVPQLID-QGHAADRKEAARLAFNAGLEMDMMGHCYDKHMAKL 336

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQ 430
           +++GK+  + +D A+  +  ++ RLGLF+         K   +     +   +A   A +
Sbjct: 337 VEEGKISMQLVDDAVKRVLRIKFRLGLFDNPYTPTSTEK---ERFLLPQSLTIAEKLAEE 393

Query: 431 GIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYTG-------IPCSPKSLLRGL 483
            IVLLKN+ K LPL      ++A++GPLV N +++ G + G       +P   K  L   
Sbjct: 394 TIVLLKNENKVLPLANGNKPTIAVMGPLVQNSAELLGSWYGHGHAEDVLPI--KKALDAE 451

Query: 484 EAYVSKTHYASGCHDVPCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDRDRVSLLLP 543
            A  ++  Y  GC D   N  + F EA+ +A+KAD +++  G       E+  R  + LP
Sbjct: 452 FAGKAELIYTEGC-DFDGNDTSKFSEALAVARKADIILLCMGEKKKWSGENASRSIIELP 510

Query: 544 GQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKALAEII 603
             Q   +  + +  K P++L L  G PL +S  E        +W   PG  G K LA ++
Sbjct: 511 AIQEKFIAEMKKAGK-PIVLALANGRPLGLSKVEPLCDAIVEMW--QPGVPGGKPLAGVL 567

Query: 604 FGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGHGLSYT 663
            G  NP G+L +T +P S  ++P+   N R  ++R   G+ Y+    T +Y FG+GLSYT
Sbjct: 568 SGRVNPSGKLSIT-FPRSTGQIPIY-YNQR-KTARPQSGK-YQNIPSTPLYEFGYGLSYT 623

Query: 664 NYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQISVTNA 723
            ++Y  ++ P E     +++ G                           +  ++I VTN 
Sbjct: 624 TFNYGNINLPKE-----TIRRGE--------------------------KLVMEIPVTNV 652

Query: 724 GDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSIANKHG 783
           G  DG+ VV  F   P  +   P K+L  F++    A  +    F +DP   L+  N +G
Sbjct: 653 GKRDGAEVVHWFISDPFSTITRPCKELKHFEKQLIKAGETHIFRFEIDPMRDLAFVNANG 712

Query: 784 RRILPLGNHVLMVGELRHSLTI 805
              L  G + ++V + +   T+
Sbjct: 713 EHFLENGEYYVIVKDQKVKFTV 734


>gi|427411074|ref|ZP_18901276.1| hypothetical protein HMPREF9718_03750 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710259|gb|EKU73281.1| hypothetical protein HMPREF9718_03750 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 792

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 220/749 (29%), Positives = 334/749 (44%), Gaps = 133/749 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  ESLHG  +           +  T FP  +  A SF+R L +++ S +A 
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187

Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E RA       G+    +P ++I RDPRWGR +ET GEDP +     V  V   QGE   
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGEG-- 239

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
                           K  G      D++M  A  KH+  +   + G N +     A I+
Sbjct: 240 ----------------KQIGP-----DKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
           E++  + F PPFR  +++   + +M SYN+++GVP+     L GD+    R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
            SD  AV  +    +     E +A   L+AG+D      +  R     +  GKV  + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
            A   + +++ R GLF     +  Y  L       +E + LAL AA + IVLLKND   L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDGT-L 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
           PL   A   +A+IGP  N      GGY+ IP    SLL G++A +   +   +A G    
Sbjct: 445 PLKPGAHRKVAVIGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRADIVHAQGVFIT 502

Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
                         P  +     EAV +AK AD +++  G D  Q + +        DR 
Sbjct: 503 QSEDRSVDEVLLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           SL L G+Q  L  ++  T K  V+  + G  P   S+       ++++   YPG+ G  A
Sbjct: 562 SLDLVGEQNDLFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALIECWYPGQEGGTA 618

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+I+FGD NPG +LP+T   ++  ++P+     R  SSR    R Y F   + ++ FG 
Sbjct: 619 MADILFGDVNPGAKLPVTVARDA-GQIPI--FYNRKPSSR----RGYLFADTSPLFPFGF 671

Query: 659 GLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQI 718
           GLSYT +++     P  L+ S  +  G D                            V++
Sbjct: 672 GLSYTKFAF----GPPRLSAS-RIGVGGD--------------------------VTVEV 700

Query: 719 SVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLSI 778
            V N G V G  VV L+      S   P K+L GF+R+      S+ +   + P E  ++
Sbjct: 701 DVRNVGTVAGEEVVQLYVHDQTASVTRPLKELKGFERIALAPGESRTVRLTIGP-EAFAL 759

Query: 779 ANKHGRRILPLGNHVLMVGELRHSLTIET 807
            N   R ++  G   +MVG    S T++T
Sbjct: 760 WNLDMREVVEPGLFDIMVGP--DSATLQT 786


>gi|401676349|ref|ZP_10808334.1| glucosidase [Enterobacter sp. SST3]
 gi|400216388|gb|EJO47289.1| glucosidase [Enterobacter sp. SST3]
          Length = 765

 Score =  240 bits (612), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 203/738 (27%), Positives = 339/738 (45%), Gaps = 138/738 (18%)

Query: 75  QEKIQQLSDNASAIPRLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAAS 134
           ++ I+++ D    + RL IP + +  + +HG                 T FP  L  A+S
Sbjct: 86  RQDIRKMQDQVMDLSRLKIPLF-FAYDVVHG---------------QRTVFPISLGLASS 129

Query: 135 FNRSLWSNIGSAVAVEARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSA 193
           FN      +G   A EA         GL   WAP +++ RDPRWGRG E  GED  + S 
Sbjct: 130 FNLDAVKTVGRVSAYEA------ADDGLNMTWAPMVDVSRDPRWGRGSEGFGEDTFLTST 183

Query: 194 YAVEFVKSFQGENWKSDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEK 253
                V++ QG++                            D   +    KH  AY   +
Sbjct: 184 MGKTMVEAMQGKS--------------------------PADRYSVMTSVKHFAAYGAVE 217

Query: 254 WGNFSRYSFNAM-ITEQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPACLRGDLFQK 312
            G      +N + ++ Q   + + PP+++ ++ G +  +M + N +NG PA     L + 
Sbjct: 218 GGK----EYNTVDMSPQRLFNDYMPPYKAGLDAG-SGAVMVALNSLNGTPATSDSWLLKD 272

Query: 313 A-RNEWGFKGYITSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCML-RHTQSA 370
             R++WGFKG   SD  A+  + ++   +   ED+    LK+G++++       ++    
Sbjct: 273 VLRDQWGFKGITVSDHGAIKELIKHGTASDP-EDAVRVALKSGINMSMSDEYYSKYLPGL 331

Query: 371 IDKGKVQEKDIDRALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSE-------HKKL 423
           +  GKV   ++D A  ++ +V+  +GLFN DP    Y  LGP D   ++       H+K 
Sbjct: 332 VKSGKVTMAELDDATRHVLNVKYDMGLFN-DP----YSHLGPKDSDPADTNAESRLHRKE 386

Query: 424 ALDAARQGIVLLKNDKKFLPLNKNAVSSLAIIGPLVNNISQMGGGYT--GIPCSPKSLLR 481
           A D AR+ +VLLKN    LPL K+   ++A++GPL ++   + G ++  G+     ++L 
Sbjct: 387 ARDVARESLVLLKNRLDTLPLKKSG--TIAVVGPLADSKRDVMGSWSAAGVADQSVTVLT 444

Query: 482 GLEAYV---SKTHYASGCH-----DV--------------PCNSDAGFHEAVRIAKKADF 519
           G+++ V   +K  YA G +     D+              P        EAV  AK++D 
Sbjct: 445 GIKSAVGENAKVVYAKGANVTNDKDIVTFLNQYEEAVKVDPRTPKEMIDEAVNAAKQSDV 504

Query: 520 VIVVAGLDLTQETEDRDRVSLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEAD 579
           V+ V G       E   R  + +P  Q  L+ ++  T K P++LVL  G PL  +  + D
Sbjct: 505 VVAVVGEAQGMAHEASSRTDITIPQSQRDLIAALKATGK-PLVLVLMNGRPL--ALVKED 561

Query: 580 SQISSILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFTKVPM--NDMNM----R 633
            Q  +IL   + G  G  A+A+++FGD+NP G+LPM+ +P S  ++P+  + +N      
Sbjct: 562 QQADAILETWFAGTEGGNAIADVLFGDYNPSGKLPMS-FPRSVGQIPVYYSHLNTGRPYN 620

Query: 634 ADSSRQYPGRSYRFYTGTQVYGFGHGLSYTNYSYKFLSAPSELTISASLKAGSDKNILQQ 693
           AD   +Y  R +    G  +Y FG+GLSYT +        S++ +SA          L++
Sbjct: 621 ADKPNKYTSRYFDEANGP-LYPFGYGLSYTTFKV------SDVKMSAP--------TLKR 665

Query: 694 TGSRLDYVHIDEVTSCTSLRFHVQISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGF 753
            G         +VT+         + VTN G  +G+ V+ ++ +    S   P KQL GF
Sbjct: 666 DG---------KVTAS--------VEVTNTGKREGATVIQMYVQDVTASMSRPVKQLRGF 708

Query: 754 DRVHTVAKGSKEISFGVD 771
           ++V+     +K ISF +D
Sbjct: 709 EKVNLKPGETKTISFPID 726


>gi|398386388|ref|ZP_10544390.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718419|gb|EJK79008.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 792

 Score =  240 bits (612), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 218/740 (29%), Positives = 331/740 (44%), Gaps = 133/740 (17%)

Query: 90  RLGIPAYEWWSESLHGIASNGPGVNFNGTVSSVTSFPQVLVSAASFNRSLWSNIGSAVAV 149
           RLGIP   +  ESLHG  +           +  T FP  +  A SF+R L +++ S +A 
Sbjct: 140 RLGIPIL-FHEESLHGYMA-----------TDATMFPMAIGLAGSFDRQLMTDVQSVIAR 187

Query: 150 EARAMYNLGQAGLTF-WAPNINIFRDPRWGRGQETPGEDPMVVSAYAVEFVKSFQGENWK 208
           E RA       G+    +P ++I RDPRWGR +ET GEDP +     V  V   QGE+  
Sbjct: 188 EVRAR------GVHLALSPVVDIARDPRWGRIEETFGEDPYLCGEMGVAAVLGLQGES-- 239

Query: 209 SDDGGIGFGFREKRVLKGFGEESDRGDELMLSACCKHLIAYDLEKWG-NFSRYSFNAMIT 267
                           K  G      D++M  A  KH+  +   + G N +     A I+
Sbjct: 240 ----------------KQIGP-----DKVM--ATLKHMTGHGQPQAGENIA----PAPIS 272

Query: 268 EQDTEDTFQPPFRSCIEQGKASCIMCSYNQVNGVPA----CLRGDLFQKARNEWGFKGYI 323
           E++  + F PPFR  +++   + +M SYN+++GVP+     L GD+    R EW F G +
Sbjct: 273 ERELRENFFPPFRQVVKRTGIAAVMPSYNEIDGVPSHQNKWLLGDIL---RGEWHFDGAV 329

Query: 324 TSDCDAVATIFEYQNYTKTHEDSAAGVLKAGMDINCGTCM-LRHTQSAIDKGKVQEKDID 382
            SD  AV  +    +     E +A   L+AG+D      +  R     +  GKV  + ++
Sbjct: 330 VSDYGAVPELDTIHHVQPDLEATARAALRAGVDCELPDGLAYRTLVEQVRAGKVPLEAVN 389

Query: 383 RALLNLFSVQLRLGLFNGDPRKGKYGKLGPDDVCTSEHKKLALDAARQGIVLLKNDKKFL 442
            A   + +++ R GLF     +  Y  L       +E + LAL AA + IVLLKND   L
Sbjct: 390 LACTRMLTLKFRAGLFENPWPRADYDAL----TGNAEARALALKAAHKSIVLLKNDGT-L 444

Query: 443 PLNKNAVSSLAIIGPLVNNISQMGGGYTGIPCSPKSLLRGLEAYV---SKTHYASGCHDV 499
           PL   A   +A++GP  N      GGY+ IP    SLL G++A +   +   +A G    
Sbjct: 445 PLRPGAHRKVAVVGP--NAAIARLGGYSSIPRQAVSLLDGVKAKLGNRADIVHAQGVFIT 502

Query: 500 --------------PCNSDAGFHEAVRIAKKADFVIVVAGLDLTQETEDR-------DRV 538
                         P  +     EAV +AK AD +++  G D  Q + +        DR 
Sbjct: 503 QSEDRSVDEVFLADPAKNRQLIAEAVEVAKTADIILLAIG-DTEQTSREGFAKNHLGDRT 561

Query: 539 SLLLPGQQMSLVTSVARTSKRPVILVLTGGGPLDVSFAEADSQISSILWIGYPGEAGAKA 598
           SL L G+Q  L  ++  T K  V+  + G  P   S+       +++L   YPG+ G  A
Sbjct: 562 SLDLVGEQNELFAAMKATGKPVVVCAINGRPP---SYPAVVDGANALLECWYPGQEGGTA 618

Query: 599 LAEIIFGDFNPGGRLPMTWYPESFTKVPMNDMNMRADSSRQYPGRSYRFYTGTQVYGFGH 658
           +A+I+FGD NPGG+LP+T   ++  ++P+     R  SSR    R Y F   + ++ FG 
Sbjct: 619 MADILFGDVNPGGKLPVTVARDA-GQIPI--FYNRKPSSR----RGYVFEDSSPLFPFGF 671

Query: 659 GLSYTNYSYKFLSAPSELTISAS-LKAGSDKNILQQTGSRLDYVHIDEVTSCTSLRFHVQ 717
           GLSYT +++       +  +SA  +  G D                            V+
Sbjct: 672 GLSYTKFTF------GKPRLSAGRIGVGGD--------------------------VFVE 699

Query: 718 ISVTNAGDVDGSHVVMLFARVPKVSQGTPEKQLIGFDRVHTVAKGSKEISFGVDPCEQLS 777
           I V N G V G  VV L+      S   P K+L GF+R+      S+ +   + P E  S
Sbjct: 700 IEVRNVGSVAGEEVVQLYVHDQTASVTRPIKELKGFERIALAPGESRTVRLAIGP-EAFS 758

Query: 778 IANKHGRRILPLGNHVLMVG 797
           + N   + ++  G   +MVG
Sbjct: 759 LWNLDMQEVVEPGLFDIMVG 778


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.408 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,736,236,143
Number of Sequences: 23463169
Number of extensions: 545838363
Number of successful extensions: 1160191
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4767
Number of HSP's successfully gapped in prelim test: 2450
Number of HSP's that attempted gapping in prelim test: 1110655
Number of HSP's gapped (non-prelim): 18956
length of query: 808
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 657
effective length of database: 8,816,256,848
effective search space: 5792280749136
effective search space used: 5792280749136
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)